[v10,12/16] sched/core: uclamp: Extend CPU's cgroup controller

The cgroup CPU bandwidth controller allows to assign a specified
(maximum) bandwidth to the tasks of a group. However this bandwidth is
defined and enforced only on a temporal base, without considering the
actual frequency a CPU is running on. Thus, the amount of computation
completed by a task within an allocated bandwidth can be very different
depending on the actual frequency the CPU is running that task.
The amount of computation can be affected also by the specific CPU a
task is running on, especially when running on asymmetric capacity
systems like Arm's big.LITTLE.

With the availability of schedutil, the scheduler is now able
to drive frequency selections based on actual task utilization.
Moreover, the utilization clamping support provides a mechanism to
bias the frequency selection operated by schedutil depending on
constraints assigned to the tasks currently RUNNABLE on a CPU.

Giving the mechanisms described above, it is now possible to extend the
cpu controller to specify the minimum (or maximum) utilization which
should be considered for tasks RUNNABLE on a cpu.
This makes it possible to better defined the actual computational
power assigned to task groups, thus improving the cgroup CPU bandwidth
controller which is currently based just on time constraints.

Extend the CPU controller with a couple of new attributes uclamp.{min,max}
which allow to enforce utilization boosting and capping for all the
tasks in a group.

Specifically:

- uclamp.min: defines the minimum utilization which should be considered
	      i.e. the RUNNABLE tasks of this group will run at least at a
	      	 minimum frequency which corresponds to the uclamp.min
	      	 utilization

- uclamp.max: defines the maximum utilization which should be considered
	      i.e. the RUNNABLE tasks of this group will run up to a
	      	 maximum frequency which corresponds to the uclamp.max
	      	 utilization

These attributes:

a) are available only for non-root nodes, both on default and legacy
   hierarchies, while system wide clamps are defined by a generic
   interface which does not depends on cgroups. This system wide
   interface enforces constraints on tasks in the root node.

b) enforce effective constraints at each level of the hierarchy which
   are a restriction of the group requests considering its parent's
   effective constraints. Root group effective constraints are defined
   by the system wide interface.
   This mechanism allows each (non-root) level of the hierarchy to:
   - request whatever clamp values it would like to get
   - effectively get only up to the maximum amount allowed by its parent

c) have higher priority than task-specific clamps, defined via
   sched_setattr(), thus allowing to control and restrict task requests.

Add two new attributes to the cpu controller to collect "requested"
clamp values. Allow that at each non-root level of the hierarchy.
Validate local consistency by enforcing uclamp.min < uclamp.max.
Keep it simple by not caring now about "effective" values computation
and propagation along the hierarchy.

Signed-off-by: Patrick Bellasi <patrick.bellasi@arm.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Tejun Heo <tj@kernel.org>

---
Changes in v10:
 Message-ID: <https://lore.kernel.org/lkml/20190603122422.GA19426@darkstar/>
 - rename cgroup attributes to be cpu.uclamp.{min,max}
 Message-ID: <https://lore.kernel.org/lkml/20190605152754.GO374014@devbig004.ftw2.facebook.com/>
 - use a percentage rational numbers for clamp attributes
 Message-ID: <https://lore.kernel.org/lkml/20190605153955.GP374014@devbig004.ftw2.facebook.com/>
 - update initialization of subgroups clamps to be none by default
---
 Documentation/admin-guide/cgroup-v2.rst |  29 ++++
 init/Kconfig                            |  22 +++
 kernel/sched/core.c                     | 181 +++++++++++++++++++++++-
 kernel/sched/sched.h                    |   6 +
 4 files changed, 237 insertions(+), 1 deletion(-)

Message ID	20190621084217.8167-13-patrick.bellasi@arm.com (mailing list archive)
State	Not Applicable, archived
Headers	show Return-Path: <linux-pm-owner@kernel.org> Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id 2D3D1186E for <patchwork-linux-pm@patchwork.kernel.org>; Fri, 21 Jun 2019 08:43:33 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 1CDA2289D7 for <patchwork-linux-pm@patchwork.kernel.org>; Fri, 21 Jun 2019 08:43:33 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id 113C1289E0; Fri, 21 Jun 2019 08:43:33 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-7.9 required=2.0 tests=BAYES_00,MAILING_LIST_MULTI, RCVD_IN_DNSWL_HI autolearn=ham version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 0C832289D7 for <patchwork-linux-pm@patchwork.kernel.org>; Fri, 21 Jun 2019 08:43:32 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726740AbfFUIn1 (ORCPT <rfc822;patchwork-linux-pm@patchwork.kernel.org>); Fri, 21 Jun 2019 04:43:27 -0400 Received: from foss.arm.com ([217.140.110.172]:51056 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726657AbfFUInG (ORCPT <rfc822;linux-pm@vger.kernel.org>); Fri, 21 Jun 2019 04:43:06 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id D0C62153B; Fri, 21 Jun 2019 01:43:05 -0700 (PDT) Received: from e110439-lin.cambridge.arm.com (e110439-lin.cambridge.arm.com [10.1.194.43]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPA id 7D3AF3F246; Fri, 21 Jun 2019 01:43:03 -0700 (PDT) From: Patrick Bellasi <patrick.bellasi@arm.com> To: linux-kernel@vger.kernel.org, linux-pm@vger.kernel.org Cc: Ingo Molnar <mingo@redhat.com>, Peter Zijlstra <peterz@infradead.org>, Tejun Heo <tj@kernel.org>, "Rafael J . Wysocki" <rafael.j.wysocki@intel.com>, Vincent Guittot <vincent.guittot@linaro.org>, Viresh Kumar <viresh.kumar@linaro.org>, Paul Turner <pjt@google.com>, Quentin Perret <quentin.perret@arm.com>, Dietmar Eggemann <dietmar.eggemann@arm.com>, Morten Rasmussen <morten.rasmussen@arm.com>, Juri Lelli <juri.lelli@redhat.com>, Todd Kjos <tkjos@google.com>, Joel Fernandes <joelaf@google.com>, Steve Muckle <smuckle@google.com>, Suren Baghdasaryan <surenb@google.com>, Alessio Balsini <balsini@android.com> Subject: [PATCH v10 12/16] sched/core: uclamp: Extend CPU's cgroup controller Date: Fri, 21 Jun 2019 09:42:13 +0100 Message-Id: <20190621084217.8167-13-patrick.bellasi@arm.com> X-Mailer: git-send-email 2.21.0 In-Reply-To: <20190621084217.8167-1-patrick.bellasi@arm.com> References: <20190621084217.8167-1-patrick.bellasi@arm.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-pm-owner@vger.kernel.org Precedence: bulk List-ID: <linux-pm.vger.kernel.org> X-Mailing-List: linux-pm@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP
Series	Add utilization clamping support \| expand [v10,00/16] Add utilization clamping support [v10,01/16] sched/core: uclamp: Add CPU's clamp buckets refcounting [v10,02/16] sched/core: uclamp: Add bucket local max tracking [v10,03/16] sched/core: uclamp: Enforce last task's UCLAMP_MAX [v10,04/16] sched/core: uclamp: Add system default clamps [v10,05/16] sched/core: Allow sched_setattr() to use the current policy [v10,06/16] sched/core: uclamp: Extend sched_setattr() to support utilization clamping [v10,07/16] sched/core: uclamp: Reset uclamp values on RESET_ON_FORK [v10,08/16] sched/core: uclamp: Set default clamps for RT tasks [v10,09/16] sched/cpufreq: uclamp: Add clamps for FAIR and RT tasks [v10,10/16] sched/core: uclamp: Add uclamp_util_with() [v10,11/16] sched/fair: uclamp: Add uclamp support to energy_compute() [v10,12/16] sched/core: uclamp: Extend CPU's cgroup controller [v10,13/16] sched/core: uclamp: Propagate parent clamps [v10,14/16] sched/core: uclamp: Propagate system defaults to root group [v10,15/16] sched/core: uclamp: Use TG's clamps to restrict TASK's clamps [v10,16/16] sched/core: uclamp: Update CPU's refcount on TG's clamp changes

[v10,12/16] sched/core: uclamp: Extend CPU's cgroup controller

Commit Message

Comments

Patch