mbox series

[v3,0/2] cpufreq/amd-pstate: Set initial min_freq to lowest_nonlinear_freq

Message ID 20241017053927.25285-1-Dhananjay.Ugwekar@amd.com (mailing list archive)
Headers show
Series cpufreq/amd-pstate: Set initial min_freq to lowest_nonlinear_freq | expand

Message

Dhananjay Ugwekar Oct. 17, 2024, 5:39 a.m. UTC
According to the AMD architectural programmer's manual volume 2 [1], 
in section "17.6.4.1 CPPC_CAPABILITY_1" lowest_nonlinear_perf is described 
as "Reports the most energy efficient performance level (in terms of 
performance per watt). Above this threshold, lower performance levels 
generally result in increased energy efficiency. Reducing performance 
below this threshold does not result in total energy savings for a given 
computation, although it reduces instantaneous power consumption". So 
lowest_nonlinear_perf is the most power efficient performance level, and 
going below that would lead to a worse performance/watt.

Also setting the minimum frequency to lowest_nonlinear_freq (instead of
lowest_freq) allows the CPU to idle at a higher frequency which leads
to more time being spent in a deeper idle state (as trivial idle tasks
are completed sooner). This has shown a power benefit in some systems.
In other systems, power consumption has increased but so has the
throughput/watt.

Our objective here is to update the initial lower frequency limit to 
lowest_nonlinear_freq, while allowing the user to later update the lower 
limit to anywhere between lowest_freq to highest_freq for the platform.

So, set the policy->min to lowest_nonlinear_freq in the ->verify() 
callback, only if the original value is equal to FREQ_QOS_MIN_DEFAULT_VALUE
(i.e. 0). Merge the two identical verify functions while at it.

Link: https://www.amd.com/content/dam/amd/en/documents/processor-tech-docs/programmer-references/24593.pdf [1]

Changes from v2:
* Fix the misplaced NULL pointer check (Mario)
* Move all new code inside the if condition
* Add comment to explain the rationale

v2 Link: https://lore.kernel.org/linux-pm/20241016144639.135610-1-Dhananjay.Ugwekar@amd.com/

Changes from v1:
* Modify the initial min_freq from verify callback, instead of adding a
  new callback in cpufreq_driver struct (Rafael)

v1 Link: https://lore.kernel.org/linux-pm/20241003083952.3186-1-Dhananjay.Ugwekar@amd.com/

Dhananjay Ugwekar (2):
  cpufreq/amd-pstate: Remove the redundant verify() function
  cpufreq/amd-pstate: Set the initial min_freq to lowest_nonlinear_freq

 drivers/cpufreq/amd-pstate.c | 34 +++++++++++++++++++++++-----------
 1 file changed, 23 insertions(+), 11 deletions(-)

Comments

Hanabishi Dec. 8, 2024, 7:54 a.m. UTC | #1
Hello. Maybe I'm too late on this, but I have some concerns.

On 10/17/24 05:39, Dhananjay Ugwekar wrote:
> In other systems, power consumption has increased but so has the
> throughput/watt.

I just want to bring up the fact that this change affects all governors. It sounds good for the performance governor, but not so much for the powersave governor.

So the question is: don't we want the lowest power consumption possible in the powersave mode? Even if it means decreased efficiency. Powersave by definition supposed to make battery last as long as possible no matter what, isn't it?
Mario Limonciello Dec. 8, 2024, 4:35 p.m. UTC | #2
On 12/8/2024 01:54, Hanabishi wrote:
> Hello. Maybe I'm too late on this, but I have some concerns.
> 
> On 10/17/24 05:39, Dhananjay Ugwekar wrote:
>> In other systems, power consumption has increased but so has the
>> throughput/watt.
> 
> I just want to bring up the fact that this change affects all governors. 
> It sounds good for the performance governor, but not so much for the 
> powersave governor.
> 
> So the question is: don't we want the lowest power consumption possible 
> in the powersave mode? Even if it means decreased efficiency. Powersave 
> by definition supposed to make battery last as long as possible no 
> matter what, isn't it?
> 

No, the powersave governor isn't a one stop shop to bring everything to 
longest battery.

By your argument we should set the EPP to "power" by default and "boost" 
to off by default when the powersave governor is enacted?

All of those are far too aggressive for a default behavior.  Setting the 
lowest nonlinear frequency as the default lowest scaling frequency is 
about having a good default that balances responsiveness, battery life 
and performance.

Like all knobs anyone that doesn't agree with it can of course modify it 
from sysfs.