Message ID | 12333551.k204JYSgao@aspire.rjw.lan (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
On Thursday, August 24, 2017 6:35:49 PM CEST Rafael J. Wysocki wrote: > On Thursday, August 24, 2017 11:15:26 AM CEST Ulf Hansson wrote: [cut] > [BTW, it is not entirely clear to me why it ever is necessary to runtime resume > a device with direct_complete set after __device_suspend(), because it can only > have direct_complete set at that point if all of the hierarchy below it has > this flag set too and so runtime PM has to be disabled for all of those > devices as well.] Which makes me realize that we should take a step back and look at what problems there are. First, there are devices (I know about two examples so far and both are PCI) that may need to be runtime resumed during system suspend for reasons other than the ones checked by the ACPI PM domain (or the PCI bus type). There needs to be a way to indicate that from the driver side. However, it still may be valuable to check the power-related conditions for leaving the device in runtime suspend over system suspend/resume in case it actually doesn't need to be runtime resumed during system suspend after all. That's what the majority of my patch was about. The second problem is that the ACPI PM domain (and the PCI bus type) runtime resumes all devices unconditionally in its ->suspend callback, even though that may not be necessary for some devices. Therefore there needs to be a way to indicate that too. That still would be good to have *regardless* of the direct_complete mechanism, because the direct_complete flag may not be set very often due to dependencies and then the resume-during-suspend will take place unnecessarily. Accordingly, it looks like we need a "no need to resume me" flag in the first place. That would indicate to interested pieces of code that, from the driver perspective, the device doesn't need to be runtime resumed before invoking its system suspend callbacks. This should be clear enough to everyone IMO. [Note that if that flag is set for all devices, we may drop it along with direct_complete, but before that happens both are needed.] To address the first issue I would add something like the flag in the patches I sent (but without the ACPI PM domain part which should be covered by the "no need to resume me" flag above), because that allows the device's ->suspend callback to run in principle and the driver may use that callback even to runtime resume the device if that's what it wants to do. So something like "run my ->suspend callback even though I might stay in runtime suspend". I would probably add driver_flags to dev_pm_info for that to set at the probe time (and I would make the core clear that on driver removal). The complexity concern is there, but honestly I don't see a better way at this point. Thanks, Rafael
[...] >> >> The above change would offer an improvement, however there are more >> benefits from the runtime PM centric path that it doesn't cover. > > The idea, though, is not to make it instead of, but in addition to the > runtime PM centric path. > > So a driver using the runtime PM centric (or rather runtime PM aware) > callbacks can set power.force_suspend and benefit from having direct_complete > set conditional on whether or not the device is runtime resumed before the > late suspend. > > If device_complete remains set in __device_suspend_late(), no system > suspend/resume callbacks will be invoked from that point on. If it isn't > set, pm_runtime_force_suspend() should work just fine and then > pm_runtime_force_resume() should do the right thing on the way back. > > All boils down to (a) setting a flag and (b) using the right callbacks > for system suspend which is the case with your patches either. You make it sound simple. :-) I try to get inspired of your ideas and see what I can come up with. However, I really want us to avoid making it more difficult to understand the behavior of the core, so I probably look start changing the ACPI PM domain first, then trying to adopt the behavior of the core on top of those changes. Does that sound reasonable to you? > >> The below is pasted from the changelog for patch9 (I will make sure to >> fold in this in the changelog for $subject patch next version). >> >> In case the device is/gets runtime resumed before the device_suspend() >> phase is entered, the PM core doesn't run the direct_complete path for >> the device during system sleep. During system resume, this lead to >> that the device will always be brought back to full power when the >> i2c-dw-plat driver's ->resume() callback is invoked. This may not be >> necessary, thus increasing the resume time for the device and wasting >> power. > > Well, I guess that depends on what is there in its resume callbacks. > >> In the runtime PM centric path, the pm_runtime_force_resume() helper >> takes better care, as it resumes the device only in case it's really >> needed. Normally this means it can be postponed to be dealt with via >> regular calls to runtime PM (pm_runtime_get*()) instead. In other >> words, the device remains in its low power state until someone request >> a new i2c transfer, whenever that may be. >> >> As far as I can think of, the direct_complete path can't be adjusted >> to support this. Or can it? > > It surely can, it's just software. :-) The question is how complicated that > is going to be in the end, however. > > So the patch I sent didn't address the dependency issue, but it allows the > core to deal with the issue which is a core issue, not an ACPI PM domain > issue, because the core disables runtime PM for devices with direct_complete > set, not the ACPI PM domain or PCI etc. > > [BTW, it is not entirely clear to me why it ever is necessary to runtime resume > a device with direct_complete set after __device_suspend(), because it can only > have direct_complete set at that point if all of the hierarchy below it has > this flag set too and so runtime PM has to be disabled for all of those > devices as well.] > > It can be made slightly better by adding a piece to avoid clearing the > parent's direct_complete if it had the new flag set, so below is a new > version (with the new flag renamed to safe_suspend which seemed better to > me). > > Of course, having safe_suspend set will still cause the parent's > direct_complete to be cleared unless the parent has safe_suspend set too, > but that's better than not allowing the parent to use direct_complete at all. Well, as I replied in the earlier response, deploying the runtime PM centric path also for the parent, offers additional optimizations while comparing to try to make the direct_complete path still to work. We could also pick that approach, which would mean additional arguments of moving drivers to the runtime PM centric path, of course only where it makes sense. > > --- > drivers/acpi/device_pm.c | 8 ++++++++ > drivers/base/power/main.c | 15 ++++++++++++--- > include/linux/pm.h | 1 + > 3 files changed, 21 insertions(+), 3 deletions(-) > > Index: linux-pm/drivers/base/power/main.c > =================================================================== > --- linux-pm.orig/drivers/base/power/main.c > +++ linux-pm/drivers/base/power/main.c > @@ -1271,9 +1271,16 @@ static int __device_suspend_late(struct > goto Complete; > } > > - if (dev->power.syscore || dev->power.direct_complete) > + if (dev->power.syscore) > goto Complete; > > + if (dev->power.direct_complete) { > + if (pm_runtime_status_suspended(dev)) > + goto Complete; > + > + dev->power.direct_complete = false; > + } > + > if (dev->pm_domain) { > info = "late power domain "; > callback = pm_late_early_op(&dev->pm_domain->ops, state); > @@ -1482,7 +1489,7 @@ static int __device_suspend(struct devic > if (dev->power.syscore) > goto Complete; > > - if (dev->power.direct_complete) { > + if (dev->power.direct_complete && !dev->power.safe_suspend) { > if (pm_runtime_status_suspended(dev)) { > pm_runtime_disable(dev); > if (pm_runtime_status_suspended(dev)) > @@ -1549,7 +1556,9 @@ static int __device_suspend(struct devic > if (parent) { > spin_lock_irq(&parent->power.lock); > > - dev->parent->power.direct_complete = false; > + if (!dev->parent->power.safe_suspend) > + dev->parent->power.direct_complete = false; > + > if (dev->power.wakeup_path > && !dev->parent->power.ignore_children) > dev->parent->power.wakeup_path = true; > Index: linux-pm/include/linux/pm.h > =================================================================== > --- linux-pm.orig/include/linux/pm.h > +++ linux-pm/include/linux/pm.h > @@ -554,6 +554,7 @@ struct dev_pm_info { > pm_message_t power_state; > unsigned int can_wakeup:1; > unsigned int async_suspend:1; > + unsigned int safe_suspend:1; > bool in_dpm_list:1; /* Owned by the PM core */ > bool is_prepared:1; /* Owned by the PM core */ > bool is_suspended:1; /* Ditto */ > Index: linux-pm/drivers/acpi/device_pm.c > =================================================================== > --- linux-pm.orig/drivers/acpi/device_pm.c > +++ linux-pm/drivers/acpi/device_pm.c > @@ -1025,9 +1025,17 @@ EXPORT_SYMBOL_GPL(acpi_subsys_prepare); > * > * Follow PCI and resume devices suspended at run time before running their > * system suspend callbacks. > + * > + * If the power.direct_complete flag is set for the device, though, skip all > + * that, because the device doesn't need to be resumed then and if it is > + * resumed via runtime PM, the core will notice that and will carry out the > + * late suspend for it. > */ > int acpi_subsys_suspend(struct device *dev) > { > + if (dev->power.direct_complete) > + return 0; > + > pm_runtime_resume(dev); > return pm_generic_suspend(dev); > } > Kind regards Uffe
On Friday, August 25, 2017 11:28:25 AM CEST Ulf Hansson wrote: > [...] > > >> > >> The above change would offer an improvement, however there are more > >> benefits from the runtime PM centric path that it doesn't cover. > > > > The idea, though, is not to make it instead of, but in addition to the > > runtime PM centric path. > > > > So a driver using the runtime PM centric (or rather runtime PM aware) > > callbacks can set power.force_suspend and benefit from having direct_complete > > set conditional on whether or not the device is runtime resumed before the > > late suspend. > > > > If device_complete remains set in __device_suspend_late(), no system > > suspend/resume callbacks will be invoked from that point on. If it isn't > > set, pm_runtime_force_suspend() should work just fine and then > > pm_runtime_force_resume() should do the right thing on the way back. > > > > All boils down to (a) setting a flag and (b) using the right callbacks > > for system suspend which is the case with your patches either. > > You make it sound simple. :-) > > I try to get inspired of your ideas and see what I can come up with. > However, I really want us to avoid making it more difficult to > understand the behavior of the core, so I probably look start changing > the ACPI PM domain first, then trying to adopt the behavior of the > core on top of those changes. Does that sound reasonable to you? Well, I don't think we need to change the ACPI PM domain at all. :-) However, let me contiune in a reply to https://marc.info/?l=linux-arm-kernel&m=150361200030186&w=2 for more complete context. Thanks, Rafael
Index: linux-pm/drivers/base/power/main.c =================================================================== --- linux-pm.orig/drivers/base/power/main.c +++ linux-pm/drivers/base/power/main.c @@ -1271,9 +1271,16 @@ static int __device_suspend_late(struct goto Complete; } - if (dev->power.syscore || dev->power.direct_complete) + if (dev->power.syscore) goto Complete; + if (dev->power.direct_complete) { + if (pm_runtime_status_suspended(dev)) + goto Complete; + + dev->power.direct_complete = false; + } + if (dev->pm_domain) { info = "late power domain "; callback = pm_late_early_op(&dev->pm_domain->ops, state); @@ -1482,7 +1489,7 @@ static int __device_suspend(struct devic if (dev->power.syscore) goto Complete; - if (dev->power.direct_complete) { + if (dev->power.direct_complete && !dev->power.safe_suspend) { if (pm_runtime_status_suspended(dev)) { pm_runtime_disable(dev); if (pm_runtime_status_suspended(dev)) @@ -1549,7 +1556,9 @@ static int __device_suspend(struct devic if (parent) { spin_lock_irq(&parent->power.lock); - dev->parent->power.direct_complete = false; + if (!dev->parent->power.safe_suspend) + dev->parent->power.direct_complete = false; + if (dev->power.wakeup_path && !dev->parent->power.ignore_children) dev->parent->power.wakeup_path = true; Index: linux-pm/include/linux/pm.h =================================================================== --- linux-pm.orig/include/linux/pm.h +++ linux-pm/include/linux/pm.h @@ -554,6 +554,7 @@ struct dev_pm_info { pm_message_t power_state; unsigned int can_wakeup:1; unsigned int async_suspend:1; + unsigned int safe_suspend:1; bool in_dpm_list:1; /* Owned by the PM core */ bool is_prepared:1; /* Owned by the PM core */ bool is_suspended:1; /* Ditto */ Index: linux-pm/drivers/acpi/device_pm.c =================================================================== --- linux-pm.orig/drivers/acpi/device_pm.c +++ linux-pm/drivers/acpi/device_pm.c @@ -1025,9 +1025,17 @@ EXPORT_SYMBOL_GPL(acpi_subsys_prepare); * * Follow PCI and resume devices suspended at run time before running their * system suspend callbacks. + * + * If the power.direct_complete flag is set for the device, though, skip all + * that, because the device doesn't need to be resumed then and if it is + * resumed via runtime PM, the core will notice that and will carry out the + * late suspend for it. */ int acpi_subsys_suspend(struct device *dev) { + if (dev->power.direct_complete) + return 0; + pm_runtime_resume(dev); return pm_generic_suspend(dev); }