diff mbox series

blk-mq: Remove get_cpu() in __blk_mq_delay_run_hw_queue().

Message ID YgtuX9dDpXe8Xbs+@linutronix.de (mailing list archive)
State New, archived
Headers show
Series blk-mq: Remove get_cpu() in __blk_mq_delay_run_hw_queue(). | expand

Commit Message

Sebastian Andrzej Siewior Feb. 15, 2022, 9:11 a.m. UTC
The callchain:
  __blk_mq_delay_run_hw_queue(x, false, x)
    cpu = get_cpu();
    cpumask_test_cpu(cpu, hctx->cpumask);
      __blk_mq_run_hw_queue(hctx);
        blk_mq_run_dispatch_ops();
	  __blk_mq_run_dispatch_ops(x, true, x);
	    might_sleep_if(true);

will trigger the might_sleep warning since get_cpu() disables
preemption. Based on my understanding, __blk_mq_run_hw_queue() should
run on the requested CPU for the benefit of cache locality. The system
won't crash if it runs on another CPU but the performance will be
probably less than optimal.

If the current CPU matches the requested cpumask then it may remain and
fulfill the requirement. If the scheduler moves the task to another CPU
then it will run on the wrong CPU but no harm is done in terms of
correctness.

Remove get_cpu() from __blk_mq_delay_run_hw_queue().

Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
---
I have nothing to trigger that path, this is based on review. I see the
callchain but with blk_queue_has_srcu == 0.
The calls I see are from per-CPU threads and from unbound CPU threads.

If the correct CPU is important then migrate_disable() could be used
(slightly higher overhead than preempt_disable() but preemptible) or
unconditionally pass to kblockd_mod_delayed_work_on() (higher overhead
than migrate_disable() but will be done anyway if the current CPU is
wrong).

 block/blk-mq.c | 6 +-----
 1 file changed, 1 insertion(+), 5 deletions(-)

Comments

Ming Lei Feb. 15, 2022, 9:23 a.m. UTC | #1
On Tue, Feb 15, 2022 at 10:11:59AM +0100, Sebastian Andrzej Siewior wrote:
> The callchain:
>   __blk_mq_delay_run_hw_queue(x, false, x)
>     cpu = get_cpu();
>     cpumask_test_cpu(cpu, hctx->cpumask);
>       __blk_mq_run_hw_queue(hctx);
>         blk_mq_run_dispatch_ops();
> 	  __blk_mq_run_dispatch_ops(x, true, x);
> 	    might_sleep_if(true);
> 
> will trigger the might_sleep warning since get_cpu() disables
> preemption. Based on my understanding, __blk_mq_run_hw_queue() should
> run on the requested CPU for the benefit of cache locality. The system
> won't crash if it runs on another CPU but the performance will be
> probably less than optimal.
> 
> If the current CPU matches the requested cpumask then it may remain and
> fulfill the requirement. If the scheduler moves the task to another CPU
> then it will run on the wrong CPU but no harm is done in terms of
> correctness.
> 
> Remove get_cpu() from __blk_mq_delay_run_hw_queue().
> 
> Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
> ---
> I have nothing to trigger that path, this is based on review. I see the
> callchain but with blk_queue_has_srcu == 0.
> The calls I see are from per-CPU threads and from unbound CPU threads.
> 
> If the correct CPU is important then migrate_disable() could be used
> (slightly higher overhead than preempt_disable() but preemptible) or
> unconditionally pass to kblockd_mod_delayed_work_on() (higher overhead
> than migrate_disable() but will be done anyway if the current CPU is
> wrong).
> 
>  block/blk-mq.c | 6 +-----
>  1 file changed, 1 insertion(+), 5 deletions(-)
> 
> diff --git a/block/blk-mq.c b/block/blk-mq.c
> index 1adfe4824ef5e..90217f1b09add 100644
> --- a/block/blk-mq.c
> +++ b/block/blk-mq.c
> @@ -2040,14 +2040,10 @@ static void __blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async,
>  		return;
>  
>  	if (!async && !(hctx->flags & BLK_MQ_F_BLOCKING)) {
> -		int cpu = get_cpu();

Did you observe the warning triggered?  might_sleep_if(true) in
__blk_mq_run_dispatch_ops() is only supposed to be run in case of
(hctx->flags & BLK_MQ_F_BLOCKING).

Thanks, 
Ming
Sebastian Andrzej Siewior Feb. 15, 2022, 9:31 a.m. UTC | #2
On 2022-02-15 17:23:45 [+0800], Ming Lei wrote:
> > diff --git a/block/blk-mq.c b/block/blk-mq.c
> > index 1adfe4824ef5e..90217f1b09add 100644
> > --- a/block/blk-mq.c
> > +++ b/block/blk-mq.c
> > @@ -2040,14 +2040,10 @@ static void __blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async,
> >  		return;
> >  
> >  	if (!async && !(hctx->flags & BLK_MQ_F_BLOCKING)) {
> > -		int cpu = get_cpu();
> 
> Did you observe the warning triggered?  might_sleep_if(true) in
> __blk_mq_run_dispatch_ops() is only supposed to be run in case of
> (hctx->flags & BLK_MQ_F_BLOCKING).

I haven't seen it triggering but if that call chain is possible then it
will triggert due to get_cpu().

> Thanks, 
> Ming

Sebastian
Ming Lei Feb. 15, 2022, 11:58 a.m. UTC | #3
On Tue, Feb 15, 2022 at 10:31:49AM +0100, Sebastian Andrzej Siewior wrote:
> On 2022-02-15 17:23:45 [+0800], Ming Lei wrote:
> > > diff --git a/block/blk-mq.c b/block/blk-mq.c
> > > index 1adfe4824ef5e..90217f1b09add 100644
> > > --- a/block/blk-mq.c
> > > +++ b/block/blk-mq.c
> > > @@ -2040,14 +2040,10 @@ static void __blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async,
> > >  		return;
> > >  
> > >  	if (!async && !(hctx->flags & BLK_MQ_F_BLOCKING)) {
> > > -		int cpu = get_cpu();
> > 
> > Did you observe the warning triggered?  might_sleep_if(true) in
> > __blk_mq_run_dispatch_ops() is only supposed to be run in case of
> > (hctx->flags & BLK_MQ_F_BLOCKING).
> 
> I haven't seen it triggering but if that call chain is possible then it
> will triggert due to get_cpu().

As I mentioned, the call chain isn't possible.

Thanks,
Ming
Sebastian Andrzej Siewior Feb. 15, 2022, 12:11 p.m. UTC | #4
On 2022-02-15 19:58:03 [+0800], Ming Lei wrote:
> On Tue, Feb 15, 2022 at 10:31:49AM +0100, Sebastian Andrzej Siewior wrote:
> > On 2022-02-15 17:23:45 [+0800], Ming Lei wrote:
> > > > diff --git a/block/blk-mq.c b/block/blk-mq.c
> > > > index 1adfe4824ef5e..90217f1b09add 100644
> > > > --- a/block/blk-mq.c
> > > > +++ b/block/blk-mq.c
> > > > @@ -2040,14 +2040,10 @@ static void __blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async,
> > > >  		return;
> > > >  
> > > >  	if (!async && !(hctx->flags & BLK_MQ_F_BLOCKING)) {
> > > > -		int cpu = get_cpu();
> > > 
> > > Did you observe the warning triggered?  might_sleep_if(true) in
> > > __blk_mq_run_dispatch_ops() is only supposed to be run in case of
> > > (hctx->flags & BLK_MQ_F_BLOCKING).
> > 
> > I haven't seen it triggering but if that call chain is possible then it
> > will triggert due to get_cpu().
> 
> As I mentioned, the call chain isn't possible.

Why not? Is BLK_MQ_F_BLOCKING and blk_queue_has_srcu() exclusive?

> Thanks,
> Ming

Sebastian
Ming Lei Feb. 15, 2022, 12:22 p.m. UTC | #5
On Tue, Feb 15, 2022 at 01:11:24PM +0100, Sebastian Andrzej Siewior wrote:
> On 2022-02-15 19:58:03 [+0800], Ming Lei wrote:
> > On Tue, Feb 15, 2022 at 10:31:49AM +0100, Sebastian Andrzej Siewior wrote:
> > > On 2022-02-15 17:23:45 [+0800], Ming Lei wrote:
> > > > > diff --git a/block/blk-mq.c b/block/blk-mq.c
> > > > > index 1adfe4824ef5e..90217f1b09add 100644
> > > > > --- a/block/blk-mq.c
> > > > > +++ b/block/blk-mq.c
> > > > > @@ -2040,14 +2040,10 @@ static void __blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async,
> > > > >  		return;
> > > > >  
> > > > >  	if (!async && !(hctx->flags & BLK_MQ_F_BLOCKING)) {
> > > > > -		int cpu = get_cpu();
> > > > 
> > > > Did you observe the warning triggered?  might_sleep_if(true) in
> > > > __blk_mq_run_dispatch_ops() is only supposed to be run in case of
> > > > (hctx->flags & BLK_MQ_F_BLOCKING).
> > > 
> > > I haven't seen it triggering but if that call chain is possible then it
> > > will triggert due to get_cpu().
> > 
> > As I mentioned, the call chain isn't possible.
> 
> Why not? Is BLK_MQ_F_BLOCKING and blk_queue_has_srcu() exclusive?

blk_queue_has_srcu() returns true iff BLK_MQ_F_BLOCKING is set.

thanks,
Ming
diff mbox series

Patch

diff --git a/block/blk-mq.c b/block/blk-mq.c
index 1adfe4824ef5e..90217f1b09add 100644
--- a/block/blk-mq.c
+++ b/block/blk-mq.c
@@ -2040,14 +2040,10 @@  static void __blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async,
 		return;
 
 	if (!async && !(hctx->flags & BLK_MQ_F_BLOCKING)) {
-		int cpu = get_cpu();
-		if (cpumask_test_cpu(cpu, hctx->cpumask)) {
+		if (cpumask_test_cpu(raw_smp_processor_id(), hctx->cpumask)) {
 			__blk_mq_run_hw_queue(hctx);
-			put_cpu();
 			return;
 		}
-
-		put_cpu();
 	}
 
 	kblockd_mod_delayed_work_on(blk_mq_hctx_next_cpu(hctx), &hctx->run_work,