Message ID | aa1d8314-3c5e-4554-61e1-95c61aa11624@sandisk.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
On Wed, Aug 31, 2016 at 03:03:24PM -0700, Bart Van Assche wrote: > Protect sw-ctx busy bit manipulations via the sw queue lock. > This allows to convert the atomic bit operations into slightly > faster non-atomic operations. Document that blk_mq_run_hw_queues() > tests these bits without holding the sw queue lock. > > Signed-off-by: Bart Van Assche <bart.vanassche@sandisk.com> > Cc: Christoph Hellwig <hch@lst.de> > Cc: Sagi Grimberg <sagi@grimberg.me> > --- > block/blk-mq.c | 16 +++++++++++++--- > 1 file changed, 13 insertions(+), 3 deletions(-) > > diff --git a/block/blk-mq.c b/block/blk-mq.c > index 13f5a6c..0dcafa6 100644 > --- a/block/blk-mq.c > +++ b/block/blk-mq.c > @@ -66,8 +66,9 @@ static void blk_mq_hctx_mark_pending(struct blk_mq_hw_ctx *hctx, > { > struct blk_align_bitmap *bm = get_bm(hctx, ctx); > > + lockdep_assert_held(&ctx->lock); > if (!test_bit(CTX_TO_BIT(hctx, ctx), &bm->word)) > - set_bit(CTX_TO_BIT(hctx, ctx), &bm->word); > + __set_bit(CTX_TO_BIT(hctx, ctx), &bm->word); > } > > static void blk_mq_hctx_clear_pending(struct blk_mq_hw_ctx *hctx, > @@ -75,7 +76,8 @@ static void blk_mq_hctx_clear_pending(struct blk_mq_hw_ctx *hctx, > { > struct blk_align_bitmap *bm = get_bm(hctx, ctx); > > - clear_bit(CTX_TO_BIT(hctx, ctx), &bm->word); > + lockdep_assert_held(&ctx->lock); > + __clear_bit(CTX_TO_BIT(hctx, ctx), &bm->word); > } NAK, we can't protect each bit with a separate lock, multiple software queues share the bitmap. There's a race if we do non-atomic bit operations on two software queues with bits in the same word.
On 08/31/16 18:18, Omar Sandoval wrote: > NAK, we can't protect each bit with a separate lock, multiple software > queues share the bitmap. There's a race if we do non-atomic bit > operations on two software queues with bits in the same word. Hi Omar, You are right of course. This is something I should have realized myself. Jens, please ignore this patch. Bart. -- To unsubscribe from this list: send the line "unsubscribe linux-block" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/block/blk-mq.c b/block/blk-mq.c index 13f5a6c..0dcafa6 100644 --- a/block/blk-mq.c +++ b/block/blk-mq.c @@ -66,8 +66,9 @@ static void blk_mq_hctx_mark_pending(struct blk_mq_hw_ctx *hctx, { struct blk_align_bitmap *bm = get_bm(hctx, ctx); + lockdep_assert_held(&ctx->lock); if (!test_bit(CTX_TO_BIT(hctx, ctx), &bm->word)) - set_bit(CTX_TO_BIT(hctx, ctx), &bm->word); + __set_bit(CTX_TO_BIT(hctx, ctx), &bm->word); } static void blk_mq_hctx_clear_pending(struct blk_mq_hw_ctx *hctx, @@ -75,7 +76,8 @@ static void blk_mq_hctx_clear_pending(struct blk_mq_hw_ctx *hctx, { struct blk_align_bitmap *bm = get_bm(hctx, ctx); - clear_bit(CTX_TO_BIT(hctx, ctx), &bm->word); + lockdep_assert_held(&ctx->lock); + __clear_bit(CTX_TO_BIT(hctx, ctx), &bm->word); } void blk_mq_freeze_queue_start(struct request_queue *q) @@ -768,8 +770,8 @@ static void flush_busy_ctxs(struct blk_mq_hw_ctx *hctx, struct list_head *list) break; ctx = hctx->ctxs[bit + off]; - clear_bit(bit, &bm->word); spin_lock(&ctx->lock); + __clear_bit(bit, &bm->word); list_splice_tail_init(&ctx->rq_list, list); spin_unlock(&ctx->lock); @@ -940,6 +942,13 @@ void blk_mq_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async) &hctx->run_work, 0); } +/** + * blk_mq_run_hw_queues - run all hardware queues + * + * Note: this function checks the SW and HW busy status without serialization + * against the functions that modify that status information. The caller is + * responsible for realizing that serialization. + */ void blk_mq_run_hw_queues(struct request_queue *q, bool async) { struct blk_mq_hw_ctx *hctx; @@ -1055,6 +1064,7 @@ static void __blk_mq_insert_request(struct blk_mq_hw_ctx *hctx, { struct blk_mq_ctx *ctx = rq->mq_ctx; + lockdep_assert_held(&ctx->lock); __blk_mq_insert_req_list(hctx, rq, at_head); blk_mq_hctx_mark_pending(hctx, ctx); }
Protect sw-ctx busy bit manipulations via the sw queue lock. This allows to convert the atomic bit operations into slightly faster non-atomic operations. Document that blk_mq_run_hw_queues() tests these bits without holding the sw queue lock. Signed-off-by: Bart Van Assche <bart.vanassche@sandisk.com> Cc: Christoph Hellwig <hch@lst.de> Cc: Sagi Grimberg <sagi@grimberg.me> --- block/blk-mq.c | 16 +++++++++++++--- 1 file changed, 13 insertions(+), 3 deletions(-)