Message ID | 83915a4be672d99729029800196008c3b39c7a3a.1504748195.git.shli@fb.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Hello, On Wed, Sep 06, 2017 at 07:00:53PM -0700, Shaohua Li wrote: > diff --git a/drivers/block/loop.c b/drivers/block/loop.c > index 9d4545f..9850b27 100644 > --- a/drivers/block/loop.c > +++ b/drivers/block/loop.c > @@ -482,6 +482,8 @@ static void lo_rw_aio_complete(struct kiocb *iocb, long ret, long ret2) > { > struct loop_cmd *cmd = container_of(iocb, struct loop_cmd, iocb); > > + if (cmd->css) > + css_put(cmd->css); > cmd->ret = ret > 0 ? 0 : ret; > lo_rw_aio_do_completion(cmd); The fact that we're forwarding explicitly in loop still bothers me a bit. Can you please elaborate why we don't want to do this generically through aio? Thanks.
On Fri, Sep 08, 2017 at 07:48:09AM -0700, Tejun Heo wrote: > Hello, > > On Wed, Sep 06, 2017 at 07:00:53PM -0700, Shaohua Li wrote: > > diff --git a/drivers/block/loop.c b/drivers/block/loop.c > > index 9d4545f..9850b27 100644 > > --- a/drivers/block/loop.c > > +++ b/drivers/block/loop.c > > @@ -482,6 +482,8 @@ static void lo_rw_aio_complete(struct kiocb *iocb, long ret, long ret2) > > { > > struct loop_cmd *cmd = container_of(iocb, struct loop_cmd, iocb); > > > > + if (cmd->css) > > + css_put(cmd->css); > > cmd->ret = ret > 0 ? 0 : ret; > > lo_rw_aio_do_completion(cmd); > > The fact that we're forwarding explicitly in loop still bothers me a > bit. Can you please elaborate why we don't want to do this > generically through aio? I think we must forward in loop, because each cmd could come from different cgroup, so we must explicitly forward for each cmd. The main reason not to do the forward in aio is complexity. We at least have 3 different implementations for dio: - __blockdev_direct_IO for ext4 and btrfs - iomap dio for xfs - blockdev dio implementation Forwarding in dio means hooking the cgroup association for each bio dispatched in the implementations, which is a little messy. I'd like to avoid this if there is no strong reason to do it. Thanks, Shaohua
Hello, Shaohua. On Fri, Sep 08, 2017 at 10:07:15AM -0700, Shaohua Li wrote: > > The fact that we're forwarding explicitly in loop still bothers me a > > bit. Can you please elaborate why we don't want to do this > > generically through aio? > > I think we must forward in loop, because each cmd could come from different > cgroup, so we must explicitly forward for each cmd. > > The main reason not to do the forward in aio is complexity. We at least have 3 > different implementations for dio: > - __blockdev_direct_IO for ext4 and btrfs > - iomap dio for xfs > - blockdev dio implementation > > Forwarding in dio means hooking the cgroup association for each bio dispatched > in the implementations, which is a little messy. I'd like to avoid this if > there is no strong reason to do it. I see. I think the important questions is whether we're failing to forward io cgroup membership propagation on some aios? If we are, that is an obvious bug which should be addressed one way or the other, and there's a fair chance that we wouldn't need to do anything special for loop. Given how simple the loop changes are, we sure can go with loop specific changes for now; however, I'm a bit unconvinced that aio changes would be that much more complex. Can you please look into it? If it is actually complex, we sure can do it later but I'd much prefer to plug the hole as soon as possible. Thanks.
diff --git a/drivers/block/loop.c b/drivers/block/loop.c index 9d4545f..9850b27 100644 --- a/drivers/block/loop.c +++ b/drivers/block/loop.c @@ -482,6 +482,8 @@ static void lo_rw_aio_complete(struct kiocb *iocb, long ret, long ret2) { struct loop_cmd *cmd = container_of(iocb, struct loop_cmd, iocb); + if (cmd->css) + css_put(cmd->css); cmd->ret = ret > 0 ? 0 : ret; lo_rw_aio_do_completion(cmd); } @@ -541,6 +543,8 @@ static int lo_rw_aio(struct loop_device *lo, struct loop_cmd *cmd, cmd->iocb.ki_filp = file; cmd->iocb.ki_complete = lo_rw_aio_complete; cmd->iocb.ki_flags = IOCB_DIRECT; + if (cmd->css) + kthread_set_orig_css(cmd->css); if (rw == WRITE) ret = call_write_iter(file, &cmd->iocb, &iter); @@ -548,6 +552,7 @@ static int lo_rw_aio(struct loop_device *lo, struct loop_cmd *cmd, ret = call_read_iter(file, &cmd->iocb, &iter); lo_rw_aio_do_completion(cmd); + kthread_reset_orig_css(); if (ret != -EIOCBQUEUED) cmd->iocb.ki_complete(&cmd->iocb, ret, 0); @@ -1692,6 +1697,14 @@ static blk_status_t loop_queue_rq(struct blk_mq_hw_ctx *hctx, break; } + /* always use the first bio's css */ +#ifdef CONFIG_CGROUPS + if (cmd->use_aio && cmd->rq->bio && cmd->rq->bio->bi_css) { + cmd->css = cmd->rq->bio->bi_css; + css_get(cmd->css); + } else +#endif + cmd->css = NULL; kthread_queue_work(&lo->worker, &cmd->work); return BLK_STS_OK; diff --git a/drivers/block/loop.h b/drivers/block/loop.h index f68c1d5..d93b669 100644 --- a/drivers/block/loop.h +++ b/drivers/block/loop.h @@ -74,6 +74,7 @@ struct loop_cmd { long ret; struct kiocb iocb; struct bio_vec *bvec; + struct cgroup_subsys_state *css; }; /* Support for loadable transfer modules */