From patchwork Wed Jun 27 19:09:39 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Josef Bacik X-Patchwork-Id: 10492441 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id B12D960325 for ; Wed, 27 Jun 2018 19:11:02 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id C254229DF5 for ; Wed, 27 Jun 2018 19:11:02 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id BE7CA27FAC; Wed, 27 Jun 2018 19:11:02 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-7.9 required=2.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID, MAILING_LIST_MULTI, RCVD_IN_DNSWL_HI autolearn=unavailable version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 32A0529D04 for ; Wed, 27 Jun 2018 19:11:02 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S966303AbeF0TKq (ORCPT ); Wed, 27 Jun 2018 15:10:46 -0400 Received: from mail-qt0-f193.google.com ([209.85.216.193]:38401 "EHLO mail-qt0-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S966253AbeF0TKE (ORCPT ); Wed, 27 Jun 2018 15:10:04 -0400 Received: by mail-qt0-f193.google.com with SMTP id c5-v6so2635906qth.5 for ; Wed, 27 Jun 2018 12:10:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=toxicpanda-com.20150623.gappssmtp.com; s=20150623; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=pyFcv48pUR78rBxKbFf7BQky/rW/J0zmzMN1zG2R2fY=; b=gpyVKjm4xN7Bs0BUpV0v7U/lyu4UASMqGmI34dq8jEaQ10FBbqQppMLcmL0fGl3qX3 vIpoAO5l6gbYghrdg33c0Yfowjqtm0m9/VNnOdJkEaptfNWhpsN3A2MFruEWV7iqwys0 qkkJ0pUdYaWYOEYC6o4geNxeOIeAgobIqiixRG1LRJ7tPw/tkD16kMDvW6nWion9ZFbi pamfSa1NPiu1Fsf+mpbzQl1n+LLAShO5mzI9rxI0hUdOwaWvsnj6I+dysRQ9Fm5OQ/hR EknIvCti/0feHPQztp6exT0zcI5bIZJXDsntV7wQkgvDydey2H8XigFhZs6CbiJv7A5R 5krg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=pyFcv48pUR78rBxKbFf7BQky/rW/J0zmzMN1zG2R2fY=; b=Fx8pPmtFucN24X8iefB/zaeSp7D4sA1gF6HiECklRCB/G6IgY/wUyyrcos6d3i9GqI Zc22kxy03H8AYlFVvz3ebOD3yhOvT7yQIkjDMrSCZ0Si6+rJpiwwDfxi+DIRccro3fko T2qDZ+xj+A786cNJh+0JiaRZy8fM62n9LjL+fNdAvnS93brICEmDECUlleEz9SJyS9QK lMO3nNR1tKNB5RcrlqTINfQDts66pSanRPahUCs1ug3p5LGvcfbbK7DAMR2gvzzSK2yu G9Bsgq3VOSS68T5pp3bLpnLGbezB1UbUC0n3jupgcJrXgtzBdddhsCCQc8ohlpKYof1k hxgQ== X-Gm-Message-State: APt69E2V4n4mHLTB11b06MeaJpZPQ7fqrEXg5wsVnKtAF1l6lAxuHGj4 cWD+/cBkcW7hq6pZ6nIW694zgw== X-Google-Smtp-Source: AAOMgpdPIV96krA4LjHFQ24FRjtFLlXVK3DBSgRhsD/JTm2ObGzFuUTPcEvqquWjiPMRdWmj/rsuoA== X-Received: by 2002:a0c:9b93:: with SMTP id o19-v6mr6623570qve.89.1530126602745; Wed, 27 Jun 2018 12:10:02 -0700 (PDT) Received: from localhost ([107.15.81.208]) by smtp.gmail.com with ESMTPSA id d6-v6sm4231341qta.61.2018.06.27.12.10.01 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Wed, 27 Jun 2018 12:10:02 -0700 (PDT) From: Josef Bacik To: axboe@kernel.dk, linux-block@vger.kernel.org, kernel-team@fb.com, akpm@linux-foundation.org, hannes@cmpxchg.org, tj@kernel.org, linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org Cc: Josef Bacik Subject: [PATCH 13/14] Documentation: add a doc for blk-iolatency Date: Wed, 27 Jun 2018 15:09:39 -0400 Message-Id: <20180627190940.4696-14-josef@toxicpanda.com> X-Mailer: git-send-email 2.14.3 In-Reply-To: <20180627190940.4696-1-josef@toxicpanda.com> References: <20180627190940.4696-1-josef@toxicpanda.com> Sender: linux-block-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-block@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP From: Josef Bacik A basic documentation to describe the interface, statistics, and behavior of io.latency. Signed-off-by: Josef Bacik --- Documentation/cgroup-v2.txt | 79 +++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 79 insertions(+) diff --git a/Documentation/cgroup-v2.txt b/Documentation/cgroup-v2.txt index 74cdeaed9f7a..06e36f3e3f9f 100644 --- a/Documentation/cgroup-v2.txt +++ b/Documentation/cgroup-v2.txt @@ -51,6 +51,9 @@ v1 is available under Documentation/cgroup-v1/. 5-3. IO 5-3-1. IO Interface Files 5-3-2. Writeback + 5-3-3. IO Latency + 5-3-3-1. How IO Latency Throttling Works + 5-3-3-2. IO Latency Interface Files 5-4. PID 5-4-1. PID Interface Files 5-5. Device @@ -1395,6 +1398,82 @@ writeback as follows. vm.dirty[_background]_ratio. +IO Latency +~~~~~~~~~~ + +This is a cgroup v2 controller for IO workload protection. You provide a group +with a latency target, and if the average latency exceeds that target the +controller will throttle any peers that have a lower latency target than the +protected workload. + +The limits are only applied at the peer level in the hierarchy. This means that +in the diagram below, only groups A, B, and C will influence each other, and +groups D and F will influence each other. Group G will influence nobody. + + [root] + / | \ + A B C + / \ | + D F G + + +So the ideal way to configure this is to set io.latency in groups A, B, and C. +Generally you do not want to set a value lower than the latency your device +supports. Experiment to find the value that works best for your workload, start +at higher than the expected latency for your device and watch the total_lat_avg +value in io.stat for your workload group to get an idea of the latency you see +during normal operation. Use this value as a basis for your real setting, +setting at 10-15% higher than the value in io.stat. Experimentation is key here +because total_lat_avg is a running total, so is the "statistics" portion of +"lies, damned lies, and statistics." + +How IO Latency Throttling Works +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + +io.latency is work conserving; so as long as everybody is meeting their latency +target the controller doesn't do anything. Once a group starts missing its +target it begins throttling any peer group that has a higher target than itself. +This throttling takes 2 forms: + +- Queue depth throttling. This is the number of outstanding IO's a group is + allowed to have. We will clamp down relatively quickly, starting at no limit + and going all the way down to 1 IO at a time. + +- Artificial delay induction. There are certain types of IO that cannot be + throttled without possibly adversely affecting higher priority groups. This + includes swapping and metadata IO. These types of IO are allowed to occur + normally, however they are "charged" to the originating group. If the + originating group is being throttled you will see the use_delay and delay + fields in io.stat increase. The delay value is how many microseconds that are + being added to any process that runs in this group. Because this number can + grow quite large if there is a lot of swapping or metadata IO occurring we + limit the individual delay events to 1 second at a time. + +Once the victimized group starts meeting its latency target again it will start +unthrottling any peer groups that were throttled previously. If the victimized +group simply stops doing IO the global counter will unthrottle appropriately. + +IO Latency Interface Files +~~~~~~~~~~~~~~~~~~~~~~~~~~ + + io.latency + This takes a similar format as the other controllers. + + "MAJOR:MINOR target=