From patchwork Tue Dec 5 14:29:55 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Johannes Thumshirn X-Patchwork-Id: 10093017 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id 3017A60329 for ; Tue, 5 Dec 2017 14:30:17 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 15D07296EE for ; Tue, 5 Dec 2017 14:30:17 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id 0970F28A29; Tue, 5 Dec 2017 14:30:17 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.9 required=2.0 tests=BAYES_00,HEXHASH_WORD, RCVD_IN_DNSWL_HI autolearn=unavailable version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 34198296D2 for ; Tue, 5 Dec 2017 14:30:15 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752339AbdLEO37 convert rfc822-to-8bit (ORCPT ); Tue, 5 Dec 2017 09:29:59 -0500 Received: from mx2.suse.de ([195.135.220.15]:39248 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752210AbdLEO36 (ORCPT ); Tue, 5 Dec 2017 09:29:58 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 179D5AE5F; Tue, 5 Dec 2017 14:29:56 +0000 (UTC) From: Johannes Thumshirn To: Ming Lei , Omar Sandoval Cc: Jens Axboe , linux-block@vger.kernel.org, Christoph Hellwig , linux-scsi@vger.kernel.org, "Martin K . Petersen" , "James E . J . Bottomley" , Bart Van Assche , linux-kernel@vger.kernel.org, Hannes Reinecke , Holger =?utf-8?Q?Hoffst=C3=A4tte?= Subject: Re: [PATCH] SCSI: run queue if SCSI device queue isn't ready and queue is idle References: <20171205075256.10319-1-ming.lei@redhat.com> Date: Tue, 05 Dec 2017 15:29:55 +0100 In-Reply-To: <20171205075256.10319-1-ming.lei@redhat.com> (Ming Lei's message of "Tue, 5 Dec 2017 15:52:56 +0800") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3 (gnu/linux) MIME-Version: 1.0 Sender: linux-scsi-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-scsi@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP [ +Cc Omar ] Ming Lei writes: > Before commit 0df21c86bdbf ("scsi: implement .get_budget and .put_budget > for blk-mq"), we run queue after 3ms if queue is idle and SCSI device > queue isn't ready, which is done in handling BLK_STS_RESOURCE. After > commit 0df21c86bdbf is introduced, queue won't be run any more under > this situation. > > IO hang is observed when timeout happened, and this patch fixes the IO > hang issue by running queue after delay in scsi_dev_queue_ready, just like > non-mq. This issue can be triggered by the following script[1]. > > There is another issue which can be covered by running idle queue: > when .get_budget() is called on request coming from hctx->dispatch_list, > if one request just completes during .get_budget(), we can't depend on > SCSI's restart to make progress any more. This patch fixes the race too. > > With this patch, we basically recover to previous behaviour(before commit > 0df21c86bdbf) of handling idle queue when running out of resource. > > [1] script for test/verify SCSI timeout > rmmod scsi_debug > modprobe scsi_debug max_queue=1 > > DEVICE=`ls -d /sys/bus/pseudo/drivers/scsi_debug/adapter*/host*/target*/*/block/* | head -1 | xargs basename` > DISK_DIR=`ls -d /sys/block/$DEVICE/device/scsi_disk/*` > > echo "using scsi device $DEVICE" > echo "-1" >/sys/bus/pseudo/drivers/scsi_debug/every_nth > echo "temporary write through" >$DISK_DIR/cache_type > echo "128" >/sys/bus/pseudo/drivers/scsi_debug/opts > echo none > /sys/block/$DEVICE/queue/scheduler > dd if=/dev/$DEVICE of=/dev/null bs=1M iflag=direct count=1 & > sleep 5 > echo "0" >/sys/bus/pseudo/drivers/scsi_debug/opts > wait > echo "SUCCESS" SO I turned the above into a blktest but have found some shortcommings of my bash skills. Maybe you or Omar has a soution for it: --- 8< --- From 80e5810011d52bc188cd858962ce202bfd4dbee5 Mon Sep 17 00:00:00 2001 From: Johannes Thumshirn Date: Tue, 5 Dec 2017 15:21:08 +0100 Subject: [PATCH blktests] block/013: add test for scsi_device queue starvation Add a test for Ming Lei's patch titled "SCSI: run queue if SCSI device queue isn't ready and queue is idle" Signed-off-by: Johannes Thumshirn --- This test case has two shortcommings, which need to be addressed I'm just lacking a bit of the shell magic to address them properly. 1) Testing without the patch applied hangs the test forever as it doesn't get killed after a specific timeout (I think this should be solved in a common function). 2) It has a nasty sleep at it's end to wait for scsi_debug's refcounts to drop to 0 before removing the module or removing will fail and thus the test case. This as well should be solved in a more generic way. --- tests/block/013 | 63 +++++++++++++++++++++++++++++++++++++++++++++++++++++ tests/block/013.out | 2 ++ 2 files changed, 65 insertions(+) create mode 100755 tests/block/013 create mode 100644 tests/block/013.out diff --git a/tests/block/013 b/tests/block/013 new file mode 100755 index 000000000000..f73724fc9ed2 --- /dev/null +++ b/tests/block/013 @@ -0,0 +1,63 @@ +#!/bin/bash +# +# Regression test for patch "SCSI: delay run queue if device is +# blocked in scsi_dev_queue_ready()" +# +# Copyright (C) 2017 Johannes Thumshirn +# +# This program is free software: you can redistribute it and/or modify +# it under the terms of the GNU General Public License as published by +# the Free Software Foundation, either version 3 of the License, or +# (at your option) any later version. +# +# This program is distributed in the hope that it will be useful, +# but WITHOUT ANY WARRANTY; without even the implied warranty of +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the +# GNU General Public License for more details. +# +# You should have received a copy of the GNU General Public License +# along with this program. If not, see . + +. common/scsi_debug + +DESCRIPTION="Test if a SCSI device's queue can be run if it isn't ready but the device is idle" +TIMED=1 + +requires() { + _have_scsi_debug && _have_module sd_mod && \ + grep -q Y /sys/module/scsi_mod/parameters/use_blk_mq +} + +test_one_device() +{ + local device=$1 + + echo "-1" > /sys/bus/pseudo/drivers/scsi_debug/every_nth + echo "temporary write through" > \ + /sys/block/"${device}"/device/scsi_disk/"$(basename $(readlink /sys/block/${device}/device))"/cache_type + echo "128" > /sys/bus/pseudo/drivers/scsi_debug/opts + echo "none" > /sys/block/${device}/queue/scheduler + dd if=/dev/"${device}" of=/dev/null bs=1M iflag=direct \ + count=1 2> /dev/null & + sleep 5 + echo 0 > /sys/bus/pseudo/drivers/scsi_debug/opts + wait +} + +test() { + echo "Running ${TEST_NAME}" + + if ! _init_scsi_debug statistics=1 max_queue=1; then + return + fi + + local device + for device in "${SCSI_DEBUG_DEVICES[@]}"; do + test_one_device ${device} + done + + sleep 5 # to free up all scsi_debug refcnts + _exit_scsi_debug + + echo "Test complete" +} diff --git a/tests/block/013.out b/tests/block/013.out new file mode 100644 index 000000000000..947bd04e2104 --- /dev/null +++ b/tests/block/013.out @@ -0,0 +1,2 @@ +Running block/013 +Test complete