From patchwork Fri Feb 2 14:09:03 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Roman Pen X-Patchwork-Id: 10196875 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id E72236037D for ; Fri, 2 Feb 2018 14:11:39 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id D5F5D28E61 for ; Fri, 2 Feb 2018 14:11:39 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id CAC3228E69; Fri, 2 Feb 2018 14:11:39 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.9 required=2.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,RCVD_IN_DNSWL_HI autolearn=unavailable version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 2448128E61 for ; Fri, 2 Feb 2018 14:11:39 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752074AbeBBOLi (ORCPT ); Fri, 2 Feb 2018 09:11:38 -0500 Received: from mail-wm0-f54.google.com ([74.125.82.54]:37035 "EHLO mail-wm0-f54.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752146AbeBBOLK (ORCPT ); Fri, 2 Feb 2018 09:11:10 -0500 Received: by mail-wm0-f54.google.com with SMTP id v71so12663785wmv.2 for ; Fri, 02 Feb 2018 06:11:10 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=profitbricks-com.20150623.gappssmtp.com; s=20150623; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=HqSgYL2DuY+fRm6KftaIx5/WQBI/XntnsUH4FAwwT4Y=; b=HBW+Izbu9tffQDdQc5d6l1VzCYtxH+AT7yRSvhL2B4XysG7rn/gVCwQjmwqjEwy72z Kq0codo/PlTMi9ehLfpq+l5LljMs/Dm95QVW7i/mcgqubo9L+VRB6wRa4G5KadHueR2m xKHQIfRtWwLX4HitVQWIlq4IzUg1EthHh7a/300fG7S/849IB7QySbjEBd2aWH5w5ahZ ypb8cUeun/EsW0gukAvzNcaF8pRA9rQG0WIVLHKgzcTaJuwzpcFwpI5coGXJssl08fDf w156D9ZUhiN7R1juuReCNfEBQD1gUY8/abKJAwYExCIgvmFVG2Y0B0Wz44kKxL/ENAIO CSbg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=HqSgYL2DuY+fRm6KftaIx5/WQBI/XntnsUH4FAwwT4Y=; b=PugimeAvLI7wyLLrEO86XbxUnYeudkpJJ+stoeDINFYHvY/Q2QcARAcAiWmeZa8xGH GGzg/MaxR/4ecp1UInO3cA7Xm6ybPg46CPRxicfyER1lUgBO/xw1mwUfxudSwBQALTp4 1WIS7q1u5W/DZPVuGgWEsHawbshmXXcDQSk15FtnEiq2pfX/Q2f/grnVRFbF3vvqnNdw cToHqQSnNm4xAX85wPr6fCHhcHj7S4PoWc6opfM2bTCUkWQpkndtop5cY6I6Hz1/ZpoD RLz1IjPaz5fPzZehBl13ZwPK/d1WlVdg8ftsa9jLz8E5Njzizi0SABIANYZLyLd70IsE 10ig== X-Gm-Message-State: AKwxytdhbmwvtmJcp7LzNe/GYIsFesKHZxzR80bZRRxSLomh/9yT7H8z zFmQTtFNFgLWTE9j0ijxDv3jKYyO X-Google-Smtp-Source: AH8x225edtzqwQa4QIoqk0W/Sj5kH04BJjX4y9iDbRTLrlThCxbGBaqxJi/HC8IxEXJ5kFpt5jSohA== X-Received: by 10.28.232.72 with SMTP id f69mr26869098wmh.110.1517580668935; Fri, 02 Feb 2018 06:11:08 -0800 (PST) Received: from pb.pb.local ([62.217.45.26]) by smtp.gmail.com with ESMTPSA id v186sm798819wmf.17.2018.02.02.06.11.07 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 02 Feb 2018 06:11:08 -0800 (PST) From: Roman Pen To: linux-block@vger.kernel.org, linux-rdma@vger.kernel.org Cc: Jens Axboe , Christoph Hellwig , Sagi Grimberg , Bart Van Assche , Or Gerlitz , Roman Pen , Danil Kipnis , Jack Wang Subject: [PATCH 23/24] ibnbd: a bit of documentation Date: Fri, 2 Feb 2018 15:09:03 +0100 Message-Id: <20180202140904.2017-24-roman.penyaev@profitbricks.com> X-Mailer: git-send-email 2.13.1 In-Reply-To: <20180202140904.2017-1-roman.penyaev@profitbricks.com> References: <20180202140904.2017-1-roman.penyaev@profitbricks.com> Sender: linux-block-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-block@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP README with description of major sysfs entries. Signed-off-by: Roman Pen Signed-off-by: Danil Kipnis Cc: Jack Wang --- drivers/block/ibnbd/README | 272 +++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 272 insertions(+) diff --git a/drivers/block/ibnbd/README b/drivers/block/ibnbd/README new file mode 100644 index 000000000000..e0feb39fad14 --- /dev/null +++ b/drivers/block/ibnbd/README @@ -0,0 +1,272 @@ +*************************************** +Infiniband Network Block Device (IBNBD) +*************************************** + +Introduction +------------ + +IBNBD (InfiniBand Network Block Device) is a pair of kernel modules +(client and server) that allow for remote access of a block device on +the server over IBTRS protocol using the RDMA (InfiniBand, RoCE, iWarp) +transport. After being mapped, the remote block devices can be accessed +on the client side as local block devices. + +I/O is transfered between client and server by the IBTRS transport +modules. The administration of IBNBD and IBTRS modules is done via +sysfs entries. + +Requirements +------------ + + IBTRS kernel modules + +Quick Start +----------- + +Server side: + # modprobe ibnbd_server + +Client side: + # modprobe ibnbd_client + # echo "sessname=blya path=ip:10.50.100.66 device_path=/dev/ram0" > \ + /sys/kernel/ibnbd_client/map_device + + Where "sessname=" is a session name, a string to identify the session + on client and on server sides; "path=" is a destination IP address or + a pair of a source and a destination IPs, separated by comma. Multiple + "path=" options can be specified in order to use multipath (see IBTRS + description for details); "device_path=" is the block device to be + mapped from the server side. After the session to the server machine is + established, the mapped device will appear on the client side under + /dev/ibnbd. + + +====================== +Client Sysfs Interface +====================== + +All sysfs files that are not read-only provide the usage information on read: + +Example: + # cat /sys/kernel/ibnbd_client/map_device + + > Usage: echo "sessname= path=<[srcaddr,]dstaddr> + > [path=<[srcaddr,]dstaddr>] device_path= + > [access_mode=] [input_mode=] + > [io_mode=]" > map_device + > + > addr ::= [ ip: | ip: | gid: ] + +Entries under /sys/kernel/ibnbd_client/ +======================================= + +map_device (RW) +--------------- + +Expected format is the following: + + sessname= + path=<[srcaddr,]dstaddr> [path=<[srcaddr,]dstaddr> ...] + device_path= + [access_mode=] + [input_mode=] + [io_mode=] + +Where: + +sessname: accepts a string not bigger than 256 chars, which identifies + a given session on the client and on the server. + I.e. "clt_hostname-srv_hostname" could be a natural choice. + +path: describes a connection between the client and the server by + specifying destination and, when required, the source address. + The addresses are to be provided in the following format: + + ip: + ip: + gid: + + for example: + + path=ip:10.0.0.66 + The single addr is treated as the destination. + The connection will be established to this + server from any client IP address. + + path=ip:10.0.0.66,ip:10.0.1.66 + First addr is the source address and the second + is the destination. + + If multiple "path=" options are specified multiple connection + will be established and data will be sent according to + the selected multipath policy (see IBTRS mp_policy sysfs entry + description). + +device_path: Path to the block device on the server side. Path is specified + relative to the directory on server side configured in the + 'dev_search_path' module parameter of the ibnbd_server. + The ibnbd_server prepends the received from client + with and tries to open the + / block device. On success, + a /dev/ibnbd device file, a /sys/block/ibnbd_client/ibnbd/ + directory and an entry in /sys/kernel/ibnbd_client/devices will be + created. + +access_mode: the access_mode parameter specifies if the device is to be + mapped as "ro" read-only or "rw" read-write. The server allows + a device to be exported in rw mode only once. The "migration" + access mode has to be specified if a second mapping in read-write + mode is desired. + + By default "rw" is used. + +input_mode: the input_mode parameter specifies the internal I/O + processing mode of the block device on the client. Accepts + "mq" and "rq". + + By default "mq" mode is used. + +io_mode: the io_mode parameter specifies if the device on the server + will be opened as block device "blockio" or as file "fileio". + When the device is opened as file, the VFS page cache is used + for read I/O operations, write I/O operations bypass the page + cache and go directly to disk (except meta updates, like file + access time). + + By default "blockio" mode is used. + +Exit Codes: + +If the device is already mapped it will fail with EEXIST. If the input +has an invalid format it will return EINVAL. If the device path cannot +be found on the server, it will fail with ENOENT. + +Finding device file after mapping +--------------------------------- + +After mapping, the device file can be found by: + o The symlink /sys/kernel/ibnbd_client/devices/ points to + /sys/block/. The last part of the symlink destination is + the same as the device name. By extracting the last part of the + path the path to the device /dev/ can be build. + + o /dev/block/$(cat /sys/kernel/ibnbd_client/devices//dev) + +How to find the of the device is described on the next +section. + +Entries under /sys/kernel/ibnbd_client/devices/ +=============================================== + +For each device mapped on the client a new symbolic link is created as +/sys/kernel/ibnbd_client/devices/, which points to the block +device created by ibnbd (/sys/block/ibnbd/). The of each +device is created as follows: + +- If the 'device_path' provided during mapping contains slashes ("/"), + they are replaced by exclamation mark ("!") and used as as the + . Otherwise, the will be the same as the + "device_path" provided. + +Entries under /sys/block/ibnbd/ibnbd_client/ +=============================================== + +unmap_device (RW) +----------------- + +To unmap a volume, "normal" or "force" has to be written to: + /sys/block/ibnbd/ibnbd_client/unmap_device + +When "normal" is used, the operation will fail with EBUSY if any process +is using the device. When "force" is used, the device is also unmapped +when device is in use. All I/Os that are in progress will fail. + +Example: + + # echo "normal" > /sys/block/ibnbd0/ibnbd/unmap_device + +state (RO) +---------- + +The file contains the current state of the block device. The state file +returns "open" when the device is successfully mapped from the server +and accepting I/O requests. When the connection to the server gets +disconnected in case of an error (e.g. link failure), the state file +returns "closed" and all I/O requests submitted to it will fail with -EIO. + +session (RO) +------------ + +IBNBD uses IBTRS session to transport the data between client and +server. The entry "session" contains the name of the session, that +was used to establish the IBTRS session. It's the same name that +was passed as server parameter to the map_device entry. + +mapping_path (RO) +----------------- + +Contains the path that was passed as "device_path" to the map_device +operation. + +====================== +Server Sysfs Interface +====================== + +Entries under /sys/kernel/ibnbd_server/ +======================================= + +When a client maps a device, a directory entry with the name of the +block device is created under /sys/kernel/ibnbd_server/devices/. + +Entries under /sys/kernel/ibnbd_server/devices// +============================================================= + +block_dev (link) +--------------- + +Is a symlink to the sysfs entry of the exported device. + +Example: + + block_dev -> ../../../../devices/virtual/block/ram0 + +Entries under /sys/kernel/ibnbd_server/devices//sessions/ +====================================================================== + +For each client a particular device is exported to, following directory will be +created: + +/sys/kernel/ibnbd_server/devices//sessions// + +When the device is unmapped by that client, the directory will be removed. + +Entries under /sys/kernel/ibnbd_server/devices//sessions/ +==================================================================================== + +read_only (RO) +-------------- + +Contains '1' if device is mapped read-only, otherwise '0'. + +mapping_path (RO) +----------------- + +Contains the relative device path provided by the user during mapping. + +============================== +IBNBD-Server Module Parameters +============================== + +dev_search_path +--------------- + +When a device is mapped from the client, the server generates the path +to the block device on the server side by concatenating dev_search_path +and the "device_path" that was specified in the map_device operation. + +The default dev_search_path is: "/". + +Contact +------- + +Mailing list: "IBNBD/IBTRS Storage Team"