From patchwork Wed Jan 24 20:03:46 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Verma, Vishal L" X-Patchwork-Id: 13529615 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id A9572C47E49 for ; Wed, 24 Jan 2024 20:04:28 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 286906B0080; Wed, 24 Jan 2024 15:04:27 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 235F06B0081; Wed, 24 Jan 2024 15:04:27 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id F2E696B0083; Wed, 24 Jan 2024 15:04:26 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id DF8846B0080 for ; Wed, 24 Jan 2024 15:04:26 -0500 (EST) Received: from smtpin27.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id AFCFDC0C1F for ; Wed, 24 Jan 2024 20:04:26 +0000 (UTC) X-FDA: 81715281732.27.6A3F312 Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.15]) by imf12.hostedemail.com (Postfix) with ESMTP id 6BEE440003 for ; Wed, 24 Jan 2024 20:04:24 +0000 (UTC) Authentication-Results: imf12.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=OJM1bTnG; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf12.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1706126664; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=x3NMPmSjdzn5PoJGblKVVqBtL8cNU4oLXt55rtggBJM=; b=Yxz69u+5wOjZm3UpKAzlb6ukAyJAvahYVWfBVdfcWXv4vj2/DZDomA6gPUfYMS/DD/7+Lp wxmCG7HFiUFIhLfFxoqwYSbm5SkAdQzt/2Im3l3oBj0n4Wkr3MLFMeysu73eU4IkdERRS8 bbRKguE8sBLmO7qYoSlC0+DI09ZGaek= ARC-Authentication-Results: i=1; imf12.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=OJM1bTnG; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf12.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1706126664; a=rsa-sha256; cv=none; b=TNp1xqkWd3d+vmHXa21Cjp0w3J6E/Yx1QQXWRy1+g7VWtyaLv44auLHTBJDjUs9/VYrp2B 1cpzUlTMVi0ABZbOPonEpYvivxk1A5XBQVcIWMImFiBVOkPv3TUJFcRpaWeHW1gBPnW6zU fj21L5lofK3fUZQNcSq76YbUfHiWYOY= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1706126664; x=1737662664; h=from:date:subject:mime-version:content-transfer-encoding: message-id:references:in-reply-to:to:cc; bh=z6bfTONvTKy2Dv6F2DgzIglLVmzBa9MpFrVu5VmluWs=; b=OJM1bTnGGPEk2/7vLGvpOj27PIlCIk9Y2FqkXr0Q37eUsKSLqcnebPE6 2NgSUR1QEBnCqwppNct+ZMNsalCSDFX68WOv9bGeUtTL0h9rHSKI/4ksD wQCPBbaJKLnzAO1weqEm3Ckiuk1rhoTd25dfFqsrvVszbgQUbK4KTVex7 H9Z/wrzC2Z1qs2Rn5yOK5+Gtmq8gzoJYLT5KW5z6tsxSc/0/QiYDtMI4N TsMUQT+1jR8a8iV9aLi8V55+wrHJOOjfCggeSROP9LARVtQcr76r1pjaV FPZfaDOWyCvAtCg2RNDIzRABBbcN8k6aL1opRCHw4H+8gu6y9XM4TfEXg w==; X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1836096" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1836096" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by orvoesa107.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:22 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1117735117" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1117735117" Received: from vverma7-mobl3.amr.corp.intel.com (HELO [10.0.0.223]) ([10.251.14.61]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:20 -0800 From: Vishal Verma Date: Wed, 24 Jan 2024 12:03:46 -0800 Subject: [PATCH v7 1/5] dax/bus.c: replace driver-core lock usage by a local rwsem MIME-Version: 1.0 Message-Id: <20240124-vv-dax_abi-v7-1-20d16cb8d23d@intel.com> References: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> In-Reply-To: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> To: Dan Williams , Vishal Verma , Dave Jiang , Andrew Morton , Oscar Salvador Cc: linux-kernel@vger.kernel.org, nvdimm@lists.linux.dev, linux-cxl@vger.kernel.org, David Hildenbrand , Dave Hansen , Huang Ying , Greg Kroah-Hartman , Matthew Wilcox , linux-mm@kvack.org X-Mailer: b4 0.13-dev-a684c X-Developer-Signature: v=1; a=openpgp-sha256; l=15115; i=vishal.l.verma@intel.com; h=from:subject:message-id; bh=z6bfTONvTKy2Dv6F2DgzIglLVmzBa9MpFrVu5VmluWs=; b=owGbwMvMwCXGf25diOft7jLG02pJDKkbc51O5uTq3p8jHv17fob9PPetu5yuljjb5DLHMd+5f VBhS41oRykLgxgXg6yYIsvfPR8Zj8ltz+cJTHCEmcPKBDKEgYtTACbi1cbwP+HLTeWpMhrz4/d/ mrTk9bWt08V2SxzUmr4rpD03Nu90+itGhv67b9d0X7jxQ+7sovrYt5+f1D8/9zj118s/ZvXzZih eXc4NAA== X-Developer-Key: i=vishal.l.verma@intel.com; a=openpgp; fpr=F8682BE134C67A12332A2ED07AFA61BEA3B84DFF X-Rspam-User: X-Rspamd-Server: rspam12 X-Rspamd-Queue-Id: 6BEE440003 X-Stat-Signature: wnqi6bfeyhreakgo8dfrt77iaf8s1fwq X-HE-Tag: 1706126664-729055 X-HE-Meta: U2FsdGVkX1+9s1UQddq4TonyYiNYZxUhWq7MuI8jEzto0RpV7DiYh4gGiWmehUei2LpSNMNfD2kaa0yO/cEb6Rp0EI1XaeBX3uIFnMOrWPDr/NhZP8+hvZN1bcVVHLNI/x42/qzEvbXOJp6+PIyudTOxdH4imIUv251Lam/jmBrLiEAHNPtEMRDXOEoU8PAaNuhrQA2JvcMMn1vBEULLYjdY6rAZcNNaKxu1lkX0zfwLcy73d0SFRw2dU7VYa2qP6zaZK/ReBhCN3UlD3eRzlsg2q+gQ6rWuQ11l23M2JygYh1m72kZVKHM2TCCuZe6HJ82Y2IHyldcZXDWzEF7R/Sb97woO/Nbs/HpuENBTO5+qgs+w6mKaQKjgcl7zWHAU8NZGjtbqfx0lyjbyYe8cPhDjaFGTNQP1+1SouFRLZ2xZnx4lVmBExwLvro80+SDcSLpNyMNVbX+OySHXl36OHTtIxCxMqarZjbRQMRCiqvedtnguNYQvkfEsPiy8isQvrojnlD/bOyeoyr95fjVr6Idqpf3f2EUj8081zrGOxlx+UqQwJ9TCgHO7eyWksZtnHpL5++YKnmfr21+a34QWMTk57t5kyxnQG/Nf0EBiFBgYUBMVz8sg6/FUbUcWQ4v6Uwtx3hVBUcPFTAU1r/7WvE+CZA3kogJH4aTwYF1BTIw7LG4UuN+1QlSAD0rxqoRmkg+JMC/tdTYvv2Yfe+MOyXtZuICkt1Dk3tUrYpgszhauKbeuBX17sa/cM8NFasWLIkOp05Pl8bJ6rSLs5TYP5vaca9/hQ7xfd/yjg5MN0mynuoMyK6IZjc5mWypR//cQERQ5PGkrAUmpOApNb+MqWKcEqFL37oo++76lH7cO9xAbsT05qXVCk2a/QSFgXhfgN+ssqjGasFhZByDl7Agjkz/pl9G5sJhX4jHHn/uuyHW6viOcSBeknw+AUuZbQNAwSXz6ArJzI/e44DzWtmz 2gnf1mSq PsscY5yzLv7fKRgxwVD9XRhfptHXL0rPcs8TToQ+f5G7yHXtoaAFpzzWi3635qN7e+2dYgUxMGKMLdpLMSS9meycCvjLn0h/aCzm8GUzZ3mH0NxM/KehNlECi5rU/f5c5OLGZcsIaJcsihuGYemkKpqBpbEXrIw5jdZWTXGyOdvTBem6WB634WedbCdduZIyerO4fWka2p0A56fibceLZVH+s6RyPJcEJS5xB X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: The dax driver incorrectly used driver-core device locks to protect internal dax region and dax device configuration structures. Replace the device lock usage with a local rwsem, one each for dax region configuration and dax device configuration. As a result of this conversion, no device_lock() usage remains in dax/bus.c. Cc: Dan Williams Reported-by: Greg Kroah-Hartman Signed-off-by: Vishal Verma Reviewed-by: Alison Schofield --- drivers/dax/bus.c | 220 ++++++++++++++++++++++++++++++++++++++---------------- 1 file changed, 157 insertions(+), 63 deletions(-) diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c index 1ff1ab5fa105..cb148f74ceda 100644 --- a/drivers/dax/bus.c +++ b/drivers/dax/bus.c @@ -12,6 +12,18 @@ static DEFINE_MUTEX(dax_bus_lock); +/* + * All changes to the dax region configuration occur with this lock held + * for write. + */ +DECLARE_RWSEM(dax_region_rwsem); + +/* + * All changes to the dax device configuration occur with this lock held + * for write. + */ +DECLARE_RWSEM(dax_dev_rwsem); + #define DAX_NAME_LEN 30 struct dax_id { struct list_head list; @@ -180,7 +192,7 @@ static u64 dev_dax_size(struct dev_dax *dev_dax) u64 size = 0; int i; - device_lock_assert(&dev_dax->dev); + WARN_ON_ONCE(!rwsem_is_locked(&dax_dev_rwsem)); for (i = 0; i < dev_dax->nr_range; i++) size += range_len(&dev_dax->ranges[i].range); @@ -194,8 +206,15 @@ static int dax_bus_probe(struct device *dev) struct dev_dax *dev_dax = to_dev_dax(dev); struct dax_region *dax_region = dev_dax->region; int rc; + u64 size; - if (dev_dax_size(dev_dax) == 0 || dev_dax->id < 0) + rc = down_read_interruptible(&dax_dev_rwsem); + if (rc) + return rc; + size = dev_dax_size(dev_dax); + up_read(&dax_dev_rwsem); + + if (size == 0 || dev_dax->id < 0) return -ENXIO; rc = dax_drv->probe(dev_dax); @@ -283,7 +302,7 @@ static unsigned long long dax_region_avail_size(struct dax_region *dax_region) resource_size_t size = resource_size(&dax_region->res); struct resource *res; - device_lock_assert(dax_region->dev); + WARN_ON_ONCE(!rwsem_is_locked(&dax_region_rwsem)); for_each_dax_region_resource(dax_region, res) size -= resource_size(res); @@ -295,10 +314,13 @@ static ssize_t available_size_show(struct device *dev, { struct dax_region *dax_region = dev_get_drvdata(dev); unsigned long long size; + int rc; - device_lock(dev); + rc = down_read_interruptible(&dax_region_rwsem); + if (rc) + return rc; size = dax_region_avail_size(dax_region); - device_unlock(dev); + up_read(&dax_region_rwsem); return sprintf(buf, "%llu\n", size); } @@ -314,10 +336,12 @@ static ssize_t seed_show(struct device *dev, if (is_static(dax_region)) return -EINVAL; - device_lock(dev); + rc = down_read_interruptible(&dax_region_rwsem); + if (rc) + return rc; seed = dax_region->seed; rc = sprintf(buf, "%s\n", seed ? dev_name(seed) : ""); - device_unlock(dev); + up_read(&dax_region_rwsem); return rc; } @@ -333,14 +357,18 @@ static ssize_t create_show(struct device *dev, if (is_static(dax_region)) return -EINVAL; - device_lock(dev); + rc = down_read_interruptible(&dax_region_rwsem); + if (rc) + return rc; youngest = dax_region->youngest; rc = sprintf(buf, "%s\n", youngest ? dev_name(youngest) : ""); - device_unlock(dev); + up_read(&dax_region_rwsem); return rc; } +static struct dev_dax *__devm_create_dev_dax(struct dev_dax_data *data); + static ssize_t create_store(struct device *dev, struct device_attribute *attr, const char *buf, size_t len) { @@ -358,7 +386,9 @@ static ssize_t create_store(struct device *dev, struct device_attribute *attr, if (val != 1) return -EINVAL; - device_lock(dev); + rc = down_write_killable(&dax_region_rwsem); + if (rc) + return rc; avail = dax_region_avail_size(dax_region); if (avail == 0) rc = -ENOSPC; @@ -369,7 +399,7 @@ static ssize_t create_store(struct device *dev, struct device_attribute *attr, .id = -1, .memmap_on_memory = false, }; - struct dev_dax *dev_dax = devm_create_dev_dax(&data); + struct dev_dax *dev_dax = __devm_create_dev_dax(&data); if (IS_ERR(dev_dax)) rc = PTR_ERR(dev_dax); @@ -387,7 +417,7 @@ static ssize_t create_store(struct device *dev, struct device_attribute *attr, rc = len; } } - device_unlock(dev); + up_write(&dax_region_rwsem); return rc; } @@ -417,7 +447,7 @@ static void trim_dev_dax_range(struct dev_dax *dev_dax) struct range *range = &dev_dax->ranges[i].range; struct dax_region *dax_region = dev_dax->region; - device_lock_assert(dax_region->dev); + WARN_ON_ONCE(!rwsem_is_locked(&dax_region_rwsem)); dev_dbg(&dev_dax->dev, "delete range[%d]: %#llx:%#llx\n", i, (unsigned long long)range->start, (unsigned long long)range->end); @@ -435,7 +465,7 @@ static void free_dev_dax_ranges(struct dev_dax *dev_dax) trim_dev_dax_range(dev_dax); } -static void unregister_dev_dax(void *dev) +static void __unregister_dev_dax(void *dev) { struct dev_dax *dev_dax = to_dev_dax(dev); @@ -447,6 +477,17 @@ static void unregister_dev_dax(void *dev) put_device(dev); } +static void unregister_dev_dax(void *dev) +{ + if (rwsem_is_locked(&dax_region_rwsem)) + return __unregister_dev_dax(dev); + + if (WARN_ON_ONCE(down_write_killable(&dax_region_rwsem) != 0)) + return; + __unregister_dev_dax(dev); + up_write(&dax_region_rwsem); +} + static void dax_region_free(struct kref *kref) { struct dax_region *dax_region; @@ -463,11 +504,10 @@ static void dax_region_put(struct dax_region *dax_region) /* a return value >= 0 indicates this invocation invalidated the id */ static int __free_dev_dax_id(struct dev_dax *dev_dax) { - struct device *dev = &dev_dax->dev; struct dax_region *dax_region; int rc = dev_dax->id; - device_lock_assert(dev); + WARN_ON_ONCE(!rwsem_is_locked(&dax_dev_rwsem)); if (!dev_dax->dyn_id || dev_dax->id < 0) return -1; @@ -480,12 +520,13 @@ static int __free_dev_dax_id(struct dev_dax *dev_dax) static int free_dev_dax_id(struct dev_dax *dev_dax) { - struct device *dev = &dev_dax->dev; int rc; - device_lock(dev); + rc = down_write_killable(&dax_dev_rwsem); + if (rc) + return rc; rc = __free_dev_dax_id(dev_dax); - device_unlock(dev); + up_write(&dax_dev_rwsem); return rc; } @@ -519,8 +560,14 @@ static ssize_t delete_store(struct device *dev, struct device_attribute *attr, if (!victim) return -ENXIO; - device_lock(dev); - device_lock(victim); + rc = down_write_killable(&dax_region_rwsem); + if (rc) + return rc; + rc = down_write_killable(&dax_dev_rwsem); + if (rc) { + up_write(&dax_region_rwsem); + return rc; + } dev_dax = to_dev_dax(victim); if (victim->driver || dev_dax_size(dev_dax)) rc = -EBUSY; @@ -541,12 +588,12 @@ static ssize_t delete_store(struct device *dev, struct device_attribute *attr, } else rc = -EBUSY; } - device_unlock(victim); + up_write(&dax_dev_rwsem); /* won the race to invalidate the device, clean it up */ if (do_del) devm_release_action(dev, unregister_dev_dax, victim); - device_unlock(dev); + up_write(&dax_region_rwsem); put_device(victim); return rc; @@ -658,16 +705,15 @@ static void dax_mapping_release(struct device *dev) put_device(parent); } -static void unregister_dax_mapping(void *data) +static void __unregister_dax_mapping(void *data) { struct device *dev = data; struct dax_mapping *mapping = to_dax_mapping(dev); struct dev_dax *dev_dax = to_dev_dax(dev->parent); - struct dax_region *dax_region = dev_dax->region; dev_dbg(dev, "%s\n", __func__); - device_lock_assert(dax_region->dev); + WARN_ON_ONCE(!rwsem_is_locked(&dax_region_rwsem)); dev_dax->ranges[mapping->range_id].mapping = NULL; mapping->range_id = -1; @@ -675,28 +721,37 @@ static void unregister_dax_mapping(void *data) device_unregister(dev); } +static void unregister_dax_mapping(void *data) +{ + if (rwsem_is_locked(&dax_region_rwsem)) + return __unregister_dax_mapping(data); + + if (WARN_ON_ONCE(down_write_killable(&dax_region_rwsem) != 0)) + return; + __unregister_dax_mapping(data); + up_write(&dax_region_rwsem); +} + static struct dev_dax_range *get_dax_range(struct device *dev) { struct dax_mapping *mapping = to_dax_mapping(dev); struct dev_dax *dev_dax = to_dev_dax(dev->parent); - struct dax_region *dax_region = dev_dax->region; + int rc; - device_lock(dax_region->dev); + rc = down_write_killable(&dax_region_rwsem); + if (rc) + return NULL; if (mapping->range_id < 0) { - device_unlock(dax_region->dev); + up_write(&dax_region_rwsem); return NULL; } return &dev_dax->ranges[mapping->range_id]; } -static void put_dax_range(struct dev_dax_range *dax_range) +static void put_dax_range(void) { - struct dax_mapping *mapping = dax_range->mapping; - struct dev_dax *dev_dax = to_dev_dax(mapping->dev.parent); - struct dax_region *dax_region = dev_dax->region; - - device_unlock(dax_region->dev); + up_write(&dax_region_rwsem); } static ssize_t start_show(struct device *dev, @@ -709,7 +764,7 @@ static ssize_t start_show(struct device *dev, if (!dax_range) return -ENXIO; rc = sprintf(buf, "%#llx\n", dax_range->range.start); - put_dax_range(dax_range); + put_dax_range(); return rc; } @@ -725,7 +780,7 @@ static ssize_t end_show(struct device *dev, if (!dax_range) return -ENXIO; rc = sprintf(buf, "%#llx\n", dax_range->range.end); - put_dax_range(dax_range); + put_dax_range(); return rc; } @@ -741,7 +796,7 @@ static ssize_t pgoff_show(struct device *dev, if (!dax_range) return -ENXIO; rc = sprintf(buf, "%#lx\n", dax_range->pgoff); - put_dax_range(dax_range); + put_dax_range(); return rc; } @@ -775,7 +830,7 @@ static int devm_register_dax_mapping(struct dev_dax *dev_dax, int range_id) struct device *dev; int rc; - device_lock_assert(dax_region->dev); + WARN_ON_ONCE(!rwsem_is_locked(&dax_region_rwsem)); if (dev_WARN_ONCE(&dev_dax->dev, !dax_region->dev->driver, "region disabled\n")) @@ -821,7 +876,7 @@ static int alloc_dev_dax_range(struct dev_dax *dev_dax, u64 start, struct resource *alloc; int i, rc; - device_lock_assert(dax_region->dev); + WARN_ON_ONCE(!rwsem_is_locked(&dax_region_rwsem)); /* handle the seed alloc special case */ if (!size) { @@ -875,13 +930,12 @@ static int adjust_dev_dax_range(struct dev_dax *dev_dax, struct resource *res, r { int last_range = dev_dax->nr_range - 1; struct dev_dax_range *dax_range = &dev_dax->ranges[last_range]; - struct dax_region *dax_region = dev_dax->region; bool is_shrink = resource_size(res) > size; struct range *range = &dax_range->range; struct device *dev = &dev_dax->dev; int rc; - device_lock_assert(dax_region->dev); + WARN_ON_ONCE(!rwsem_is_locked(&dax_region_rwsem)); if (dev_WARN_ONCE(dev, !size, "deletion is handled by dev_dax_shrink\n")) return -EINVAL; @@ -907,10 +961,13 @@ static ssize_t size_show(struct device *dev, { struct dev_dax *dev_dax = to_dev_dax(dev); unsigned long long size; + int rc; - device_lock(dev); + rc = down_write_killable(&dax_dev_rwsem); + if (rc) + return rc; size = dev_dax_size(dev_dax); - device_unlock(dev); + up_write(&dax_dev_rwsem); return sprintf(buf, "%llu\n", size); } @@ -1080,17 +1137,27 @@ static ssize_t size_store(struct device *dev, struct device_attribute *attr, return -EINVAL; } - device_lock(dax_region->dev); + rc = down_write_killable(&dax_region_rwsem); + if (rc) + return rc; if (!dax_region->dev->driver) { - device_unlock(dax_region->dev); - return -ENXIO; + rc = -ENXIO; + goto err_region; } - device_lock(dev); - rc = dev_dax_resize(dax_region, dev_dax, val); - device_unlock(dev); - device_unlock(dax_region->dev); + rc = down_write_killable(&dax_dev_rwsem); + if (rc) + goto err_dev; - return rc == 0 ? len : rc; + rc = dev_dax_resize(dax_region, dev_dax, val); + +err_dev: + up_write(&dax_dev_rwsem); +err_region: + up_write(&dax_region_rwsem); + + if (rc == 0) + return len; + return rc; } static DEVICE_ATTR_RW(size); @@ -1138,18 +1205,24 @@ static ssize_t mapping_store(struct device *dev, struct device_attribute *attr, return rc; rc = -ENXIO; - device_lock(dax_region->dev); + rc = down_write_killable(&dax_region_rwsem); + if (rc) + return rc; if (!dax_region->dev->driver) { - device_unlock(dax_region->dev); + up_write(&dax_region_rwsem); + return rc; + } + rc = down_write_killable(&dax_dev_rwsem); + if (rc) { + up_write(&dax_region_rwsem); return rc; } - device_lock(dev); to_alloc = range_len(&r); if (alloc_is_aligned(dev_dax, to_alloc)) rc = alloc_dev_dax_range(dev_dax, r.start, to_alloc); - device_unlock(dev); - device_unlock(dax_region->dev); + up_write(&dax_dev_rwsem); + up_write(&dax_region_rwsem); return rc == 0 ? len : rc; } @@ -1196,13 +1269,19 @@ static ssize_t align_store(struct device *dev, struct device_attribute *attr, if (!dax_align_valid(val)) return -EINVAL; - device_lock(dax_region->dev); + rc = down_write_killable(&dax_region_rwsem); + if (rc) + return rc; if (!dax_region->dev->driver) { - device_unlock(dax_region->dev); + up_write(&dax_region_rwsem); return -ENXIO; } - device_lock(dev); + rc = down_write_killable(&dax_dev_rwsem); + if (rc) { + up_write(&dax_region_rwsem); + return rc; + } if (dev->driver) { rc = -EBUSY; goto out_unlock; @@ -1214,8 +1293,8 @@ static ssize_t align_store(struct device *dev, struct device_attribute *attr, if (rc) dev_dax->align = align_save; out_unlock: - device_unlock(dev); - device_unlock(dax_region->dev); + up_write(&dax_dev_rwsem); + up_write(&dax_region_rwsem); return rc == 0 ? len : rc; } static DEVICE_ATTR_RW(align); @@ -1325,7 +1404,7 @@ static const struct device_type dev_dax_type = { .groups = dax_attribute_groups, }; -struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data) +static struct dev_dax *__devm_create_dev_dax(struct dev_dax_data *data) { struct dax_region *dax_region = data->dax_region; struct device *parent = dax_region->dev; @@ -1440,6 +1519,21 @@ struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data) return ERR_PTR(rc); } + +struct dev_dax *devm_create_dev_dax(struct dev_dax_data *data) +{ + struct dev_dax *dev_dax; + int rc; + + rc = down_write_killable(&dax_region_rwsem); + if (rc) + return ERR_PTR(rc); + + dev_dax = __devm_create_dev_dax(data); + up_write(&dax_region_rwsem); + + return dev_dax; +} EXPORT_SYMBOL_GPL(devm_create_dev_dax); int __dax_driver_register(struct dax_device_driver *dax_drv, From patchwork Wed Jan 24 20:03:47 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Verma, Vishal L" X-Patchwork-Id: 13529616 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 1A319C46CD2 for ; Wed, 24 Jan 2024 20:04:31 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 101266B0081; Wed, 24 Jan 2024 15:04:28 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 089376B0083; Wed, 24 Jan 2024 15:04:28 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id E1D416B0085; Wed, 24 Jan 2024 15:04:27 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id D38C56B0081 for ; Wed, 24 Jan 2024 15:04:27 -0500 (EST) Received: from smtpin15.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay06.hostedemail.com (Postfix) with ESMTP id A52EAA2394 for ; Wed, 24 Jan 2024 20:04:27 +0000 (UTC) X-FDA: 81715281774.15.397970E Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.15]) by imf01.hostedemail.com (Postfix) with ESMTP id 6492140011 for ; Wed, 24 Jan 2024 20:04:25 +0000 (UTC) Authentication-Results: imf01.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=IqnnQdny; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf01.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1706126665; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=AcC6QAOmr8SGg10jtCgJ40Pni6duinW4+w/MW4U+57s=; b=6v2w0zOAjNVHyRaXw6I6Bid8KwrbKstA5HdjzGgUS15LTZwYPPJ4mt++N9PcfSDjP/gKrt VM6WnseYesLDqYl5AEO9TFkXrZzlsvHWefifRJlk0QIO9buLVNzSXEAhmx2+4rfJdAOK/R pBEgDpnw0H3cGyVhS61aQ0NG/bmedl0= ARC-Authentication-Results: i=1; imf01.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=IqnnQdny; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf01.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1706126665; a=rsa-sha256; cv=none; b=Eb6ij0I0KjFiGdo67omsEpLKejA4sp8+kTmDVNteQfjhQXADiCXo7Cm4KPV3tReH4e5ZwC HK/E4LCl0eMSU1zi76knjXobvJfce65l6mfkCPYFE0SqhsqKyA0HMPvhVY5TMyDatQ9I0w ATMmeFngoStGpZ1Uk3UzHY4sTM9u5xI= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1706126665; x=1737662665; h=from:date:subject:mime-version:content-transfer-encoding: message-id:references:in-reply-to:to:cc; bh=mYxIOVPok7zj5mr8tVu7SKhQQM/JT9jvpgs14KCdT/k=; b=IqnnQdnyYth76mX8uj/d/jLxPZ+9WJSj5lDRgCFJGYLwQSmZCCWmU+xT OPdBpz3gwOEeW6loZNpbOVD9FCw4RzEbcwodZ05tARsCIwbsOibKL47Ks vzqUZppSXPDIB1DWyjr8q4/il3nRX+vYsyxCActpJ6j8VMR6yb2axnnTZ hh/qwi3BJq7jhVdQ2KkbneKUBSQJi5P5dUv2K86d/aVJY9FQmlJNp6ROi LV6J7fe7OhX6FVNvf+evOs3O8p5cRdg1OeDQxXPocehaqRh9gcJHz8I2l 0yw6PsmnHhN+C6bO7J9HUMUOC1pnOE2E7DPW3nxFCoOAE4VFwRXU1M3Vh w==; X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1836106" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1836106" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by orvoesa107.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:23 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1117735125" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1117735125" Received: from vverma7-mobl3.amr.corp.intel.com (HELO [10.0.0.223]) ([10.251.14.61]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:21 -0800 From: Vishal Verma Date: Wed, 24 Jan 2024 12:03:47 -0800 Subject: [PATCH v7 2/5] dax/bus.c: replace several sprintf() with sysfs_emit() MIME-Version: 1.0 Message-Id: <20240124-vv-dax_abi-v7-2-20d16cb8d23d@intel.com> References: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> In-Reply-To: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> To: Dan Williams , Vishal Verma , Dave Jiang , Andrew Morton , Oscar Salvador Cc: linux-kernel@vger.kernel.org, nvdimm@lists.linux.dev, linux-cxl@vger.kernel.org, David Hildenbrand , Dave Hansen , Huang Ying , Greg Kroah-Hartman , Matthew Wilcox , linux-mm@kvack.org X-Mailer: b4 0.13-dev-a684c X-Developer-Signature: v=1; a=openpgp-sha256; l=5299; i=vishal.l.verma@intel.com; h=from:subject:message-id; bh=mYxIOVPok7zj5mr8tVu7SKhQQM/JT9jvpgs14KCdT/k=; b=owGbwMvMwCXGf25diOft7jLG02pJDKkbc51aNcyLd59M+vAnQJuv9cKWC28PyIbMOhQbrPNRb 5Gwv/eUjlIWBjEuBlkxRZa/ez4yHpPbns8TmOAIM4eVCWQIAxenAExkWxfD/+qY2iizk+avOY5J 8LJd2RK2fdbEB2ZbOKZFZ3Xsal646Bgjw5SvBW9Pvevi+zG/+cnxb//auSw3rG44w/3KtfNY85z J+XwA X-Developer-Key: i=vishal.l.verma@intel.com; a=openpgp; fpr=F8682BE134C67A12332A2ED07AFA61BEA3B84DFF X-Rspamd-Server: rspam09 X-Rspamd-Queue-Id: 6492140011 X-Stat-Signature: dd63ui8scfpj6nf5neow95epwgcterre X-Rspam-User: X-HE-Tag: 1706126665-238234 X-HE-Meta: U2FsdGVkX19o6bzQcDdp9FGk+kUdYdkIgq9qshZr7VweOeCv7ivMCKaOgpDz5UPcRq9Ly1nEtOGmLiqiVZGN8FoW9mvYR80aJJA2Fk5IdETjpJH1i/ecgeTzcfujr/0JTplN+6c5TL+3m1MUEcCGyShhwThnt2PeWoGFqyNMKo6ABrID4dalIXmfYyDJUdwiRkx7LBhagvfNxdA3jnLoX9DRKUFygzLOmQgbLVmQ7ANeAL8ONBdZ5/hPEqVX277+0KTh1ns3H2DBPNOMhK6/J1xYMCwauwBsDTKm2HC5UENxkqo1Yg0sNGb9xIyKy3SasmBizEIQtlxuny5vOYn1ez+09RmR78M3TQTYHuIryt6lBYKB8SS7w94G7FC8t2ggdkTJKhGHGAvwKOK01az/I5KZG2urLp23uzaE9hRz41Q8cvqYNmj9XzAN3WC7wZEh1z5rmT6mmmSDlHCAwf1Xg/7GyQMOOjsF129IXzLFv/FHJbC4iX5pZsgDPMUDE/P9JPPGyAfy3dJcVRys7O1E2h9ckdTwT7L8zKQ14KA5/EC4szTSWRgXfj7EbNLzpvrkEUwcUlIUUDxBp3AniV/Ge7sc9db6kh5HV97XZhbV489di66yyhF3aEuigP+RAPUkwEoCs/qnmVjj0LMbzfh2GqLio22AzDbQe8F+Lnd+M8ln0Hk5z7BPEPk/gd6nc+O7+aOI9OVhd244BybUnRmuKWOCkeZdL8czSZ3DwSs5Md0iu2TK6xuZ+yf27WnjoygSrqIc+rXTwSjsgH79YpgVXT0rRSaIQSdjLxjpe6u+5PQRptKG/UlVxPjjWQqbAm6bMICVahJbovOw7fglyty9tiAAwg8AHQd3RSXFrMxOZV+UIkSBrkSfxzBRoFkuvnsWFoTvL/Xr0eLdgoT8NNbQ9y/Tm+mBQN2OY7mCQYcUZ+oAOtHm+FtisukKYBPeRcmqdxB1RXPrFmyrR11ltU/ sbjkB9/D HHVQX/QI/2pWXI9wdzVI8Ov74wH8cwWPFrjN/I0E4uzgkc7N1AwnQ/67XOwVAs/m+f1PnWqbWQyLiwbJftqGb3Zw0coOp2PZeqaituNgLeRUFOZj6FRattqh1+7iQUQfUOutXD0YzcUu4Jxs8h+FHewOelroY1smhCn3SzQkD2s6OcYQsnWjYHgSQgIqdJBj//nYA5EPdsLtZFn05y6OBySeXVM8pBQ3jB3rSK0dyILGekODHaCvt5FhJW1X0zv92Hvu+cxWB7x1P8cZSm7uXT+eTZg== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: There were several places where drivers/dax/bus.c uses 'sprintf' to print sysfs data. Since a sysfs_emit() helper is available specifically for this purpose, replace all the sprintf() usage for sysfs with sysfs_emit() in this file. Cc: Dan Williams Reported-by: Greg Kroah-Hartman Signed-off-by: Vishal Verma Reviewed-by: Alison Schofield --- drivers/dax/bus.c | 32 ++++++++++++++++---------------- 1 file changed, 16 insertions(+), 16 deletions(-) diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c index cb148f74ceda..0fd948a4443e 100644 --- a/drivers/dax/bus.c +++ b/drivers/dax/bus.c @@ -269,7 +269,7 @@ static ssize_t id_show(struct device *dev, { struct dax_region *dax_region = dev_get_drvdata(dev); - return sprintf(buf, "%d\n", dax_region->id); + return sysfs_emit(buf, "%d\n", dax_region->id); } static DEVICE_ATTR_RO(id); @@ -278,8 +278,8 @@ static ssize_t region_size_show(struct device *dev, { struct dax_region *dax_region = dev_get_drvdata(dev); - return sprintf(buf, "%llu\n", (unsigned long long) - resource_size(&dax_region->res)); + return sysfs_emit(buf, "%llu\n", + (unsigned long long)resource_size(&dax_region->res)); } static struct device_attribute dev_attr_region_size = __ATTR(size, 0444, region_size_show, NULL); @@ -289,7 +289,7 @@ static ssize_t region_align_show(struct device *dev, { struct dax_region *dax_region = dev_get_drvdata(dev); - return sprintf(buf, "%u\n", dax_region->align); + return sysfs_emit(buf, "%u\n", dax_region->align); } static struct device_attribute dev_attr_region_align = __ATTR(align, 0400, region_align_show, NULL); @@ -322,7 +322,7 @@ static ssize_t available_size_show(struct device *dev, size = dax_region_avail_size(dax_region); up_read(&dax_region_rwsem); - return sprintf(buf, "%llu\n", size); + return sysfs_emit(buf, "%llu\n", size); } static DEVICE_ATTR_RO(available_size); @@ -340,7 +340,7 @@ static ssize_t seed_show(struct device *dev, if (rc) return rc; seed = dax_region->seed; - rc = sprintf(buf, "%s\n", seed ? dev_name(seed) : ""); + rc = sysfs_emit(buf, "%s\n", seed ? dev_name(seed) : ""); up_read(&dax_region_rwsem); return rc; @@ -361,7 +361,7 @@ static ssize_t create_show(struct device *dev, if (rc) return rc; youngest = dax_region->youngest; - rc = sprintf(buf, "%s\n", youngest ? dev_name(youngest) : ""); + rc = sysfs_emit(buf, "%s\n", youngest ? dev_name(youngest) : ""); up_read(&dax_region_rwsem); return rc; @@ -763,7 +763,7 @@ static ssize_t start_show(struct device *dev, dax_range = get_dax_range(dev); if (!dax_range) return -ENXIO; - rc = sprintf(buf, "%#llx\n", dax_range->range.start); + rc = sysfs_emit(buf, "%#llx\n", dax_range->range.start); put_dax_range(); return rc; @@ -779,7 +779,7 @@ static ssize_t end_show(struct device *dev, dax_range = get_dax_range(dev); if (!dax_range) return -ENXIO; - rc = sprintf(buf, "%#llx\n", dax_range->range.end); + rc = sysfs_emit(buf, "%#llx\n", dax_range->range.end); put_dax_range(); return rc; @@ -795,7 +795,7 @@ static ssize_t pgoff_show(struct device *dev, dax_range = get_dax_range(dev); if (!dax_range) return -ENXIO; - rc = sprintf(buf, "%#lx\n", dax_range->pgoff); + rc = sysfs_emit(buf, "%#lx\n", dax_range->pgoff); put_dax_range(); return rc; @@ -969,7 +969,7 @@ static ssize_t size_show(struct device *dev, size = dev_dax_size(dev_dax); up_write(&dax_dev_rwsem); - return sprintf(buf, "%llu\n", size); + return sysfs_emit(buf, "%llu\n", size); } static bool alloc_is_aligned(struct dev_dax *dev_dax, resource_size_t size) @@ -1233,7 +1233,7 @@ static ssize_t align_show(struct device *dev, { struct dev_dax *dev_dax = to_dev_dax(dev); - return sprintf(buf, "%d\n", dev_dax->align); + return sysfs_emit(buf, "%d\n", dev_dax->align); } static ssize_t dev_dax_validate_align(struct dev_dax *dev_dax) @@ -1311,7 +1311,7 @@ static ssize_t target_node_show(struct device *dev, { struct dev_dax *dev_dax = to_dev_dax(dev); - return sprintf(buf, "%d\n", dev_dax_target_node(dev_dax)); + return sysfs_emit(buf, "%d\n", dev_dax_target_node(dev_dax)); } static DEVICE_ATTR_RO(target_node); @@ -1327,7 +1327,7 @@ static ssize_t resource_show(struct device *dev, else start = dev_dax->ranges[0].range.start; - return sprintf(buf, "%#llx\n", start); + return sysfs_emit(buf, "%#llx\n", start); } static DEVICE_ATTR(resource, 0400, resource_show, NULL); @@ -1338,14 +1338,14 @@ static ssize_t modalias_show(struct device *dev, struct device_attribute *attr, * We only ever expect to handle device-dax instances, i.e. the * @type argument to MODULE_ALIAS_DAX_DEVICE() is always zero */ - return sprintf(buf, DAX_DEVICE_MODALIAS_FMT "\n", 0); + return sysfs_emit(buf, DAX_DEVICE_MODALIAS_FMT "\n", 0); } static DEVICE_ATTR_RO(modalias); static ssize_t numa_node_show(struct device *dev, struct device_attribute *attr, char *buf) { - return sprintf(buf, "%d\n", dev_to_node(dev)); + return sysfs_emit(buf, "%d\n", dev_to_node(dev)); } static DEVICE_ATTR_RO(numa_node); From patchwork Wed Jan 24 20:03:48 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Verma, Vishal L" X-Patchwork-Id: 13529617 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 07B13C46CD2 for ; Wed, 24 Jan 2024 20:04:34 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id E52E26B0088; Wed, 24 Jan 2024 15:04:28 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id DA84E6B0087; Wed, 24 Jan 2024 15:04:28 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id BFC626B0088; Wed, 24 Jan 2024 15:04:28 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0013.hostedemail.com [216.40.44.13]) by kanga.kvack.org (Postfix) with ESMTP id A8DEC6B0083 for ; Wed, 24 Jan 2024 15:04:28 -0500 (EST) Received: from smtpin08.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay04.hostedemail.com (Postfix) with ESMTP id 6C29C1A0415 for ; Wed, 24 Jan 2024 20:04:28 +0000 (UTC) X-FDA: 81715281816.08.31B2550 Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.15]) by imf27.hostedemail.com (Postfix) with ESMTP id 4018D4000F for ; Wed, 24 Jan 2024 20:04:26 +0000 (UTC) Authentication-Results: imf27.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b="J/X21R3v"; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf27.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1706126666; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=XUmZz8pgbOQyFoQipuzs/q5/a1ezSTUJLIhuIbBITmw=; b=gdq+C+ft+iSLmT+0xJxk7+qI5tNbng4Yhu4CmOIQnpfPwJoUocF6EfAPK2iyMTdqGQZdfl a0kax4EHKrqcMXzFb/oWEKcfbDEzOwkaOKKEeJh5PNNvdYhrRUmKE1n/fcYxIXVeP8cfHX 6OWVKNBMlILHgVBNupDH8PIJOBjTYtI= ARC-Authentication-Results: i=1; imf27.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b="J/X21R3v"; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf27.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1706126666; a=rsa-sha256; cv=none; b=VtHgAGE5ZGVRWN40xVbBdTwPEzbMwCGx0JK4k1MibNtJlJCuKNXBaok16bINwPmkLAd+m9 cwGpSk3xTlnsXJyI2R+M9Av0z3VP2kjTdJNM0DYvzNZcE25GzOn6HsuTWZKwq8Pgz/AAb7 f+k+c5LknCmKZ1njaX6GEKWJOJ7O9cY= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1706126666; x=1737662666; h=from:date:subject:mime-version:content-transfer-encoding: message-id:references:in-reply-to:to:cc; bh=pbx8GN6oF7bpvrPHhUbumLfrIjuzdnl/0AJX9PTq6Qc=; b=J/X21R3vINkmSZ5Gik1UNJWwHK+buzpU22mesBJr3nk3c9m2UMmlg+3Z IOEU/YAsxOhXE6UYcirv1DykOoxj8xtOU7lFdymKw00j2/saERWtzJ5vm l9wFIgx/atsY97JnKLlXte+7cZvU8aywGWTI58oX79zUc6ON+m2OpGeFj vZfHyo56nJuS/ugccIK1Y8/CgAXhdsD+3efoCo0R+OlXwCb4zIXWA1quH +dk3JSwpJ5/tYl7c4Df4UW7ImQTWqOGnLjC4X5w74grg7LYzRrbHfUV91 e5zodPBDVYCcRxNsX6LFpMxFz+u05Gdkxkehh6N9rdSAj3Zbi4PT9EizR g==; X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1836115" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1836115" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by orvoesa107.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:24 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1117735135" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1117735135" Received: from vverma7-mobl3.amr.corp.intel.com (HELO [10.0.0.223]) ([10.251.14.61]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:22 -0800 From: Vishal Verma Date: Wed, 24 Jan 2024 12:03:48 -0800 Subject: [PATCH v7 3/5] Documentatiion/ABI: Add ABI documentation for sys-bus-dax MIME-Version: 1.0 Message-Id: <20240124-vv-dax_abi-v7-3-20d16cb8d23d@intel.com> References: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> In-Reply-To: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> To: Dan Williams , Vishal Verma , Dave Jiang , Andrew Morton , Oscar Salvador Cc: linux-kernel@vger.kernel.org, nvdimm@lists.linux.dev, linux-cxl@vger.kernel.org, David Hildenbrand , Dave Hansen , Huang Ying , Greg Kroah-Hartman , Matthew Wilcox , linux-mm@kvack.org X-Mailer: b4 0.13-dev-a684c X-Developer-Signature: v=1; a=openpgp-sha256; l=5934; i=vishal.l.verma@intel.com; h=from:subject:message-id; bh=pbx8GN6oF7bpvrPHhUbumLfrIjuzdnl/0AJX9PTq6Qc=; b=owGbwMvMwCXGf25diOft7jLG02pJDKkbc505nke437328TqL7m7B7XeMf4c4z14x5ZK/0LvjS 1xWGTi97ChlYRDjYpAVU2T5u+cj4zG57fk8gQmOMHNYmUCGMHBxCsBEvtQxMmzIv34xu9h1/lQW WXGViZ17Pltryjld/3Ve88XRDX2x874y/LM8U6QgKhpdXrq7oI+tdtmOgq1avXd62RekHvr6/FK NPw8A X-Developer-Key: i=vishal.l.verma@intel.com; a=openpgp; fpr=F8682BE134C67A12332A2ED07AFA61BEA3B84DFF X-Rspam-User: X-Rspamd-Server: rspam12 X-Rspamd-Queue-Id: 4018D4000F X-Stat-Signature: xsgehc4dxmnjri3abam6nhr78ojc1gbr X-HE-Tag: 1706126666-452768 X-HE-Meta: U2FsdGVkX18uLZMDQgLGdLPNJDihEkL+ByX06G6Mup7Sdz6WN2T/9MuyBqlVxQe3bb8dJM0MXyL1EZWylvSmYMNyOnrcArLfM/eGvLcQdnNL2qXz9ure7OOiM9r/jHbYQy610ymT10HuZ8T5qAHWk+a47SaY/KwtS14THNollWO1OTo3pEzTFCRLAmdC7iRL9V7UZBF3ZOdXfTJoZuo+dlHO+kMZj+BnRbgxXX4quXz9pQCDEwKroxh8v9vxmidibwGN5bBKediUT54/ZTkICGdDvtrcFUaQaaNdHbH83yxP0d52FZ4ir/LYoC6PfTr20XiOdkCxZckpeM7tIMhuNH8tyUsg0yv7AYwIOcqLosFB7Ly+dXe53fbHMJlj0Q4wJlhblQP/JZSqPVYdBcr+b4llywIYPrvnDbaX1f2QiYPdhtQJJTzxzvAFp7g1j9c9IxYYbaZm3IBNeKnIvkvvwSA2Xxv50BvABPEZJCUlDdvcyVNLgmdPLRfvnFWZx5gYEBSuyfQ8dEVj7GFJMcEGbCKE6i2JvpxeUHj13FnbrKbAR1kSg1f07dpwnNTCXlmuyeC3pjan5dgLNFJvHjVrKJqUAQ5cY+0mN/JznMRQ53jA0fSWTRQiZXGZJ4mS7BV6fcwRoGhSvdBG07tg1nfCBuPeiEL85vQRrAeGyjD4EkwJAYB6HmX7xvDhBodz7HlWSZR13MNTc15mZKHycdncKsp5TzgMh3GMHDG9hD4X7ydVPUvIPemSKPAm6i4CEPAm62+SCFlI02Sw+GGhfAB2LtoHApFSHEe7U1J30wMfg14DfhF2T8zswAzupzIlM+dXZKIq8M4LK3+1W7SoMSBKdgqYmeCIzivVNCdjc8d2XdTcJNNmwqBPI1qaJt9YFpY3CtWbw3IMeVZUNPZHg6GkE0R6UFzDfOqjtiOJNsC+LIM5sARa1a4rIOtg7caTWfZkyBN39QboHE5uh7L8mzm 5eFhvoeW LkmpBc5IUGUNGr4jJ9xYHmE3pjEGWOpQPGWvrK29g5xtnuicgTPVnblXJtpddw4NZE/V2ZyKzeXhQAWvdOEWBmyiUd/5iU097P0sx/aYxaOoCOYktcxKLS3WLOJglpvwT+FT5iB+gFL21+16LRojNPK2g4m1VQxN+67MkQ/VGbmbiY9O//8ns4+cXKjvhE42GyoDB X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Add the missing sysfs ABI documentation for the device DAX subsystem. Various ABI attributes under this have been present since v5.1, and more have been added over time. In preparation for adding a new attribute, add this file with the historical details. Cc: Dan Williams Signed-off-by: Vishal Verma --- Documentation/ABI/testing/sysfs-bus-dax | 136 ++++++++++++++++++++++++++++++++ 1 file changed, 136 insertions(+) diff --git a/Documentation/ABI/testing/sysfs-bus-dax b/Documentation/ABI/testing/sysfs-bus-dax new file mode 100644 index 000000000000..6359f7bc9bf4 --- /dev/null +++ b/Documentation/ABI/testing/sysfs-bus-dax @@ -0,0 +1,136 @@ +What: /sys/bus/dax/devices/daxX.Y/align +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (RW) Provides a way to specify an alignment for a dax device. + Values allowed are constrained by the physical address ranges + that back the dax device, and also by arch requirements. + +What: /sys/bus/dax/devices/daxX.Y/mapping +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (WO) Provides a way to allocate a mapping range under a dax + device. Specified in the format -. + +What: /sys/bus/dax/devices/daxX.Y/mapping[0..N]/start +What: /sys/bus/dax/devices/daxX.Y/mapping[0..N]/end +What: /sys/bus/dax/devices/daxX.Y/mapping[0..N]/page_offset +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (RO) A dax device may have multiple constituent discontiguous + address ranges. These are represented by the different + 'mappingX' subdirectories. The 'start' attribute indicates the + start physical address for the given range. The 'end' attribute + indicates the end physical address for the given range. The + 'page_offset' attribute indicates the offset of the current + range in the dax device. + +What: /sys/bus/dax/devices/daxX.Y/resource +Date: June, 2019 +KernelVersion: v5.3 +Contact: nvdimm@lists.linux.dev +Description: + (RO) The resource attribute indicates the starting physical + address of a dax device. In case of a device with multiple + constituent ranges, it indicates the starting address of the + first range. + +What: /sys/bus/dax/devices/daxX.Y/size +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (RW) The size attribute indicates the total size of a dax + device. For creating subdivided dax devices, or for resizing + an existing device, the new size can be written to this as + part of the reconfiguration process. + +What: /sys/bus/dax/devices/daxX.Y/numa_node +Date: November, 2019 +KernelVersion: v5.5 +Contact: nvdimm@lists.linux.dev +Description: + (RO) If NUMA is enabled and the platform has affinitized the + backing device for this dax device, emit the CPU node + affinity for this device. + +What: /sys/bus/dax/devices/daxX.Y/target_node +Date: February, 2019 +KernelVersion: v5.1 +Contact: nvdimm@lists.linux.dev +Description: + (RO) The target-node attribute is the Linux numa-node that a + device-dax instance may create when it is online. Prior to + being online the device's 'numa_node' property reflects the + closest online cpu node which is the typical expectation of a + device 'numa_node'. Once it is online it becomes its own + distinct numa node. + +What: $(readlink -f /sys/bus/dax/devices/daxX.Y)/../dax_region/available_size +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (RO) The available_size attribute tracks available dax region + capacity. This only applies to volatile hmem devices, not pmem + devices, since pmem devices are defined by nvdimm namespace + boundaries. + +What: $(readlink -f /sys/bus/dax/devices/daxX.Y)/../dax_region/size +Date: July, 2017 +KernelVersion: v5.1 +Contact: nvdimm@lists.linux.dev +Description: + (RO) The size attribute indicates the size of a given dax region + in bytes. + +What: $(readlink -f /sys/bus/dax/devices/daxX.Y)/../dax_region/align +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (RO) The align attribute indicates alignment of the dax region. + Changes on align may not always be valid, when say certain + mappings were created with 2M and then we switch to 1G. This + validates all ranges against the new value being attempted, post + resizing. + +What: $(readlink -f /sys/bus/dax/devices/daxX.Y)/../dax_region/seed +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (RO) The seed device is a concept for dynamic dax regions to be + able to split the region amongst multiple sub-instances. The + seed device, similar to libnvdimm seed devices, is a device + that starts with zero capacity allocated and unbound to a + driver. + +What: $(readlink -f /sys/bus/dax/devices/daxX.Y)/../dax_region/create +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (RW) The create interface to the dax region provides a way to + create a new unconfigured dax device under the given region, which + can then be configured (with a size etc.) and then probed. + +What: $(readlink -f /sys/bus/dax/devices/daxX.Y)/../dax_region/delete +Date: October, 2020 +KernelVersion: v5.10 +Contact: nvdimm@lists.linux.dev +Description: + (WO) The delete interface for a dax region provides for deletion + of any 0-sized and idle dax devices. + +What: $(readlink -f /sys/bus/dax/devices/daxX.Y)/../dax_region/id +Date: July, 2017 +KernelVersion: v5.1 +Contact: nvdimm@lists.linux.dev +Description: + (RO) The id attribute indicates the region id of a dax region. From patchwork Wed Jan 24 20:03:49 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Verma, Vishal L" X-Patchwork-Id: 13529619 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 85349C47DDF for ; Wed, 24 Jan 2024 20:04:39 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 631856B0089; Wed, 24 Jan 2024 15:04:31 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 5DEEA6B008A; Wed, 24 Jan 2024 15:04:31 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 483D36B008C; Wed, 24 Jan 2024 15:04:31 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id 348E26B0089 for ; Wed, 24 Jan 2024 15:04:31 -0500 (EST) Received: from smtpin20.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay03.hostedemail.com (Postfix) with ESMTP id 0762CA0C79 for ; Wed, 24 Jan 2024 20:04:29 +0000 (UTC) X-FDA: 81715281900.20.57E70DC Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.15]) by imf12.hostedemail.com (Postfix) with ESMTP id C225240003 for ; Wed, 24 Jan 2024 20:04:26 +0000 (UTC) Authentication-Results: imf12.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=gSWo6nAC; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf12.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1706126667; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=a6zYS5tHYyD516jLQ0b49vxvCgIzBSqergsiT+dJduA=; b=ZI6qe9sLXq4sJLdJisrf9c5kEg6xehWmFxd2sTtPqzHQFLMKST24Jyo0TqVB+1j6ciB4fy gfqzaBlYuWuccU4En/WGyDB44kka3WozLVs+k/IeqT9ISCja6bEJChUC5d3ZpT3kOjz4ZC rO7POpsiO5y1ga2EqLMbULxv8Sbz0cc= ARC-Authentication-Results: i=1; imf12.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=gSWo6nAC; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf12.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1706126667; a=rsa-sha256; cv=none; b=43danTPzaV5Sy9+oDfhB7x3156xBJKgxj4aFQZyPYSAJ2qmHRGzlKCtB8mnR5fTBj6IhRU FDQxlhXwaOKv+diBf7m1FBUiDFQN3xkC6GNMjx0xd474ioENX7Ix67BeuIQvlDi8GtgFw7 zX0IIawRLY5XeeiU21/CyHJIYUIsh/8= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1706126667; x=1737662667; h=from:date:subject:mime-version:content-transfer-encoding: message-id:references:in-reply-to:to:cc; bh=Orl+mZtfo3P92OAi6zjw0XQv4aRvq83sE++FPaqZvSw=; b=gSWo6nACemiOSOltcdG1U02I3JRFe56ZrRV3GBN7cNOF7b4SsrUHeFGk fBNDHKY5/+VHyPeIg0zu2Y9xEinbLwzH3guirQzm5ZjFbFVBRW1JM2vvV MGz46Vdyoik65/AzCDsgYx4PsEZNpBf4pxyNj8apcvKUg4eoOp9nM6+4O ew/9m80fmiENv5pobikSE968uQunrgMI7U+eMSic4fOmfzS/n6da8J+If K7t/Cg7N5J+tNW6rmwj46a0ki6M/EkAk+8NPSCvmZSj6W+peomu/oRoe8 yZIYXp/HeTELZasOAzih7ozYvdC0WEqTTmfA4cMA5TXU4GAKLf5ambFx/ g==; X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1836122" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1836122" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by orvoesa107.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:25 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1117735142" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1117735142" Received: from vverma7-mobl3.amr.corp.intel.com (HELO [10.0.0.223]) ([10.251.14.61]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:23 -0800 From: Vishal Verma Date: Wed, 24 Jan 2024 12:03:49 -0800 Subject: [PATCH v7 4/5] mm/memory_hotplug: export mhp_supports_memmap_on_memory() MIME-Version: 1.0 Message-Id: <20240124-vv-dax_abi-v7-4-20d16cb8d23d@intel.com> References: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> In-Reply-To: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> To: Dan Williams , Vishal Verma , Dave Jiang , Andrew Morton , Oscar Salvador Cc: linux-kernel@vger.kernel.org, nvdimm@lists.linux.dev, linux-cxl@vger.kernel.org, David Hildenbrand , Dave Hansen , Huang Ying , Greg Kroah-Hartman , Matthew Wilcox , linux-mm@kvack.org, Michal Hocko X-Mailer: b4 0.13-dev-a684c X-Developer-Signature: v=1; a=openpgp-sha256; l=4673; i=vishal.l.verma@intel.com; h=from:subject:message-id; bh=Orl+mZtfo3P92OAi6zjw0XQv4aRvq83sE++FPaqZvSw=; b=owGbwMvMwCXGf25diOft7jLG02pJDKkbc523Ta9vOi09a3bGqYv8n6JOx6T18S1boOA25RJ7L P+fjzrKHaUsDGJcDLJiiix/93xkPCa3PZ8nMMERZg4rE8gQBi5OAZjItkpGhoPKq2Z5afNceV/D XRrhep3dROdqhKT6hrTy71VRU+ReCTIybOX1yTxzjmHHz5CFdRe+Lru4J7D9BkOBr/zmL9+PJae u5AIA X-Developer-Key: i=vishal.l.verma@intel.com; a=openpgp; fpr=F8682BE134C67A12332A2ED07AFA61BEA3B84DFF X-Rspam-User: X-Rspamd-Server: rspam12 X-Rspamd-Queue-Id: C225240003 X-Stat-Signature: tgio8wqdtixh6e89nqd8qouh9n35mmq6 X-HE-Tag: 1706126666-245908 X-HE-Meta: U2FsdGVkX19PufaVKx3IbGJIQp7ganrmxzRCya3XP4Q23VfQk3vbIIWGs06+/h05tHQNPOxj9Vxeq9vYcFcNCvGqslVYJgzsgoAfV+8BPCoGvY1EtGhxYmyWjhOKRJaQy532pWGf8G+BUZijCKR5RlRZ6KoLnQJnPlTT51xrLuF2tgsjgkbvrMJ0NqY4HNR1FduBSUOW0nOwhtk0SrtFgu53ycPsDriRJb8EM1BYs3Ede9osULSSzLx+OQZUQ1BD1ava0HjV9bzszfVH3CoEqAy4usu0uNWta8fAFQSY3qBHHkF4+QaAFTEirvPr0veSS/bO4n5evdhJPRFgbXBAuNRQgrSvfHbk/5NexjpfzYoelIXsE52OHPg/B1ZjUhkYy6CGiFAZkQQS/9Y/1EQrIw0yFGDiItkngrz+41iHlGp7IzWnTXSIQ03hH1TXNcZWdfRBbNZcrIgkm0RrLirMqUyxEfJDHrx9mmtFWS9F8T0uPV9tIp4Y8RkpPBhbja96h2Cfdf0kzGivqt4JFBAGHknzMU7SrTy+4LmQ3dC3b7Cj6M8oIG0AnAy7rdevPIitmgjfGiUhfKcgy4tiCLGROEJuy4MKxtf+FGAzQuKF+8c7UFCAsmCNWp7v37ms0xULUaNBzjysP2uqDXRYuUxHMXkfKNFJJN4EUuOXpiULb4i+wv5MrJLwD4C8OaADndkyIYkWcBDPfZ6tHIflO3uy3/Fgze27sdiJuj1MEzh02xqmAbssWhtA2YbNIHBNg+GBBGa5g61A/WfT4+Btj80N/KM6+wJH1X7javrUGLGzMVsmif72NyOb66URCNl2oEvR47unKJSBtfTT+fKwZSHH5k6CG0bpZPjTVGquJ2kG3a6niS1PWZW43DWTYzmty1CXmsTMEWBQ5Sm4B0V294nvnlTn9j9J5Z0SZF97mI7KDmAcFBeUyrrfjNS8hW2/Lfbs5C+xvl2mT8VNOqY2cyZ HadApLMe 7NEB2aodaSBvYBfpP4OLLpDKpw27ryQk7au7LDixifthH03NSTxk3KuOp3ROmz3mUtNlrlIkOC1o5WRCVX6nJFEk+VCsyMb/oOSz9c57F2hZHirmEWDRE0/EcYj+mOFrPkvp3sxbV+bkEqWlspSl2tAtvTHVKbotLrangYYOIeUwRYrdWnooGZY7SyIoSj6H9+vWDkX+5CtcDlSUl/JeTDOctVFV1vyV82CQoOSvpyt83S4qfCLLeSDcboa6cP3l29a2nTq+7GZv4JEP2px1w6EkkGTU3GsUJZ6FZub0IzcwtT4xUZDB7I78+z2VKwCX43SSAJX+9JIrcem//tZ/kG39L4KJhl70D7qkuyXVuf84+PqE= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: In preparation for adding sysfs ABI to toggle memmap_on_memory semantics for drivers adding memory, export the mhp_supports_memmap_on_memory() helper. This allows drivers to check if memmap_on_memory support is available before trying to request it, and display an appropriate message if it isn't available. As part of this, remove the size argument to this - with recent updates to allow memmap_on_memory for larger ranges, and the internal splitting of altmaps into respective memory blocks, the size argument is meaningless. Cc: Andrew Morton Cc: David Hildenbrand Cc: Michal Hocko Cc: Oscar Salvador Cc: Dan Williams Cc: Dave Jiang Cc: Dave Hansen Cc: Huang Ying Suggested-by: David Hildenbrand Acked-by: David Hildenbrand Signed-off-by: Vishal Verma --- include/linux/memory_hotplug.h | 6 ++++++ mm/memory_hotplug.c | 17 ++++++----------- 2 files changed, 12 insertions(+), 11 deletions(-) diff --git a/include/linux/memory_hotplug.h b/include/linux/memory_hotplug.h index 7d2076583494..ebc9d528f00c 100644 --- a/include/linux/memory_hotplug.h +++ b/include/linux/memory_hotplug.h @@ -121,6 +121,7 @@ struct mhp_params { bool mhp_range_allowed(u64 start, u64 size, bool need_mapping); struct range mhp_get_pluggable_range(bool need_mapping); +bool mhp_supports_memmap_on_memory(void); /* * Zone resizing functions @@ -262,6 +263,11 @@ static inline bool movable_node_is_enabled(void) return false; } +static bool mhp_supports_memmap_on_memory(void) +{ + return false; +} + static inline void pgdat_kswapd_lock(pg_data_t *pgdat) {} static inline void pgdat_kswapd_unlock(pg_data_t *pgdat) {} static inline void pgdat_kswapd_lock_init(pg_data_t *pgdat) {} diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index 21890994c1d3..065fb4804f1b 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1328,7 +1328,7 @@ static inline bool arch_supports_memmap_on_memory(unsigned long vmemmap_size) } #endif -static bool mhp_supports_memmap_on_memory(unsigned long size) +bool mhp_supports_memmap_on_memory(void) { unsigned long vmemmap_size = memory_block_memmap_size(); unsigned long memmap_pages = memory_block_memmap_on_memory_pages(); @@ -1337,17 +1337,11 @@ static bool mhp_supports_memmap_on_memory(unsigned long size) * Besides having arch support and the feature enabled at runtime, we * need a few more assumptions to hold true: * - * a) We span a single memory block: memory onlining/offlinin;g happens - * in memory block granularity. We don't want the vmemmap of online - * memory blocks to reside on offline memory blocks. In the future, - * we might want to support variable-sized memory blocks to make the - * feature more versatile. - * - * b) The vmemmap pages span complete PMDs: We don't want vmemmap code + * a) The vmemmap pages span complete PMDs: We don't want vmemmap code * to populate memory from the altmap for unrelated parts (i.e., * other memory blocks) * - * c) The vmemmap pages (and thereby the pages that will be exposed to + * b) The vmemmap pages (and thereby the pages that will be exposed to * the buddy) have to cover full pageblocks: memory onlining/offlining * code requires applicable ranges to be page-aligned, for example, to * set the migratetypes properly. @@ -1359,7 +1353,7 @@ static bool mhp_supports_memmap_on_memory(unsigned long size) * altmap as an alternative source of memory, and we do not exactly * populate a single PMD. */ - if (!mhp_memmap_on_memory() || size != memory_block_size_bytes()) + if (!mhp_memmap_on_memory()) return false; /* @@ -1382,6 +1376,7 @@ static bool mhp_supports_memmap_on_memory(unsigned long size) return arch_supports_memmap_on_memory(vmemmap_size); } +EXPORT_SYMBOL_GPL(mhp_supports_memmap_on_memory); static void __ref remove_memory_blocks_and_altmaps(u64 start, u64 size) { @@ -1515,7 +1510,7 @@ int __ref add_memory_resource(int nid, struct resource *res, mhp_t mhp_flags) * Self hosted memmap array */ if ((mhp_flags & MHP_MEMMAP_ON_MEMORY) && - mhp_supports_memmap_on_memory(memory_block_size_bytes())) { + mhp_supports_memmap_on_memory()) { ret = create_altmaps_and_memory_blocks(nid, group, start, size); if (ret) goto error; From patchwork Wed Jan 24 20:03:50 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Verma, Vishal L" X-Patchwork-Id: 13529618 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id CD3C5C46CD2 for ; Wed, 24 Jan 2024 20:04:36 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id B6EB76B0083; Wed, 24 Jan 2024 15:04:30 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id AF5F96B0087; Wed, 24 Jan 2024 15:04:30 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 877CB6B0089; Wed, 24 Jan 2024 15:04:30 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0010.hostedemail.com [216.40.44.10]) by kanga.kvack.org (Postfix) with ESMTP id 709086B0083 for ; Wed, 24 Jan 2024 15:04:30 -0500 (EST) Received: from smtpin18.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay06.hostedemail.com (Postfix) with ESMTP id 1FEE6A2397 for ; Wed, 24 Jan 2024 20:04:30 +0000 (UTC) X-FDA: 81715281900.18.D5A9F24 Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.15]) by imf01.hostedemail.com (Postfix) with ESMTP id BE40440010 for ; Wed, 24 Jan 2024 20:04:27 +0000 (UTC) Authentication-Results: imf01.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=aLCF1bZB; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf01.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1706126668; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=jL8JuNc8RZCQZhaTEBqkvXrxE+HNH+PtKP+EKzsZHzo=; b=n1DReCIQRghFXyB1sxz9qAlVEciHFc58+DeEWgFBo2lVlJdQQ9IKrbu8byEcn4KUbGxboM bv+x7XiKRYdNgWy3e4w2O5g4A9PGZOycwyYmMdiLqXUB/HtkCsXWt1A0uXaqM1rUAklCDU fjDzVmfnFcYTwGIvgR5jtDVAraEfPYU= ARC-Authentication-Results: i=1; imf01.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=aLCF1bZB; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf01.hostedemail.com: domain of vishal.l.verma@intel.com designates 198.175.65.15 as permitted sender) smtp.mailfrom=vishal.l.verma@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1706126668; a=rsa-sha256; cv=none; b=pI1EvG10+VOJ5ihvut2UfMdp4AMOlGSQ7p+7gkSuXjw1pXWLAHfGI8ymhD1ISUNOF2cmXn Q0FF1DOgJB5dl0N5hJngnj9f3QSsuDth4OpU1aSpHrQ2GU93Ajew+5s8u5W9XfKd/v88+U toOKZd8OWyFgOdzbOid3qBtrwzkU9YI= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1706126668; x=1737662668; h=from:date:subject:mime-version:content-transfer-encoding: message-id:references:in-reply-to:to:cc; bh=LlNwsQ9722KaImXgMJUAbGVELQS9Qj5zVBOISdpjDqE=; b=aLCF1bZBWPOj1tmRUU+59PrnpDp2gSGOICdq5u30xXMgEhCKaKldT/7/ miEAAe7asjfogAEoI0Gn5kPr2igJoT3N7t+NtIZoCBlvQE8ZdVMSDIs3R 5/4iaXB+r5xt1K+W+FYjJOoVEh8hcPinUm77apunlG/ncsMTM4Au1LJuw 3Tjpx3r0/WpSHwG+GsdIUE9f4xSQ+laOsquYOaz6N3pRNcN2BtqqfMWOm /uZeqMzXLqg3ejdPARczqJB/ueRGI7W3DqqnGaTGENFxyodkIYT/OlA5I 8NTDGnvT3WK4DY7OIokOjS9wJFSUxJmLyaD7KjNxSxse68MYRFfNIumuS A==; X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1836131" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1836131" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by orvoesa107.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:26 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10962"; a="1117735146" X-IronPort-AV: E=Sophos;i="6.05,216,1701158400"; d="scan'208";a="1117735146" Received: from vverma7-mobl3.amr.corp.intel.com (HELO [10.0.0.223]) ([10.251.14.61]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jan 2024 12:04:25 -0800 From: Vishal Verma Date: Wed, 24 Jan 2024 12:03:50 -0800 Subject: [PATCH v7 5/5] dax: add a sysfs knob to control memmap_on_memory behavior MIME-Version: 1.0 Message-Id: <20240124-vv-dax_abi-v7-5-20d16cb8d23d@intel.com> References: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> In-Reply-To: <20240124-vv-dax_abi-v7-0-20d16cb8d23d@intel.com> To: Dan Williams , Vishal Verma , Dave Jiang , Andrew Morton , Oscar Salvador Cc: linux-kernel@vger.kernel.org, nvdimm@lists.linux.dev, linux-cxl@vger.kernel.org, David Hildenbrand , Dave Hansen , Huang Ying , Greg Kroah-Hartman , Matthew Wilcox , linux-mm@kvack.org, Li Zhijian , Jonathan Cameron X-Mailer: b4 0.13-dev-a684c X-Developer-Signature: v=1; a=openpgp-sha256; l=4012; i=vishal.l.verma@intel.com; h=from:subject:message-id; bh=LlNwsQ9722KaImXgMJUAbGVELQS9Qj5zVBOISdpjDqE=; b=owGbwMvMwCXGf25diOft7jLG02pJDKkbc50VeD8v+CjLFhJhfFzu8RSjtH97erzz9l4Qf8Enc LfQ66hyRykLgxgXg6yYIsvfPR8Zj8ltz+cJTHCEmcPKBDKEgYtTACbiH8TwV1Ju2kePSfLnrhke 2viisS8mpjzc/qGgQObvl8E/GFTs9Rn+B7+fyvC/VG61dh1TyJMbdnVpnpttknc4N2ev5nkgpin HBAA= X-Developer-Key: i=vishal.l.verma@intel.com; a=openpgp; fpr=F8682BE134C67A12332A2ED07AFA61BEA3B84DFF X-Rspamd-Server: rspam09 X-Rspamd-Queue-Id: BE40440010 X-Stat-Signature: r1hiu5ahx99tft8p9j58pb3hgfm8bxen X-Rspam-User: X-HE-Tag: 1706126667-576059 X-HE-Meta: U2FsdGVkX1/XYQIubzb65urY0N3M2M3tGzwVRPw/4g7KNHPrHl8Fggcxe2l7NRvRAkPJmbbs6UpojdJVHmQ8Xwpozyt6WyGEs0TBxDqWAhoX/kzvnOQLot64yjR7xFGTbvdEh2pYymd8dyc9BOdzlfABVOgK3i2JHMl5F2vkxnAcoUBxfaQmk97UfDSyrT2uZslDf1rcvWDOMWmT/eq3chljgiiJJO63lJ2MHqO3HTvyVGAB3wZYlYSY5dhO/+fbGsz1MMYRe4UhA/4UbE6qTNmQrZLe7/MSmeOzczNLvFgSZWihJamPHNmVGk0SOLQmaxdB3khEWRoWUHpOQBrqu2mkgdlTqnOSqXMnQadbcsvDoXLByAp/noIgBKdtHHAyhtYKsG+cVerQaciTi4oonrx4BOcT8bq/893/0/cosGE1o5+7+esdZw3g/vAByZEI9Fi0Ckj6sQkQ9vjOSS/p+ZQfK/pswTH81zSvz78swQOl8Wsz5xbnmYBIm30dFeiK6YZPbJRtuWcgv65o5j4HJw2TTvIK5ylec0fLcZMCVTT+EreEXHw0CcfDFq7+jGcW0DUgz3JISnVWhvyilCG5a2rs0enAsrzzyXlBhXP53pmgQ1KLSMWInpcaJcgXY7ZHXWFXaAeIhuvLaBbSwGMlHG0/oi44jwn0xdWQMENFpRQdaaloQC4cOjZ1C3M8XtCJwzTmPYBhf7txFMPLx9bGJXB3YSgEwYbyr4aVF2ZNeJjzVFRXIWnA4uSupNHmU/dU0YjukXoVHH+7gGQGu2RF6/7ffy0p+3Ke+cuSmd6JU8zaaBVXZXhPris/AZi+nAwP4OYB6u1vewoAEcnU8wu1RC4hnaXIatNsGB9DDuy8aXeP0OHvZlqc8MZWkHJ32716gzBB5KdxPHsLRH4bBuD3YFDolTRj+zgubfsFz/cagdz6bVe1L9Cck349blMstHUvDCoKNRhcEGwELiqSFB6 k7bXDJL4 oJQismCpeIBTcvBRzDL6jdskJ6tNZo3rmkzUwyBGue4APZfvnzllqQKXY3eal3v/sD0No2LjX/DOyG/CiNmrAskhoiuI2H8Hl6gPvJfInj6lfMNOmwV0KQFUZpndjhp7vo2FgPZcuGlvWWMQB7g2+npGv32BTchn8DSsfX23nIAC53y6HLsYUd6YxsG+kVEYs9qh54dtkaEB+31rJHY650cXS6arXGs1VGFc5wvjVfxmkvOMU3eB0C8qoM0MXbMajECXBIujeAmYBaGz7ofZ784htm2n62/bWcsEuArqlCDZT+3EmptznRAqYhg== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Add a sysfs knob for dax devices to control the memmap_on_memory setting if the dax device were to be hotplugged as system memory. The default memmap_on_memory setting for dax devices originating via pmem or hmem is set to 'false' - i.e. no memmap_on_memory semantics, to preserve legacy behavior. For dax devices via CXL, the default is on. The sysfs control allows the administrator to override the above defaults if needed. Cc: David Hildenbrand Cc: Dan Williams Cc: Dave Jiang Cc: Dave Hansen Cc: Huang Ying Tested-by: Li Zhijian Reviewed-by: Jonathan Cameron Reviewed-by: David Hildenbrand Reviewed-by: Huang, Ying Signed-off-by: Vishal Verma Reviewed-by: Alison Schofield --- drivers/dax/bus.c | 43 +++++++++++++++++++++++++++++++++ Documentation/ABI/testing/sysfs-bus-dax | 17 +++++++++++++ 2 files changed, 60 insertions(+) diff --git a/drivers/dax/bus.c b/drivers/dax/bus.c index 0fd948a4443e..27c86d0ca711 100644 --- a/drivers/dax/bus.c +++ b/drivers/dax/bus.c @@ -1349,6 +1349,48 @@ static ssize_t numa_node_show(struct device *dev, } static DEVICE_ATTR_RO(numa_node); +static ssize_t memmap_on_memory_show(struct device *dev, + struct device_attribute *attr, char *buf) +{ + struct dev_dax *dev_dax = to_dev_dax(dev); + + return sysfs_emit(buf, "%d\n", dev_dax->memmap_on_memory); +} + +static ssize_t memmap_on_memory_store(struct device *dev, + struct device_attribute *attr, + const char *buf, size_t len) +{ + struct dev_dax *dev_dax = to_dev_dax(dev); + bool val; + int rc; + + rc = kstrtobool(buf, &val); + if (rc) + return rc; + + if (val == true && !mhp_supports_memmap_on_memory()) { + dev_dbg(dev, "memmap_on_memory is not available\n"); + return -EOPNOTSUPP; + } + + rc = down_write_killable(&dax_dev_rwsem); + if (rc) + return rc; + + if (dev_dax->memmap_on_memory != val && dev->driver && + to_dax_drv(dev->driver)->type == DAXDRV_KMEM_TYPE) { + up_write(&dax_dev_rwsem); + return -EBUSY; + } + + dev_dax->memmap_on_memory = val; + up_write(&dax_dev_rwsem); + + return len; +} +static DEVICE_ATTR_RW(memmap_on_memory); + static umode_t dev_dax_visible(struct kobject *kobj, struct attribute *a, int n) { struct device *dev = container_of(kobj, struct device, kobj); @@ -1375,6 +1417,7 @@ static struct attribute *dev_dax_attributes[] = { &dev_attr_align.attr, &dev_attr_resource.attr, &dev_attr_numa_node.attr, + &dev_attr_memmap_on_memory.attr, NULL, }; diff --git a/Documentation/ABI/testing/sysfs-bus-dax b/Documentation/ABI/testing/sysfs-bus-dax index 6359f7bc9bf4..b34266bfae49 100644 --- a/Documentation/ABI/testing/sysfs-bus-dax +++ b/Documentation/ABI/testing/sysfs-bus-dax @@ -134,3 +134,20 @@ KernelVersion: v5.1 Contact: nvdimm@lists.linux.dev Description: (RO) The id attribute indicates the region id of a dax region. + +What: /sys/bus/dax/devices/daxX.Y/memmap_on_memory +Date: January, 2024 +KernelVersion: v6.8 +Contact: nvdimm@lists.linux.dev +Description: + (RW) Control the memmap_on_memory setting if the dax device + were to be hotplugged as system memory. This determines whether + the 'altmap' for the hotplugged memory will be placed on the + device being hotplugged (memmap_on_memory=1) or if it will be + placed on regular memory (memmap_on_memory=0). This attribute + must be set before the device is handed over to the 'kmem' + driver (i.e. hotplugged into system-ram). Additionally, this + depends on CONFIG_MHP_MEMMAP_ON_MEMORY, and a globally enabled + memmap_on_memory parameter for memory_hotplug. This is + typically set on the kernel command line - + memory_hotplug.memmap_on_memory set to 'true' or 'force'."