From patchwork Mon Feb 17 01:43:27 2025 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Wupeng Ma X-Patchwork-Id: 13976984 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 46E26C02198 for ; Mon, 17 Feb 2025 01:52:55 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 7B341280026; Sun, 16 Feb 2025 20:52:54 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 7627428001A; Sun, 16 Feb 2025 20:52:54 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 60340280026; Sun, 16 Feb 2025 20:52:54 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0015.hostedemail.com [216.40.44.15]) by kanga.kvack.org (Postfix) with ESMTP id 40C39280024 for ; Sun, 16 Feb 2025 20:52:54 -0500 (EST) Received: from smtpin09.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay01.hostedemail.com (Postfix) with ESMTP id BA9111C8FA9 for ; Mon, 17 Feb 2025 01:52:53 +0000 (UTC) X-FDA: 83127763026.09.6B51DBD Received: from szxga05-in.huawei.com (szxga05-in.huawei.com [45.249.212.191]) by imf15.hostedemail.com (Postfix) with ESMTP id 40891A0009 for ; Mon, 17 Feb 2025 01:52:50 +0000 (UTC) Authentication-Results: imf15.hostedemail.com; dkim=none; spf=pass (imf15.hostedemail.com: domain of mawupeng1@huawei.com designates 45.249.212.191 as permitted sender) smtp.mailfrom=mawupeng1@huawei.com; dmarc=pass (policy=quarantine) header.from=huawei.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1739757172; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=C1Dqs96Z2lcIcEp69blI8r6Etb1dHW83gbnGOg0TDVE=; b=bu1lD/i0q9s+2itlS6Eef0ri4MJn+gE5gIMqZPBjO+hEfyTsDlN6a+MLUOU+PQoXxFmHXW VZd5H6aEayMQ4iKpommhrSnf1itTyr8+/PRtCo+AgRdnB4bcEzdkVjRn8iyBr6bfOILPtR Ni3ORM7fBa27FyXm8z3c9oNpBsMjtHM= ARC-Authentication-Results: i=1; imf15.hostedemail.com; dkim=none; spf=pass (imf15.hostedemail.com: domain of mawupeng1@huawei.com designates 45.249.212.191 as permitted sender) smtp.mailfrom=mawupeng1@huawei.com; dmarc=pass (policy=quarantine) header.from=huawei.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1739757172; a=rsa-sha256; cv=none; b=rD4zbT21L42rAfywBrMNdtcYsugA+CeMCUkGndscNER1nhwRR1NLwbT62IZIb+SAK4ohKs GVGFcF2fgyl+J79b64w+rtbTmLD+WCSXwXOGumrDze8OGDwdUimWaGwD/Er2g19dj0phzD qbF/Kxn+QgJseFPASvNLPQwnTTPCRD4= Received: from mail.maildlp.com (unknown [172.19.88.214]) by szxga05-in.huawei.com (SkyGuard) with ESMTP id 4Yx5Gv64C1z11P4m; Mon, 17 Feb 2025 09:51:19 +0800 (CST) Received: from kwepemg100017.china.huawei.com (unknown [7.202.181.58]) by mail.maildlp.com (Postfix) with ESMTPS id E67D51A0171; Mon, 17 Feb 2025 09:52:47 +0800 (CST) Received: from huawei.com (10.175.124.71) by kwepemg100017.china.huawei.com (7.202.181.58) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1544.11; Mon, 17 Feb 2025 09:52:47 +0800 From: Wupeng Ma To: , , , , , CC: , , Subject: [PATCH v3 1/3] mm: memory-failure: update ttu flag inside unmap_poisoned_folio Date: Mon, 17 Feb 2025 09:43:27 +0800 Message-ID: <20250217014329.3610326-2-mawupeng1@huawei.com> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20250217014329.3610326-1-mawupeng1@huawei.com> References: <20250217014329.3610326-1-mawupeng1@huawei.com> MIME-Version: 1.0 X-Originating-IP: [10.175.124.71] X-ClientProxiedBy: dggems705-chm.china.huawei.com (10.3.19.182) To kwepemg100017.china.huawei.com (7.202.181.58) X-Rspamd-Server: rspam04 X-Rspamd-Queue-Id: 40891A0009 X-Stat-Signature: erzjypyw97b4b9nebtfdaz3fmny1uuxf X-Rspam-User: X-HE-Tag: 1739757170-730089 X-HE-Meta: U2FsdGVkX18jCHHkTIk6kG4oAobOx6qnb0E22V+uH4rwJ16Qdq03xhEoX621pAKHHS/899BUXxHD7xIEthABryuh+GorXmWUTV0YUQwCbEwjf9P09pseUBw4JhFHbn19Yp7kMaWmvQiIdyRpE9TbmHQZYqasrd9CVDUTtw8kVXUIWsLJbuYYOcOADByOWuC9sKnCbEqTrpagPiKIW4JYdQ+r15VjrZIFC2Ss8QlrvoB7bg+kbhScm3JinFbT2TQEd2gBhcjtYMhAjZK/n97Fw0PY11Jiws6CLr0/uIZYEWPIWhsoFFcOzOk8Pv1E8TP7i8ZQXVjUKI2f/ZAQ7eYUBbxkN5GpcksVRn+gaGCvsVme65srAKQVYIJW4Cp7MEYnQkjsvLcwDOTiQtu0OWvzUdqVN1w+PEHwojWvpKE7rpv2W5CQMG4nZ0E9E5YHI/uiL/oUYrdgNINOnJfBQ+Ki5hKDcnt0cLrA1Jx6VYJhqNcf4gfWPRXy1656oM7/KHwfCdlfbkWKmbrhalrBJ1ShCANO1njd8l+vREC7poB8FmZl+1LOaNN1nl0C9uqOPX04iTYf3ZaNxfu1vjxv+5/vvM1XIIi3URytvRQxp+k4CIV7qPLBzzke4HyMIq7+YirLf4nMY4ezolsH6P1LQCDzGsUXtKV/rPdmhVDHy3WrruGs0BkFay1ESxx/+x8WR+a3tcPhzVfj2JvZQC1U6CaS8fM4hEQjI0ZeyPWPsC9RoHZhPwuUllImnEGqwT3nemm3ct+EyydEjjGzCRziS6YMKvs+Ni8iiiJGXLxCCq38wfeI/OrzjwB9mTO4DrQX0rw/tIQ7OxtK8qlouwlYb40kRRQ3IB0OWlhsTZskjPRgm0MlqfynCGO1QpjK0oLy6/qZ8ZBjGsG42OdC3KLLM9hyBHTMO39J+z5U/5NOyQgzR49biSYWO31eK+tCQA5Qu1b98K9VqxLjxt4LIdyDGr+ air9hDXI Xjz8ZG45aSjoW3yDf2tOatW38rQgKgk0aPLDV8zdBC2MCvTS9KHn3/E5bS3IkomV9t+hwyhfpFWUMbnuaCZZNhVAGLS4PzFkfMO/Zau1H+yXRagzoGMUrMP08Vw7ZIt5Sabpo/cDRskkCUfHUkmhdzhRDn+5jZ4wcIR2Pi/K7xcqAaNTER9+KefHAD1LtDm5MlZcCevRqFNVr9FgnnIEkU9r/44ccX6cPfhdlygkf3rS9HisUXRtc55VOmH5WS/6XEmUMP5m3pKiLmhYQ9tEDG8iGPsVEHeB4Mygy X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: From: Ma Wupeng Commit 6da6b1d4a7df ("mm/hwpoison: convert TTU_IGNORE_HWPOISON to TTU_HWPOISON") introduce TTU_HWPOISON to replace TTU_IGNORE_HWPOISON in order to stop send SIGBUS signal when accessing an error page after a memory error on a clean folio. However during page migration, anon folio must be set with TTU_HWPOISON during unmap_*(). For pagecache we need some policy just like the one in hwpoison_user_mappings to set this flag. So move this policy from hwpoison_user_mappings to unmap_poisoned_folio to handle this warning properly. Warning will be produced during unamp poison folio with the following log: ------------[ cut here ]------------ WARNING: CPU: 1 PID: 365 at mm/rmap.c:1847 try_to_unmap_one+0x8fc/0xd3c Modules linked in: CPU: 1 UID: 0 PID: 365 Comm: bash Tainted: G W 6.13.0-rc1-00018-gacdb4bbda7ab #42 Tainted: [W]=WARN Hardware name: QEMU QEMU Virtual Machine, BIOS 0.0.0 02/06/2015 pstate: 20400005 (nzCv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--) pc : try_to_unmap_one+0x8fc/0xd3c lr : try_to_unmap_one+0x3dc/0xd3c Call trace: try_to_unmap_one+0x8fc/0xd3c (P) try_to_unmap_one+0x3dc/0xd3c (L) rmap_walk_anon+0xdc/0x1f8 rmap_walk+0x3c/0x58 try_to_unmap+0x88/0x90 unmap_poisoned_folio+0x30/0xa8 do_migrate_range+0x4a0/0x568 offline_pages+0x5a4/0x670 memory_block_action+0x17c/0x374 memory_subsys_offline+0x3c/0x78 device_offline+0xa4/0xd0 state_store+0x8c/0xf0 dev_attr_store+0x18/0x2c sysfs_kf_write+0x44/0x54 kernfs_fop_write_iter+0x118/0x1a8 vfs_write+0x3a8/0x4bc ksys_write+0x6c/0xf8 __arm64_sys_write+0x1c/0x28 invoke_syscall+0x44/0x100 el0_svc_common.constprop.0+0x40/0xe0 do_el0_svc+0x1c/0x28 el0_svc+0x30/0xd0 el0t_64_sync_handler+0xc8/0xcc el0t_64_sync+0x198/0x19c ---[ end trace 0000000000000000 ]--- Fixes: 6da6b1d4a7df ("mm/hwpoison: convert TTU_IGNORE_HWPOISON to TTU_HWPOISON") Suggested-by: David Hildenbrand Signed-off-by: Ma Wupeng Acked-by: David Hildenbrand Acked-by: Miaohe Lin --- mm/internal.h | 5 ++-- mm/memory-failure.c | 61 +++++++++++++++++++++++---------------------- mm/memory_hotplug.c | 3 ++- 3 files changed, 36 insertions(+), 33 deletions(-) diff --git a/mm/internal.h b/mm/internal.h index 9826f7dce607..c9186ca8d7c2 100644 --- a/mm/internal.h +++ b/mm/internal.h @@ -1102,7 +1102,7 @@ static inline int find_next_best_node(int node, nodemask_t *used_node_mask) * mm/memory-failure.c */ #ifdef CONFIG_MEMORY_FAILURE -void unmap_poisoned_folio(struct folio *folio, enum ttu_flags ttu); +int unmap_poisoned_folio(struct folio *folio, unsigned long pfn, bool must_kill); void shake_folio(struct folio *folio); extern int hwpoison_filter(struct page *p); @@ -1125,8 +1125,9 @@ unsigned long page_mapped_in_vma(const struct page *page, struct vm_area_struct *vma); #else -static inline void unmap_poisoned_folio(struct folio *folio, enum ttu_flags ttu) +static inline int unmap_poisoned_folio(struct folio *folio, unsigned long pfn, bool must_kill) { + return -EBUSY; } #endif diff --git a/mm/memory-failure.c b/mm/memory-failure.c index a7b8ccd29b6f..b5212b6e330a 100644 --- a/mm/memory-failure.c +++ b/mm/memory-failure.c @@ -1556,8 +1556,34 @@ static int get_hwpoison_page(struct page *p, unsigned long flags) return ret; } -void unmap_poisoned_folio(struct folio *folio, enum ttu_flags ttu) +int unmap_poisoned_folio(struct folio *folio, unsigned long pfn, bool must_kill) { + enum ttu_flags ttu = TTU_IGNORE_MLOCK | TTU_SYNC | TTU_HWPOISON; + struct address_space *mapping; + + if (folio_test_swapcache(folio)) { + pr_err("%#lx: keeping poisoned page in swap cache\n", pfn); + ttu &= ~TTU_HWPOISON; + } + + /* + * Propagate the dirty bit from PTEs to struct page first, because we + * need this to decide if we should kill or just drop the page. + * XXX: the dirty test could be racy: set_page_dirty() may not always + * be called inside page lock (it's recommended but not enforced). + */ + mapping = folio_mapping(folio); + if (!must_kill && !folio_test_dirty(folio) && mapping && + mapping_can_writeback(mapping)) { + if (folio_mkclean(folio)) { + folio_set_dirty(folio); + } else { + ttu &= ~TTU_HWPOISON; + pr_info("%#lx: corrupted page was clean: dropped without side effects\n", + pfn); + } + } + if (folio_test_hugetlb(folio) && !folio_test_anon(folio)) { struct address_space *mapping; @@ -1572,7 +1598,7 @@ void unmap_poisoned_folio(struct folio *folio, enum ttu_flags ttu) if (!mapping) { pr_info("%#lx: could not lock mapping for mapped hugetlb folio\n", folio_pfn(folio)); - return; + return -EBUSY; } try_to_unmap(folio, ttu|TTU_RMAP_LOCKED); @@ -1580,6 +1606,8 @@ void unmap_poisoned_folio(struct folio *folio, enum ttu_flags ttu) } else { try_to_unmap(folio, ttu); } + + return folio_mapped(folio) ? -EBUSY : 0; } /* @@ -1589,8 +1617,6 @@ void unmap_poisoned_folio(struct folio *folio, enum ttu_flags ttu) static bool hwpoison_user_mappings(struct folio *folio, struct page *p, unsigned long pfn, int flags) { - enum ttu_flags ttu = TTU_IGNORE_MLOCK | TTU_SYNC | TTU_HWPOISON; - struct address_space *mapping; LIST_HEAD(tokill); bool unmap_success; int forcekill; @@ -1613,29 +1639,6 @@ static bool hwpoison_user_mappings(struct folio *folio, struct page *p, if (!folio_mapped(folio)) return true; - if (folio_test_swapcache(folio)) { - pr_err("%#lx: keeping poisoned page in swap cache\n", pfn); - ttu &= ~TTU_HWPOISON; - } - - /* - * Propagate the dirty bit from PTEs to struct page first, because we - * need this to decide if we should kill or just drop the page. - * XXX: the dirty test could be racy: set_page_dirty() may not always - * be called inside page lock (it's recommended but not enforced). - */ - mapping = folio_mapping(folio); - if (!(flags & MF_MUST_KILL) && !folio_test_dirty(folio) && mapping && - mapping_can_writeback(mapping)) { - if (folio_mkclean(folio)) { - folio_set_dirty(folio); - } else { - ttu &= ~TTU_HWPOISON; - pr_info("%#lx: corrupted page was clean: dropped without side effects\n", - pfn); - } - } - /* * First collect all the processes that have the page * mapped in dirty form. This has to be done before try_to_unmap, @@ -1643,9 +1646,7 @@ static bool hwpoison_user_mappings(struct folio *folio, struct page *p, */ collect_procs(folio, p, &tokill, flags & MF_ACTION_REQUIRED); - unmap_poisoned_folio(folio, ttu); - - unmap_success = !folio_mapped(folio); + unmap_success = !unmap_poisoned_folio(folio, pfn, flags & MF_MUST_KILL); if (!unmap_success) pr_err("%#lx: failed to unmap page (folio mapcount=%d)\n", pfn, folio_mapcount(folio)); diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index c43b4e7fb298..3de661e57e92 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1806,7 +1806,8 @@ static void do_migrate_range(unsigned long start_pfn, unsigned long end_pfn) if (WARN_ON(folio_test_lru(folio))) folio_isolate_lru(folio); if (folio_mapped(folio)) - unmap_poisoned_folio(folio, TTU_IGNORE_MLOCK); + unmap_poisoned_folio(folio, pfn, false); + continue; } From patchwork Mon Feb 17 01:43:28 2025 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Wupeng Ma X-Patchwork-Id: 13976987 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 7C258C021A6 for ; Mon, 17 Feb 2025 01:52:58 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id DF53B280027; Sun, 16 Feb 2025 20:52:54 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id DCC62280024; Sun, 16 Feb 2025 20:52:54 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id C92DC280027; Sun, 16 Feb 2025 20:52:54 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0012.hostedemail.com [216.40.44.12]) by kanga.kvack.org (Postfix) with ESMTP id 9B75F280024 for ; Sun, 16 Feb 2025 20:52:54 -0500 (EST) Received: from smtpin04.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay01.hostedemail.com (Postfix) with ESMTP id 458FA1C8FE9 for ; Mon, 17 Feb 2025 01:52:54 +0000 (UTC) X-FDA: 83127763068.04.AE08F92 Received: from szxga05-in.huawei.com (szxga05-in.huawei.com [45.249.212.191]) by imf09.hostedemail.com (Postfix) with ESMTP id 2165A14000D for ; Mon, 17 Feb 2025 01:52:51 +0000 (UTC) Authentication-Results: imf09.hostedemail.com; dkim=none; spf=pass (imf09.hostedemail.com: domain of mawupeng1@huawei.com designates 45.249.212.191 as permitted sender) smtp.mailfrom=mawupeng1@huawei.com; dmarc=pass (policy=quarantine) header.from=huawei.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1739757172; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=0tpejWBCwcJBY2BnUuswEBPRJMqkN9qHU0n0a5fxbHo=; b=Qz1Sxt0eF/lo9ovlaisZYcXeEIgCvyq5YSUbSX1OgGOYs3fxclh0GpJ6OOGfR/UGKM9HGY tEGpQoGYQNA8AC1I5uIxpX/6q9yqHEkVm0TF5BPC1q1b4blP7JJWfDt/mAL9KBA59lytQj n/sJaXU9yDtXYDz55zgoMYOXaUBVEEU= ARC-Authentication-Results: i=1; imf09.hostedemail.com; dkim=none; spf=pass (imf09.hostedemail.com: domain of mawupeng1@huawei.com designates 45.249.212.191 as permitted sender) smtp.mailfrom=mawupeng1@huawei.com; dmarc=pass (policy=quarantine) header.from=huawei.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1739757172; a=rsa-sha256; cv=none; b=MVSw+0fbBx/M/qg/hYIGSweyLONDlJtgnTBdXAjNR4dVXbQiNsJybXJpM/KdGJ2vSDce2F DKs8EW4ij3nWVJ9GcLj1kT3YB57j2Dmdr3uigTP8qPBRD+ldb5XGuAOKl8WLaTbiBi1k2d bq4A67obvP8iuYQk7aDybdoRNO+KmV4= Received: from mail.maildlp.com (unknown [172.19.88.234]) by szxga05-in.huawei.com (SkyGuard) with ESMTP id 4Yx5Gw4VCWz11P4t; Mon, 17 Feb 2025 09:51:20 +0800 (CST) Received: from kwepemg100017.china.huawei.com (unknown [7.202.181.58]) by mail.maildlp.com (Postfix) with ESMTPS id AE0F8140155; Mon, 17 Feb 2025 09:52:48 +0800 (CST) Received: from huawei.com (10.175.124.71) by kwepemg100017.china.huawei.com (7.202.181.58) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1544.11; Mon, 17 Feb 2025 09:52:47 +0800 From: Wupeng Ma To: , , , , , CC: , , Subject: [PATCH v3 2/3] mm: memory-hotplug: check folio ref count first in do_migrate_range Date: Mon, 17 Feb 2025 09:43:28 +0800 Message-ID: <20250217014329.3610326-3-mawupeng1@huawei.com> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20250217014329.3610326-1-mawupeng1@huawei.com> References: <20250217014329.3610326-1-mawupeng1@huawei.com> MIME-Version: 1.0 X-Originating-IP: [10.175.124.71] X-ClientProxiedBy: dggems705-chm.china.huawei.com (10.3.19.182) To kwepemg100017.china.huawei.com (7.202.181.58) X-Rspamd-Server: rspam04 X-Rspamd-Queue-Id: 2165A14000D X-Stat-Signature: dgo64sjj8wdkxhz6mh9egr4jyccrh3s8 X-Rspam-User: X-HE-Tag: 1739757171-614019 X-HE-Meta: U2FsdGVkX1+7Er61Xjmw0yG8rJ5fvTWIDLd7jvYSXEl6djR21pqwnpjcQf+VnKZCL7FrWcvo7+4xmXCnXFtRjVHL3lIic3QJn/MXeI0VCit0g/yXY0k9z7CqczHNwue/VCTKdAuxZWc7czT36Bcnxt1pX6Ym4xYfUhfjUmMgw32yTpzPRbquvC1fGQo1kxguEnzorgeIg+QoJne6quWL9rGOEYbK2y3LdEPynDcSDTQkfDha8D6+CPQ1K9Ci9dpHmURPsTLZUHbtOaDhgegg3fDBVZzILjGodCjR2SX/XxbLZzaikJgHJnvPv66/g5WKi+U8fYRouEkRvwG+FEJli5/xLk9C0qp6r+xW9SuZ/9cs97zXiJI+q/IH1brFufK7Fixw3AuGR4isZSBZ5wawb3xWv8EX3aQOOh/H67o+SoGiRCA1Z2W45nJ71/kzoar0l0pPjrrrabi8FHXfauR5kvobEuSy1mgOHQ2Am8MWtmqc/oUcuIDCwjDiMQd92UE/gqDAhSMonC4BlWiJ3LDrrL0+/oJ1d6whEnWdUf4Y+QjicT6g8ZO7hmg8Vdhdi7v/Px31pWE0NrMtKwgezd7tWaZJhSnCb+yjXLEenZo81p7qypfxw3oCswCRMSGPI6Sd/M/Cu99b9anMZXs7ehJgkwl9M4T3xjViVsVSgwTVzgdejLnHLocCDljHitNlpfHwWfArO4I9d5vCTmY/7R2AYsNkSxJ1XGuze/SZgDai2GxCW7Fujv69Fk9ryyNlc4LqAmPQGLiizY/A8tOzjJHfr25vNlKzu5ItoHePEjPoYH2J3hFgpoi74eU3umlUXC48NNpskmTVvl22LVubJzXo/mcAyX5Gw5G6xz1+EFqc+UQ0QCtR4k5vNUyag1Z2FY0bBCROl5ouY2JhUSGr/h8uAGUr9wVftyuibGx9WFN0Mc04c5w/KZjj2+BgIIy4Bu/Bmgyclo/u9lBp9wqR6ON IEzLOPTX p9jxjTMTU1bxdx48TEtPrgkC3y1qaeMXOL0mYCRHTwmMOBIW056JeI94iKbmE4VS1apR4fBghlXWZPbW4/hIcBnP+DCuwA/4fY5LJ0YsbEnhMhoNb/mxdb8UoGslK7xCRqAtW5K+r48BVcsjjd9kevd+1Z9DDkpZ4Ry5HzVM53w+MIQ1vaO4hGhPAFfOmLL/kgwt2OM8qdjhf5t651P6e5zxEv207JP+88xE4863mPqxSKZ0zo71XPwTA9+V9PazFKGPF2qTAR6/69ac= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: From: Ma Wupeng If a folio has an increased reference count, folio_try_get() will acquire it, perform necessary operations, and then release it. In the case of a poisoned folio without an elevated reference count (which is unlikely for memory-failure), folio_try_get() will simply bypass it. Therefore, relocate the folio_try_get() function, responsible for checking and acquiring this reference count at first. Signed-off-by: Ma Wupeng Acked-by: David Hildenbrand Acked-by: Miaohe Lin --- mm/memory_hotplug.c | 20 +++++++------------- 1 file changed, 7 insertions(+), 13 deletions(-) diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index 3de661e57e92..3669b3e2d962 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1795,12 +1795,12 @@ static void do_migrate_range(unsigned long start_pfn, unsigned long end_pfn) if (folio_test_large(folio)) pfn = folio_pfn(folio) + folio_nr_pages(folio) - 1; - /* - * HWPoison pages have elevated reference counts so the migration would - * fail on them. It also doesn't make any sense to migrate them in the - * first place. Still try to unmap such a page in case it is still mapped - * (keep the unmap as the catch all safety net). - */ + if (!folio_try_get(folio)) + continue; + + if (unlikely(page_folio(page) != folio)) + goto put_folio; + if (folio_test_hwpoison(folio) || (folio_test_large(folio) && folio_test_has_hwpoisoned(folio))) { if (WARN_ON(folio_test_lru(folio))) @@ -1808,14 +1808,8 @@ static void do_migrate_range(unsigned long start_pfn, unsigned long end_pfn) if (folio_mapped(folio)) unmap_poisoned_folio(folio, pfn, false); - continue; - } - - if (!folio_try_get(folio)) - continue; - - if (unlikely(page_folio(page) != folio)) goto put_folio; + } if (!isolate_folio_to_list(folio, &source)) { if (__ratelimit(&migrate_rs)) { From patchwork Mon Feb 17 01:43:29 2025 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Wupeng Ma X-Patchwork-Id: 13976986 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id A4798C021A7 for ; Mon, 17 Feb 2025 01:53:00 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 98AFD280028; Sun, 16 Feb 2025 20:52:55 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 937AB280024; Sun, 16 Feb 2025 20:52:55 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 789F6280028; Sun, 16 Feb 2025 20:52:55 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0014.hostedemail.com [216.40.44.14]) by kanga.kvack.org (Postfix) with ESMTP id 5589E280024 for ; Sun, 16 Feb 2025 20:52:55 -0500 (EST) Received: from smtpin02.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay07.hostedemail.com (Postfix) with ESMTP id F0A8E1613E1 for ; Mon, 17 Feb 2025 01:52:54 +0000 (UTC) X-FDA: 83127763068.02.59BC005 Received: from szxga06-in.huawei.com (szxga06-in.huawei.com [45.249.212.32]) by imf22.hostedemail.com (Postfix) with ESMTP id 7DD68C0007 for ; Mon, 17 Feb 2025 01:52:52 +0000 (UTC) Authentication-Results: imf22.hostedemail.com; dkim=none; spf=pass (imf22.hostedemail.com: domain of mawupeng1@huawei.com designates 45.249.212.32 as permitted sender) smtp.mailfrom=mawupeng1@huawei.com; dmarc=pass (policy=quarantine) header.from=huawei.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1739757173; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=SnggH7JbAzJeBRjlLL4jUjK2wWRFo1Brbc67PdGwXsU=; b=roIoiBmUPpUmM5RGJwhPG8QDN1gtuM1T3ilIKrjHbW5cqbfLB7q/45VeQIFQvy7MuY5fit in7XBGeYwTQEyPqbrjDzDPeZEn2dwdDu5tw+y4SiTkU/QAjx7R5pcAF6K4C5qa1GY40uCI 1K1VX1uom1+r1uKh8tkofHGm235ifUY= ARC-Authentication-Results: i=1; imf22.hostedemail.com; dkim=none; spf=pass (imf22.hostedemail.com: domain of mawupeng1@huawei.com designates 45.249.212.32 as permitted sender) smtp.mailfrom=mawupeng1@huawei.com; dmarc=pass (policy=quarantine) header.from=huawei.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1739757173; a=rsa-sha256; cv=none; b=CAfSfpDbgkBJEy2UXbnEw1pIa/YbQwgXEAGZ+ABT0JpPh0bNS5Sp2RHI3/ooTOLWLr2CRC zLd9ioXVJ+GpHF90FzREH7kvyP41QXgPriNoroOeu+GM1eOagsC1virYL0ZBMFmntWAhNj kRdWctK27D8jUQe67UvKsCnx48iCfaA= Received: from mail.maildlp.com (unknown [172.19.163.17]) by szxga06-in.huawei.com (SkyGuard) with ESMTP id 4Yx5KC63C6z1xxGr; Mon, 17 Feb 2025 09:53:19 +0800 (CST) Received: from kwepemg100017.china.huawei.com (unknown [7.202.181.58]) by mail.maildlp.com (Postfix) with ESMTPS id 7773F1A0188; Mon, 17 Feb 2025 09:52:49 +0800 (CST) Received: from huawei.com (10.175.124.71) by kwepemg100017.china.huawei.com (7.202.181.58) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1544.11; Mon, 17 Feb 2025 09:52:48 +0800 From: Wupeng Ma To: , , , , , CC: , , Subject: [PATCH v3 3/3] hwpoison, memory_hotplug: lock folio before unmap hwpoisoned folio Date: Mon, 17 Feb 2025 09:43:29 +0800 Message-ID: <20250217014329.3610326-4-mawupeng1@huawei.com> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20250217014329.3610326-1-mawupeng1@huawei.com> References: <20250217014329.3610326-1-mawupeng1@huawei.com> MIME-Version: 1.0 X-Originating-IP: [10.175.124.71] X-ClientProxiedBy: dggems705-chm.china.huawei.com (10.3.19.182) To kwepemg100017.china.huawei.com (7.202.181.58) X-Rspamd-Server: rspam04 X-Rspamd-Queue-Id: 7DD68C0007 X-Stat-Signature: uzifokm6e8616bkykoywmy3ge77u9ocj X-Rspam-User: X-HE-Tag: 1739757172-864451 X-HE-Meta: U2FsdGVkX1+PmDnUZtEAvvvJnwFqtCzvaoSE3qwlrrqXOrur9RiwE+Ch1jG5dy0qJiH9mpnPnX81TYAiUMcRoBkTt+Ru3POSCVCvyXT7bmsmeX+9GunZDlWbPsTbmqgQwBHcHXJygWL6MbRsvbINqQUhIA9tuAHQ9rtqxemXzfoLK+sUYZe7zNB20VyLFuFbfgxIZF63hqYfohuA4IS/XcwYjWQP3P/PJtQqcYlpk07ZowErsmFjP+9FGSoAZbNlp/GM4HIGG80JFT4ZHVLf19AUK7ItxCYYIY7Jai+zTFovrjsa+Hmg3hzwFdUCOEkwvvGe/rozTnU4DsrL5josbxICo5cwj6Syz+dCtnp/9/4clBXEMrDpWii2FARybtCfLI3n8MWBDuLiqgiF263x2KqKr6eTpsCe4yZfleWfS599LrZwvlmgfty429kWve2PNalgabXQXROy2EiSpjVg3mzCJ0sycpVo9K9xLpYCO/0Mh+zIsyVAXSmiXAdPZZ8reWPjDzqbJQjVWHuOX7H7NEz6KZAnGiX3CHrkP2BlT/upVjTBB3xMBQlVJM089f9bXMSsjocBDA/lHAd4NMd6VeG51AzpdDxGH4wouybI2QqmTic34aAdTaBdNW1S7rz+VTHNr5JpjCMafwLR4hY47X5zREBEN2i3GT/U4dNPEqXSKQORtyek/7aMTHhir6U6ivnK5EpRjmAaad3xlKKm2gEuuV1Uoej5Z8P3Sf6vU31JRGdbFh7FdWn9ljnlzx4hSUiE9cRfgpaMh9EkkGPy3vUsbuqIWngHu/x6ibT6qpP+eQ9nH5VwqlbgfJMNWAt1uV3BVOWxrMNGz+sIWLTTXl93/THgEk4AA87uJWKydhFX7KNDJHk6XZIqvPXu2y9iYpUYyF/zbATls0M614/h1JU+Hpv2b8IgAN5eBaIyZxrvlqRssF0SKux3CYm1ayh4wDqTEgPusyGJjBlR84D JnoNB+fz pebVQOBx0EFbD/rEtE84TvJ+NSqwm6u+i3uQ7epomuRZTjWw8m1cF1FWg0VxN/TorQzgWlJCIjMjEtLQVN8RfhLAftCjtywSDO3P/4Xe0E8qcpUbvuJx8oGpszqopP48Jr5oypOyIKiQ7pHUwU/b6pU2UwfcKweO76Tyvva5AJS7fwgil5BrNz1Idzhh+5LjorH74gLI92mvzPFlvtpwoGxQtnPXZedVMOKUbkEbxQm0AXFxAwnBHxQZjtZJxUwfnU+s/xXSDa0TcRqOR6ivGdcTIcRmFx/uFUzq3 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: From: Ma Wupeng Commit b15c87263a69 ("hwpoison, memory_hotplug: allow hwpoisoned pages to be offlined) add page poison checks in do_migrate_range in order to make offline hwpoisoned page possible by introducing isolate_lru_page and try_to_unmap for hwpoisoned page. However folio lock must be held before calling try_to_unmap. Add it to fix this problem. Warning will be produced if folio is not locked during unmap: ------------[ cut here ]------------ kernel BUG at ./include/linux/swapops.h:400! Internal error: Oops - BUG: 00000000f2000800 [#1] PREEMPT SMP Modules linked in: CPU: 4 UID: 0 PID: 411 Comm: bash Tainted: G W 6.13.0-rc1-00016-g3c434c7ee82a-dirty #41 Tainted: [W]=WARN Hardware name: QEMU QEMU Virtual Machine, BIOS 0.0.0 02/06/2015 pstate: 40400005 (nZcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--) pc : try_to_unmap_one+0xb08/0xd3c lr : try_to_unmap_one+0x3dc/0xd3c Call trace: try_to_unmap_one+0xb08/0xd3c (P) try_to_unmap_one+0x3dc/0xd3c (L) rmap_walk_anon+0xdc/0x1f8 rmap_walk+0x3c/0x58 try_to_unmap+0x88/0x90 unmap_poisoned_folio+0x30/0xa8 do_migrate_range+0x4a0/0x568 offline_pages+0x5a4/0x670 memory_block_action+0x17c/0x374 memory_subsys_offline+0x3c/0x78 device_offline+0xa4/0xd0 state_store+0x8c/0xf0 dev_attr_store+0x18/0x2c sysfs_kf_write+0x44/0x54 kernfs_fop_write_iter+0x118/0x1a8 vfs_write+0x3a8/0x4bc ksys_write+0x6c/0xf8 __arm64_sys_write+0x1c/0x28 invoke_syscall+0x44/0x100 el0_svc_common.constprop.0+0x40/0xe0 do_el0_svc+0x1c/0x28 el0_svc+0x30/0xd0 el0t_64_sync_handler+0xc8/0xcc el0t_64_sync+0x198/0x19c Code: f9407be0 b5fff320 d4210000 17ffff97 (d4210000) ---[ end trace 0000000000000000 ]--- Fixes: b15c87263a69 ("hwpoison, memory_hotplug: allow hwpoisoned pages to be offlined") Signed-off-by: Ma Wupeng Acked-by: David Hildenbrand Acked-by: Miaohe Lin --- mm/memory_hotplug.c | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index 3669b3e2d962..c3de35389269 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1805,8 +1805,11 @@ static void do_migrate_range(unsigned long start_pfn, unsigned long end_pfn) (folio_test_large(folio) && folio_test_has_hwpoisoned(folio))) { if (WARN_ON(folio_test_lru(folio))) folio_isolate_lru(folio); - if (folio_mapped(folio)) + if (folio_mapped(folio)) { + folio_lock(folio); unmap_poisoned_folio(folio, pfn, false); + folio_unlock(folio); + } goto put_folio; }