From patchwork Thu Jun 27 00:39:52 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Gavin Shan X-Patchwork-Id: 13713566 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6EC97C30653 for ; Thu, 27 Jun 2024 00:41:13 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id F38D86B0096; Wed, 26 Jun 2024 20:41:12 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id EE9436B0098; Wed, 26 Jun 2024 20:41:12 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id D89EB6B0099; Wed, 26 Jun 2024 20:41:12 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id BA3F06B0096 for ; Wed, 26 Jun 2024 20:41:12 -0400 (EDT) Received: from smtpin24.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id 32A41C179C for ; Thu, 27 Jun 2024 00:41:12 +0000 (UTC) X-FDA: 82274814384.24.AE050E6 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by imf02.hostedemail.com (Postfix) with ESMTP id 76E428000E for ; Thu, 27 Jun 2024 00:41:10 +0000 (UTC) Authentication-Results: imf02.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=cOcblj25; spf=pass (imf02.hostedemail.com: domain of gshan@redhat.com designates 170.10.133.124 as permitted sender) smtp.mailfrom=gshan@redhat.com; dmarc=pass (policy=none) header.from=redhat.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1719448862; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=1sMjn8/iFlaf4yeh2Uq+knpx1ch2OQwfa4RLnK7BL0o=; b=P2ZcWA4t2CrBT6FImbmna+Wur3LM3tHPOtzQ3kcqvOOQLoDbVckJD3Bd20nKD2qY3Nwr28 QpCc0Z2xVvOctk7BjzAtseH7v3HK8l6GoJ5KI+t9bW9sGEumkM5ozWjoqf983AUfaUU8r0 9uPa2Fs7hQSFWhu9zH7Jl8QBSjQDNlw= ARC-Authentication-Results: i=1; imf02.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=cOcblj25; spf=pass (imf02.hostedemail.com: domain of gshan@redhat.com designates 170.10.133.124 as permitted sender) smtp.mailfrom=gshan@redhat.com; dmarc=pass (policy=none) header.from=redhat.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1719448862; a=rsa-sha256; cv=none; b=WqZzEM03CfxuXkUxuBA+zZv8IkuLEoqollOrwu9kQXKKUoHwMTJB6XMymkzUsuCxa+LQsT kOB/OD0A1yS5GHR1/WC6zSLx+LOlSVtkiDpd2h4iqvQdotvg5aBEgbrQQS0rPTLzgCXNPE Jje84V6UnhyZhjGoCvTF3panWF71E0M= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1719448864; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=1sMjn8/iFlaf4yeh2Uq+knpx1ch2OQwfa4RLnK7BL0o=; b=cOcblj256+wKezNs7e+on2B9Z+zCFpokWtw5gfN3/b6iOPoGrG1V9aSugHUSOT6ADz0ppJ atlIEKLW/1ckPNyCEMi0+tbTBz/UmkfE8e7z6ZcafXU8orM9TfK5UfK9I7FXRmdjyTyefv 6taXjyeN3SMJrn7vlamfxsD5ugudVLg= Received: from mx-prod-mc-01.mail-002.prod.us-west-2.aws.redhat.com (ec2-54-186-198-63.us-west-2.compute.amazonaws.com [54.186.198.63]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-438-IcPKpqm3NiyiViClxPbBgA-1; Wed, 26 Jun 2024 20:41:02 -0400 X-MC-Unique: IcPKpqm3NiyiViClxPbBgA-1 Received: from mx-prod-int-04.mail-002.prod.us-west-2.aws.redhat.com (mx-prod-int-04.mail-002.prod.us-west-2.aws.redhat.com [10.30.177.40]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mx-prod-mc-01.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id 5E7F2195608E; Thu, 27 Jun 2024 00:40:51 +0000 (UTC) Received: from gshan-thinkpadx1nanogen2.remote.csb (unknown [10.64.136.58]) by mx-prod-int-04.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id 15F101956087; Thu, 27 Jun 2024 00:40:44 +0000 (UTC) From: Gavin Shan To: linux-mm@kvack.org Cc: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, david@redhat.com, willy@infradead.org, akpm@linux-foundation.org, ryan.roberts@arm.com, hughd@google.com, william.kucharski@oracle.com, djwong@kernel.org, torvalds@linux-foundation.org, ddutile@redhat.com, zhenyzha@redhat.com, shan.gavin@gmail.com Subject: [PATCH v2 4/4] mm/shmem: Disable PMD-sized page cache if needed Date: Thu, 27 Jun 2024 10:39:52 +1000 Message-ID: <20240627003953.1262512-5-gshan@redhat.com> In-Reply-To: <20240627003953.1262512-1-gshan@redhat.com> References: <20240627003953.1262512-1-gshan@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 3.0 on 10.30.177.40 X-Rspam-User: X-Rspamd-Server: rspam01 X-Rspamd-Queue-Id: 76E428000E X-Stat-Signature: no6pfwm4j8gy5u4aieirrb6x9r1oghqc X-HE-Tag: 1719448870-693593 X-HE-Meta: U2FsdGVkX1+d/mXYrOd6RAuQLxcxJvzWK85HIblt6m+sMtx82yC8QvmYKEPt78aMrlJZup7RMwBlLbxpgd6oteT2fKfWPR6qJenxOBswz8+w1mKkYaYXOuPsdYsYxJO6oFV8oY1ksF/2eftOwsXBH8bOThFuwfaz7PBeW3eXQfkmg5AUJh51T0eitifFTzStxd+M0rJWSOvS8GPzMGrHIO1w7OuXcgbWkgrCgyMLf26y5weJlAude13dKQVSoIbPSKtHMfgG0XXbWPWIx2IkCiQ1LFmDRQMwLj0YBZGiuWK32xwquoqAQKifW3par4WwIjCfs+L+22cUsoh4qx80FXWKaduN1FzY8o5JL3XC1Ec7nZqFanLl58roAewYwxwMV5cR5EDyENPzzSqgzp1Xr6zG7Hia5xAkV9Zt2RYhsXohOhI55Mj9vTsH45NIcAq8HCo69iyVRNup9UgEsR2NsaE0DkYG2tjkZYbcgKYw87BtD2HIRCl5x9vVQfwz/HTcE/Z0fMOROFKNM1jAZ0D+1o9s5W4syEHxoDHlNyPqxkHmzsYC2vxB9e3sCpmLHT32ZHk9NMVORst7tp8RCpI0q8h+Sx/11Rd/ufj6XMR+HKkphv5F4XLzPlUOy1qOA2oaQy3M202MGjoAbBDhbucYfKjfr/57yT2elLNJLbV98FcPuR0FRH7fFwMJwbyuTxs3630UpOwDD38NqnXc6ZuP/MkDqSXIJZ7MayoP8lBxwbPq1KcFueI6ihesbERXvJNkrX/PN60srdeNsrZrzRop18XPL8dt6QrG14XRNmYYe1PG1gq5AVnT9FAJtBnflC2EtU5RYSK1UmEKxaS+9wGmdu/qVLrU6P6NKkpIv6K5Zf+bCHxhi/sjNsS1afxHzzWWHOychxXHUwcSfQJXklcUfa9EBfKOPvIl9T33Q7gDlRIYgrfwOnA/+dWdMksYXcPXq2zfBCKtWh48dF2TwBh UMNqywbA j+TjggkAWFsDxA/RwHW/NkCDteOTZ87Um8MVS9bLDOf4FNiG1GyvonwyPhU4RL/LWNIHep0TiCoPcp0AQhBj+kdd0QW/rpOD1wevz+W33c9B8CiBGGepnNhZx8CrZp6gAcOGG8bKE94dSIwZVqHNwrta7T78dQCuSeVRJZISg8DSDFxb8WgZsn6Q3xPjh4KhGr1BRaqLRLmxt1b+awkEIEHicjmqNlcoYoYWWpMGZDxYuY6I1VXU009F+w9j1NHs2dpI4D7s7k60P0mFjU5L9g75ZLpm8kTmM657JviOcO7LoSn84oQ6BO1k1b8UGYnvW0l3ivKRTQ9/57b4l3p7KAPvnkLoUbAg6pnOyR3OO0tXSJPHG/E8AWeJ/sz4CSS1mAGVTOf9HIqOAuH8Rtj5h70saR1s0vhRJY6pSJccYup1kSPmXE4lOpApnbA== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: For shmem files, it's possible that PMD-sized page cache can't be supported by xarray. For example, 512MB page cache on ARM64 when the base page size is 64KB can't be supported by xarray. It leads to errors as the following messages indicate when this sort of xarray entry is split. WARNING: CPU: 34 PID: 7578 at lib/xarray.c:1025 xas_split_alloc+0xf8/0x128 Modules linked in: binfmt_misc nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 \ nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject \ nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 \ ip_set rfkill nf_tables nfnetlink vfat fat virtio_balloon drm fuse xfs \ libcrc32c crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce virtio_net \ net_failover virtio_console virtio_blk failover dimlib virtio_mmio CPU: 34 PID: 7578 Comm: test Kdump: loaded Tainted: G W 6.10.0-rc5-gavin+ #9 Hardware name: QEMU KVM Virtual Machine, BIOS edk2-20240524-1.el9 05/24/2024 pstate: 83400005 (Nzcv daif +PAN -UAO +TCO +DIT -SSBS BTYPE=--) pc : xas_split_alloc+0xf8/0x128 lr : split_huge_page_to_list_to_order+0x1c4/0x720 sp : ffff8000882af5f0 x29: ffff8000882af5f0 x28: ffff8000882af650 x27: ffff8000882af768 x26: 0000000000000cc0 x25: 000000000000000d x24: ffff00010625b858 x23: ffff8000882af650 x22: ffffffdfc0900000 x21: 0000000000000000 x20: 0000000000000000 x19: ffffffdfc0900000 x18: 0000000000000000 x17: 0000000000000000 x16: 0000018000000000 x15: 52f8004000000000 x14: 0000e00000000000 x13: 0000000000002000 x12: 0000000000000020 x11: 52f8000000000000 x10: 52f8e1c0ffff6000 x9 : ffffbeb9619a681c x8 : 0000000000000003 x7 : 0000000000000000 x6 : ffff00010b02ddb0 x5 : ffffbeb96395e378 x4 : 0000000000000000 x3 : 0000000000000cc0 x2 : 000000000000000d x1 : 000000000000000c x0 : 0000000000000000 Call trace: xas_split_alloc+0xf8/0x128 split_huge_page_to_list_to_order+0x1c4/0x720 truncate_inode_partial_folio+0xdc/0x160 shmem_undo_range+0x2bc/0x6a8 shmem_fallocate+0x134/0x430 vfs_fallocate+0x124/0x2e8 ksys_fallocate+0x4c/0xa0 __arm64_sys_fallocate+0x24/0x38 invoke_syscall.constprop.0+0x7c/0xd8 do_el0_svc+0xb4/0xd0 el0_svc+0x44/0x1d8 el0t_64_sync_handler+0x134/0x150 el0t_64_sync+0x17c/0x180 Fix it by disabling PMD-sized page cache when HPAGE_PMD_ORDER is larger than MAX_PAGECACHE_ORDER. As Matthew Wilcox pointed, the page cache in a shmem file isn't represented by a multi-index entry and doesn't have this limitation when the xarry entry is split until commit 6b24ca4a1a8d ("mm: Use multi-index entries in the page cache"). Fixes: 6b24ca4a1a8d ("mm: Use multi-index entries in the page cache") Cc: stable@kernel.org # v5.17+ Signed-off-by: Gavin Shan Acked-by: David Hildenbrand --- mm/shmem.c | 15 +++++++++++++-- 1 file changed, 13 insertions(+), 2 deletions(-) diff --git a/mm/shmem.c b/mm/shmem.c index a8b181a63402..c1befe046c7e 100644 --- a/mm/shmem.c +++ b/mm/shmem.c @@ -541,8 +541,9 @@ static bool shmem_confirm_swap(struct address_space *mapping, static int shmem_huge __read_mostly = SHMEM_HUGE_NEVER; -bool shmem_is_huge(struct inode *inode, pgoff_t index, bool shmem_huge_force, - struct mm_struct *mm, unsigned long vm_flags) +static bool __shmem_is_huge(struct inode *inode, pgoff_t index, + bool shmem_huge_force, struct mm_struct *mm, + unsigned long vm_flags) { loff_t i_size; @@ -573,6 +574,16 @@ bool shmem_is_huge(struct inode *inode, pgoff_t index, bool shmem_huge_force, } } +bool shmem_is_huge(struct inode *inode, pgoff_t index, + bool shmem_huge_force, struct mm_struct *mm, + unsigned long vm_flags) +{ + if (HPAGE_PMD_ORDER > MAX_PAGECACHE_ORDER) + return false; + + return __shmem_is_huge(inode, index, shmem_huge_force, mm, vm_flags); +} + #if defined(CONFIG_SYSFS) static int shmem_parse_huge(const char *str) {