From patchwork Fri Nov 1 15:03:57 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Zi Yan X-Patchwork-Id: 13859514 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id BA0B7E6F069 for ; Fri, 1 Nov 2024 15:04:36 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 4A62B6B0098; Fri, 1 Nov 2024 11:04:36 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 455346B0099; Fri, 1 Nov 2024 11:04:36 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 2A77F6B009A; Fri, 1 Nov 2024 11:04:36 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0017.hostedemail.com [216.40.44.17]) by kanga.kvack.org (Postfix) with ESMTP id 060E06B0098 for ; Fri, 1 Nov 2024 11:04:36 -0400 (EDT) Received: from smtpin04.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay04.hostedemail.com (Postfix) with ESMTP id 7CBC71A06C1 for ; Fri, 1 Nov 2024 15:04:35 +0000 (UTC) X-FDA: 82737847164.04.79B57BF Received: from NAM11-CO1-obe.outbound.protection.outlook.com (mail-co1nam11on2078.outbound.protection.outlook.com [40.107.220.78]) by imf09.hostedemail.com (Postfix) with ESMTP id BB13E140029 for ; Fri, 1 Nov 2024 15:04:12 +0000 (UTC) Authentication-Results: imf09.hostedemail.com; dkim=pass header.d=Nvidia.com header.s=selector2 header.b=oFJuAa+K; dmarc=pass (policy=reject) header.from=nvidia.com; arc=pass ("microsoft.com:s=arcselector10001:i=1"); spf=pass (imf09.hostedemail.com: domain of ziy@nvidia.com designates 40.107.220.78 as permitted sender) smtp.mailfrom=ziy@nvidia.com ARC-Seal: i=2; s=arc-20220608; d=hostedemail.com; t=1730473309; a=rsa-sha256; cv=pass; b=56csqhpm0HdqLYELPznNhWssdIM3lu52BfcR4h1jlDUmVnhTAU+si2xN0Y/hw/gtNIRQZx nDM02unv2/iY2zqBFdX3Qsn/mL1ObUdQgEywsQNEv55Fi2unbQp8Y+cuGZSL3YGvXCxq6r 0SpZbt3p/Vf/mOjH8aWWOePIvAZqiwY= ARC-Authentication-Results: i=2; imf09.hostedemail.com; dkim=pass header.d=Nvidia.com header.s=selector2 header.b=oFJuAa+K; dmarc=pass (policy=reject) header.from=nvidia.com; arc=pass ("microsoft.com:s=arcselector10001:i=1"); spf=pass (imf09.hostedemail.com: domain of ziy@nvidia.com designates 40.107.220.78 as permitted sender) smtp.mailfrom=ziy@nvidia.com ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1730473309; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=lI+AH28x7kVbKn6FgZA82KmuNv3qy1tf+VlP9HU+4jY=; b=OuYhtXztn43NrxPBYLYZD1u6fXh+kUUvIWkDf2Naq23ub2s4l8WXDG2J6DFNx/9/Vi7cJf X1SbmB++wyQDI/GzPichU0sXFQR+BNxsvoeJipwxigWwVsCLsSWe5FQs0DUKoH7IUC8dXq rOjKkxEaAe2cbj559eQrwYY1MGGL+1Y= ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=jjUBhVB4kJdF27GMJG0+tDiN1sMLXJraRLU5N5J/8NBuchCYwlMv27ctu1YhQ3udxO220zOkHdGQ5bhPRdqlnrceZqR3eo6JRCz5VppNI3c6RAnL2GfXVTMZakr7RDaY3NXb8X84IoAcV0FpVnhpMAgrxhaEGD8USDm8yNEFRWTSpz8jx1PzUV/rD6H9z8OZz6XOz7NiHQuR/9+y2ePgp9wxPmjoJaomL9+IIi6gY6ZzzvigoP0yntOq49MfeC501LoJmSct34z4P0hNaNwOkelaVudj+ITQgetZjyBOHQL65qn9x5bX/Mv2LYmWJYvqy6jVZdud4gEGPB0oZDc5tQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=lI+AH28x7kVbKn6FgZA82KmuNv3qy1tf+VlP9HU+4jY=; b=rQA7kRukg0BB7HhdXxeT1jJSvloOCvW2MIEii1pw+Rh1wNPbLuQw5FhMW2inop2m3Fpw55IV2s1UAIYwtGlKCToK6tYBMDynN8FcWXThLvBy78Th1ux6WrKsHfbH3uQfH/I/T/4QjMhvYic3tKGfVMySqZPD0J/FWYwSU1ngbyOB4OiadS1J+oZDs3iw3TRzHI2+eQbDENsMeeDV5TXj/V0Li1J6V7dhTA9Hwgx3yuJno/bh/vsUTz+Js71NdsMaKBXT+2ykjRt55gGl6WTJKXT4M1zwvJ8XimWmdteK+KL8kVf0gNVDgRCrL9iTxyjnoIDUWRNFkcIzvG8yIsysFw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nvidia.com; dmarc=pass action=none header.from=nvidia.com; dkim=pass header.d=nvidia.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=Nvidia.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=lI+AH28x7kVbKn6FgZA82KmuNv3qy1tf+VlP9HU+4jY=; b=oFJuAa+KNelwsqXgZ2POTq0xuAs+q6SOqKPX8ZX63jBts5BOq1hl8gxl+w5v7nn5a9hs1o3EPtvs5TEeCsOMe4HeSB8oFNKC9zdNf+X6brZM48B1j+3FIGyRcXBbwm959vTREOnc6CurtrvVo8kZkhkHtrw4sz8fTMFg+la8qvkep6IcXazlZztiY/RkOksq7qvqmKho3B3kWJiZ9bygoputHr32jzTSGcQbE/Xz76gSx6dt43pnURAMom+peaZObk861ta+p40Ke/Ts2D+xhV2YknB878B8Qky9OGovY83bbVzUBm7bLQ7tgOwbDesJkx6vruB/CgfGJld+S6HvrQ== Received: from DS7PR12MB9473.namprd12.prod.outlook.com (2603:10b6:8:252::5) by CH3PR12MB9453.namprd12.prod.outlook.com (2603:10b6:610:1c9::12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.8114.18; Fri, 1 Nov 2024 15:04:16 +0000 Received: from DS7PR12MB9473.namprd12.prod.outlook.com ([fe80::5189:ecec:d84a:133a]) by DS7PR12MB9473.namprd12.prod.outlook.com ([fe80::5189:ecec:d84a:133a%7]) with mapi id 15.20.8114.015; Fri, 1 Nov 2024 15:04:16 +0000 From: Zi Yan To: linux-mm@kvack.org, "Kirill A . Shutemov" , "Matthew Wilcox (Oracle)" Cc: Ryan Roberts , Hugh Dickins , David Hildenbrand , Yang Shi , Miaohe Lin , Kefeng Wang , Yu Zhao , John Hubbard , linux-kernel@vger.kernel.org, Zi Yan Subject: [PATCH v2 6/6] mm/truncate: use folio_split() for truncate operation. Date: Fri, 1 Nov 2024 11:03:57 -0400 Message-ID: <20241101150357.1752726-7-ziy@nvidia.com> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241101150357.1752726-1-ziy@nvidia.com> References: <20241101150357.1752726-1-ziy@nvidia.com> X-ClientProxiedBy: BN0PR04CA0130.namprd04.prod.outlook.com (2603:10b6:408:ed::15) To DS7PR12MB9473.namprd12.prod.outlook.com (2603:10b6:8:252::5) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: DS7PR12MB9473:EE_|CH3PR12MB9453:EE_ X-MS-Office365-Filtering-Correlation-Id: 7cd5493d-1c37-43c5-c054-08dcfa867438 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0;ARA:13230040|366016|7416014|376014|1800799024; X-Microsoft-Antispam-Message-Info: E7+tN4QhvbR50Mn6ojl/YYLw4YEa1wxSEem39qSRpT2cOjQKYhtCM1wqbSCI6+shx1MyNs/SfKRX+gdnu2ZxQpItHOLPtsDCxkLO1SFPDEAH7zJIRqONDjViyTJktxnCinViSTZgkJX9ZnyvpWF6phV9XkWd8TbKfK8s0SFyplly8iARO8KDuXL3gwl4aGs9EYNeiUVdWrORX3kSNoGtXBz3LkAOOctQRWsizqp7r/PHZd50h5uE9hDenMg/YCcdnZtrqZJa+OJ4rMJ/2NKu/MlexR/NpuqWQBk+9nb+kxYYSkEns6rZ91I61KPA2qmJVipxWtT2fMtW/k/lIY9MAk/RAZ7+RjTDzfhsj3VbpXghAEFInm8aj9llzATrQiAnUbOMDDbrLno1V6nEEk9B3w3XselfVIrwO6LFWn4M0nnfA+aKvmXktQGNrQNMiUcV/izFdRUvkkhEaUmhy2tD+ZYs6b3KS+JpM2rV1nvO5hCUmFKn90LtNiD2MRJUYR3F7LC6TWG9jBuF7LULLkKkExtwn6xIkl5iNu4vfzGeMkz+lm7c4x5r+SdFcbEhBpxD2GmDJSUG7txZs6ljeYaKpUnCL+SJoPwtPqwx9TlCTjM5P5TIXzg9tQvFY9p5uw5oxjJWo2JtsBdId6IosYPDE2snLNwbunbY09J0bdimXVn930Z+puq0MQyhpuPenbOEmf0pwEh+G4VcB6K+lTIw1KFyxLFM2zwBuyteueTkF+/rYiRSbyeoDf9h6DsYkk+4pnZFyfvFEXf40hzhvgq4hWFGcxSX/Cc6jAwAew16DHOhPPVa1indkaqyYr2JBSdgutMwWkO2rSmb1yKEdahfKBjK6tDcYOW/nuv8yGtFYOHoYqjRG6jVUetD9jkGLFGL1vIyk8TzMM/UOP2vICh6vNP73EJaoajLKn3+X5rWN8Gd4ZKy8a5Oq7RmnwitXUfX2kOeMWCVF2crm3suOPuXbC4VZMLPEaQZK0bMJiQ/l/BYe5b8w2CRfjqZHeMYkxTsIi5tyfuczIGcI+OXqktjQdDUnpz5/ngfYygzvr7N3Kv0pklJMl6Hys6Oajm81feF323sbTDBhk27puKrCpPC9a4N3ajnHtIdGOyiHi9qBzDjhpflCBYHZw/ZuPgMFlpK3oBGzE2WVrkkZfzTwcuiJHaE023lYcgFwzInjBE9jzfXzof9OOrpPyupI7ErUayD+Nh66UXOWRB78KpeOOD8mXM/DvkKN+8SRLZxmdd+YhmexRwQWfoQWj8LOHANlohXI9ePArTQXcap1e/RZGU8vOk5+zp9USlfIktEl8d3/mjnnB1n3ZftkhyLiE+D9xnV X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:DS7PR12MB9473.namprd12.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230040)(366016)(7416014)(376014)(1800799024);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: bm70WWlL0Zgu9Y8GJvMkTLecZhz8T+Cnc/o7YJuonvewZGHmSN4QmKrO/0KGOFWcmQZCPSIP9K/19xPrQVh2HZt5mSrIeokIL8YAuenM1K9hDBLgPl4E0m54kAHHtirTRWltECQ+tE5rLuWhYPMu1EmaDfKSnHcOkg23JYSDj/zOTGEqs4oifK/Ejx2Gntbt7kuJ/DImCSMmVz4SOM+bFyIOvgTfgDMCf4nS0tg7AXmltC7gC4WqzusQF4txrj+adTgA9qQ3gsmGbib+NBi5SEqU5qTHZBIiZWui04sG70uRBanjpsg4W89ldl5/Gp0E7VCGzCfbJjTj6jfvNGJIqr5Ao8iOZzjMpCJk/JefL9DxpNv2VSdtldjEjuKUQBDB6V9hYuxhXV3GMtCvJl8m0PHNs6U4kisZHUH3uybA26ReDhaiZCCE0PCKsaGFLu6aXVj5eg1l8O2Oc08uvbutCoZC0cLkoEsklcS51nvWquAfQbiqMLFsTnqx5MPfn4rSAeCVqrx4uq2IMlXuu7HVPZDOti/A/TPcNYAH4EYqqIUFp1YksjAMPd8krBSjPO/IWrtd8QTe97/RSK2XGw+2oBKLzE/EKvAgmJcmTDeFqu8bpLvCY18j6oUkFK5LMuEvW9mPtdzfveq7uQ2wL4AWzEHS/oQcl9FL8nHSKlaeInSorNz9xeCKlVexNtx1W+xfrpW7A+XGonNg8kVKhPYeGjdl0lgYBWtMdzDPcQRoNlEIu+8mj4l+mYvkX4GNfxbNKj32V4Gunzwo0vy+pGVBEWKHZ0Nem4fcftRUxyKnmunZpPxmaA17zyi+Bv05AeFeLALtDOdL66gO/ai3Fl85SQ2gmdzi5grV17sw0eHVABjChzJs8LErnsO7dRlgML2vjpaAnRijoCTo6T2CTcxXBBMZbOppiBig4wTXw5NCt1LFkNZ/zjyjp454emPCpLbCJCgaIFOmCc1K1oHIduZ+8eJJVvPW6An1at2AiVYxfG8znwbk2B7r53SegJVDHlHaj9rCmmTmY74TM4THVkTKxJkzd8P/JyAf0hWYQAvn5Rnl81OQXMEZZOB7VfGMAyM1wllw6mCxa4H/hifCQ3A8xfkwmFndcD+3uS3WOd91+lu5yyyCIsm9cPR0iNIodboaBD5jKVdDPwsflfKM4huOJ9RTtjPN6M47Ict5gslsX+/TuA1K2Ef+NEkQvmYLdSpfcYD2QSHLnTZJ12j/oI3/rRNALwt7h+p7Qm7mPNGUqkgieyzqiv0iaAaaRPKxr85B/KmEBMxu0x4+p2pGIGz3YIi+HlDAi2SRqrN9u5RIsUTCC8IXHYiN5OKW6OJFdR5vL/AermKI8LX4p3CEd/hg4RuBv6JDoz4nbrhYzon68+FLxXZebjsINOYkwmNHj0I/czd4/jrUG7g0GCCkdU0Ikkub7WS9ihaqrlLcdkbVlCbHU8t6hJVgjHKC6xN2PhBBnqt7nARHlnfyFqSh5AiQPWzTcw6RWI/tIWYJ6002GAWPHsAoXthCob+MXm+qIJC1+WlZhxEpPFDIwIP1sN1XDYb8Qe8dHIGt+x9vfppp+k+t0bym8cbp4xCuwh7kpnHw X-OriginatorOrg: Nvidia.com X-MS-Exchange-CrossTenant-Network-Message-Id: 7cd5493d-1c37-43c5-c054-08dcfa867438 X-MS-Exchange-CrossTenant-AuthSource: DS7PR12MB9473.namprd12.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 01 Nov 2024 15:04:16.3650 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 43083d15-7273-40c1-b7db-39efd9ccc17a X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: OljR/ToYuKDg40rgbQSGjP8Y2qY204OfODaQ+qkAJkU27A9HCVIn0V8hrVHqWyAK X-MS-Exchange-Transport-CrossTenantHeadersStamped: CH3PR12MB9453 X-Stat-Signature: b7tgqw6n7kwb8dnx5crbf7rantrrz6jd X-Rspamd-Queue-Id: BB13E140029 X-Rspamd-Server: rspam08 X-Rspam-User: X-HE-Tag: 1730473452-644297 X-HE-Meta: U2FsdGVkX1+vYgyg2yj4jBecTLe8Vl0sJ/LnTxKaO+ZDFEzhjpnyCtNlM40OIW5I36ts7AhOXxTCmYdp/09T0jJ3RHsYRjtw77Rca7WognQinRKcMzOAKBeONYi4NnmTcLP31hqa9pfhzyGSpEEN2W1QkaynfQlV9aP1OvvNOSrk5wlpyplrwhV5XsiIdFHsT14IVf8oivZwHoQdTopIXPwnUnhelETHHKlm3DQNemDYGwuaGW7uSQxAcp+B0FYSKZjT9UbMtbfw0UdLLXY9MXC3lBBjgX82ACVHTT/yry0kfpavB8YBkJJkSkw4xIqtUyniVcvMveApMBRi40geYEc6Db0ELAW1PLwAAfmu4eBHBOn7PPRlPPyf5q6Pt+Wvqbri00Ift5Vo5nnJwxKDW53W9+SZWxU7g0kfYa14pMByjpQjEmoa5PpD0TZGDIIQGibACvrVglazoPqSQoWDJDCNcoX4FhaA1Qh3iCW2RtYgg64hHrBQNuNS3AIDpHNfiEHPGOPLZ1zGSNTP8zUhknHnF+qWs3RhODW/YqT+U2iawIua12SWodlXre2heRMjqVn0WAWEDkDWCzZoMdmD3WAql1/GUJnwMjNykvW/emLx4G3kxxtFH2plBmm/UaXdIvgwyb30yXRCnQxZUZaVCPBh2a7sY6LlGCmZMaVqvtwmY+jTLB3c0FIHh22xlLTWiHhMu092Hh1GabkwDD/kQQdSr05cMuYO7n4hP2LHIaLJzA14vc49MZeXgEXU9INIQ/LlP+xkrRGnQeRcdmT6M7InGeI8l/smjtgtv9fJagfXMQcReT48YG4sNOxTaDEjt+1Eqs1yimmQCZbEgWJF/IuLEHt4ZmDq/0F3mxghRAJhO8pv9H3JzjYkszUY8RsgbLFCmKGfjF3wjQRHvBVLCsPWyOuF2vvOyzKv3v3D1t13nuXQXXGzZSfi7cx0nqZZaOwnlgtixuh5gndjq1R 5xv1b2Zr i4nF7C/dNmcfYMMMXBa3lJop6haOPzPW8tCbnj49OWD8RgY08Inaohg7oiF+vXyHSzljrRb2TPw+DVr7dmsJNtx+ue47rwzyDvoL77C71JoxDcONrp+mA6eOXYr0QGkF1Hw9CRPcCRs0nWee4UagAci4PMyEZrZtY60zh98jerYrj86HiTZLUQth74o2HgdU2XViEg1MKp47Wo1MGFbHCSuIrM0Hzs4jGXxaOaZyDvJhWD+m0AsVMagZ2emeal/YfRXawgy6Y74pmsv+0Xb70Qk5dD9EDIQUBIvFSabi0ruVlIIxGPfItqHW9MMKHQowK9xSIsYBVPg1FD+A9qTPGVGCd3w== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Instead of splitting the large folio uniformly during truncation, use buddy allocator like split at the start of truncation range to minimize the number of resulting folios. For example, to truncate a order-4 folio [0, 1, 2, 3, 4, 5, ..., 15] between [3, 10] (inclusive), folio_split() splits the folio to [0,1], [2], [3], [4..7], [8..15] and [3], [4..7] can be dropped and [8..15] is kept with zeros in [8..10]. It is possible to further do a folio_split() at 10, so more resulting folios can be dropped. But it is left as future possible optimization if needed. Another possible optimization is to make folio_split() to split a folio based on a given range, like [3..10] above. But that complicates folio_split(), so it will investigated when necessary. Signed-off-by: Zi Yan --- include/linux/huge_mm.h | 12 ++++++++++++ mm/truncate.c | 5 ++++- 2 files changed, 16 insertions(+), 1 deletion(-) diff --git a/include/linux/huge_mm.h b/include/linux/huge_mm.h index b94c2e8ee918..8048500e7bc2 100644 --- a/include/linux/huge_mm.h +++ b/include/linux/huge_mm.h @@ -339,6 +339,18 @@ int split_huge_page_to_list_to_order(struct page *page, struct list_head *list, unsigned int new_order); int min_order_for_split(struct folio *folio); int split_folio_to_list(struct folio *folio, struct list_head *list); +int folio_split(struct folio *folio, unsigned int new_order, struct page *page, + struct list_head *list); +static inline int split_folio_at(struct folio *folio, struct page *page, + struct list_head *list) +{ + int ret = min_order_for_split(folio); + + if (ret < 0) + return ret; + + return folio_split(folio, ret, page, list); +} static inline int split_huge_page(struct page *page) { struct folio *folio = page_folio(page); diff --git a/mm/truncate.c b/mm/truncate.c index e5151703ba04..dbd81c21b460 100644 --- a/mm/truncate.c +++ b/mm/truncate.c @@ -179,6 +179,7 @@ bool truncate_inode_partial_folio(struct folio *folio, loff_t start, loff_t end) { loff_t pos = folio_pos(folio); unsigned int offset, length; + long in_folio_offset; if (pos < start) offset = start - pos; @@ -208,7 +209,9 @@ bool truncate_inode_partial_folio(struct folio *folio, loff_t start, loff_t end) folio_invalidate(folio, offset, length); if (!folio_test_large(folio)) return true; - if (split_folio(folio) == 0) + + in_folio_offset = PAGE_ALIGN_DOWN(offset) / PAGE_SIZE; + if (split_folio_at(folio, folio_page(folio, in_folio_offset), NULL) == 0) return true; if (folio_test_dirty(folio)) return false;