From patchwork Thu Aug 10 10:33:28 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Ryan Roberts X-Patchwork-Id: 13349211 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 2590CC04A94 for ; Thu, 10 Aug 2023 10:33:49 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 743CB6B0071; Thu, 10 Aug 2023 06:33:49 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 6F3906B0074; Thu, 10 Aug 2023 06:33:49 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 5E2DA6B0075; Thu, 10 Aug 2023 06:33:49 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0012.hostedemail.com [216.40.44.12]) by kanga.kvack.org (Postfix) with ESMTP id 470356B0071 for ; Thu, 10 Aug 2023 06:33:49 -0400 (EDT) Received: from smtpin07.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay01.hostedemail.com (Postfix) with ESMTP id 16DD41C95E9 for ; Thu, 10 Aug 2023 10:33:49 +0000 (UTC) X-FDA: 81107834178.07.A33FA22 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by imf01.hostedemail.com (Postfix) with ESMTP id 20FF640011 for ; Thu, 10 Aug 2023 10:33:46 +0000 (UTC) Authentication-Results: imf01.hostedemail.com; dkim=none; spf=pass (imf01.hostedemail.com: domain of ryan.roberts@arm.com designates 217.140.110.172 as permitted sender) smtp.mailfrom=ryan.roberts@arm.com; dmarc=pass (policy=none) header.from=arm.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1691663627; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:references; bh=3WSe5bChNNZzp5pgrhTbo7ZbVUijl9UYAECkb7P534g=; b=emPpQrHfDn/8z54eS5lfSJIoSA2UPqSwnwyHMHihYU8HVi3nvLgp3ArCwu2DctEiGrswMv oq2jGH+hpoSKaNeOUULjuyg/bQQV4yDsVJoWsyaxqME9Ah04U6h9jG+O1GyI4LUXbXUba1 xWPL0MuPXWfyUQ0E6mJPKBiKK9gzFic= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1691663627; a=rsa-sha256; cv=none; b=fKgKZAOU9bo+USbNM2hLIg9QT86KBv/FkckHITr63BTraB1JKCzl43tn5PpFk7F0dbGfNO RfyyfeuvWfPjO/nXU0hy5/sEpuxKmm58nB5T6Qwecp2sBWZuA6RvMyqutZGU4ilFaeniLb LVJAX9kBFZ3d5t1ApkDKGs40P/Wptj8= ARC-Authentication-Results: i=1; imf01.hostedemail.com; dkim=none; spf=pass (imf01.hostedemail.com: domain of ryan.roberts@arm.com designates 217.140.110.172 as permitted sender) smtp.mailfrom=ryan.roberts@arm.com; dmarc=pass (policy=none) header.from=arm.com Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 4A960D75; Thu, 10 Aug 2023 03:34:28 -0700 (PDT) Received: from e125769.cambridge.arm.com (e125769.cambridge.arm.com [10.1.196.26]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 637153F6C4; Thu, 10 Aug 2023 03:33:43 -0700 (PDT) From: Ryan Roberts To: Will Deacon , "Aneesh Kumar K.V" , Andrew Morton , Nick Piggin , Peter Zijlstra , Christian Borntraeger , Sven Schnelle , Arnd Bergmann , "Matthew Wilcox (Oracle)" , David Hildenbrand , Yu Zhao , "Kirill A. Shutemov" , Yin Fengwei , Yang Shi , "Huang, Ying" , Zi Yan Cc: Ryan Roberts , linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: [PATCH v1 0/4] Optimize mmap_exit for large folios Date: Thu, 10 Aug 2023 11:33:28 +0100 Message-Id: <20230810103332.3062143-1-ryan.roberts@arm.com> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 X-Stat-Signature: 9zngorm8d8heribdujun4e646faawf4f X-Rspamd-Server: rspam10 X-Rspamd-Queue-Id: 20FF640011 X-Rspam-User: X-HE-Tag: 1691663626-649528 X-HE-Meta: U2FsdGVkX19FwfFzz10nqGpSoT7uTYhC+3sfrRYWVX8BbqHaIfPsG3T1fTVy/m8dMAPpNBfmY4jIN0rfhvW4crRmF3Lnu3WejSJNvk9T+IONGbWOL2RYqeDJG9Zqcoubt+v20gkUhNeNh5hlUQpv5JQR3WbMPpnqECZ06Z9imM33tZhuxpiho8Vx0izMZevKAvcEYuLjrVxZ3mPaczABRR5Zi0fnx0iwwhR6SXJ1IY0WYd2W3xaQh0A8CpfjRdiY2pROc8gGgabi9XmwCQK6Rm1ugcp8t90UDhJk4jlSWl9Scu2rawJVBqDcITmGA5isy7AERaY7RWHEa2G9xrTqTx4NM1qSIp2D03+aEvfE+Xm29TdjG+uA5FupIUmYVllzoZ3ZZ5Hpde5kMYEdnHRiOYplX6no9cakG7gZugF30a5i6Hc0D4Fu6K0Uxz4Bi7X+7FAvwutXI1nkG9fd7SzGsZlz/TDFwXk/Suecac0JeysIK0eYBP0s6th5IT186OnnqMzSikV880wZeuyXJ5M92LEFwYoJ/QWOT1NchIbiHSURW9LsRMFzHtr+vpMJctZNNytv7nPjJerlqN0danPSx5LXIQ7Zej6641JsaZ3kK9ebQwM5JhS77e65td2Ic3Kcp4bJUbpSsqHXradcjPlkhoykhifR1ViKLC1GOEDbbpppUDi37JseaxbAwp5uNNkVzoiJ90uONiobIXkWqkVDC6+BlUE3yV7ei+BSWVXlxgtT9lSJv/zKI+Ol5REqXwNc/V7H+CqGn7manIWlQQHFmKxqiX2MLR1SSUqT/8C09rVgtT+jhjwRNji1RWwNUYzmJ8rJPcqbQND6621pjKeFuC3jLi8RrpAA5gh0yHnXT1v+44l5YITPUSGhQ0xwZ2LSu4HFbqzVYWazBpT26cZV7PK0KyhstKHe8yNfvIgrhQlcEqzfWJkGp23NpohVHWwMUP9ypmkQYrwL0+reXpt h39/Wvh2 pcRGs6nI3+jZEizsJGWMv2f6P+hy/oG9TpoArKXRL+0UTQLX5WO+xXlWq5JpUwnTH2lf0WqLX0VXXOWxAh120+UMNxdo93pvFoTisbd1YNj8yz2Z3oLqtJn1OqVcQc01EWe5Le3+3NyQPm/LL2qw+oR5/NtIJG8lRpPPexmTpcFhTFMI/PkfOPYaMcvoWk+0OkyoKl9kQEqjdmG4TGNo67PObbAE3cmIMDp9eZFkHYTvvhRJ+hoiCkcW7Y2x0SrAM+CBFr4ZNSAwaXno= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: Hi All, This is a series to improve performance of process teardown, taking advantage of the fact that large folios are increasingly regularly pte-mapped in user space; supporting filesystems already use large folios for pagecache memory, and large folios for anonymous memory are (hopefully) on the horizon. See last patch for performance numbers, including measurements that show this approach doesn't regress (and actually improves a little bit) when all folios are small. The basic approach is to accumulate contiguous ranges of pages in the mmu_gather structure (instead of storing each individual page pointer), then take advantage of this internal format to efficiently batch rmap removal, swapcache removal and page release - see the commit messages for more details. This series replaces the previous approach I took at [1], which was much smaller in scope, only attempting to batch rmap removal for anon pages. Feedback was that I should do something more general that would also batch-remove pagecache pages from the rmap. But while designing that, I found it was also possible to improve swapcache removal and page release. Hopefully I haven't gone too far the other way now! Note that patch 1 is unchanged from that originl series. This series is based on mm-unstable (ad3232df3e41). I'm going to be out on holiday from the end of today, returning on 29th August. So responses will likely be patchy, as I'm terrified of posting to list from my phone! [1] https://lore.kernel.org/linux-mm/20230727141837.3386072-1-ryan.roberts@arm.com/ Thanks, Ryan Ryan Roberts (4): mm: Implement folio_remove_rmap_range() mm/mmu_gather: generalize mmu_gather rmap removal mechanism mm/mmu_gather: Remove encoded_page infrastructure mm/mmu_gather: Store and process pages in contig ranges arch/s390/include/asm/tlb.h | 9 +-- include/asm-generic/tlb.h | 49 +++++++------- include/linux/mm.h | 11 +++- include/linux/mm_types.h | 34 +--------- include/linux/rmap.h | 2 + include/linux/swap.h | 6 +- mm/memory.c | 24 ++++--- mm/mmu_gather.c | 112 +++++++++++++++++++++++--------- mm/rmap.c | 125 +++++++++++++++++++++++++++--------- mm/swap.c | 99 ++++++++++++++++++++++++++-- mm/swap_state.c | 11 ++-- 11 files changed, 333 insertions(+), 149 deletions(-) --- 2.25.1