From patchwork Thu Oct 20 21:53:09 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Doug Berger X-Patchwork-Id: 13014047 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 511A4C433FE for ; Thu, 20 Oct 2022 21:54:23 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id ABBCA8E0003; Thu, 20 Oct 2022 17:54:22 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id A6C0C8E0001; Thu, 20 Oct 2022 17:54:22 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 95AA08E0003; Thu, 20 Oct 2022 17:54:22 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0017.hostedemail.com [216.40.44.17]) by kanga.kvack.org (Postfix) with ESMTP id 8744E8E0001 for ; Thu, 20 Oct 2022 17:54:22 -0400 (EDT) Received: from smtpin28.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay05.hostedemail.com (Postfix) with ESMTP id 578DF40AA5 for ; Thu, 20 Oct 2022 21:54:22 +0000 (UTC) X-FDA: 80042681964.28.37E17B5 Received: from mail-qk1-f169.google.com (mail-qk1-f169.google.com [209.85.222.169]) by imf02.hostedemail.com (Postfix) with ESMTP id E157180035 for ; Thu, 20 Oct 2022 21:54:21 +0000 (UTC) Received: by mail-qk1-f169.google.com with SMTP id f8so892038qkg.3 for ; Thu, 20 Oct 2022 14:54:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date:message-id:reply-to; bh=XJV3aFcijl0Rz0XlRnZF5dclSHM+OiKYXagTECOXWUs=; b=mhSDba1z9+BM+IWhMevt2Zop1ZxgI5vtmX5lEIxyybt5IFnrl63bEj9L06UWGe4Qyj Xh/VFcCdOIlTEXPIAuLMElK/9X2NLL8Xzxgxx0JErorvkGNAIpD6am2MMcie+7j5b5vO f1MRedrX5cqtmrOkuu8sxdcuN0Hwe6AP38s1kkny9SdIMjdyEfTXrbkZ6bO8KyxtESCx Mkt9SAKXBXsE89tXA+vISzKSht45tmKyLvFcnC2QPmatgyY3BBRkC/Fc+BNFns+IuIDL f3NjTc8DIXlDssWaLdR05AHBxi1y/lzgM+XYwxZOBTzai43oKzRsI7ysCjgZ6PHgXyRN rT2A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=XJV3aFcijl0Rz0XlRnZF5dclSHM+OiKYXagTECOXWUs=; b=Y1iTLMHZyCp9S096D3oOhGgEHUGAzINoFkIkSWO1FhFwUVO8rwvj3o/uW3kQqnsgAd FejW1MtTeyS/3XxLNzDqoVRsP57oE8b4kLhQk66xUEc86uirOx55xR1GzEumYQSJipBD cG094jP5hcf4wrmgB8rFDxF8Tf5uNzmtj/GqtBg9CnxYLC1WILEvmDu6vWOhm3xMRKxR kOz4JhJTNhdgWZeQXGWKa8dNQrKvtPhGP2VfnS0JV1g7RfxcXbMiBoKvyId6ztrbfpW7 d5115nUl34lhW25NEPdyS3k9syjf7Oyxm73E9Tqpfg51p1861pI3P/PV+tAZiduYKRDW 7tLA== X-Gm-Message-State: ACrzQf2tW++mRKtJpTVhvmtxkzgLrtatJnRgCQLUyM4uDLofpdOlhbrW E+7CNOod8oQ6WG1Kyt57IiM= X-Google-Smtp-Source: AMsMyM7xID6F9ahsV93JiJpkQl+rEA4HCAjdi0ex+QQUchYb7mrjkbkktbo/sbZ+jGyVQLZZ2jhgBQ== X-Received: by 2002:a05:620a:2697:b0:6cf:33cd:2bd2 with SMTP id c23-20020a05620a269700b006cf33cd2bd2mr11114790qkp.341.1666302861044; Thu, 20 Oct 2022 14:54:21 -0700 (PDT) Received: from stbirv-lnx-3.igp.broadcom.net ([192.19.223.252]) by smtp.gmail.com with ESMTPSA id bj41-20020a05620a192900b006bb29d932e1sm8121067qkb.105.2022.10.20.14.54.18 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 20 Oct 2022 14:54:20 -0700 (PDT) From: Doug Berger To: Andrew Morton Cc: Jonathan Corbet , Mike Rapoport , Borislav Petkov , "Paul E. McKenney" , Neeraj Upadhyay , Randy Dunlap , Damien Le Moal , Muchun Song , Vlastimil Babka , Johannes Weiner , Michal Hocko , KOSAKI Motohiro , Mel Gorman , Mike Kravetz , Florian Fainelli , David Hildenbrand , Oscar Salvador , Joonsoo Kim , linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, Doug Berger Subject: [PATCH v3 0/9] mm: introduce Designated Movable Blocks Date: Thu, 20 Oct 2022 14:53:09 -0700 Message-Id: <20221020215318.4193269-1-opendmb@gmail.com> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1666302861; a=rsa-sha256; cv=none; b=waYw0HY9hqSP4vG0brAzKxt7yuWg2DkLS4d9TWGzCJCNCDiM7VwujVhEPy/yi4MdYWeK5H 0ucY/W3JvK0QQmBOq2rSUBZxiOo/h1vAwHsLWLqvTiS0JTN9jAcUSZsvY52aUA0ksn8FrJ 4rkUkcJsW/IszAY611cuFrI6yw0C2mA= ARC-Authentication-Results: i=1; imf02.hostedemail.com; dkim=pass header.d=gmail.com header.s=20210112 header.b=mhSDba1z; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (imf02.hostedemail.com: domain of opendmb@gmail.com designates 209.85.222.169 as permitted sender) smtp.mailfrom=opendmb@gmail.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1666302861; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:references:dkim-signature; bh=XJV3aFcijl0Rz0XlRnZF5dclSHM+OiKYXagTECOXWUs=; b=chwhNLUehUfhpeKGh9tPeET6vFUch3kDTWisJt84ypWgQH5IzcGf97qWugkNe8MR5lzl6F b+9oeOj5hWd6CmHJ6SW/TNozjqN6ewA868sm+FJ99dc3oR0EZ1/b++sH8ns2kRrU2MUVMc 01xiNvV/CQAlKW1scOE9YN0IdUnjnHw= Authentication-Results: imf02.hostedemail.com; dkim=pass header.d=gmail.com header.s=20210112 header.b=mhSDba1z; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (imf02.hostedemail.com: domain of opendmb@gmail.com designates 209.85.222.169 as permitted sender) smtp.mailfrom=opendmb@gmail.com X-Rspamd-Server: rspam04 X-Rspam-User: X-Stat-Signature: zgjm8153te1hdhp6afgsxk3bhc9qn67p X-Rspamd-Queue-Id: E157180035 X-HE-Tag: 1666302861-929408 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: MOTIVATION: Some Broadcom devices (e.g. 7445, 7278) contain multiple memory controllers with each mapped in a different address range within a Uniform Memory Architecture. Some users of these systems have expressed the desire to locate ZONE_MOVABLE memory on each memory controller to allow user space intensive processing to make better use of the additional memory bandwidth. Unfortunately, the historical monotonic layout of zones would mean that if the lowest addressed memory controller contains ZONE_MOVABLE memory then all of the memory available from memory controllers at higher addresses must also be in the ZONE_MOVABLE zone. This would force all kernel memory accesses onto the lowest addressed memory controller and significantly reduce the amount of memory available for non-movable allocations. The main objective of this patch set is therefore to allow a block of memory to be designated as part of the ZONE_MOVABLE zone where it will always only be used by the kernel page allocator to satisfy requests for movable pages. The term Designated Movable Block is introduced here to represent such a block. The favored implementation allows extension of the 'movablecore' kernel parameter to allow specification of a base address and support for multiple blocks. The existing 'movablecore' mechanisms are retained. BACKGROUND: NUMA architectures support distributing movablecore memory across each node, but it is undesirable to introduce the overhead and complexities of NUMA on systems that don't have a Non-Uniform Memory Architecture. Commit 342332e6a925 ("mm/page_alloc.c: introduce kernelcore=mirror option") also depends on zone overlap to support sytems with multiple mirrored ranges. Commit c6f03e2903c9 ("mm, memory_hotplug: remove zone restrictions") embraced overlapped zones for memory hotplug. This commit set follows their lead to allow the ZONE_MOVABLE zone to overlap other zones. Designated Movable Blocks are made absent from overlapping zones and present within the ZONE_MOVABLE zone. I initially investigated an implementation using a Designated Movable migrate type in line with comments[1] made by Mel Gorman regarding a "sticky" MIGRATE_MOVABLE type to avoid using ZONE_MOVABLE. However, this approach was riskier since it was much more instrusive on the allocation paths. Ultimately, the progress made by the memory hotplug folks to expand the ZONE_MOVABLE functionality convinced me to follow this approach. Changes in v3: - removed OTHER OPPORTUNITIES and NOTES from this cover letter. - prevent the creation of empty zones instead of adding extra info to zoneinfo. - size the ZONE_MOVABLE span to the minimum necessary to cover pages within the zone to be more intuitive. - removed "real" from variable names that were consolidated. - rebased to akpm-mm/master (i.e. Linux 6.1-rc1). Changes in v2: - first three commits upstreamed separately [3], [4], and [5]. - commits 04-06 submitted separately [6]. - Corrected errors "Reported-by: kernel test robot " - Deferred commits after 15 to simplify review of the base functionality. - minor reorganization of commit 13. v2: https://lore.kernel.org/linux-mm/20220928223301.375229-1-opendmb@gmail.com/ v1: https://lore.kernel.org/linux-mm/20220913195508.3511038-1-opendmb@gmail.com/ [1] https://lore.kernel.org/lkml/20160428103927.GM2858@techsingularity.net/ [2] https://lore.kernel.org/lkml/1401260672-28339-1-git-send-email-iamjoonsoo.kim@lge.com [3] https://lore.kernel.org/linux-mm/20220914023913.1855924-1-zi.yan@sent.com [4] https://lore.kernel.org/linux-mm/20220823030209.57434-2-linmiaohe@huawei.com [5] https://lore.kernel.org/linux-mm/20220914190917.3517663-1-opendmb@gmail.com [6] https://lore.kernel.org/linux-mm/20220921223639.1152392-1-opendmb@gmail.com/ Doug Berger (9): lib/show_mem.c: display MovableOnly mm/page_alloc: calculate node_spanned_pages from pfns mm/page_alloc: prevent creation of empty zones mm/page_alloc.c: allow oversized movablecore mm/page_alloc: introduce init_reserved_pageblock() memblock: introduce MEMBLOCK_MOVABLE flag mm/dmb: Introduce Designated Movable Blocks mm/page_alloc: make alloc_contig_pages DMB aware mm/page_alloc: allow base for movablecore .../admin-guide/kernel-parameters.txt | 14 +- include/linux/dmb.h | 29 +++ include/linux/gfp.h | 5 +- include/linux/memblock.h | 8 + lib/show_mem.c | 2 +- mm/Kconfig | 12 ++ mm/Makefile | 1 + mm/cma.c | 15 +- mm/dmb.c | 91 +++++++++ mm/memblock.c | 30 ++- mm/page_alloc.c | 188 +++++++++++++----- 11 files changed, 338 insertions(+), 57 deletions(-) create mode 100644 include/linux/dmb.h create mode 100644 mm/dmb.c