From patchwork Tue Feb 25 13:59:33 2025 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Chen Yu X-Patchwork-Id: 13990060 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 662DEC021BC for ; Tue, 25 Feb 2025 14:04:49 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id BE3346B0085; Tue, 25 Feb 2025 09:04:48 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id B6D126B0088; Tue, 25 Feb 2025 09:04:48 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 9BEF16B0089; Tue, 25 Feb 2025 09:04:48 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0017.hostedemail.com [216.40.44.17]) by kanga.kvack.org (Postfix) with ESMTP id 7A77C6B0085 for ; Tue, 25 Feb 2025 09:04:48 -0500 (EST) Received: from smtpin04.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id 24A7AC172B for ; Tue, 25 Feb 2025 14:04:48 +0000 (UTC) X-FDA: 83158637856.04.023E047 Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.16]) by imf23.hostedemail.com (Postfix) with ESMTP id 066DC140011 for ; Tue, 25 Feb 2025 14:04:44 +0000 (UTC) Authentication-Results: imf23.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=TVnSXYry; spf=pass (imf23.hostedemail.com: domain of yu.c.chen@intel.com designates 198.175.65.16 as permitted sender) smtp.mailfrom=yu.c.chen@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1740492286; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding:in-reply-to: references:dkim-signature; bh=t9Wttdeb/tQul9kMVtK65pTuO2ilpdhK7n8N7oiDAjA=; b=3JlJePyiKopOD36bqkG6HrAo4iIvMqlqYzx9CKJb4pb21mJVKN3SKoTbWS3qf+mCRoATTd oFpKQlo5wLidNmT9aITMcBErWVHb3OrzjFA0raZ8qsYVi98eUpjVJttc94o96+96e0HMEp oAPUuH5yk0MVWe09JDxZzXItSZN9Jjo= ARC-Authentication-Results: i=1; imf23.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=TVnSXYry; spf=pass (imf23.hostedemail.com: domain of yu.c.chen@intel.com designates 198.175.65.16 as permitted sender) smtp.mailfrom=yu.c.chen@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1740492286; a=rsa-sha256; cv=none; b=tY/kWYL3D7eUKgzngpORRhjGVcOsv/4GtXbTwpim6VZKRIowWD+NusI5mgPGJwZ1FtoDh5 mGo9bDDJVUxWQNVu1z3p1f/MjJdt0g64yXW7lOORZSN2XGl236Rh2ssC4vCTTiSIZlmE5t s7URgpNE3zHTiZhesfwgHCWUayCsrbI= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1740492286; x=1772028286; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=sUXTHt6eh+QXI81EQBaYj0iPTuZrYTa5Od8UTyPzP8E=; b=TVnSXYryWhBbDyxrs39KooKhQEXreNgKN9Yn9aTToXR8Fadke7RGy/fp 5g/Hp6Vujg/aY77bNxVF5rM+7xoZSB/Eh4FjzQ9YW7n9DQY3Cpm5TOxxJ BC7QefXmwFmrfVOn8Nn8nSEBmkHrcqw442ATWAWwb+QdRLi2n55WqLH0o 0FIpNFt0mihShWYrNOofeiX0xoo6z9qs6k/2hPaBZvgK6gEB4luww5zMQ jiOhRsST+LTpqbvOmq8Q369RusoBDEkaHWS+hgS9S4lQ2YPjE2NmLYKm2 MfC+yO1axQgj5DKTpl2n6KuEz9IMXC/UP2bU/1az+ULUp6pkWbNTb2JAg w==; X-CSE-ConnectionGUID: 0cM2QoBKQ+ew7gE3L/mpEQ== X-CSE-MsgGUID: cFnfJBDST8mgHtHtjzeFLw== X-IronPort-AV: E=McAfee;i="6700,10204,11356"; a="41424563" X-IronPort-AV: E=Sophos;i="6.13,314,1732608000"; d="scan'208";a="41424563" Received: from fmviesa008.fm.intel.com ([10.60.135.148]) by orvoesa108.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 25 Feb 2025 06:04:44 -0800 X-CSE-ConnectionGUID: DLHCB871RPS+O8m+2dz66Q== X-CSE-MsgGUID: vS2aVbs+RjSh1G++J6j2IA== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.13,314,1732608000"; d="scan'208";a="116590673" Received: from chenyu-dev.sh.intel.com ([10.239.62.107]) by fmviesa008.fm.intel.com with ESMTP; 25 Feb 2025 06:04:38 -0800 From: Chen Yu To: Ingo Molnar , Peter Zijlstra , Juri Lelli , Vincent Guittot , Andrew Morton Cc: Rik van Riel , Mel Gorman , Johannes Weiner , Michal Hocko , Roman Gushchin , Shakeel Butt , Muchun Song , "Liam R. Howlett" , Lorenzo Stoakes , "Huang, Ying" , Tim Chen , Aubrey Li , Michael Wang , Kaiyang Zhao , David Rientjes , Raghavendra K T , cgroups@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, Chen Yu Subject: [RFC PATCH 0/3] sched/numa: Introduce per cgroup numa balance control Date: Tue, 25 Feb 2025 21:59:33 +0800 Message-Id: X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 X-Rspam-User: X-Rspamd-Server: rspam11 X-Rspamd-Queue-Id: 066DC140011 X-Stat-Signature: quzamuuqn3ujwoco3jrdje7bu3x4x345 X-HE-Tag: 1740492284-432936 X-HE-Meta: U2FsdGVkX18i13ZgpvgS9Unxn0WIJUb+3SvuX8icCGz931/HZg/d3ZfvoRDQrOgvuj4v4QmNM1rtwhBWuL7jJAE0lagSrZwPcTn2qoU3MmizCvep7N10h88pIZvL+NPpMGqn12wNynULdSXPM8Ucrbe98vDSk2MzP5Q/3952VSNrey2OF58OwXlsEz/hnRCbRfToGqULq5MeETPbLJsAAyzeNPvOTOSh2Ny/5dgmiw2RoGnNYKcHGmMYZ1BN9Va1GAlO04o3wx9eMeSQ6fvwrIWKiEBJTgt2tdzoMol5QDjNjLcx0dPZMAst6Br2ac2cO3dXrAnUQHRcoMqeQyjnrvNA+/cAPebJkBz+5XQaMtWrZx09m8VH5g6oN3benlvxHslzgSllhIVPwXwTxh+Rs4A/hKRXPv2bjVPoJuc3HSYOg7HVor5OochBaN86JQ96V+L/0Hr001rBjlTZcrZ9YQYS5OvLJWtfufhDitQA+P/I1FDKWTvUFlMCgRzshUJvEC8b98ILIqguYC22c7cp5ZjO26mH78Xs/GwjA9XJixE0Q+mlozwUlZCruirUtlBGG9E2mgNtZKcXVWSy1t0poFEaMVIda5Scs5O1Xa3q6gi2gpxvLTG5vYln1AK+56s6EfRd0xWPPMfPnHEO+ESD2qaItLx/ItZ4IVuDTsHn351c92l+stFJOx/SlXiQxT7sy1A7XFDIG3mEzeJDfjsveKZSjZQQ3Pu/OIr+1mplW93viNeA+z0REMRHOoKvvo5QpECJ4nnlZnwWzaDD8H6ClERN5WqFobymIuJlU820B9SmCg5B+p3XYZVNhSk/invhxKKeisZLa/vPhYgAIhET9pF5Oby16GfQcHwN+3h0izN1MX/VIC/Drzy7fpOGtxAoWnDO2ATfvsOLdd+XilVwAvYvaOpkWSqXSbRwzMAit9kDa3xARwaknchPtiW8Ym4JA1qMuih5MRO/qPXpzPj FZqzo5Gv lsq1gDFfB3L518PoOgzsybOcGeKkd5Tovu09X5mH0zXFG6a9OZI9WBtjDh/gLn0kDHsoWy1p/SienYpZReSp1Pt/dZJSReHdotmiUClyxGu5AhcGyCkStozm01NVbKDlE345mupw7bY/Qkdi6S5rONvc0m7TM8SimKr18pRjsRNXE4O+Uqr2QWnzBiLTO+i+XjnEws3vYVp+5BdmXdOfjWvb3dnwvxj6la/+KBl1n2YLYcYhVlzJsJaksv5hAoOmrghAJ X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Introduce a per-cgroup interface to enable NUMA balancing for specific cgroups. The system administrator needs to set the NUMA balancing mode to NUMA_BALANCING_CGROUP=4 to enable this feature. When in the NUMA_BALANCING_CGROUP mode, all cgroups' NUMA balancing is disabled by default. After the administrator enables this feature for a specific cgroup, NUMA balancing for that cgroup is enabled. This per-cgroup NUMA balancing control was once proposed in 2019 by Yun Wang[1]. Then, in 2024, Kaiyang Zhao mentioned that he was working with Meta on per-cgroup NUMA control[2] during a discussion with David Rientjes. I could not find further discussion regarding per-cgroup NUMA balancing from that point on. This set of RFC patches is a rough and compile-passed version, and may have unhandled cases (for example, THP). It has not been thoroughly tested and is intended to initiate or resume the discussion on the topic of per-cgroup NUMA load balancing. The first patch is a NUMA load balancing statistics enhancement. The second patch introduces per-cgroup NUMA balancing. The third one enhances NUMA load balancing for the MPOL_INTERLEAVE policy. Any feedback would be appreciated. [1] https://lore.kernel.org/linux-fsdevel/60b59306-5e36-e587-9145-e90657daec41@linux.alibaba.com/ [2] https://lore.kernel.org/linux-mm/ZrukILyQhMAKWwTe@localhost.localhost/T/ Chen Yu (3): sched/numa: Introduce numa balance task migration and swap in schedstats sched/numa: Introduce per cgroup numa balance control sched/numa: Allow intervale memory allocation for numa balance include/linux/numa.h | 1 + include/linux/sched.h | 4 ++++ include/linux/sched/sysctl.h | 1 + include/linux/vm_event_item.h | 2 ++ include/uapi/linux/mempolicy.h | 1 + kernel/sched/core.c | 42 ++++++++++++++++++++++++++++++++-- kernel/sched/debug.c | 4 ++++ kernel/sched/fair.c | 18 +++++++++++++++ kernel/sched/sched.h | 3 +++ mm/memcontrol.c | 2 ++ mm/memory.c | 2 +- mm/mempolicy.c | 7 ++++++ mm/mprotect.c | 5 ++-- mm/vmstat.c | 2 ++ 14 files changed, 89 insertions(+), 5 deletions(-)