From patchwork Sun Dec 1 01:57:32 2019 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Andrew Morton X-Patchwork-Id: 11268457 Return-Path: Received: from mail.kernel.org (pdx-korg-mail-1.web.codeaurora.org [172.30.200.123]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id 032F817F0 for ; Sun, 1 Dec 2019 01:57:36 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id C4BF620873 for ; Sun, 1 Dec 2019 01:57:35 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (1024-bit key) header.d=kernel.org header.i=@kernel.org header.b="D5wkfYio" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org C4BF620873 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=linux-foundation.org Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id BE8EB6B037A; Sat, 30 Nov 2019 20:57:34 -0500 (EST) Delivered-To: linux-mm-outgoing@kvack.org Received: by kanga.kvack.org (Postfix, from userid 40) id B992B6B037C; Sat, 30 Nov 2019 20:57:34 -0500 (EST) X-Original-To: int-list-linux-mm@kvack.org X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id A61976B037D; Sat, 30 Nov 2019 20:57:34 -0500 (EST) X-Original-To: linux-mm@kvack.org X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0064.hostedemail.com [216.40.44.64]) by kanga.kvack.org (Postfix) with ESMTP id 8E5016B037A for ; Sat, 30 Nov 2019 20:57:34 -0500 (EST) Received: from smtpin21.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay03.hostedemail.com (Postfix) with SMTP id 43FC2824999B for ; Sun, 1 Dec 2019 01:57:34 +0000 (UTC) X-FDA: 76214910828.21.glue13_42106ce10ea37 X-Spam-Summary: 2,0,0,b935c683d633a9b4,d41d8cd98f00b204,akpm@linux-foundation.org,:akpm@linux-foundation.org:dan.j.williams@intel.com:dave.hansen@linux.intel.com:fengguang.wu@intel.com::mgorman@suse.de:mhocko@suse.com:mingo@kernel.org:mm-commits@vger.kernel.org:peterz@infradead.org:riel@redhat.com:torvalds@linux-foundation.org:ying.huang@intel.com,RULES_HIT:41:355:379:800:960:967:973:988:989:1260:1263:1345:1381:1431:1437:1534:1542:1711:1730:1747:1777:1792:2393:2525:2559:2563:2682:2685:2859:2898:2901:2902:2933:2937:2939:2942:2945:2947:2951:2954:3022:3138:3139:3140:3141:3142:3353:3834:3865:3866:3867:3868:3870:3871:3872:3874:3934:3936:3938:3941:3944:3947:3950:3953:3956:3959:4250:4321:4560:5007:6119:6261:6653:6737:7576:8599:8957:9025:9545:9592:10004:10913:11026:11658:11914:12043:12048:12297:12517:12519:12555:12679:12783:12986:13161:13229:13846:14181:14721:14849:21080:21433:21451:21627:21939:30003:30045:30054:30055:30064:30070,0,RBL:error,CacheIP:none,Bayesian:0.5,0.5,0.5,Netcheck :none,Do X-HE-Tag: glue13_42106ce10ea37 X-Filterd-Recvd-Size: 3606 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by imf22.hostedemail.com (Postfix) with ESMTP for ; Sun, 1 Dec 2019 01:57:33 +0000 (UTC) Received: from localhost.localdomain (c-73-231-172-41.hsd1.ca.comcast.net [73.231.172.41]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id CF13F2082E; Sun, 1 Dec 2019 01:57:32 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1575165453; bh=3psEDXDdfYuBV/kjrYa967oPAqNh0lcS+E4+zQo8hos=; h=Date:From:To:Subject:From; b=D5wkfYioaze+lr5DOEGzZDX4zppA+v4BYnHSgsEOctq3TLSqno8QFin06LtqJvNvy bxCu70eCpWVvShuhuMiVHmBmBSbzau1GsSrNcTcs+X6IHraq13ub587iHnWUJDsYwD tOwvw+1M/b9BBpj4mQwzsNbX9wVMekj2P13HQ1yM= Date: Sat, 30 Nov 2019 17:57:32 -0800 From: akpm@linux-foundation.org To: akpm@linux-foundation.org, dan.j.williams@intel.com, dave.hansen@linux.intel.com, fengguang.wu@intel.com, linux-mm@kvack.org, mgorman@suse.de, mhocko@suse.com, mingo@kernel.org, mm-commits@vger.kernel.org, peterz@infradead.org, riel@redhat.com, torvalds@linux-foundation.org, ying.huang@intel.com Subject: [patch 140/158] autonuma: reduce cache footprint when scanning page tables Message-ID: <20191201015732.Ecjld8a-U%akpm@linux-foundation.org> User-Agent: s-nail v14.8.16 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: From: Huang Ying Subject: autonuma: reduce cache footprint when scanning page tables In auto NUMA balancing page table scanning, if the pte_protnone() is true, the PTE needs not to be changed because it's in target state already. So other checking on corresponding struct page is unnecessary too. So, if we check pte_protnone() firstly for each PTE, we can avoid unnecessary struct page accessing, so that reduce the cache footprint of NUMA balancing page table scanning. In the performance test of pmbench memory accessing benchmark with 80:20 read/write ratio and normal access address distribution on a 2 socket Intel server with Optance DC Persistent Memory, perf profiling shows that the autonuma page table scanning time reduces from 1.23% to 0.97% (that is, reduced 21%) with the patch. Link: http://lkml.kernel.org/r/20191101075727.26683-3-ying.huang@intel.com Signed-off-by: "Huang, Ying" Acked-by: Mel Gorman Cc: Michal Hocko Cc: Rik van Riel Cc: Peter Zijlstra Cc: Ingo Molnar Cc: Dave Hansen Cc: Dan Williams Cc: Fengguang Wu Signed-off-by: Andrew Morton --- mm/mprotect.c | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) --- a/mm/mprotect.c~autonuma-reduce-cache-footprint-when-scanning-page-tables +++ a/mm/mprotect.c @@ -80,6 +80,10 @@ static unsigned long change_pte_range(st if (prot_numa) { struct page *page; + /* Avoid TLB flush if possible */ + if (pte_protnone(oldpte)) + continue; + page = vm_normal_page(vma, addr, oldpte); if (!page || PageKsm(page)) continue; @@ -97,10 +101,6 @@ static unsigned long change_pte_range(st if (page_is_file_cache(page) && PageDirty(page)) continue; - /* Avoid TLB flush if possible */ - if (pte_protnone(oldpte)) - continue; - /* * Don't mess with PTEs if page is already on the node * a single-threaded process is running on.