[11/18] mm: Extract THP hugepage allocation

Message ID	20200619162414.1052234-12-ben.widawsky@intel.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=VjTG=AA=kvack.org=owner-linux-mm@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 799222168B IronPort-SDR: sXhD0NOB0B2DAD6KNmWKwCrQE4lEqN8IqO5EWVT+uU7ssshMMoyq3eWak/Ae2RT9gGcSU2tnX3 xvHGy4g6CGwg== IronPort-SDR: QyzmVislWiGMSX9RlqRypaXmCqqXRntrqn26yzGwbYA/cJDX4siRn12I1Q75uMXVe11wintGcN pvbrBy++bW5g== From: Ben Widawsky <ben.widawsky@intel.com> To: linux-mm <linux-mm@kvack.org> Subject: [PATCH 11/18] mm: Extract THP hugepage allocation Date: Fri, 19 Jun 2020 09:24:07 -0700 Message-Id: <20200619162414.1052234-12-ben.widawsky@intel.com> In-Reply-To: <20200619162414.1052234-1-ben.widawsky@intel.com> References: <20200619162414.1052234-1-ben.widawsky@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Sender: owner-linux-mm@kvack.org Precedence: bulk
Series	multiple preferred nodes \| expand [00/18] multiple preferred nodes [01/18] mm/mempolicy: Add comment for missing LOCAL [02/18] mm/mempolicy: Use node_mem_id() instead of node_id() [03/18] mm/page_alloc: start plumbing multi preferred node [04/18] mm/page_alloc: add preferred pass to page allocation [05/18] mm/mempolicy: convert single preferred_node to full nodemask [06/18] mm/mempolicy: Add MPOL_PREFERRED_MANY for multiple preferred nodes [07/18] mm/mempolicy: allow preferred code to take a nodemask [08/18] mm/mempolicy: refactor rebind code for PREFERRED_MANY [09/18] mm: Finish handling MPOL_PREFERRED_MANY [10/18] mm: clean up alloc_pages_vma (thp) [11/18] mm: Extract THP hugepage allocation [12/18] mm/mempolicy: Use __alloc_page_node for interleaved [13/18] mm: kill __alloc_pages

Message ID

20200619162414.1052234-12-ben.widawsky@intel.com (mailing list archive)

State

New, archived

Headers

DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 799222168B
IronPort-SDR: 
 sXhD0NOB0B2DAD6KNmWKwCrQE4lEqN8IqO5EWVT+uU7ssshMMoyq3eWak/Ae2RT9gGcSU2tnX3
 xvHGy4g6CGwg==
IronPort-SDR: 
 QyzmVislWiGMSX9RlqRypaXmCqqXRntrqn26yzGwbYA/cJDX4siRn12I1Q75uMXVe11wintGcN
 pvbrBy++bW5g==
From: Ben Widawsky <ben.widawsky@intel.com>
To: linux-mm <linux-mm@kvack.org>
Subject: [PATCH 11/18] mm: Extract THP hugepage allocation
Date: Fri, 19 Jun 2020 09:24:07 -0700
Message-Id: <20200619162414.1052234-12-ben.widawsky@intel.com>
In-Reply-To: <20200619162414.1052234-1-ben.widawsky@intel.com>
References: <20200619162414.1052234-1-ben.widawsky@intel.com>
MIME-Version: 1.0
Content-Transfer-Encoding: quoted-printable
Sender: owner-linux-mm@kvack.org
Precedence: bulk

Series

multiple preferred nodes | expand

Commit Message

Ben Widawsky June 19, 2020, 4:24 p.m. UTC

The next patch is going to rework this code to support
MPOL_PREFERRED_MANY. This refactor makes the that change much more
readable.

After the extraction, the resulting code makes it apparent that this can
be converted to a simple if ladder and thus allows removing the goto.

There is not meant to be any functional or behavioral changes.

Note that still at this point MPOL_PREFERRED_MANY isn't specially
handled for huge pages.

Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Dave Hansen <dave.hansen@linux.intel.com>
Cc: Michal Hocko <mhocko@kernel.org>
Signed-off-by: Ben Widawsky <ben.widawsky@intel.com>
---
 mm/mempolicy.c | 96 ++++++++++++++++++++++++++------------------------
 1 file changed, 49 insertions(+), 47 deletions(-)

diff --git a/mm/mempolicy.c b/mm/mempolicy.c
index 408ba78c8424..3ce2354fed44 100644
--- a/mm/mempolicy.c
+++ b/mm/mempolicy.c
@@ -2232,6 +2232,48 @@  static struct page *alloc_page_interleave(gfp_t gfp, unsigned order,
 	return page;
 }
 
+static struct page *alloc_pages_vma_thp(gfp_t gfp, struct mempolicy *pol,
+					int order, int node)
+{
+	nodemask_t *nmask;
+	struct page *page;
+	int hpage_node = node;
+
+	/*
+	 * For hugepage allocation and non-interleave policy which allows the
+	 * current node (or other explicitly preferred node) we only try to
+	 * allocate from the current/preferred node and don't fall back to other
+	 * nodes, as the cost of remote accesses would likely offset THP
+	 * benefits.
+	 *
+	 * If the policy is interleave or multiple preferred nodes, or does not
+	 * allow the current node in its nodemask, we allocate the standard way.
+	 */
+	if (pol->mode == MPOL_PREFERRED && !(pol->flags & MPOL_F_LOCAL))
+		hpage_node = first_node(pol->v.preferred_nodes);
+
+	nmask = policy_nodemask(gfp, pol);
+
+	/*
+	 * First, try to allocate THP only on local node, but don't reclaim
+	 * unnecessarily, just compact.
+	 */
+	page = __alloc_pages_nodemask(gfp | __GFP_THISNODE | __GFP_NORETRY,
+				      order, hpage_node, nmask);
+
+	/*
+	 * If hugepage allocations are configured to always synchronous compact
+	 * or the vma has been madvised to prefer hugepage backing, retry
+	 * allowing remote memory with both reclaim and compact as well.
+	 */
+	if (!page && (gfp & __GFP_DIRECT_RECLAIM))
+		page = __alloc_pages_nodemask(gfp, order, hpage_node, nmask);
+
+	VM_BUG_ON(page && nmask && !node_isset(page_to_nid(page), *nmask));
+
+	return page;
+}
+
 /**
  * 	alloc_pages_vma	- Allocate a page for a VMA.
  *
@@ -2272,57 +2314,17 @@  alloc_pages_vma(gfp_t gfp, int order, struct vm_area_struct *vma,
 		nid = interleave_nid(pol, vma, addr, PAGE_SHIFT + order);
 		mpol_cond_put(pol);
 		page = alloc_page_interleave(gfp, order, nid);
-		goto out;
-	}
-
-	if (unlikely(IS_ENABLED(CONFIG_TRANSPARENT_HUGEPAGE) && hugepage)) {
-		int hpage_node = node;
-
-		/*
-		 * For hugepage allocation and non-interleave policy which
-		 * allows the current node (or other explicitly preferred
-		 * node) we only try to allocate from the current/preferred
-		 * node and don't fall back to other nodes, as the cost of
-		 * remote accesses would likely offset THP benefits.
-		 *
-		 * If the policy is interleave or multiple preferred nodes, or
-		 * does not allow the current node in its nodemask, we allocate
-		 * the standard way.
-		 */
-		if (pol->mode == MPOL_PREFERRED && !(pol->flags & MPOL_F_LOCAL))
-			hpage_node = first_node(pol->v.preferred_nodes);
-
+	} else if (unlikely(IS_ENABLED(CONFIG_TRANSPARENT_HUGEPAGE) &&
+			    hugepage)) {
+		page = alloc_pages_vma_thp(gfp, pol, order, node);
+		mpol_cond_put(pol);
+	} else {
 		nmask = policy_nodemask(gfp, pol);
+		preferred_nid = policy_node(gfp, pol, node);
+		page = __alloc_pages_nodemask(gfp, order, preferred_nid, nmask);
 		mpol_cond_put(pol);
-
-		/*
-		 * First, try to allocate THP only on local node, but
-		 * don't reclaim unnecessarily, just compact.
-		 */
-		page = __alloc_pages_nodemask(gfp | __GFP_THISNODE |
-						      __GFP_NORETRY,
-					      order, hpage_node, nmask);
-
-		/*
-		 * If hugepage allocations are configured to always synchronous
-		 * compact or the vma has been madvised to prefer hugepage
-		 * backing, retry allowing remote memory with both reclaim and
-		 * compact as well.
-		 */
-		if (!page && (gfp & __GFP_DIRECT_RECLAIM))
-			page = __alloc_pages_nodemask(gfp, order, hpage_node,
-						      nmask);
-
-		VM_BUG_ON(page && nmask &&
-			  !node_isset(page_to_nid(page), *nmask));
-		goto out;
 	}
 
-	nmask = policy_nodemask(gfp, pol);
-	preferred_nid = policy_node(gfp, pol, node);
-	page = __alloc_pages_nodemask(gfp, order, preferred_nid, nmask);
-	mpol_cond_put(pol);
-out:
 	return page;
 }
 EXPORT_SYMBOL(alloc_pages_vma);

[11/18] mm: Extract THP hugepage allocation

Commit Message

Patch