[07/14] perf/arm-cmn: Optimise DTM counter reads

Message ID	7777d77c2df17693cd3dabb6e268906e15238d82.1638530442.git.robin.murphy@arm.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org> From: Robin Murphy <robin.murphy@arm.com> To: will@kernel.org Cc: mark.rutland@arm.com, linux-arm-kernel@lists.infradead.org Subject: [PATCH 07/14] perf/arm-cmn: Optimise DTM counter reads Date: Fri, 3 Dec 2021 11:44:56 +0000 Message-Id: <7777d77c2df17693cd3dabb6e268906e15238d82.1638530442.git.robin.murphy@arm.com> In-Reply-To: <cover.1638530442.git.robin.murphy@arm.com> References: <cover.1638530442.git.robin.murphy@arm.com> MIME-Version: 1.0 Precedence: list Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: "linux-arm-kernel" <linux-arm-kernel-bounces@lists.infradead.org> Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org
Series	perf: Arm CMN updates \| expand [00/14] perf: Arm CMN updates [01/14] perf/arm-cmn: Fix CPU hotplug unregistration [02/14] perf/arm-cmn: Account for NUMA affinity [03/14] perf/arm-cmn: Drop compile-test restriction [04/14] perf/arm-cmn: Refactor node ID handling [05/14] perf/arm-cmn: Streamline node iteration [06/14] perf/arm-cmn: Refactor DTM handling [07/14] perf/arm-cmn: Optimise DTM counter reads [08/14] perf/arm-cmn: Optimise DTC counter accesses [09/14] perf/arm-cmn: Move group validation data off-stack [10/14] perf/arm-cmn: Demarcate CMN-600 specifics [11/14] perf/arm-cmn: Support new IP features [12/14] dt-bindings: perf: arm-cmn: Add CI-700 [13/14] perf/arm-cmn: Add CI-700 Support [14/14] perf/arm-cmn: Add debugfs topology info

Message ID

7777d77c2df17693cd3dabb6e268906e15238d82.1638530442.git.robin.murphy@arm.com (mailing list archive)

State

New, archived

Headers

From: Robin Murphy <robin.murphy@arm.com>
To: will@kernel.org
Cc: mark.rutland@arm.com,
	linux-arm-kernel@lists.infradead.org
Subject: [PATCH 07/14] perf/arm-cmn: Optimise DTM counter reads
Date: Fri,  3 Dec 2021 11:44:56 +0000
Message-Id: 
 <7777d77c2df17693cd3dabb6e268906e15238d82.1638530442.git.robin.murphy@arm.com>
In-Reply-To: <cover.1638530442.git.robin.murphy@arm.com>
References: <cover.1638530442.git.robin.murphy@arm.com>
MIME-Version: 1.0
Precedence: list
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Sender: "linux-arm-kernel" <linux-arm-kernel-bounces@lists.infradead.org>
Errors-To: 
 linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org

Series

perf: Arm CMN updates | expand

Commit Message

Robin Murphy Dec. 3, 2021, 11:44 a.m. UTC

When multiple nodes of the same type are connected to the same XP
(particularly in CAL configurations), it seems that they are likely
to be consecutive in logical ID. Therefore, we're likely to gain a
small benefit from an easy tweak to optimise out consecutive reads
of the same set of DTM counters for an aggregated event.

Signed-off-by: Robin Murphy <robin.murphy@arm.com>
---
 drivers/perf/arm-cmn.c | 17 +++++++++--------
 1 file changed, 9 insertions(+), 8 deletions(-)

diff --git a/drivers/perf/arm-cmn.c b/drivers/perf/arm-cmn.c
index 8b98ca9666d0..005a0d83bcac 100644
--- a/drivers/perf/arm-cmn.c
+++ b/drivers/perf/arm-cmn.c
@@ -690,18 +690,19 @@  static void arm_cmn_pmu_disable(struct pmu *pmu)
 static u64 arm_cmn_read_dtm(struct arm_cmn *cmn, struct arm_cmn_hw_event *hw,
 			    bool snapshot)
 {
+	struct arm_cmn_dtm *dtm = NULL;
 	struct arm_cmn_node *dn;
-	unsigned int i, offset;
-	u64 count = 0;
+	unsigned int i, offset, dtm_idx;
+	u64 reg, count = 0;
 
 	offset = snapshot ? CMN_DTM_PMEVCNTSR : CMN_DTM_PMEVCNT;
 	for_each_hw_dn(hw, dn, i) {
-		struct arm_cmn_dtm *dtm = &cmn->dtms[dn->dtm];
-		int dtm_idx = arm_cmn_get_index(hw->dtm_idx, i);
-		u64 reg = readq_relaxed(dtm->base + offset);
-		u16 dtm_count = reg >> (dtm_idx * 16);
-
-		count += dtm_count;
+		if (dtm != &cmn->dtms[dn->dtm]) {
+			dtm = &cmn->dtms[dn->dtm];
+			reg = readq_relaxed(dtm->base + offset);
+		}
+		dtm_idx = arm_cmn_get_index(hw->dtm_idx, i);
+		count += (u16)(reg >> (dtm_idx * 16));
 	}
 	return count;
 }

[07/14] perf/arm-cmn: Optimise DTM counter reads

Commit Message

Patch