From patchwork Thu Nov 1 10:04:00 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Wang, Wei W" X-Patchwork-Id: 10663837 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id E106714E2 for ; Thu, 1 Nov 2018 10:37:29 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id CB22B28397 for ; Thu, 1 Nov 2018 10:37:29 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id BF8292B7C3; Thu, 1 Nov 2018 10:37:29 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-7.9 required=2.0 tests=BAYES_00,MAILING_LIST_MULTI, RCVD_IN_DNSWL_HI autolearn=ham version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 5D25628397 for ; Thu, 1 Nov 2018 10:37:29 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727862AbeKATis (ORCPT ); Thu, 1 Nov 2018 15:38:48 -0400 Received: from mga06.intel.com ([134.134.136.31]:52047 "EHLO mga06.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726320AbeKATir (ORCPT ); Thu, 1 Nov 2018 15:38:47 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by orsmga104.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 01 Nov 2018 03:36:25 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.54,451,1534834800"; d="scan'208";a="100601930" Received: from devel-ww.sh.intel.com ([10.239.48.119]) by fmsmga002.fm.intel.com with ESMTP; 01 Nov 2018 03:36:21 -0700 From: Wei Wang To: linux-kernel@vger.kernel.org, kvm@vger.kernel.org, pbonzini@redhat.com, ak@linux.intel.com, peterz@infradead.org Cc: mingo@redhat.com, rkrcmar@redhat.com, like.xu@intel.com, wei.w.wang@intel.com Subject: [PATCH v1 0/8] Intel Virtual PMU Optimization Date: Thu, 1 Nov 2018 18:04:00 +0800 Message-Id: <1541066648-40690-1-git-send-email-wei.w.wang@intel.com> X-Mailer: git-send-email 2.7.4 Sender: kvm-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: kvm@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP This patch series optimizes the Intel PMU virtualization by reducing the PMU virtualization overhead and providing guests with accurate PMU statistics. The differences of the traditional approach and the optimized apporach are depicted in the figures here: https://github.com/weiwangwork/vPMU/blob/master/vPMU%20Optimization.pdf The traditional approach of PMU virtualization is host perf event oriented. The KVM vPMU layer sits on top of the host perf subsystem and each guest's update to the vPMU is translated into a new host perf event, which needs to go through the host perf software stack (e.g. releasing the old perf event, re-creating a new one and getting it rescheduled to a hardware perf counter) for a reconfiguration. With this optimization, we intend to make the virtualization layer to be register oriented. The KVM vPMU layer is moved down to directly sit on the hardware perf counters. The guest accesses to the vPMU registers can be directly applied to the related hardware counter by the vPMU. It can reduce the virtualization overhead from around 2500000ns to 400ns. (Tested on Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz, and added host booting parameter "nowatchdog" to avoid the noise from watchdog_hld) We still need the vPMU to get the ownership of the physical perf counters from the host perf core. The guest used counters are taken from the host perf core via x86_perf_mask_perf_counters, which in most cases is a bit-setting of the guest mask. This series currently covers the basic perf counter virtualization. Other features, such as pebs and lbr, will come after this series. Wei Wang (8): perf/x86: add support to mask counters from host perf/x86/intel: add pmi callback support KVM/x86/vPMU: optimize intel vPMU KVM/x86/vPMU: support msr switch on vmx transitions KVM/x86/vPMU: intel_pmu_read_pmc KVM/x86/vPMU: remove some unused functions KVM/x86/vPMU: save/restore guest perf counters on vCPU switching KVM/x86/vPMU: return the counters to host if guest is torn down arch/x86/events/core.c | 47 ++++ arch/x86/events/intel/core.c | 65 ++---- arch/x86/events/perf_event.h | 10 +- arch/x86/include/asm/kvm_host.h | 13 ++ arch/x86/include/asm/perf_event.h | 16 +- arch/x86/kvm/pmu.c | 15 ++ arch/x86/kvm/pmu.h | 7 + arch/x86/kvm/pmu_intel.c | 448 +++++++++++++++++++++++++------------- arch/x86/kvm/vmx.c | 6 +- arch/x86/kvm/x86.c | 6 + include/linux/kvm_host.h | 1 + virt/kvm/kvm_main.c | 3 + 12 files changed, 416 insertions(+), 221 deletions(-)