From patchwork Wed Apr 21 15:08:29 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Kenta Ishiguro X-Patchwork-Id: 12216353 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-13.8 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_PATCH, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_GIT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 49922C433ED for ; Wed, 21 Apr 2021 15:10:17 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 0DF6461457 for ; Wed, 21 Apr 2021 15:10:17 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S243852AbhDUPKp (ORCPT ); Wed, 21 Apr 2021 11:10:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38236 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S243950AbhDUPJO (ORCPT ); Wed, 21 Apr 2021 11:09:14 -0400 Received: from mail-pf1-x42a.google.com (mail-pf1-x42a.google.com [IPv6:2607:f8b0:4864:20::42a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 891AEC06174A for ; Wed, 21 Apr 2021 08:08:39 -0700 (PDT) Received: by mail-pf1-x42a.google.com with SMTP id c17so29123715pfn.6 for ; Wed, 21 Apr 2021 08:08:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sslab.ics.keio.ac.jp; s=google; h=from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=6tJWCygQYXpyjmta5WwDxPI/i/SiGWVTOjHzOmEYKlE=; b=C1PVVj6GicJZ2pct0tdsal9RHi3mTrSgwF8Perq6B7IXO9c6YWeePyMrncWjx9dX5c 4wbHBbiVh9evi6zxlz+viHBjFYkU/JgnXxXMGXTCKx2V8pb3vawpkwRCVALFs8jiBRDJ rSffYnzLVqokcFKXrrza+WO2+Q15zJ+Ylckh4= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=6tJWCygQYXpyjmta5WwDxPI/i/SiGWVTOjHzOmEYKlE=; b=D8/GIzyUtsQjqWfhh4jphe7/G+x8BMqNsboCdZ8MJxY2S0xgQT8umhYK6qjo/geNYh k5l2itbY8R2IU78PVigXzWaWlbLwVyhEuCuE/va/EKjnTrpfHqtRbKTJC5As8XhbQE0k 0b/CJwRWVWf0ymaUOyuBf22EnrZNqMMBZLcd2vg68C2xsYBGIbUbNBGLRwTOd2B4hPwi d51wqBbvvP4G/HJHI9P5M2VsIm2v9Uxi+y3FS64MJju3DLz46s4bUCWPCDzIgiL8CP0G xI4y/nSxr6klHpds+KcoqoFLWcW55ZmI+6rf5+82tcZQxEu0rGNVs4WDBsM6AUS5gXph m3GA== X-Gm-Message-State: AOAM531l/XNRs8Q8RYtQkYxGGTz+WcbOZvA5izX2lPH3LQrkmhJbgOLY CO7gtiDhblzxlI2rAOJdJxz/gQ== X-Google-Smtp-Source: ABdhPJz1WE6GD3g5TmO4Eb0xaKhSpVZkOqP/gqcFkJCrrmxOYw5FpUQvy3ylfWfox/xr2V5sf2Zomg== X-Received: by 2002:a17:90a:a895:: with SMTP id h21mr11920590pjq.13.1619017718934; Wed, 21 Apr 2021 08:08:38 -0700 (PDT) Received: from haraichi.dnlocal (113x36x239x145.ap113.ftth.ucom.ne.jp. [113.36.239.145]) by smtp.googlemail.com with ESMTPSA id f3sm5432553pjo.3.2021.04.21.08.08.36 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 21 Apr 2021 08:08:38 -0700 (PDT) From: Kenta Ishiguro To: Paolo Bonzini Cc: Sean Christopherson , Vitaly Kuznetsov , Wanpeng Li , Jim Mattson , Joerg Roedel , David Hildenbrand , kvm@vger.kernel.org, linux-kernel@vger.kernel.org, pl@sslab.ics.keio.ac.jp, kono@sslab.ics.keio.ac.jp, Kenta Ishiguro Subject: [RFC PATCH 0/2] Mitigating Excessive Pause-Loop Exiting in VM-Agnostic KVM Date: Thu, 22 Apr 2021 00:08:29 +0900 Message-Id: <20210421150831.60133-1-kentaishiguro@sslab.ics.keio.ac.jp> X-Mailer: git-send-email 2.30.2 MIME-Version: 1.0 Precedence: bulk List-ID: X-Mailing-List: kvm@vger.kernel.org Dear KVM developers and maintainers, In our research work presented last week at the VEE 2021 conference [1], we found out that a lot of continuous Pause-Loop-Exiting (PLE) events occur due to three problems we have identified: 1) Linux CFS ignores hints from KVM; 2) IPI receiver vCPUs in user-mode are not boosted; 3) IPI-receiver that has halted is always a candidate for boost. We have intoduced two mitigations against the problems. To solve problem (1), patch 1 increases the vruntime of yielded vCPU to pass the check `if (cfs_rq->next && wakeup_preempt_entity(cfs_rq->next, left) < 1)` in `struct sched_entity * pick_next_entity()` if the cfs_rq's skip and next are both vCPUs in the same VM. To keep fairness it does not prioritize the guest VM which causes PLE, however it improves the performance by eliminating unnecessary PLE. Also we have confirmed `yield_to_task_fair` is called only from KVM. To solve problems (2) and (3), patch 2 monitors IPI communication between vCPUs and leverages the relationship between vCPUs to select boost candidates. The "[PATCH] KVM: Boost vCPU candidiate in user mode which is delivering interrupt" patch (https://lore.kernel.org/kvm/CANRm+Cy-78UnrkX8nh5WdHut2WW5NU=UL84FRJnUNjsAPK+Uww@mail.gmail.com/T/) seems to be effective for (2) while it only uses the IPI receiver information. Our approach reduces the total number of PLE events by up to 87.6 % in four 8-vCPU VMs in over-subscribed scenario with the Linux kernel 5.6.0. Please find the patch below. We would greatly appreciate your valuable feedback on our approach and patch. Thank you very much for your consideration Kenta Ishiguro [1] Kenta Ishiguro, Naoki Yasuno, Pierre-Louis Aublin, and Kenji Kono. "Mitigating excessive vCPU spinning in VM-agnostic KVM". In Proceedings of the 17th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE 2021). Association for Computing Machinery, New York, NY, USA, 139--152. https://dl.acm.org/doi/abs/10.1145/3453933.3454020 Kenta Ishiguro (2): Prevent CFS from ignoring boost requests from KVM Boost vCPUs based on IPI-sender and receiver information arch/x86/kvm/lapic.c | 14 ++++++++++++++ arch/x86/kvm/vmx/vmx.c | 2 ++ include/linux/kvm_host.h | 5 +++++ kernel/sched/fair.c | 31 +++++++++++++++++++++++++++++++ virt/kvm/kvm_main.c | 26 ++++++++++++++++++++++++-- 5 files changed, 76 insertions(+), 2 deletions(-)