[v1,10/17] virtio-mem: Paravirtualized memory hot(un)plug

This is the very basic/initial version of virtio-mem. An introduction to
virtio-mem can be found in the Linux kernel driver [1]. While it can be
used in the current state for hotplug of a smaller amount of memory, it
will heavily benefit from resizeable memory regions in the future.

Each virtio-mem device manages a memory region (provided via a memory
backend). After requested by the hypervisor ("requested-size"), the
guest can try to plug/unplug blocks of memory within that region, in order
to reach the requested size. Initially, and after a reboot, all memory is
unplugged (except in special cases - reboot during postcopy).

The guest may only try to plug/unplug blocks of memory within the usable
region size. The usable region size is a little bigger than the
requested size, to give the device driver some flexibility. The usable
region size will only grow, except on reboots or when all memory is
requested to get unplugged. The guest can never plug more memory than
requested. Unplugged memory will get zapped/discarded, similar to in a
balloon device.

The block size is variable, however, it is always chosen in a way such that
THP splits are avoided (e.g., 2MB). The state of each block
(plugged/unplugged) is tracked in a bitmap.

As virtio-mem devices (e.g., virtio-mem-pci) will be memory devices, we now
expose "VirtioMEMDeviceInfo" via "query-memory-devices".

--------------------------------------------------------------------------

There are two important follow-up items that are in the works:
1. Resizeable memory regions: Use resizeable allocations/RAM blocks to
   grow/shrink along with the usable region size. This avoids creating
   initially very big VMAs, RAM blocks, and KVM slots.
2. Protection of unplugged memory: Make sure the gust cannot actually
   make use of unplugged memory.

Other follow-up items that are in the works:
1. Exclude unplugged memory during migration (via precopy notifier).
2. Handle remapping of memory.
3. Support for other architectures.

--------------------------------------------------------------------------

Example usage (virtio-mem-pci is introduced in follow-up patches):

Start QEMU with two virtio-mem devices (one per NUMA node):
 $ qemu-system-x86_64 -m 4G,maxmem=20G \
  -smp sockets=2,cores=2 \
  -numa node,nodeid=0,cpus=0-1 -numa node,nodeid=1,cpus=2-3 \
  [...]
  -object memory-backend-ram,id=mem0,size=8G \
  -device virtio-mem-pci,id=vm0,memdev=mem0,node=0,requested-size=0M \
  -object memory-backend-ram,id=mem1,size=8G \
  -device virtio-mem-pci,id=vm1,memdev=mem1,node=1,requested-size=1G

Query the configuration:
 (qemu) info memory-devices
 Memory device [virtio-mem]: "vm0"
   memaddr: 0x140000000
   node: 0
   requested-size: 0
   size: 0
   max-size: 8589934592
   block-size: 2097152
   memdev: /objects/mem0
 Memory device [virtio-mem]: "vm1"
   memaddr: 0x340000000
   node: 1
   requested-size: 1073741824
   size: 1073741824
   max-size: 8589934592
   block-size: 2097152
   memdev: /objects/mem1

Add some memory to node 0:
 (qemu) qom-set vm0 requested-size 500M

Remove some memory from node 1:
 (qemu) qom-set vm1 requested-size 200M

Query the configuration again:
 (qemu) info memory-devices
 Memory device [virtio-mem]: "vm0"
   memaddr: 0x140000000
   node: 0
   requested-size: 524288000
   size: 524288000
   max-size: 8589934592
   block-size: 2097152
   memdev: /objects/mem0
 Memory device [virtio-mem]: "vm1"
   memaddr: 0x340000000
   node: 1
   requested-size: 209715200
   size: 209715200
   max-size: 8589934592
   block-size: 2097152
   memdev: /objects/mem1

[1] https://lkml.kernel.org/r/20200311171422.10484-1-david@redhat.com

Cc: "Michael S. Tsirkin" <mst@redhat.com>
Cc: Eric Blake <eblake@redhat.com>
Cc: Markus Armbruster <armbru@redhat.com>
Cc: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
Cc: Igor Mammedov <imammedo@redhat.com>
Signed-off-by: David Hildenbrand <david@redhat.com>
---
 hw/virtio/Kconfig              |  11 +
 hw/virtio/Makefile.objs        |   1 +
 hw/virtio/virtio-mem.c         | 762 +++++++++++++++++++++++++++++++++
 include/hw/virtio/virtio-mem.h |  80 ++++
 qapi/misc.json                 |  39 +-
 5 files changed, 892 insertions(+), 1 deletion(-)
 create mode 100644 hw/virtio/virtio-mem.c
 create mode 100644 include/hw/virtio/virtio-mem.h

Message ID	20200506094948.76388-11-david@redhat.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=qN9b=6U=vger.kernel.org=kvm-owner@kernel.org> Received: from mail.kernel.org (pdx-korg-mail-1.web.codeaurora.org [172.30.200.123]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id CE1A417EF for <patchwork-kvm@patchwork.kernel.org>; Wed, 6 May 2020 09:50:53 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id A7B002073A for <patchwork-kvm@patchwork.kernel.org>; Wed, 6 May 2020 09:50:53 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="aCl6/6fj" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729252AbgEFJux (ORCPT <rfc822;patchwork-kvm@patchwork.kernel.org>); Wed, 6 May 2020 05:50:53 -0400 Received: from us-smtp-1.mimecast.com ([207.211.31.81]:31807 "EHLO us-smtp-delivery-1.mimecast.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1729236AbgEFJuw (ORCPT <rfc822;kvm@vger.kernel.org>); Wed, 6 May 2020 05:50:52 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1588758648; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=Neb3XNqN8GnA6E+Qn0tinD/A0QhWu5ZHKEpLUQAT+yU=; b=aCl6/6fjpLVwmpa3ExZaiu38o+J24dSS0yuvtkzAUMt1HAZYmD+ckDhrrO+xGLCZM01qlA UwyUcKJ5Ch0Il1qMW+pNTrlCuKXSQdmOcjD54+PVMoW5BTNQuoMsm7IoP9ybTAShaKMKlE 7L4N0pI6qOiL/p4FU/UBm2UeiqTdtJ4= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-75-v2VmSf4BNYeGMrlSAQM-dA-1; Wed, 06 May 2020 05:50:44 -0400 X-MC-Unique: v2VmSf4BNYeGMrlSAQM-dA-1 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.phx2.redhat.com [10.5.11.16]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 82A961800D4A; Wed, 6 May 2020 09:50:43 +0000 (UTC) Received: from t480s.redhat.com (ovpn-113-17.ams2.redhat.com [10.36.113.17]) by smtp.corp.redhat.com (Postfix) with ESMTP id 0361C5C1BD; Wed, 6 May 2020 09:50:40 +0000 (UTC) From: David Hildenbrand <david@redhat.com> To: qemu-devel@nongnu.org Cc: kvm@vger.kernel.org, qemu-s390x@nongnu.org, Richard Henderson <rth@twiddle.net>, Paolo Bonzini <pbonzini@redhat.com>, "Dr . David Alan Gilbert" <dgilbert@redhat.com>, Eduardo Habkost <ehabkost@redhat.com>, "Michael S . Tsirkin" <mst@redhat.com>, David Hildenbrand <david@redhat.com>, Eric Blake <eblake@redhat.com>, Markus Armbruster <armbru@redhat.com>, Igor Mammedov <imammedo@redhat.com> Subject: [PATCH v1 10/17] virtio-mem: Paravirtualized memory hot(un)plug Date: Wed, 6 May 2020 11:49:41 +0200 Message-Id: <20200506094948.76388-11-david@redhat.com> In-Reply-To: <20200506094948.76388-1-david@redhat.com> References: <20200506094948.76388-1-david@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 2.79 on 10.5.11.16 Content-Transfer-Encoding: quoted-printable Sender: kvm-owner@vger.kernel.org Precedence: bulk List-ID: <kvm.vger.kernel.org> X-Mailing-List: kvm@vger.kernel.org
Series	virtio-mem: Paravirtualized memory hot(un)plug \| expand [v1,00/17] virtio-mem: Paravirtualized memory hot(un)plug [v1,01/17] exec: Introduce ram_block_discard_set_(unreliable\|required)() [v1,02/17] vfio: Convert to ram_block_discard_set_broken() [v1,03/17] accel/kvm: Convert to ram_block_discard_set_broken() [v1,04/17] s390x/pv: Convert to ram_block_discard_set_broken() [v1,05/17] virtio-balloon: Rip out qemu_balloon_inhibit() [v1,06/17] target/i386: sev: Use ram_block_discard_set_broken() [v1,07/17] migration/rdma: Use ram_block_discard_set_broken() [v1,08/17] migration/colo: Use ram_block_discard_set_broken() [v1,09/17] linux-headers: update to contain virtio-mem [v1,10/17] virtio-mem: Paravirtualized memory hot(un)plug [v1,11/17] virtio-pci: Proxy for virtio-mem [v1,12/17] MAINTAINERS: Add myself as virtio-mem maintainer [v1,13/17] hmp: Handle virtio-mem when printing memory device info [v1,14/17] numa: Handle virtio-mem in NUMA stats [v1,15/17] pc: Support for virtio-mem-pci [v1,16/17] virtio-mem: Allow notifiers for size changes [v1,17/17] virtio-pci: Send qapi events when the virtio-mem size changes

[v1,10/17] virtio-mem: Paravirtualized memory hot(un)plug

Commit Message

Comments

Patch