From patchwork Fri Jan 19 13:44:34 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Zhang Chen X-Patchwork-Id: 10175339 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id 7FEA060392 for ; Fri, 19 Jan 2018 13:51:08 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 7125B28698 for ; Fri, 19 Jan 2018 13:51:08 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id 6548E2869D; Fri, 19 Jan 2018 13:51:08 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.8 required=2.0 tests=BAYES_00, DKIM_ADSP_CUSTOM_MED, DKIM_SIGNED, FREEMAIL_FROM, RCVD_IN_DNSWL_HI, T_DKIM_INVALID autolearn=ham version=3.3.1 Received: from lists.gnu.org (lists.gnu.org [208.118.235.17]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by mail.wl.linuxfoundation.org (Postfix) with ESMTPS id B4A4928698 for ; Fri, 19 Jan 2018 13:51:07 +0000 (UTC) Received: from localhost ([::1]:50360 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ecX4V-0007YX-1I for patchwork-qemu-devel@patchwork.kernel.org; Fri, 19 Jan 2018 08:51:07 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:46607) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ecWzx-0003gn-T5 for qemu-devel@nongnu.org; Fri, 19 Jan 2018 08:46:27 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ecWzw-00036n-H3 for qemu-devel@nongnu.org; Fri, 19 Jan 2018 08:46:25 -0500 Received: from mail-pf0-x244.google.com ([2607:f8b0:400e:c00::244]:38219) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1ecWzw-000342-8Q for qemu-devel@nongnu.org; Fri, 19 Jan 2018 08:46:24 -0500 Received: by mail-pf0-x244.google.com with SMTP id k19so1395640pfj.5 for ; Fri, 19 Jan 2018 05:46:23 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=AJoBEHA0u5SSBqFQSfh76BU8EwRpu3kZjAEqZLZZI/s=; b=RfnO0aK+Nzw4/c8V8bdQGbSwBphC0yVnPTxXMSlP1h696oNqzrqNdlIg9dGLY0Lz/S J60SCiuCebwkOAv7rBIBVDm7MxzD471C8Aqm84C+b8/9yaBC3+aPj2n3kIGj/N8szcEu 176OyyUkjyEH1woVvSlxybfwob5aJ6VbzyRTvqh/2TwFbGMS9Lu+mgyDds6pvpDfkC46 LonhWaOehLnlXw5BJNBT2bo9PfcZGRUboR5lAN9HUtaL3bh1pFCa/5muIzM9tpuH0KOe s1fdVL1JVLCpEkjddGHERfaFyJSCItO3ix8PPX4xNotw+Lnoxr1z+yhcEf61z+xsibRZ sBng== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=AJoBEHA0u5SSBqFQSfh76BU8EwRpu3kZjAEqZLZZI/s=; b=jxiFqB3Gb5UWJbX2xlnf6DTcO26+Jk71+Cv+ZzaswAav9WZNw9502aXOfTGWQBGzPc whkFlfXUIjyetNmkGBc77nCYn9wOSw/dat4+/jUMZoEMp2aABRR5cmzvWBGskPehLcSx AeAcVhpxeqLYYF7pQ5fiNFfbmKQ0fht9D5fHWcGt3hC5lBooSCRoU6w7gkJnBIj6p4bl aY4x+IcYQynuEcVcyTCfJTamvFK9GceKi+nHNHFHwhV261nT/E6H07JmHy13IlmxyJgi xXE/WzaCzCZy5Xw6moZbh3+oIWPSZT9h1m8QIQcRcdWyfZSGfB+ty5hLwW2Kv9FTA+aY NdAA== X-Gm-Message-State: AKwxyteABPORFfEKgBqVYNrNh5Uxxs2ojIHryTGazJPkoDwK6NPPcBA3 2MPzCfrssCtpbaNM7hqTkoYXFiaH X-Google-Smtp-Source: ACJfBos3nMXV/4dvDcfBqpSsE0YWc6rkw3/lV+jBQxtKm68eOXH2NoYrX7UwnbpqvY/WOUv0ZcKptQ== X-Received: by 2002:a17:902:20c8:: with SMTP id v8-v6mr1623026plg.226.1516369582672; Fri, 19 Jan 2018 05:46:22 -0800 (PST) Received: from localhost.localdomain (120.236.201.35.bc.googleusercontent.com. [35.201.236.120]) by smtp.gmail.com with ESMTPSA id r88sm14251865pfb.17.2018.01.19.05.46.19 (version=TLS1_2 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Fri, 19 Jan 2018 05:46:21 -0800 (PST) From: Zhang Chen To: qemu-devel@nongnu.org Date: Fri, 19 Jan 2018 21:44:34 +0800 Message-Id: <1516369485-5374-6-git-send-email-zhangckid@gmail.com> X-Mailer: git-send-email 2.7.4 In-Reply-To: <1516369485-5374-1-git-send-email-zhangckid@gmail.com> References: <1516369485-5374-1-git-send-email-zhangckid@gmail.com> X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2607:f8b0:400e:c00::244 Subject: [Qemu-devel] [PATCH V4 05/16] COLO: Add block replication into colo process X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: zhanghailiang , Li Zhijian , Juan Quintela , Jason Wang , "Dr . David Alan Gilbert" , Markus Armbruster , Zhang Chen , Paolo Bonzini Errors-To: qemu-devel-bounces+patchwork-qemu-devel=patchwork.kernel.org@nongnu.org Sender: "Qemu-devel" X-Virus-Scanned: ClamAV using ClamSMTP Make sure master start block replication after slave's block replication started. Besides, we need to activate VM's blocks before goes into COLO state. Signed-off-by: zhanghailiang Signed-off-by: Li Zhijian Signed-off-by: Zhang Chen --- migration/colo.c | 46 ++++++++++++++++++++++++++++++++++++++++++++++ migration/migration.c | 9 +++++++++ 2 files changed, 55 insertions(+) diff --git a/migration/colo.c b/migration/colo.c index c513805..0e689df 100644 --- a/migration/colo.c +++ b/migration/colo.c @@ -26,6 +26,9 @@ #include "qmp-commands.h" #include "net/colo-compare.h" #include "net/colo.h" +#include "qapi-event.h" +#include "block/block.h" +#include "replication.h" static bool vmstate_loading; static Notifier packets_compare_notifier; @@ -55,6 +58,7 @@ static void secondary_vm_do_failover(void) { int old_state; MigrationIncomingState *mis = migration_incoming_get_current(); + Error *local_err = NULL; /* Can not do failover during the process of VM's loading VMstate, Or * it will break the secondary VM. @@ -72,6 +76,11 @@ static void secondary_vm_do_failover(void) migrate_set_state(&mis->state, MIGRATION_STATUS_COLO, MIGRATION_STATUS_COMPLETED); + replication_stop_all(true, &local_err); + if (local_err) { + error_report_err(local_err); + } + if (!autostart) { error_report("\"-S\" qemu option will be ignored in secondary side"); /* recover runstate to normal migration finish state */ @@ -109,6 +118,7 @@ static void primary_vm_do_failover(void) { MigrationState *s = migrate_get_current(); int old_state; + Error *local_err = NULL; migrate_set_state(&s->state, MIGRATION_STATUS_COLO, MIGRATION_STATUS_COMPLETED); @@ -132,6 +142,13 @@ static void primary_vm_do_failover(void) FailoverStatus_str(old_state)); return; } + + replication_stop_all(true, &local_err); + if (local_err) { + error_report_err(local_err); + local_err = NULL; + } + /* Notify COLO thread that failover work is finished */ qemu_sem_post(&s->colo_exit_sem); } @@ -355,6 +372,11 @@ static int colo_do_checkpoint_transaction(MigrationState *s, qemu_savevm_state_header(fb); qemu_savevm_state_setup(fb); qemu_mutex_lock_iothread(); + replication_do_checkpoint_all(&local_err); + if (local_err) { + qemu_mutex_unlock_iothread(); + goto out; + } qemu_savevm_state_complete_precopy(fb, false, false); qemu_mutex_unlock_iothread(); @@ -396,6 +418,7 @@ static int colo_do_checkpoint_transaction(MigrationState *s, ret = 0; qemu_mutex_lock_iothread(); + vm_start(); qemu_mutex_unlock_iothread(); trace_colo_vm_state_change("stop", "run"); @@ -445,6 +468,12 @@ static void colo_process_checkpoint(MigrationState *s) object_unref(OBJECT(bioc)); qemu_mutex_lock_iothread(); + replication_start_all(REPLICATION_MODE_PRIMARY, &local_err); + if (local_err) { + qemu_mutex_unlock_iothread(); + goto out; + } + vm_start(); qemu_mutex_unlock_iothread(); trace_colo_vm_state_change("stop", "run"); @@ -584,6 +613,11 @@ void *colo_process_incoming_thread(void *opaque) object_unref(OBJECT(bioc)); qemu_mutex_lock_iothread(); + replication_start_all(REPLICATION_MODE_SECONDARY, &local_err); + if (local_err) { + qemu_mutex_unlock_iothread(); + goto out; + } vm_start(); trace_colo_vm_state_change("stop", "run"); qemu_mutex_unlock_iothread(); @@ -664,6 +698,18 @@ void *colo_process_incoming_thread(void *opaque) goto out; } + replication_get_error_all(&local_err); + if (local_err) { + qemu_mutex_unlock_iothread(); + goto out; + } + /* discard colo disk buffer */ + replication_do_checkpoint_all(&local_err); + if (local_err) { + qemu_mutex_unlock_iothread(); + goto out; + } + vmstate_loading = false; vm_start(); trace_colo_vm_state_change("stop", "run"); diff --git a/migration/migration.c b/migration/migration.c index 5f8c2de..23b3cff 100644 --- a/migration/migration.c +++ b/migration/migration.c @@ -323,6 +323,7 @@ static void process_incoming_migration_co(void *opaque) MigrationIncomingState *mis = migration_incoming_get_current(); PostcopyState ps; int ret; + Error *local_err = NULL; assert(mis->from_src_file); mis->largest_page_size = qemu_ram_pagesize_largest(); @@ -354,6 +355,14 @@ static void process_incoming_migration_co(void *opaque) /* we get COLO info, and know if we are in COLO mode */ if (!ret && migration_incoming_enable_colo()) { + /* Make sure all file formats flush their mutable metadata */ + bdrv_invalidate_cache_all(&local_err); + if (local_err) { + migrate_set_state(&mis->state, MIGRATION_STATUS_ACTIVE, + MIGRATION_STATUS_FAILED); + error_report_err(local_err); + exit(EXIT_FAILURE); + } mis->migration_incoming_co = qemu_coroutine_self(); qemu_thread_create(&mis->colo_incoming_thread, "COLO incoming", colo_process_incoming_thread, mis, QEMU_THREAD_JOINABLE);