diff mbox series

[1/1] migration: improve migration_throttle tracepoint

Message ID 20240905165146.293586-1-den@openvz.org (mailing list archive)
State New, archived
Headers show
Series [1/1] migration: improve migration_throttle tracepoint | expand

Commit Message

Zhijian Li (Fujitsu)" via Sept. 5, 2024, 4:51 p.m. UTC
Right now this tracepoint is just saying that the guest has been
throttled, but this is not that good for debugging purposes. We should
also know how much the guest is throttled in order to understand
consequences for the guest behaviour.

The patch moves the tracepoint from migration_trigger_throttle() to
mig_throttle_guest_down() where this information is really available.
This is not a problem as mig_throttle_guest_down() is called in the
only one place.

Signed-off-by: Denis V. Lunev <den@openvz.org>
CC: Peter Xu <peterx@redhat.com>
CC: Fabiano Rosas <farosas@suse.de>
---
 migration/ram.c        | 9 ++++++---
 migration/trace-events | 2 +-
 2 files changed, 7 insertions(+), 4 deletions(-)

Comments

Peter Xu Sept. 5, 2024, 5:52 p.m. UTC | #1
On Thu, Sep 05, 2024 at 06:51:46PM +0200, Denis V. Lunev via wrote:
> Right now this tracepoint is just saying that the guest has been
> throttled, but this is not that good for debugging purposes. We should
> also know how much the guest is throttled in order to understand
> consequences for the guest behaviour.
> 
> The patch moves the tracepoint from migration_trigger_throttle() to
> mig_throttle_guest_down() where this information is really available.
> This is not a problem as mig_throttle_guest_down() is called in the
> only one place.
> 
> Signed-off-by: Denis V. Lunev <den@openvz.org>
> CC: Peter Xu <peterx@redhat.com>
> CC: Fabiano Rosas <farosas@suse.de>

Makes sense to me, but maybe we can further move it to cpu_throttle_set()?
Denis V. Lunev Sept. 5, 2024, 6 p.m. UTC | #2
On 9/5/24 19:52, Peter Xu wrote:
> On Thu, Sep 05, 2024 at 06:51:46PM +0200, Denis V. Lunev via wrote:
>> Right now this tracepoint is just saying that the guest has been
>> throttled, but this is not that good for debugging purposes. We should
>> also know how much the guest is throttled in order to understand
>> consequences for the guest behaviour.
>>
>> The patch moves the tracepoint from migration_trigger_throttle() to
>> mig_throttle_guest_down() where this information is really available.
>> This is not a problem as mig_throttle_guest_down() is called in the
>> only one place.
>>
>> Signed-off-by: Denis V. Lunev <den@openvz.org>
>> CC: Peter Xu <peterx@redhat.com>
>> CC: Fabiano Rosas <farosas@suse.de>
> Makes sense to me, but maybe we can further move it to cpu_throttle_set()?
>
in that case we should rename the tracepoint as in that case the
module would be different :)

    4     90  system/cpu-throttle.c <<cpu_throttle_set>>
              void cpu_throttle_set(int new_throttle_pct)

Will it be OK for you?

Den
Peter Xu Sept. 5, 2024, 6:21 p.m. UTC | #3
On Thu, Sep 05, 2024 at 08:00:54PM +0200, Denis V. Lunev wrote:
> On 9/5/24 19:52, Peter Xu wrote:
> > On Thu, Sep 05, 2024 at 06:51:46PM +0200, Denis V. Lunev via wrote:
> > > Right now this tracepoint is just saying that the guest has been
> > > throttled, but this is not that good for debugging purposes. We should
> > > also know how much the guest is throttled in order to understand
> > > consequences for the guest behaviour.
> > > 
> > > The patch moves the tracepoint from migration_trigger_throttle() to
> > > mig_throttle_guest_down() where this information is really available.
> > > This is not a problem as mig_throttle_guest_down() is called in the
> > > only one place.
> > > 
> > > Signed-off-by: Denis V. Lunev <den@openvz.org>
> > > CC: Peter Xu <peterx@redhat.com>
> > > CC: Fabiano Rosas <farosas@suse.de>
> > Makes sense to me, but maybe we can further move it to cpu_throttle_set()?
> > 
> in that case we should rename the tracepoint as in that case the
> module would be different :)
> 
>    4     90  system/cpu-throttle.c <<cpu_throttle_set>>
>              void cpu_throttle_set(int new_throttle_pct)
> 
> Will it be OK for you?

Yes. Thanks.
diff mbox series

Patch

diff --git a/migration/ram.c b/migration/ram.c
index edec1a2d07..e587f11120 100644
--- a/migration/ram.c
+++ b/migration/ram.c
@@ -548,9 +548,11 @@  static void mig_throttle_guest_down(uint64_t bytes_dirty_period,
     uint64_t throttle_now = cpu_throttle_get_percentage();
     uint64_t cpu_now, cpu_ideal, throttle_inc;
 
+    int new_throttle_pct;
+
     /* We have not started throttling yet. Let's start it. */
     if (!cpu_throttle_active()) {
-        cpu_throttle_set(pct_initial);
+        new_throttle_pct = pct_initial;
     } else {
         /* Throttling already on, just increase the rate */
         if (!pct_tailslow) {
@@ -563,8 +565,10 @@  static void mig_throttle_guest_down(uint64_t bytes_dirty_period,
                         bytes_dirty_period);
             throttle_inc = MIN(cpu_now - cpu_ideal, pct_increment);
         }
-        cpu_throttle_set(MIN(throttle_now + throttle_inc, pct_max));
+        new_throttle_pct = MIN(throttle_now + throttle_inc, pct_max);
     }
+    trace_migration_throttle(new_throttle_pct);
+    cpu_throttle_set(new_throttle_pct);
 }
 
 void mig_throttle_counter_reset(void)
@@ -1032,7 +1036,6 @@  static void migration_trigger_throttle(RAMState *rs)
         (++rs->dirty_rate_high_cnt >= 2)) {
         rs->dirty_rate_high_cnt = 0;
         if (migrate_auto_converge()) {
-            trace_migration_throttle();
             mig_throttle_guest_down(bytes_dirty_period,
                                     bytes_dirty_threshold);
         } else if (migrate_dirty_limit()) {
diff --git a/migration/trace-events b/migration/trace-events
index 0b7c3324fb..099c828535 100644
--- a/migration/trace-events
+++ b/migration/trace-events
@@ -95,7 +95,7 @@  get_queued_page_not_dirty(const char *block_name, uint64_t tmp_offset, unsigned
 migration_bitmap_sync_start(void) ""
 migration_bitmap_sync_end(uint64_t dirty_pages) "dirty_pages %" PRIu64
 migration_bitmap_clear_dirty(char *str, uint64_t start, uint64_t size, unsigned long page) "rb %s start 0x%"PRIx64" size 0x%"PRIx64" page 0x%lx"
-migration_throttle(void) ""
+migration_throttle(int new_throttle_pct)  "guest CPU throttled by %d%%"
 migration_dirty_limit_guest(int64_t dirtyrate) "guest dirty page rate limit %" PRIi64 " MB/s"
 ram_discard_range(const char *rbname, uint64_t start, size_t len) "%s: start: %" PRIx64 " %zx"
 ram_load_loop(const char *rbname, uint64_t addr, int flags, void *host) "%s: addr: 0x%" PRIx64 " flags: 0x%x host: %p"