From patchwork Thu Jan 16 12:58:57 2020 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jinpu Wang X-Patchwork-Id: 11336743 Return-Path: Received: from mail.kernel.org (pdx-korg-mail-1.web.codeaurora.org [172.30.200.123]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id 37DC01820 for ; Thu, 16 Jan 2020 12:59:34 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id F25F52081E for ; Thu, 16 Jan 2020 12:59:33 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="FzkX5yJT" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726983AbgAPM7c (ORCPT ); Thu, 16 Jan 2020 07:59:32 -0500 Received: from mail-ed1-f66.google.com ([209.85.208.66]:36228 "EHLO mail-ed1-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726935AbgAPM7a (ORCPT ); Thu, 16 Jan 2020 07:59:30 -0500 Received: by mail-ed1-f66.google.com with SMTP id j17so18827242edp.3; Thu, 16 Jan 2020 04:59:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=8gXCdiTrGT9cSmxSfagxCFHIg1uOw2JvV3t7ahQLAso=; b=FzkX5yJThSgMMCfxM0kEq79seSXmaSg2OWf77qlKzTOfpFg6Z3QyQJmFq0V1hANEq8 OgWdXIzSDBXupSnrqn2MHcksYyEmjCc9mWkSd4pE1vIq2DlBXE0bKhmlYNAgPTz7xX3c SLun0VHqGOmfw6PKBEza/9uIIseG0PDIq/Xjqb9t66zDTOMIOzuvhsTgcM1mjxX60w4H ENS9BaKfNmAaY6M/88+WQ3GENpF0vDGCg//o6ic94FnTNKwTIy6z8REsjSCzSUAZt127 6eBFfL8ObLR8MhEok6gmGiD4foU7bjaZyCawFo1FnTJNgQlcYrZ9K+KuNfyf5tGr3vDU wPgQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=8gXCdiTrGT9cSmxSfagxCFHIg1uOw2JvV3t7ahQLAso=; b=g01WGu/rJdbWf8XWnBG1IuMRwYvogIQquP3EWmiQe2KDopLwQ043mLdRG3IDnchKLM a6XLIl2NoTT6/q7WqMYPw5jNNblM2xTSlaq9pXR3mOFnXbl03KkJU6Wp7bCjs396sS/Y 3TQpKYmYN+YgiJQaTS2m6izkA3dqfiJxg2Ks5hwGNyeUjQeYP9C3un/VRh5ozzkkw3Vy RaTA2TgVpZ8cfH5WBPxG3ybI3DD7ZWLUHzkS2fP7FsQWVW3x7q0xkD1uMO+uGkDwHz2t ebKtQbhYPVKRXYD8mIcZjqqDpcLof8dh5bJ8EVYbQQrrxbYyx+CKK5MokiMtcTLMmhMC BDFw== X-Gm-Message-State: APjAAAVjjSSAdBKypNyzCtV2/kLlwBXwXqj5I6jcnBQvWnsbnR8zbfPF iVYap+VwqdhE+0Q9CX4TWC89+P/Y X-Google-Smtp-Source: APXvYqxr2A7TCRkjJxMrPI8dbN9VkEu7BnRqgUlTq5YRXERzCwBpx7FQdKM7gah+K9XjC2UOHWKTJQ== X-Received: by 2002:a50:ea89:: with SMTP id d9mr4667130edo.162.1579179567359; Thu, 16 Jan 2020 04:59:27 -0800 (PST) Received: from jwang-Latitude-5491.fritz.box ([2001:16b8:4956:1800:d464:b0ea:3ef4:abbb]) by smtp.gmail.com with ESMTPSA id b13sm697289ejl.5.2020.01.16.04.59.26 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 16 Jan 2020 04:59:26 -0800 (PST) From: Jack Wang To: linux-block@vger.kernel.org, linux-rdma@vger.kernel.org Cc: axboe@kernel.dk, hch@infradead.org, sagi@grimberg.me, bvanassche@acm.org, leon@kernel.org, dledford@redhat.com, jgg@ziepe.ca, danil.kipnis@cloud.ionos.com, jinpu.wang@cloud.ionos.com, rpenyaev@suse.de Subject: [PATCH v7 07/25] RDMA/rtrs: client: statistics functions Date: Thu, 16 Jan 2020 13:58:57 +0100 Message-Id: <20200116125915.14815-8-jinpuwang@gmail.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20200116125915.14815-1-jinpuwang@gmail.com> References: <20200116125915.14815-1-jinpuwang@gmail.com> Sender: linux-block-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-block@vger.kernel.org From: Jack Wang This introduces set of functions used on client side to account statistics of RDMA data sent/received, amount of IOs inflight, latency, cpu migrations, etc. Almost all statistics are collected using percpu variables. Signed-off-by: Danil Kipnis Signed-off-by: Jack Wang --- drivers/infiniband/ulp/rtrs/rtrs-clt-stats.c | 346 +++++++++++++++++++ 1 file changed, 346 insertions(+) create mode 100644 drivers/infiniband/ulp/rtrs/rtrs-clt-stats.c diff --git a/drivers/infiniband/ulp/rtrs/rtrs-clt-stats.c b/drivers/infiniband/ulp/rtrs/rtrs-clt-stats.c new file mode 100644 index 000000000000..c8a7f2dc9479 --- /dev/null +++ b/drivers/infiniband/ulp/rtrs/rtrs-clt-stats.c @@ -0,0 +1,346 @@ +// SPDX-License-Identifier: GPL-2.0-or-later +/* + * RDMA Transport Layer + * + * Copyright (c) 2014 - 2018 ProfitBricks GmbH. All rights reserved. + * + * Copyright (c) 2018 - 2019 1&1 IONOS Cloud GmbH. All rights reserved. + * + * Copyright (c) 2019 - 2020 1&1 IONOS SE. All rights reserved. + */ +#undef pr_fmt +#define pr_fmt(fmt) KBUILD_MODNAME " L" __stringify(__LINE__) ": " fmt + +#include "rtrs-clt.h" + +static inline int rtrs_clt_ms_to_bin(unsigned long ms) +{ + int bin = ms ? ilog2(ms) - MIN_LOG_LAT + 1 : 0; + + return clamp(bin, 0, LOG_LAT_SZ - 1); +} + +void rtrs_clt_update_rdma_lat(struct rtrs_clt_stats *stats, bool read, + unsigned long ms) +{ + struct rtrs_clt_stats_pcpu *s; + int bin; + + bin = rtrs_clt_ms_to_bin(ms); + s = this_cpu_ptr(stats->pcpu_stats); + if (read) { + s->rdma_lat_distr[bin].read++; + if (s->rdma_lat_max.read < ms) + s->rdma_lat_max.read = ms; + } else { + s->rdma_lat_distr[bin].write++; + if (s->rdma_lat_max.write < ms) + s->rdma_lat_max.write = ms; + } +} + +void rtrs_clt_decrease_inflight(struct rtrs_clt_stats *stats) +{ + atomic_dec(&stats->inflight); +} + +void rtrs_clt_update_wc_stats(struct rtrs_clt_con *con) +{ + struct rtrs_clt_sess *sess = to_clt_sess(con->c.sess); + struct rtrs_clt_stats *stats = &sess->stats; + struct rtrs_clt_stats_pcpu *s; + int cpu; + + cpu = raw_smp_processor_id(); + s = this_cpu_ptr(stats->pcpu_stats); + s->wc_comp.cnt++; + s->wc_comp.total_cnt++; + if (unlikely(con->cpu != cpu)) { + s->cpu_migr.to++; + + /* Careful here, override s pointer */ + s = per_cpu_ptr(stats->pcpu_stats, con->cpu); + atomic_inc(&s->cpu_migr.from); + } +} + +void rtrs_clt_inc_failover_cnt(struct rtrs_clt_stats *stats) +{ + struct rtrs_clt_stats_pcpu *s; + + s = this_cpu_ptr(stats->pcpu_stats); + s->rdma.failover_cnt++; +} + +static inline u32 rtrs_clt_stats_get_avg_wc_cnt(struct rtrs_clt_stats *stats) +{ + u32 cnt = 0; + u64 sum = 0; + int cpu; + + for_each_possible_cpu(cpu) { + struct rtrs_clt_stats_pcpu *s; + + s = per_cpu_ptr(stats->pcpu_stats, cpu); + sum += s->wc_comp.total_cnt; + cnt += s->wc_comp.cnt; + } + + return cnt ? sum / cnt : 0; +} + +int rtrs_clt_stats_wc_completion_to_str(struct rtrs_clt_stats *stats, + char *buf, size_t len) +{ + return scnprintf(buf, len, "%u\n", + rtrs_clt_stats_get_avg_wc_cnt(stats)); +} + +ssize_t rtrs_clt_stats_rdma_lat_distr_to_str(struct rtrs_clt_stats *stats, + char *page, size_t len) +{ + struct rtrs_clt_stats_rdma_lat res[LOG_LAT_SZ]; + struct rtrs_clt_stats_rdma_lat max; + struct rtrs_clt_stats_pcpu *s; + + ssize_t cnt = 0; + int i, cpu; + + max.write = 0; + max.read = 0; + for_each_possible_cpu(cpu) { + s = per_cpu_ptr(stats->pcpu_stats, cpu); + + if (max.write < s->rdma_lat_max.write) + max.write = s->rdma_lat_max.write; + if (max.read < s->rdma_lat_max.read) + max.read = s->rdma_lat_max.read; + } + for (i = 0; i < ARRAY_SIZE(res); i++) { + res[i].write = 0; + res[i].read = 0; + for_each_possible_cpu(cpu) { + s = per_cpu_ptr(stats->pcpu_stats, cpu); + + res[i].write += s->rdma_lat_distr[i].write; + res[i].read += s->rdma_lat_distr[i].read; + } + } + + for (i = 0; i < ARRAY_SIZE(res) - 1; i++) + cnt += scnprintf(page + cnt, len - cnt, + "< %6d ms: %llu %llu\n", + 1 << (i + MIN_LOG_LAT), res[i].read, + res[i].write); + cnt += scnprintf(page + cnt, len - cnt, ">= %5d ms: %llu %llu\n", + 1 << (i - 1 + MIN_LOG_LAT), res[i].read, + res[i].write); + cnt += scnprintf(page + cnt, len - cnt, " maximum ms: %llu %llu\n", + max.read, max.write); + + return cnt; +} + +int rtrs_clt_stats_migration_cnt_to_str(struct rtrs_clt_stats *stats, + char *buf, size_t len) +{ + struct rtrs_clt_stats_pcpu *s; + + size_t used; + int cpu; + + used = scnprintf(buf, len, " "); + for_each_possible_cpu(cpu) + used += scnprintf(buf + used, len - used, " CPU%u", cpu); + + used += scnprintf(buf + used, len - used, "\nfrom:"); + for_each_possible_cpu(cpu) { + s = per_cpu_ptr(stats->pcpu_stats, cpu); + used += scnprintf(buf + used, len - used, " %d", + atomic_read(&s->cpu_migr.from)); + } + + used += scnprintf(buf + used, len - used, "\nto :"); + for_each_possible_cpu(cpu) { + s = per_cpu_ptr(stats->pcpu_stats, cpu); + used += scnprintf(buf + used, len - used, " %d", + s->cpu_migr.to); + } + used += scnprintf(buf + used, len - used, "\n"); + + return used; +} + +int rtrs_clt_stats_reconnects_to_str(struct rtrs_clt_stats *stats, char *buf, + size_t len) +{ + return scnprintf(buf, len, "%d %d\n", + stats->reconnects.successful_cnt, + stats->reconnects.fail_cnt); +} + +ssize_t rtrs_clt_stats_rdma_to_str(struct rtrs_clt_stats *stats, + char *page, size_t len) +{ + struct rtrs_clt_stats_rdma sum; + struct rtrs_clt_stats_rdma *r; + int cpu; + + memset(&sum, 0, sizeof(sum)); + + for_each_possible_cpu(cpu) { + r = &per_cpu_ptr(stats->pcpu_stats, cpu)->rdma; + + sum.dir[READ].cnt += r->dir[READ].cnt; + sum.dir[READ].size_total += r->dir[READ].size_total; + sum.dir[WRITE].cnt += r->dir[WRITE].cnt; + sum.dir[WRITE].size_total += r->dir[WRITE].size_total; + sum.failover_cnt += r->failover_cnt; + } + + return scnprintf(page, len, "%llu %llu %llu %llu %u %llu\n", + sum.dir[READ].cnt, sum.dir[READ].size_total, + sum.dir[WRITE].cnt, sum.dir[WRITE].size_total, + atomic_read(&stats->inflight), sum.failover_cnt); +} + +ssize_t rtrs_clt_reset_all_help(struct rtrs_clt_stats *s, + char *page, size_t len) +{ + return scnprintf(page, len, "echo 1 to reset all statistics\n"); +} + +int rtrs_clt_reset_rdma_stats(struct rtrs_clt_stats *stats, bool enable) +{ + struct rtrs_clt_stats_pcpu *s; + int cpu; + + if (unlikely(!enable)) + return -EINVAL; + + for_each_possible_cpu(cpu) { + s = per_cpu_ptr(stats->pcpu_stats, cpu); + memset(&s->rdma, 0, sizeof(s->rdma)); + } + + return 0; +} + +int rtrs_clt_reset_rdma_lat_distr_stats(struct rtrs_clt_stats *stats, + bool enable) +{ + struct rtrs_clt_stats_pcpu *s; + int cpu; + + if (enable) { + for_each_possible_cpu(cpu) { + s = per_cpu_ptr(stats->pcpu_stats, cpu); + memset(&s->rdma_lat_max, 0, sizeof(s->rdma_lat_max)); + memset(&s->rdma_lat_distr, 0, + sizeof(s->rdma_lat_distr)); + } + } + stats->enable_rdma_lat = enable; + + return 0; +} + +int rtrs_clt_reset_cpu_migr_stats(struct rtrs_clt_stats *stats, bool enable) +{ + struct rtrs_clt_stats_pcpu *s; + int cpu; + + if (unlikely(!enable)) + return -EINVAL; + + for_each_possible_cpu(cpu) { + s = per_cpu_ptr(stats->pcpu_stats, cpu); + memset(&s->cpu_migr, 0, sizeof(s->cpu_migr)); + } + + return 0; +} + +int rtrs_clt_reset_reconnects_stat(struct rtrs_clt_stats *stats, bool enable) +{ + if (unlikely(!enable)) + return -EINVAL; + + memset(&stats->reconnects, 0, sizeof(stats->reconnects)); + + return 0; +} + +int rtrs_clt_reset_wc_comp_stats(struct rtrs_clt_stats *stats, bool enable) +{ + struct rtrs_clt_stats_pcpu *s; + int cpu; + + if (unlikely(!enable)) + return -EINVAL; + + for_each_possible_cpu(cpu) { + s = per_cpu_ptr(stats->pcpu_stats, cpu); + memset(&s->wc_comp, 0, sizeof(s->wc_comp)); + } + + return 0; +} + +int rtrs_clt_reset_all_stats(struct rtrs_clt_stats *s, bool enable) +{ + if (enable) { + rtrs_clt_reset_rdma_stats(s, enable); + rtrs_clt_reset_rdma_lat_distr_stats(s, enable); + rtrs_clt_reset_cpu_migr_stats(s, enable); + rtrs_clt_reset_reconnects_stat(s, enable); + rtrs_clt_reset_wc_comp_stats(s, enable); + atomic_set(&s->inflight, 0); + return 0; + } + + return -EINVAL; +} + +static inline void rtrs_clt_update_rdma_stats(struct rtrs_clt_stats *stats, + size_t size, int d) +{ + struct rtrs_clt_stats_pcpu *s; + + s = this_cpu_ptr(stats->pcpu_stats); + s->rdma.dir[d].cnt++; + s->rdma.dir[d].size_total += size; +} + +void rtrs_clt_update_all_stats(struct rtrs_clt_io_req *req, int dir) +{ + struct rtrs_clt_con *con = req->con; + struct rtrs_clt_sess *sess = to_clt_sess(con->c.sess); + struct rtrs_clt_stats *stats = &sess->stats; + unsigned int len; + + len = req->usr_len + req->data_len; + rtrs_clt_update_rdma_stats(stats, len, dir); + atomic_inc(&stats->inflight); +} + +int rtrs_clt_init_stats(struct rtrs_clt_stats *stats) +{ + stats->enable_rdma_lat = false; + stats->pcpu_stats = alloc_percpu(typeof(*stats->pcpu_stats)); + if (unlikely(!stats->pcpu_stats)) + return -ENOMEM; + + /* + * successful_cnt will be set to 0 after session + * is established for the first time + */ + stats->reconnects.successful_cnt = -1; + + return 0; +} + +void rtrs_clt_free_stats(struct rtrs_clt_stats *stats) +{ + free_percpu(stats->pcpu_stats); +}