From patchwork Sat Jun 2 11:58:54 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?b?56a56Iif6ZSu?= X-Patchwork-Id: 10444785 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id 73916602BD for ; Sat, 2 Jun 2018 11:59:49 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 64B9228A67 for ; Sat, 2 Jun 2018 11:59:49 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id 582ED28A6A; Sat, 2 Jun 2018 11:59:49 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-3.0 required=2.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, FREEMAIL_FROM, MAILING_LIST_MULTI, RCVD_IN_DNSWL_NONE autolearn=ham version=3.3.1 Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 8668E28A67 for ; Sat, 2 Jun 2018 11:59:48 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 7A1426B0008; Sat, 2 Jun 2018 07:59:47 -0400 (EDT) Delivered-To: linux-mm-outgoing@kvack.org Received: by kanga.kvack.org (Postfix, from userid 40) id 750E66B000A; Sat, 2 Jun 2018 07:59:47 -0400 (EDT) X-Original-To: int-list-linux-mm@kvack.org X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 61ECF6B000C; Sat, 2 Jun 2018 07:59:47 -0400 (EDT) X-Original-To: linux-mm@kvack.org X-Delivered-To: linux-mm@kvack.org Received: from mail-pf0-f197.google.com (mail-pf0-f197.google.com [209.85.192.197]) by kanga.kvack.org (Postfix) with ESMTP id 1A16B6B0008 for ; Sat, 2 Jun 2018 07:59:47 -0400 (EDT) Received: by mail-pf0-f197.google.com with SMTP id j17-v6so611437pfi.21 for ; Sat, 02 Jun 2018 04:59:47 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:dkim-signature:from:to:cc:subject:date :message-id:in-reply-to:references; bh=B3mr7q7JlMWx6b65/X/ihdrj8dCcIGeQt/tFTgp6uag=; b=kXRjOyWsIIkQg5KPxZYp76mA7S3V+owxQ95e2qiwxcA7bBCQSU/FoxOUuJ93gnArX/ o7Qp9xd+Fl33DIk+bpy7EWBAx7lpsMuG9MjyEb25uGv4ZcIhkwr0z63tUBSYihEbbn0P w851dtRd+tn+ih8IX+FvTMwsuEkGVa8dVec7vxB787V6Qubz7OBGX5uqzoMkslLRP4ly yXqv7B66qxQQjv/u3QnHtJVSlT0KGvo0i5rqmugmOXSjrSutti1KHWYRrXHTROWkRAi7 lRDYHcmoxLXA5mNT9Ufr2QuCY02e1yw9tTpVGIkX68XNy9Swfd83gEq+SkrBGSWZz3NM uCfQ== X-Gm-Message-State: ALKqPwfhqbUP7e/aOFeNpW40dFOYGuzGAj221kWtZ/TSaCrwLDzCaIiG CLzJmA8e/Zr6OReVDAt2CDa9DV40PBvXeAKxnCXgMkPLYCyJFZeNv5O0TyJ1+moLBaJucEyruIJ NtEg8B+BfGgnpIho4AY8yLsp0PBKdo/z1mPnxnGtyx/a7maJPkvD8QfEWLjYy5mfvGLD/ojJnmj HZidOZU76bvpjyo+D4T+dixuE5gSWez3XGUO7jAEI8MNz4dueJiaOtg2ctQ31a3WnJtDmuPt6uQ bwVfylS860bwzi/njh5b16kY+qt+zLpH/2AvZthyCr1Sc/TG4c7lBoCuCII9pvTZPlBAPGxoHEK AXILK+Lbl6FTmX12Hw8hdH/bI8bERIUjyXoMgrTc8VqZ8p2FUwbnoxN0tb4G1EyIqFD59pYzmtT T X-Received: by 2002:a65:4dca:: with SMTP id q10-v6mr11805067pgt.22.1527940786775; Sat, 02 Jun 2018 04:59:46 -0700 (PDT) X-Received: by 2002:a65:4dca:: with SMTP id q10-v6mr11805029pgt.22.1527940785806; Sat, 02 Jun 2018 04:59:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1527940785; cv=none; d=google.com; s=arc-20160816; b=uA+q3NUPhT6HnQ3UEhnCmRjp+UhfyPYGNai1E/dWTaU4GEJp1W797ApHsASACkausy 9hsXMJdLxg9McvJpk5bGbEgRoXjVT0c7f0ddwY5JosvuhgrTDnFrl7v0cSXgc/NqNxUN +QO5qtbvTV08kpFNMaZsroDyaT728fmA99VAcff4gKsTliGU+sXXT3myLZSDqHjseT3E DhW/RZw/+zihC0AFLhwks0XYOvbIRI5Nu9FU7HyWOXwtCFzp/K6oTYirNGXJYQ2FMUCO 3monlTEh5j7siCYl+PRd4oYsthSRTsPu9KSsTB8xa4Va+FkLB3OK/uA+Sw3h576LIyFh urkg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature:arc-authentication-results; bh=B3mr7q7JlMWx6b65/X/ihdrj8dCcIGeQt/tFTgp6uag=; b=DVxiXzedeVz8O5gE+OoNsp+xvJzORNkUJyDRmqerbbIn/Hb4a1j/xHgk9zmDDVq2RX QUjUK0/7Bwa108L6v57SBKaNeL7E+VsDCgkx4DpOLN2aIfz8pcD1ULOgKBQclDF75YHu 2QxK4huxOFR9noKerfaDVYb3Eb3qqbmebEHX7X9g/HfSdYG6uWrKzHoKK0ybbvcbMLOZ 9L/Nai/qtnBdzVnF0b5H4MpVP21WxlWraPd3SkfBRDAqYXMqFkWgtop0Ol06ausSnH1m Eo8sao7jHn9/wwAzFElC+lvRRzYygHulXyTurLaoHX1fjKc2JSWZOHz/N8Dq3wLnbkqw gnVA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=ba7QFvxO; spf=pass (google.com: domain of ufo19890607@gmail.com designates 209.85.220.65 as permitted sender) smtp.mailfrom=ufo19890607@gmail.com; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from mail-sor-f65.google.com (mail-sor-f65.google.com. [209.85.220.65]) by mx.google.com with SMTPS id c129-v6sor13372313pfa.130.2018.06.02.04.59.45 for (Google Transport Security); Sat, 02 Jun 2018 04:59:45 -0700 (PDT) Received-SPF: pass (google.com: domain of ufo19890607@gmail.com designates 209.85.220.65 as permitted sender) client-ip=209.85.220.65; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=ba7QFvxO; spf=pass (google.com: domain of ufo19890607@gmail.com designates 209.85.220.65 as permitted sender) smtp.mailfrom=ufo19890607@gmail.com; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=B3mr7q7JlMWx6b65/X/ihdrj8dCcIGeQt/tFTgp6uag=; b=ba7QFvxO6oY7ros2qz/CoadcQWkISpxpqLhbFJ2PtFidfc2NRGgIdrDcyC3yApfPjA 2xZGMicVbbczl+AzkbyHr1lZNZq0dJcIKXq75KaZlh83jIvD3yIubPfVCgqCzs1bCCyW RIIPOcc58a5sY8KRjiQPRQSVMTm81t9s1G3aSvqlyIoxJVJZpfnC8C17O6MyesztidZr kQ9eb/QzhPB/ZIhhJtTvcUTF8bMwjdmqjeoBunzlPMjxHgPWq/0U1jv2VwEsLVlctTm4 wypHmQfgagB+465gRw6gKlxEAlw2vD+cjiB6ow6ZTA5zHW4B4uFi7g+JySrrgHuhA7m1 uoVA== X-Google-Smtp-Source: ADUXVKKTmvUo8hhtzgQLEu+vzl/YBme2V1OcigBqP7ssAfKP0sTu/CmUNKNF8IzYgQIpdHMxjPZzwA== X-Received: by 2002:a62:a21b:: with SMTP id m27-v6mr6100667pff.248.1527940785431; Sat, 02 Jun 2018 04:59:45 -0700 (PDT) Received: from dest.didichuxing.com ([52.229.160.192]) by smtp.gmail.com with ESMTPSA id c83-v6sm84723543pfc.111.2018.06.02.04.59.41 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sat, 02 Jun 2018 04:59:44 -0700 (PDT) From: ufo19890607@gmail.com To: akpm@linux-foundation.org, mhocko@suse.com, rientjes@google.com, kirill.shutemov@linux.intel.com, aarcange@redhat.com, penguin-kernel@I-love.SAKURA.ne.jp, guro@fb.com, yang.s@alibaba-inc.com Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, yuzhoujian Subject: [PATCH v7 2/2] Refactor part of the oom report in dump_header Date: Sat, 2 Jun 2018 19:58:54 +0800 Message-Id: <1527940734-35161-4-git-send-email-ufo19890607@gmail.com> X-Mailer: git-send-email 1.8.3.1 In-Reply-To: <1527940734-35161-1-git-send-email-ufo19890607@gmail.com> References: <1527940734-35161-1-git-send-email-ufo19890607@gmail.com> X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: X-Virus-Scanned: ClamAV using ClamSMTP From: yuzhoujian The dump_header does not print the memcg's name when the system oom happened, so users cannot locate the certain container which contains the task that has been killed by the oom killer. I follow the advices of David Rientjes and Michal Hocko, and refactor part of the oom report in a backwards compatible way. After this patch, users can get the memcg's path from the oom report and check the certain container more quickly. Below is the part of the oom report in the dmesg ... [ 142.158316] panic cpuset=/ mems_allowed=0-1 [ 142.158983] CPU: 15 PID: 8682 Comm: panic Not tainted 4.17.0-rc6+ #13 [ 142.159659] Hardware name: Inspur SA5212M4/YZMB-00370-107, BIOS 4.1.10 11/14/2016 [ 142.160342] Call Trace: [ 142.161037] dump_stack+0x78/0xb3 [ 142.161734] dump_header+0x7d/0x334 [ 142.162433] oom_kill_process+0x228/0x490 [ 142.163126] ? oom_badness+0x2a/0x130 [ 142.163821] out_of_memory+0xf0/0x280 [ 142.164532] __alloc_pages_slowpath+0x711/0xa07 [ 142.165241] __alloc_pages_nodemask+0x23f/0x260 [ 142.165947] alloc_pages_vma+0x73/0x180 [ 142.166665] do_anonymous_page+0xed/0x4e0 [ 142.167388] __handle_mm_fault+0xbd2/0xe00 [ 142.168114] handle_mm_fault+0x116/0x250 [ 142.168841] __do_page_fault+0x233/0x4d0 [ 142.169567] do_page_fault+0x32/0x130 [ 142.170303] ? page_fault+0x8/0x30 [ 142.171036] page_fault+0x1e/0x30 [ 142.171764] RIP: 0033:0x7f403000a860 [ 142.172517] RSP: 002b:00007ffc9f745c28 EFLAGS: 00010206 [ 142.173268] RAX: 00007f3f6fd7d000 RBX: 0000000000000000 RCX: 00007f3f7f5cd000 [ 142.174040] RDX: 00007f3fafd7d000 RSI: 0000000000000000 RDI: 00007f3f6fd7d000 [ 142.174806] RBP: 00007ffc9f745c50 R08: ffffffffffffffff R09: 0000000000000000 [ 142.175623] R10: 0000000000000022 R11: 0000000000000246 R12: 0000000000400490 [ 142.176542] R13: 00007ffc9f745d30 R14: 0000000000000000 R15: 0000000000000000 [ 142.177709] oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),origin_memcg=(null),kill_memcg=/test/test1/test2,task=panic,pid= 8622,uid= 0 ... Changes since v6: - divide the patch v5 into two parts. One part is to add an array of const char and put enum oom_constraint into the memcontrol.h; the other is will refactor the output in the dump_header. - limit the memory usage for the static char array by using NAME_MAX in the mem_cgroup_print_oom_context. - eliminate the spurious spaces in the oom's output and fix the spelling of "constrain". Changes since v5: - add an array of const char for each constraint. - replace all of the pr_cont with a single line print of the pr_info. - put enum oom_constraint into the memcontrol.c file for printing oom constraint. Changes since v4: - rename the helper's name to mem_cgroup_print_oom_context. - rename the mem_cgroup_print_oom_info to mem_cgroup_print_oom_meminfo. - add the constrain info in the dump_header. Changes since v3: - rename the helper's name to mem_cgroup_print_oom_memcg_name. - add the rcu lock held to the helper. - remove the print info of memcg's name in mem_cgroup_print_oom_info. Changes since v2: - add the mem_cgroup_print_memcg_name helper to print the memcg's name which contains the task that will be killed by the oom-killer. Changes since v1: - replace adding mem_cgroup_print_oom_info with printing the memcg's name only. Signed-off-by: yuzhoujian --- include/linux/memcontrol.h | 15 ++++++++++--- mm/memcontrol.c | 55 ++++++++++++++++++++++++++++++++-------------- mm/oom_kill.c | 5 +++-- 3 files changed, 53 insertions(+), 22 deletions(-) diff --git a/include/linux/memcontrol.h b/include/linux/memcontrol.h index 57311b6c4d67..1c7d5da1c827 100644 --- a/include/linux/memcontrol.h +++ b/include/linux/memcontrol.h @@ -478,8 +478,11 @@ void mem_cgroup_handle_over_high(void); unsigned long mem_cgroup_get_limit(struct mem_cgroup *memcg); -void mem_cgroup_print_oom_info(struct mem_cgroup *memcg, - struct task_struct *p); +void mem_cgroup_print_oom_context(struct mem_cgroup *memcg, + struct task_struct *p, enum oom_constraint constraint, + nodemask_t *nodemask); + +void mem_cgroup_print_oom_meminfo(struct mem_cgroup *memcg); static inline void mem_cgroup_oom_enable(void) { @@ -873,7 +876,13 @@ static inline unsigned long mem_cgroup_get_limit(struct mem_cgroup *memcg) } static inline void -mem_cgroup_print_oom_info(struct mem_cgroup *memcg, struct task_struct *p) +mem_cgroup_print_oom_context(struct mem_cgroup *memcg, struct task_struct *p, + enum oom_constraint constraint, nodemask_t *nodemask) +{ +} + +static inline void +mem_cgroup_print_oom_meminfo(struct mem_cgroup *memcg) { } diff --git a/mm/memcontrol.c b/mm/memcontrol.c index 2bd3df3d101a..fd1172938c8e 100644 --- a/mm/memcontrol.c +++ b/mm/memcontrol.c @@ -1118,33 +1118,54 @@ static const char *const memcg1_stat_names[] = { }; #define K(x) ((x) << (PAGE_SHIFT-10)) -/** - * mem_cgroup_print_oom_info: Print OOM information relevant to memory controller. - * @memcg: The memory cgroup that went over limit +/* + * mem_cgroup_print_oom_context: Print OOM context information relevant to + * memory controller, which includes allocation constraint, nodemask, origin + * memcg that has reached its limit, kill memcg that contains the killed + * process, killed process's command, pid and uid. + * @memcg: The origin memory cgroup that went over limit * @p: Task that is going to be killed + * @constraint: The allocation constraint + * @nodemask: The allocation nodemask * * NOTE: @memcg and @p's mem_cgroup can be different when hierarchy is * enabled */ -void mem_cgroup_print_oom_info(struct mem_cgroup *memcg, struct task_struct *p) +void mem_cgroup_print_oom_context(struct mem_cgroup *memcg, struct task_struct *p, + enum oom_constraint constraint, nodemask_t *nodemask) { - struct mem_cgroup *iter; - unsigned int i; + static char origin_memcg_name[NAME_MAX], kill_memcg_name[NAME_MAX]; + struct cgroup *origin_cgrp, *kill_cgrp; rcu_read_lock(); - - if (p) { - pr_info("Task in "); - pr_cont_cgroup_path(task_cgroup(p, memory_cgrp_id)); - pr_cont(" killed as a result of limit of "); - } else { - pr_info("Memory limit reached of cgroup "); + if (memcg) { + origin_cgrp = memcg->css.cgroup; + cgroup_path(origin_cgrp, origin_memcg_name, NAME_MAX); } - - pr_cont_cgroup_path(memcg->css.cgroup); - pr_cont("\n"); - + kill_cgrp = task_cgroup(p, memory_cgrp_id); + cgroup_path(kill_cgrp, kill_memcg_name, NAME_MAX); + + if (p) + pr_info("oom-kill:constraint=%s,nodemask=%*pbl,origin_memcg=%s,kill_memcg=%s,task=%s,pid=%5d,uid=%5d\n", + oom_constraint_text[constraint], nodemask_pr_args(nodemask), + strlen(origin_memcg_name) ? origin_memcg_name : "(null)", + kill_memcg_name, p->comm, p->pid, + from_kuid(&init_user_ns, task_uid(p))); + else + pr_info("oom-kill:constraint=%s,nodemask=%*pbl,origin_memcg=%s,kill_memcg=%s\n", + oom_constraint_text[constraint], nodemask_pr_args(nodemask), + strlen(origin_memcg_name) ? origin_memcg_name : "(null)", kill_memcg_name); rcu_read_unlock(); +} + +/** + * mem_cgroup_print_oom_info: Print OOM memory information relevant to memory controller. + * @memcg: The memory cgroup that went over limit + */ +void mem_cgroup_print_oom_meminfo(struct mem_cgroup *memcg) +{ + struct mem_cgroup *iter; + unsigned int i; pr_info("memory: usage %llukB, limit %llukB, failcnt %lu\n", K((u64)page_counter_read(&memcg->memory)), diff --git a/mm/oom_kill.c b/mm/oom_kill.c index c806cd656af6..af0efab8a9e5 100644 --- a/mm/oom_kill.c +++ b/mm/oom_kill.c @@ -16,7 +16,6 @@ * for newbie kernel hackers. It features several pointers to major * kernel subsystems and hints as to where to find out what things do. */ - #include #include #include @@ -414,6 +413,7 @@ static void dump_tasks(struct mem_cgroup *memcg, const nodemask_t *nodemask) static void dump_header(struct oom_control *oc, struct task_struct *p) { + enum oom_constraint constraint = constrained_alloc(oc); pr_warn("%s invoked oom-killer: gfp_mask=%#x(%pGg), nodemask=%*pbl, order=%d, oom_score_adj=%hd\n", current->comm, oc->gfp_mask, &oc->gfp_mask, nodemask_pr_args(oc->nodemask), oc->order, @@ -423,8 +423,9 @@ static void dump_header(struct oom_control *oc, struct task_struct *p) cpuset_print_current_mems_allowed(); dump_stack(); + mem_cgroup_print_oom_context(oc->memcg, p, constraint, oc->nodemask); if (is_memcg_oom(oc)) - mem_cgroup_print_oom_info(oc->memcg, p); + mem_cgroup_print_oom_meminfo(oc->memcg); else { show_mem(SHOW_MEM_FILTER_NODES, oc->nodemask); if (is_dump_unreclaim_slabs())