From patchwork Fri Sep 27 18:41:31 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Namhyung Kim X-Patchwork-Id: 13814485 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id B3A8BCDD1D2 for ; Fri, 27 Sep 2024 18:41:43 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 6E7176B0128; Fri, 27 Sep 2024 14:41:41 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 696606B012A; Fri, 27 Sep 2024 14:41:41 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 536A26B012B; Fri, 27 Sep 2024 14:41:41 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0015.hostedemail.com [216.40.44.15]) by kanga.kvack.org (Postfix) with ESMTP id 348E56B0128 for ; Fri, 27 Sep 2024 14:41:41 -0400 (EDT) Received: from smtpin11.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay04.hostedemail.com (Postfix) with ESMTP id D64061A13F1 for ; Fri, 27 Sep 2024 18:41:40 +0000 (UTC) X-FDA: 82611386760.11.A9CE6A9 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by imf14.hostedemail.com (Postfix) with ESMTP id 2F5FE100008 for ; Fri, 27 Sep 2024 18:41:38 +0000 (UTC) Authentication-Results: imf14.hostedemail.com; dkim=pass header.d=kernel.org header.s=k20201202 header.b="mpN/lLFl"; dmarc=pass (policy=quarantine) header.from=kernel.org; spf=pass (imf14.hostedemail.com: domain of namhyung@kernel.org designates 139.178.84.217 as permitted sender) smtp.mailfrom=namhyung@kernel.org ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1727462480; a=rsa-sha256; cv=none; b=EHO7OcZdNsQ2UzMF9IOCL0edG9QSaC2SW44HtydFrG5a1j0zwBefvSYPs8QOSBHXWHELlX M+VgFUlpWgNrbpIT6RiPVt4EZFhMEuefMD7R2Bi6Z1DDsu928N4HOjmEnBbJIVPpWTC9Ot 6IF6qwNaNvB8/0Xekfa6EhP7qFC/3lA= ARC-Authentication-Results: i=1; imf14.hostedemail.com; dkim=pass header.d=kernel.org header.s=k20201202 header.b="mpN/lLFl"; dmarc=pass (policy=quarantine) header.from=kernel.org; spf=pass (imf14.hostedemail.com: domain of namhyung@kernel.org designates 139.178.84.217 as permitted sender) smtp.mailfrom=namhyung@kernel.org ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1727462480; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=/fudKJsChFaU2ONbY5+4NGPqNDMKtJmMIBOKIHSnlRY=; b=pBKaZHBnKt4f8XDWRD47dBH3shrLWdYZ5WEbylinM3o3R0WbkAu5dPvqgAvGq0JvG1rUSe hxQsz/NRvuRkJaCw7UT9wGbmlytf+UF3xUPYPHIRJwzlglCpMYEwf9faS7PpKhNj7UZpGw 46NLeBcdg9+i9KbsZ15LJdzducU0LDk= Received: from smtp.kernel.org (transwarp.subspace.kernel.org [100.75.92.58]) by dfw.source.kernel.org (Postfix) with ESMTP id 295BE5C5A25; Fri, 27 Sep 2024 18:41:33 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id C852AC4CED0; Fri, 27 Sep 2024 18:41:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1727462496; bh=kkigyrBYeiWFl3tgn2/scYgLndEN9wtat/LKpQKHAbM=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=mpN/lLFlB1Vb1cprdBgZpMgvY7xKvVW4ERpyob0A4VD5ETjBfYgXi5tR6/8EAF+Jb 1EcA81jZSJ7KmCVBP9+6vc4aJqOy0eKwdD33fngUqmVFPrTLgpSkQBX2RIByycMYrN 5RgnokD2ZNhl5Xm/6BhI1KIMqHOa2Uwa7EAyGs1C6KwXVpJRC0tK4z+gDiUrguKvv2 iu1kuFDjSJ3ZhR8zEhurS21hnyht5jvxNxGc0Xy5KrRNGCJ38g8ll1FATQLjDFHUt+ r/SzBZ81vAKbFt3VbIB1dg77XaUGQrKBFfP6uXPYmU7m+/B5ecq6z7mVufBXloKQuj nnTrTYVsUKbWw== From: Namhyung Kim To: Alexei Starovoitov , Daniel Borkmann , Andrii Nakryiko Cc: Martin KaFai Lau , Eduard Zingerman , Song Liu , Yonghong Song , John Fastabend , KP Singh , Stanislav Fomichev , Hao Luo , Jiri Olsa , LKML , bpf@vger.kernel.org, Andrew Morton , Christoph Lameter , Pekka Enberg , David Rientjes , Joonsoo Kim , Vlastimil Babka , Roman Gushchin , Hyeonggon Yoo <42.hyeyoo@gmail.com>, linux-mm@kvack.org, Arnaldo Carvalho de Melo Subject: [RFC/PATCH bpf-next 1/3] bpf: Add kmem_cache iterator Date: Fri, 27 Sep 2024 11:41:31 -0700 Message-ID: <20240927184133.968283-2-namhyung@kernel.org> X-Mailer: git-send-email 2.46.1.824.gd892dcdcdd-goog In-Reply-To: <20240927184133.968283-1-namhyung@kernel.org> References: <20240927184133.968283-1-namhyung@kernel.org> MIME-Version: 1.0 X-Rspam-User: X-Stat-Signature: 94h8te6gnfoff6unfsw31mu9fg6384yi X-Rspamd-Queue-Id: 2F5FE100008 X-Rspamd-Server: rspam02 X-HE-Tag: 1727462498-741067 X-HE-Meta: U2FsdGVkX18EDIdY7Fp4VWZX57Jl5QZG5bxP5y5G6a1YhcbZJcZ0oUqDoID4RA4HRFRU/gOpzjUdyJEQIxhWRMH8UCjg103RZxiYVY9aK30nbkcscE+tMWfsHgrBW5NxpHpYQAXqb9Z71IMbzvaaz24hOnljDmIZMF31a8Azair3rvmy6XCQqj+vwoP5gAzsdeYvMQ0SuBjsJiQMWg+lsxkrBrLPsacnWXtBiSZosP3KeQakvzybOwXtHLe4imPXlgFLZ8dFZ/mfUcy4U3VLHzY4L4MfZvEgvlcSyFXRxDOR1ATnF1879EltTGxxPcTjsfj5KJZepGKplQpKoH8iC1g/Yv+Xu2DjJ2DHA2QjjZsJHpU12EAQoqCMgLscwAPemrc1R3xVG4Z80zeeRZPUMQomHtZvx0P3i7uuTq4dRaaQ7DT+0IbJSmVcanMJINFlSRF8XxukpvWfFY7JDZRWJ/66SKIIHrid8I9HTgbaIIapjCxLffWQRnQCPknEQ7UI8o8MaX4zbDgHklNV1aJ9Jk+SG9B7tMh5dQlWrU2Hlwf4R8NnDL0AZq0aIer9I7VYh7JpsP7de5ahcDnV8GmexLmPHPuUd8E/gyZS3vVrABFHCEPdGM3RTSqro1ZDp2N5jveGaIsdbqL0M8OuJGcEUkR+TKoKgWtsPdiXvzou5FGDN89uKS3JW+kuTQU63+PIXhTN9LZvnIG3Jd+yzVqpHEma97mjUtqBzquVSZUCESJksY7JDloiMu0yDfAtIfQUjMoUPjjwi/HGsbIZug6/RE17i7qlhhKcM5GrV55kA35SSFaam+Q07l566ZBG6pafsZ+lAinRCV77JAx0XXJtTrU+wO4JJM/0BrCNdGaZZHswCWAuLXPeezA7Z5fwY71IzRCsuTiFeVsDvDdiKvjiFw/6c0ovzvzQFSJJK2GBQlNLTs7r0yzgGu44ao0SWZFTRVq7EngaB3j0/qoVwDR gjcHjHjf 6fejG+MRDmKZHaa2ZwolsqreRz2/aaDCTFCAFTP0jccfYA/LTTreAeVNA2BrUMJfG81c5NxjgEq1vLquTRBQ5nXUxXxGchShj++r4rOc1sus2zRKD2ojn7LisfGNu4isHMdaGt2KkFOUBdoM5fiSJNs30+gU6Np1MV5xO/+caOzo4s2gKKgRaK3wywbF8edG7TjqiGi4IOEyZaQ29R3+jhJ/ITSbZ6ypFaxeKojPbg9RGk8ddnaGTrppeqzbkq09Ksi9LK/UfhSMjyN+Q9DN6Y769Nfu+xZvAeEzFvbMmxu/uxrYNRSf0Mb8aNtdWixj/ZEqHBD/PckFd/FHCQHJBisucw5QApxxlnaw5PNTHOywRQQlZsbef+hYTJZqASmHqWWJ7RaELmywM+rY= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: The new "kmem_cache" iterator will traverse the list of slab caches and call attached BPF programs for each entry. It should check the argument (ctx.s) if it's NULL before using it. The iteration will be done with slab_mutex held but it'd break and return to user if the BPF program emits data to seq buffer more than the buffer size given by the user. IOW the whole iteration would be protected by the slab_mutex as long as it won't emit anything. It includes the internal "mm/slab.h" header to access kmem_cache, slab_caches and slab_mutex. Hope it's ok to mm folks. Acked-by: Roman Gushchin (mm/*) Acked-by: Vlastimil Babka #mm/slab Signed-off-by: Namhyung Kim --- include/linux/btf_ids.h | 1 + kernel/bpf/Makefile | 1 + kernel/bpf/kmem_cache_iter.c | 131 +++++++++++++++++++++++++++++++++++ 3 files changed, 133 insertions(+) create mode 100644 kernel/bpf/kmem_cache_iter.c diff --git a/include/linux/btf_ids.h b/include/linux/btf_ids.h index c0e3e1426a82f5c4..139bdececdcfaefb 100644 --- a/include/linux/btf_ids.h +++ b/include/linux/btf_ids.h @@ -283,5 +283,6 @@ extern u32 btf_tracing_ids[]; extern u32 bpf_cgroup_btf_id[]; extern u32 bpf_local_storage_map_btf_id[]; extern u32 btf_bpf_map_id[]; +extern u32 bpf_kmem_cache_btf_id[]; #endif diff --git a/kernel/bpf/Makefile b/kernel/bpf/Makefile index 9b9c151b5c826b31..105328f0b9c04e37 100644 --- a/kernel/bpf/Makefile +++ b/kernel/bpf/Makefile @@ -52,3 +52,4 @@ obj-$(CONFIG_BPF_PRELOAD) += preload/ obj-$(CONFIG_BPF_SYSCALL) += relo_core.o obj-$(CONFIG_BPF_SYSCALL) += btf_iter.o obj-$(CONFIG_BPF_SYSCALL) += btf_relocate.o +obj-$(CONFIG_BPF_SYSCALL) += kmem_cache_iter.o diff --git a/kernel/bpf/kmem_cache_iter.c b/kernel/bpf/kmem_cache_iter.c new file mode 100644 index 0000000000000000..5f7436b52f2e6b06 --- /dev/null +++ b/kernel/bpf/kmem_cache_iter.c @@ -0,0 +1,131 @@ +// SPDX-License-Identifier: GPL-2.0-only +/* Copyright (c) 2024 Google */ +#include +#include +#include +#include +#include + +#include "../../mm/slab.h" /* kmem_cache, slab_caches and slab_mutex */ + +struct bpf_iter__kmem_cache { + __bpf_md_ptr(struct bpf_iter_meta *, meta); + __bpf_md_ptr(struct kmem_cache *, s); +}; + +static void *kmem_cache_iter_seq_start(struct seq_file *seq, loff_t *pos) +{ + loff_t cnt = 0; + struct kmem_cache *s = NULL; + + mutex_lock(&slab_mutex); + + /* + * Find an entry at the given position in the slab_caches list instead + * of keeping a reference (of the last visited entry, if any) out of + * slab_mutex. It might miss something if one is deleted in the middle + * while it releases the lock. But it should be rare and there's not + * much we can do about it. + */ + list_for_each_entry(s, &slab_caches, list) { + if (cnt == *pos) + break; + + cnt++; + } + + if (cnt != *pos) + return NULL; + + ++*pos; + return s; +} + +static void kmem_cache_iter_seq_stop(struct seq_file *seq, void *v) +{ + struct bpf_iter_meta meta; + struct bpf_iter__kmem_cache ctx = { + .meta = &meta, + .s = v, + }; + struct bpf_prog *prog; + + meta.seq = seq; + prog = bpf_iter_get_info(&meta, true); + if (prog) + bpf_iter_run_prog(prog, &ctx); + + mutex_unlock(&slab_mutex); +} + +static void *kmem_cache_iter_seq_next(struct seq_file *seq, void *v, loff_t *pos) +{ + struct kmem_cache *s = v; + + ++*pos; + + if (list_last_entry(&slab_caches, struct kmem_cache, list) == s) + return NULL; + + return list_next_entry(s, list); +} + +static int kmem_cache_iter_seq_show(struct seq_file *seq, void *v) +{ + struct bpf_iter_meta meta; + struct bpf_iter__kmem_cache ctx = { + .meta = &meta, + .s = v, + }; + struct bpf_prog *prog; + int ret = 0; + + meta.seq = seq; + prog = bpf_iter_get_info(&meta, false); + if (prog) + ret = bpf_iter_run_prog(prog, &ctx); + + return ret; +} + +static const struct seq_operations kmem_cache_iter_seq_ops = { + .start = kmem_cache_iter_seq_start, + .next = kmem_cache_iter_seq_next, + .stop = kmem_cache_iter_seq_stop, + .show = kmem_cache_iter_seq_show, +}; + +BTF_ID_LIST_GLOBAL_SINGLE(bpf_kmem_cache_btf_id, struct, kmem_cache) + +static const struct bpf_iter_seq_info kmem_cache_iter_seq_info = { + .seq_ops = &kmem_cache_iter_seq_ops, +}; + +static void bpf_iter_kmem_cache_show_fdinfo(const struct bpf_iter_aux_info *aux, + struct seq_file *seq) +{ + seq_puts(seq, "kmem_cache iter\n"); +} + +DEFINE_BPF_ITER_FUNC(kmem_cache, struct bpf_iter_meta *meta, + struct kmem_cache *s) + +static struct bpf_iter_reg bpf_kmem_cache_reg_info = { + .target = "kmem_cache", + .feature = BPF_ITER_RESCHED, + .show_fdinfo = bpf_iter_kmem_cache_show_fdinfo, + .ctx_arg_info_size = 1, + .ctx_arg_info = { + { offsetof(struct bpf_iter__kmem_cache, s), + PTR_TO_BTF_ID_OR_NULL | PTR_TRUSTED }, + }, + .seq_info = &kmem_cache_iter_seq_info, +}; + +static int __init bpf_kmem_cache_iter_init(void) +{ + bpf_kmem_cache_reg_info.ctx_arg_info[0].btf_id = bpf_kmem_cache_btf_id[0]; + return bpf_iter_reg_target(&bpf_kmem_cache_reg_info); +} + +late_initcall(bpf_kmem_cache_iter_init);