From patchwork Thu Apr 18 11:36:06 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Alexander Lobakin X-Patchwork-Id: 13634534 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 0A52DC4345F for ; Thu, 18 Apr 2024 11:36:42 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 077446B0098; Thu, 18 Apr 2024 07:36:42 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 027136B0099; Thu, 18 Apr 2024 07:36:41 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id E307A6B009B; Thu, 18 Apr 2024 07:36:41 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id C5BC66B0098 for ; Thu, 18 Apr 2024 07:36:41 -0400 (EDT) Received: from smtpin23.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay07.hostedemail.com (Postfix) with ESMTP id 345471611A4 for ; Thu, 18 Apr 2024 11:36:41 +0000 (UTC) X-FDA: 82022450202.23.5F54FF5 Received: from mgamail.intel.com (mgamail.intel.com [192.198.163.11]) by imf25.hostedemail.com (Postfix) with ESMTP id E850FA0010 for ; Thu, 18 Apr 2024 11:36:37 +0000 (UTC) Authentication-Results: imf25.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=K4K77bns; spf=pass (imf25.hostedemail.com: domain of aleksander.lobakin@intel.com designates 192.198.163.11 as permitted sender) smtp.mailfrom=aleksander.lobakin@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1713440198; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:references:dkim-signature; bh=g9rrUDfTVzBALjTBZqu/qEUaCpd4C0NT5+Cirj8caDE=; b=nVsudaQX8rgStpNxK63BytUlZ5wwqD6YLcr9xal0ek43YReb3O34WKsZByQ26UhOpTIpoG i1nMZgil7IkXx68OOWzUpA1ONvtVMNGdVxe8mARGKR3/1/binPRlP3SE+SJKca1oD75NQy j95ewMeegirsdmUDXZhcSthFiFMBxrs= ARC-Authentication-Results: i=1; imf25.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=K4K77bns; spf=pass (imf25.hostedemail.com: domain of aleksander.lobakin@intel.com designates 192.198.163.11 as permitted sender) smtp.mailfrom=aleksander.lobakin@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1713440198; a=rsa-sha256; cv=none; b=UiVbHQT8c/mQy6kV8nNCpZXuKBx/jUCupcIdlTt/Q4RIyAObnFj/7QZIaOLVaPvT5ZCnfz 9uPxzBTFC/8MxKI/tZ53H1tsFFvLXSjobu39OX3ESsjs37/0x0+VvwkLdyuPuHerD6gEQT OpCawAmj6M7U+reVnE6AbcQnGclBb2c= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1713440198; x=1744976198; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=OmUbjHBP4g7trrsWmNC2vJP7ZeDRKs3EuLB4aUbD8Ds=; b=K4K77bnsw9f1yN31oIj5nPxpuSrn/rugCZaE0Ow0D/KyEuOqwnyk6bh4 1NhWbzeBgmhXFi26UUOySWWsUteZ2KSqwprmOr9v79+ks+4nzMuUrWBNK MVBwCqiFDF4z2n5w3foPbQWNypIMyGbO+zhUhK4c659h12JZhVz6Naj3u nkhOEFOmvDiLbivcj/tr3GjH4wKeYJoEjAy6c9nvLyMAsncL0xpGValz7 baMyaO676K0LqvPsIc4PCkn2uWoqfxYeTyT79kS0yJY2AwMqZhScYhdGY QCJLi/FGNXms4WOuZVX2zHSPDg7bfupuejaYvC4u2SqaRk+MvlTsVx/Tg Q==; X-CSE-ConnectionGUID: MTqpObbkSU6MSyD8QDyGkg== X-CSE-MsgGUID: 2FfFv32ySF2eap3yJMh3Gw== X-IronPort-AV: E=McAfee;i="6600,9927,11047"; a="19587684" X-IronPort-AV: E=Sophos;i="6.07,212,1708416000"; d="scan'208";a="19587684" Received: from fmviesa004.fm.intel.com ([10.60.135.144]) by fmvoesa105.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 18 Apr 2024 04:36:35 -0700 X-CSE-ConnectionGUID: xiqf2ZhTRxW958u4IVf8rA== X-CSE-MsgGUID: qKipbVrMTdO+OMeDoK4GMg== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.07,212,1708416000"; d="scan'208";a="27586095" Received: from newjersey.igk.intel.com ([10.102.20.203]) by fmviesa004.fm.intel.com with ESMTP; 18 Apr 2024 04:36:31 -0700 From: Alexander Lobakin To: "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni Cc: Alexander Lobakin , Alexander Duyck , Yunsheng Lin , Jesper Dangaard Brouer , Ilias Apalodimas , Christoph Lameter , Vlastimil Babka , Andrew Morton , Przemek Kitszel , nex.sw.ncis.osdt.itp.upstreaming@intel.com, netdev@vger.kernel.org, intel-wired-lan@lists.osuosl.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: [PATCH net-next v10 00/10] net: intel: start The Great Code Dedup + Page Pool for iavf Date: Thu, 18 Apr 2024 13:36:06 +0200 Message-ID: <20240418113616.1108566-1-aleksander.lobakin@intel.com> X-Mailer: git-send-email 2.44.0 MIME-Version: 1.0 X-Rspamd-Queue-Id: E850FA0010 X-Stat-Signature: y3gtukkdchn8yeyoutnycuyy8nbu6x9b X-Rspam-User: X-Rspamd-Server: rspam07 X-HE-Tag: 1713440197-909179 X-HE-Meta: U2FsdGVkX1/JlTDG8rl44+qn9x0ZqQf9BVG8x9H+A7AC45sZx4zOysgLfBFfnZZXdwHdTF+E/6naoVuz+AVsNkwAVttedpAJvYEb0CU31viFvGy6hhooSxhvo03JqW2d+12FD4BhDkBop1MrlEY0ecq1BXkyXAekDLNPU/4iO/wVIYXIMBizMrf+ckI4mpLPJRC9LQ7NBojZgaYf20Etx4butyO/TnVj3xe774Rod0bQSOcY6NfCGpvc4l+a/as0IzhSPMKffDAtUY+MlomYSIchQlZjmLbDhSLRCsfkjFUn5NjMGg+dmexOjtPWBXDF2Q/fC/TOg4B0dQl7HE/LdERAueFxXIiSfHLT7njY9ujBXv02CwycEWFjSLMfEWgklCWLLexFhrp/cvPnDYH4ulobnWUy9NY1pOtHEEpRdvS+lRvw0LO02Z+ee8gxLVN2LoPg3GbDTlcPnDwbLovnQqqOHAR1yH5Sp+Ee5x7XMH9x0agnI/HUuRIzZQD75drZdr3j0tcApw4zKHc0p8n2bPCcogqxBcC0WLQ7G7u5wUj+E8H53S3JRxNmAj63PC+qCJBH4GlZBTxBGDX0Or0b8xBm/xHLOjZAXUyyaD5Od3hcfreb5e8Z7JP1kOIrN2lN1+KQNgcl5GamRW6m44kDYveCbg3WriAfiZM8ziUp8d0SpMpiCeTVZrnO41BZZl7v8wwnW8j5wM1KahIaUcQLWkq/kYC5gT4khTVhzXIJmybmhmvBZtXPmj+WMlfw8bPqaM6tNzCJGsTthBp9WuE7sE0nRuR1BwHLzaRyHNym2VWB5CS6nB97j9YhXWPtNz80SZ18J60wjIHc3hGeOeoOFjr1BKf/+Iz0EPaThurCMW2PrdhMeGZBmMY7nvbu265M19GoJlES1g9ZN3W29DFsBF8hW0pUyTFiTX+aB740Kg8BbGk2JJHOm8hJ0caM1kz/wfzcr9IYpXQ2xgnT9xO 9kox/jZP mf49HFHatqSzrwm4pQ7nVmsbGMc3GIiW0AMP0B/5119k13XzPLIJiFEUUSykn4vcVNizNpoMHBpiQCcJ2VQwqjlTS7af3btpktEjcipP2yayaJx6TZCXEG+KOMDyGn6RG/vHmyBAe2QVMPCQxxGXoZLiesNBBEmnRtg4YA1lYd0QzEKuNieAWdmmBXpVNuM04y7Jit9MEoVXVrptS75siwXFW8uWHAlt6RuEmiOICupa1P4Y3FpOUgszRNMBPQOZ0PboUiWK/4V2j0tpa/S0YT9FeOT3zm4Xoiy28 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Here's a two-shot: introduce {,Intel} Ethernet common library (libeth and libie) and switch iavf to Page Pool. Details are in the commit messages; here's a summary: Not a secret there's a ton of code duplication between two and more Intel ethernet modules. Before introducing new changes, which would need to be copied over again, start decoupling the already existing duplicate functionality into a new module, which will be shared between several Intel Ethernet drivers. The first name that came to my mind was "libie" -- "Intel Ethernet common library". Also this sounds like "lovelie" (-> one word, no "lib I E" pls) and can be expanded as "lib Internet Explorer" :P The "generic", pure-software part is placed separately, so that it can be easily reused in any driver by any vendor without linking to the Intel pre-200G guts. In a few words, it's something any modern driver does the same way, but nobody moved it level up (yet). The series is only the beginning. From now on, adding every new feature or doing any good driver refactoring will remove much more lines than add for quite some time. There's a basic roadmap with some deduplications planned already, not speaking of that touching every line now asks: "can I share this?". The final destination is very ambitious: have only one unified driver for at least i40e, ice, iavf, and idpf with a struct ops for each generation. That's never gonna happen, right? But you still can at least try. PP conversion for iavf lands within the same series as these two are tied closely. libie will support Page Pool model only, so that a driver can't use much of the lib until it's converted. iavf is only the example, the rest will eventually be converted soon on a per-driver basis. That is when it gets really interesting. Stay tech. Alexander Lobakin (10): net: intel: introduce {,Intel} Ethernet common library iavf: kill "legacy-rx" for good iavf: drop page splitting and recycling slab: introduce kvmalloc_array_node() and kvcalloc_node() page_pool: constify some read-only function arguments page_pool: add DMA-sync-for-CPU inline helper libeth: add Rx buffer management iavf: pack iavf_ring more efficiently iavf: switch to Page Pool MAINTAINERS: add entry for libeth and libie MAINTAINERS | 20 + drivers/net/ethernet/intel/Kconfig | 7 + drivers/net/ethernet/intel/libeth/Kconfig | 9 + drivers/net/ethernet/intel/libie/Kconfig | 10 + drivers/net/ethernet/intel/Makefile | 3 + drivers/net/ethernet/intel/libeth/Makefile | 6 + drivers/net/ethernet/intel/libie/Makefile | 6 + include/net/page_pool/types.h | 4 +- .../net/ethernet/intel/i40e/i40e_prototype.h | 7 - drivers/net/ethernet/intel/i40e/i40e_type.h | 88 --- drivers/net/ethernet/intel/iavf/iavf.h | 2 +- .../net/ethernet/intel/iavf/iavf_prototype.h | 7 - drivers/net/ethernet/intel/iavf/iavf_txrx.h | 146 +---- drivers/net/ethernet/intel/iavf/iavf_type.h | 90 --- .../net/ethernet/intel/ice/ice_lan_tx_rx.h | 320 ---------- include/linux/net/intel/libie/rx.h | 50 ++ include/linux/slab.h | 17 +- include/net/libeth/rx.h | 242 ++++++++ include/net/page_pool/helpers.h | 34 +- drivers/net/ethernet/intel/i40e/i40e_common.c | 253 -------- drivers/net/ethernet/intel/i40e/i40e_main.c | 1 + drivers/net/ethernet/intel/i40e/i40e_txrx.c | 72 +-- drivers/net/ethernet/intel/iavf/iavf_common.c | 253 -------- .../net/ethernet/intel/iavf/iavf_ethtool.c | 140 ----- drivers/net/ethernet/intel/iavf/iavf_main.c | 40 +- drivers/net/ethernet/intel/iavf/iavf_txrx.c | 551 +++--------------- .../net/ethernet/intel/iavf/iavf_virtchnl.c | 17 +- drivers/net/ethernet/intel/ice/ice_main.c | 1 + drivers/net/ethernet/intel/ice/ice_txrx_lib.c | 111 +--- drivers/net/ethernet/intel/libeth/rx.c | 150 +++++ drivers/net/ethernet/intel/libie/rx.c | 124 ++++ net/core/page_pool.c | 10 +- 32 files changed, 836 insertions(+), 1955 deletions(-) create mode 100644 drivers/net/ethernet/intel/libeth/Kconfig create mode 100644 drivers/net/ethernet/intel/libie/Kconfig create mode 100644 drivers/net/ethernet/intel/libeth/Makefile create mode 100644 drivers/net/ethernet/intel/libie/Makefile create mode 100644 include/linux/net/intel/libie/rx.h create mode 100644 include/net/libeth/rx.h create mode 100644 drivers/net/ethernet/intel/libeth/rx.c create mode 100644 drivers/net/ethernet/intel/libie/rx.c Reviewed-by: Przemek Kitszel --- libeth has way more generic functionality and code in the idpf XDP tree[0], take a look if you want to have more complete picture of what this really is about. From v9[1]: * pick Acked-by from Vlastimil and a couple Reviewed-by from Przemek; * mention that the libeth_fq::fp kernel-doc generates a warning and the fix for that is pending on the linux-doc ML (Jakub); * no functional changes. From v8[2]: * rebase on top of net-next (6.9-rc1+); * introduce kvmalloc_array_node() and kvcalloc_node(); * make Rx buffer management NUMA-aware; * resolve kdoc issues (Jakub, me). From v7[3]: * drop Page Pool optimization prereqs; * drop generic stats part: will redo to the new per-queue stats later; * split libie into "generic" and "fnic" (i40e, ice, iavf) parts; * use shorter and modern struct names; * #1: allow to compile-out hotpath IPv6 code when !CONFIG_IPV6; * #1: generate XDP RSS hash directly in the lookup table; * #8: fix rare skb nullptr deref bug. From v6[4]: * #04: resolve ethtool_puts() Git conflict (Jakub); * #06: pick RB from Ilias; * no functional changes. From v5[5]: * drop Page Pool DMA shortcut: will pick up Eric's more global DMA sync optimization[6] and expand it to cover both IOMMU and direct DMA a bit later (Yunsheng); * drop per-queue Page Pool Ethtool stats: they are now exported via generic Netlink interface (Jakub); * #01: leave a comment why exactly this alignment (Jakub, Yunsheng); * #08: make use of page_pool_params::netdev when calculating PP params; * #08: rename ``libie_rx_queue`` -> ``libie_buf_queue``. From v4[7]: * make use of Jakub's &page_pool_params split; * #01: prevent frag fields from spanning into 2 cachelines after splitting &page_pool_params into fast and slow; * #02-03: bring back the DMA sync shortcut, now as a per-page flag (me, Yunsheng); * #04: let libie have its own Kconfig to stop further bloating of poor intel/Kconfig; * #06: merge page split-reuse-recycle drop into one commit (Alex); * #07: decouple constifying of several Page Pool function arguments into a separate commit, constify some more; * #09: stop abusing internal PP fields in the driver code (Yunsheng); * #09: calculate DMA sync size (::max_len) correctly: within one page, not one buffer (Yunsheng); * #10: decouple rearranging &iavf_ring into separate commit, optimize it even more; * #11: let the driver get back to the last descriptor to process after an skb allocation fail, don't drop it (Alex); * #11: stop touching unrelated stuff like watchdog timeout etc. (Alex); * fix "Return:" in the kdoc (now `W=12 C=1` is clean), misc typos. From v3[8]: * base on the latest net-next, update bloat-o-meter and perf stats; * split generic PP optimizations into a separate series; * drop "optimize hotpath a bunch" commit: a lot of [controversial] changes in one place, worth own series (Alex); * 02: pick Rev-by (Alex); * 03: move in-place recycling removal here from the dropped patch; * 05: new, add libie Rx buffer API separatelly from IAVF changes; * 05-06: use new "hybrid" allocation API from[8] to reduce memory usage when a page can fit more than 1 truesize (also asked by David); * 06: merge with "always use order-0 page" commit to reduce diffs and simplify things (Alex); * 09: fix page_alloc_fail counter. From v2[10]: * 0006: fix page_pool.h include in OcteonTX2 files (Jakub, Patchwork); * no functional changes. From v1[11]: * 0006: new (me, Jakub); * 0008: give the helpers more intuitive names (Jakub, Ilias); * -^-: also expand their kdoc a bit for the same reason; * -^-: fix kdoc copy-paste issue (Patchwork, Jakub); * 0011: drop `inline` from C file (Patchwork, Jakub). [0] https://github.com/alobakin/linux/commits/idpf-libie-new [1] https://lore.kernel.org/netdev/20240404154402.3581254-1-aleksander.lobakin@intel.com [2] https://lore.kernel.org/netdev/20240308141833.2966600-1-aleksander.lobakin@intel.com [3] https://lore.kernel.org/netdev/20231213112835.2262651-1-aleksander.lobakin@intel.com [4] https://lore.kernel.org/netdev/20231207172010.1441468-1-aleksander.lobakin@intel.com [5] https://lore.kernel.org/netdev/20231124154732.1623518-1-aleksander.lobakin@intel.com [6] https://lore.kernel.org/netdev/20221115182841.2640176-1-edumazet@google.com [7] https://lore.kernel.org/netdev/20230705155551.1317583-1-aleksander.lobakin@intel.com [8] https://lore.kernel.org/netdev/20230530150035.1943669-1-aleksander.lobakin@intel.com [9] https://lore.kernel.org/netdev/20230629120226.14854-1-linyunsheng@huawei.com [10] https://lore.kernel.org/netdev/20230525125746.553874-1-aleksander.lobakin@intel.com [11] https://lore.kernel.org/netdev/20230516161841.37138-1-aleksander.lobakin@intel.com