mbox

[pull,request,net-next,00/15] mlx5 updates 2023-12-20

Message ID 20231221005721.186607-1-saeed@kernel.org (mailing list archive)
State Accepted
Delegated to: Netdev Maintainers
Headers show

Pull-request

git://git.kernel.org/pub/scm/linux/kernel/git/saeed/linux.git tags/mlx5-updates-2023-12-20

Message

Saeed Mahameed Dec. 21, 2023, 12:57 a.m. UTC
From: Saeed Mahameed <saeedm@nvidia.com>

This series adds support for Socket Direct and Embedded management PF
ethernet netdev support.

For more information please see tag log below.

Please pull and let me know if there is any problem.

Happy holidays.

Thanks,
Saeed.


The following changes since commit bee9705c679d0df8ee099e3c5312ac76f447848a:

  Merge branch 'net-sched-tc-drop-reason' (2023-12-20 11:50:13 +0000)

are available in the Git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/saeed/linux.git tags/mlx5-updates-2023-12-20

for you to fetch changes up to 22c4640698a1d47606b5a4264a584e8046641784:

  net/mlx5: Implement management PF Ethernet profile (2023-12-20 16:54:27 -0800)

----------------------------------------------------------------
mlx5-updates-2023-12-20

mlx5 Socket direct support and management PF profile.

Tariq Says:
===========
Support Socket-Direct multi-dev netdev

This series adds support for combining multiple devices (PFs) of the
same port under one netdev instance. Passing traffic through different
devices belonging to different NUMA sockets saves cross-numa traffic and
allows apps running on the same netdev from different numas to still
feel a sense of proximity to the device and achieve improved
performance.

We achieve this by grouping PFs together, and creating the netdev only
once all group members are probed. Symmetrically, we destroy the netdev
once any of the PFs is removed.

The channels are distributed between all devices, a proper configuration
would utilize the correct close numa when working on a certain app/cpu.

We pick one device to be a primary (leader), and it fills a special
role.  The other devices (secondaries) are disconnected from the network
in the chip level (set to silent mode). All RX/TX traffic is steered
through the primary to/from the secondaries.

Currently, we limit the support to PFs only, and up to two devices
(sockets).

===========

Armen Says:
===========
Management PF support and module integration

This patch rolls out comprehensive support for the Management Physical
Function (MGMT PF) within the mlx5 driver. It involves updating the
mlx5 interface header to introduce necessary definitions for MGMT PF
and adding a new management PF netdev profile, which will allow the host
side to communicate with the embedded linux on Blue-field devices.

===========

----------------------------------------------------------------
Armen Ratner (1):
      net/mlx5: Implement management PF Ethernet profile

Saeed Mahameed (1):
      net/mlx5e: Use the correct lag ports number when creating TISes

Tariq Toukan (13):
      net/mlx5: Fix query of sd_group field
      net/mlx5: SD, Introduce SD lib
      net/mlx5: SD, Implement basic query and instantiation
      net/mlx5: SD, Implement devcom communication and primary election
      net/mlx5: SD, Implement steering for primary and secondaries
      net/mlx5: SD, Add informative prints in kernel log
      net/mlx5e: Create single netdev per SD group
      net/mlx5e: Create EN core HW resources for all secondary devices
      net/mlx5e: Let channels be SD-aware
      net/mlx5e: Support cross-vhca RSS
      net/mlx5e: Support per-mdev queue counter
      net/mlx5e: Block TLS device offload on combined SD netdev
      net/mlx5: Enable SD feature

 drivers/net/ethernet/mellanox/mlx5/core/Makefile   |   2 +-
 drivers/net/ethernet/mellanox/mlx5/core/dev.c      |   3 +
 drivers/net/ethernet/mellanox/mlx5/core/ecpf.c     |   6 +
 drivers/net/ethernet/mellanox/mlx5/core/en.h       |  15 +-
 .../net/ethernet/mellanox/mlx5/core/en/channels.c  |  10 +-
 .../net/ethernet/mellanox/mlx5/core/en/channels.h  |   6 +-
 .../net/ethernet/mellanox/mlx5/core/en/mgmt_pf.c   | 268 ++++++++++++
 .../ethernet/mellanox/mlx5/core/en/monitor_stats.c |  48 +-
 .../net/ethernet/mellanox/mlx5/core/en/params.c    |   9 +-
 .../net/ethernet/mellanox/mlx5/core/en/params.h    |   3 -
 drivers/net/ethernet/mellanox/mlx5/core/en/ptp.c   |  12 +-
 drivers/net/ethernet/mellanox/mlx5/core/en/qos.c   |   8 +-
 .../ethernet/mellanox/mlx5/core/en/reporter_rx.c   |   4 +-
 .../ethernet/mellanox/mlx5/core/en/reporter_tx.c   |   3 +-
 drivers/net/ethernet/mellanox/mlx5/core/en/rqt.c   | 123 +++++-
 drivers/net/ethernet/mellanox/mlx5/core/en/rqt.h   |   9 +-
 drivers/net/ethernet/mellanox/mlx5/core/en/rss.c   |  17 +-
 drivers/net/ethernet/mellanox/mlx5/core/en/rss.h   |   4 +-
 .../net/ethernet/mellanox/mlx5/core/en/rx_res.c    |  62 ++-
 .../net/ethernet/mellanox/mlx5/core/en/rx_res.h    |   1 +
 drivers/net/ethernet/mellanox/mlx5/core/en/trap.c  |  11 +-
 .../net/ethernet/mellanox/mlx5/core/en/xsk/pool.c  |   6 +-
 .../net/ethernet/mellanox/mlx5/core/en/xsk/setup.c |   8 +-
 .../ethernet/mellanox/mlx5/core/en_accel/ktls.c    |   2 +-
 .../ethernet/mellanox/mlx5/core/en_accel/ktls.h    |   4 +-
 .../ethernet/mellanox/mlx5/core/en_accel/ktls_rx.c |   6 +-
 .../net/ethernet/mellanox/mlx5/core/en_common.c    |  21 +-
 drivers/net/ethernet/mellanox/mlx5/core/en_main.c  | 200 +++++++--
 drivers/net/ethernet/mellanox/mlx5/core/en_stats.c |  39 +-
 drivers/net/ethernet/mellanox/mlx5/core/en_tc.c    |   4 +-
 drivers/net/ethernet/mellanox/mlx5/core/eswitch.c  |   2 +-
 .../net/ethernet/mellanox/mlx5/core/ipoib/ipoib.c  |   2 +-
 .../net/ethernet/mellanox/mlx5/core/lib/devcom.h   |   1 +
 drivers/net/ethernet/mellanox/mlx5/core/lib/mlx5.h |  12 +
 drivers/net/ethernet/mellanox/mlx5/core/lib/sd.c   | 487 +++++++++++++++++++++
 drivers/net/ethernet/mellanox/mlx5/core/lib/sd.h   |  38 ++
 drivers/net/ethernet/mellanox/mlx5/core/vport.c    |  21 +
 include/linux/mlx5/driver.h                        |  10 +
 include/linux/mlx5/mlx5_ifc.h                      |  24 +-
 include/linux/mlx5/vport.h                         |   1 +
 40 files changed, 1320 insertions(+), 192 deletions(-)
 create mode 100644 drivers/net/ethernet/mellanox/mlx5/core/en/mgmt_pf.c
 create mode 100644 drivers/net/ethernet/mellanox/mlx5/core/lib/sd.c
 create mode 100644 drivers/net/ethernet/mellanox/mlx5/core/lib/sd.h

Comments

Jakub Kicinski Jan. 4, 2024, 10:47 p.m. UTC | #1
On Wed, 20 Dec 2023 16:57:06 -0800 Saeed Mahameed wrote:
> Support Socket-Direct multi-dev netdev

There's no documentation for any of it?

$ git grep -i 'socket.direct' -- Documentation/
$

it's a feature many people have talked about for ever.
I'm pretty sure there are at least 2 vendors who have
HW support to do the same thing. Without docs everyone
will implement is slightly differently :(
Jakub Kicinski Jan. 8, 2024, 1:19 a.m. UTC | #2
On Thu, 4 Jan 2024 14:47:21 -0800 Jakub Kicinski wrote:
> On Wed, 20 Dec 2023 16:57:06 -0800 Saeed Mahameed wrote:
> > Support Socket-Direct multi-dev netdev  
> 
> There's no documentation for any of it?
> 
> $ git grep -i 'socket.direct' -- Documentation/
> $
> 
> it's a feature many people have talked about for ever.
> I'm pretty sure there are at least 2 vendors who have
> HW support to do the same thing. Without docs everyone
> will implement is slightly differently :(

No replies so far, and v6.8 merge window has just begun,
so let me drop this from -next for now.
Saeed Mahameed Jan. 8, 2024, 11:14 p.m. UTC | #3
On 07 Jan 17:19, Jakub Kicinski wrote:
>On Thu, 4 Jan 2024 14:47:21 -0800 Jakub Kicinski wrote:
>> On Wed, 20 Dec 2023 16:57:06 -0800 Saeed Mahameed wrote:
>> > Support Socket-Direct multi-dev netdev
>>
>> There's no documentation for any of it?
>>
>> $ git grep -i 'socket.direct' -- Documentation/
>> $
>>
>> it's a feature many people have talked about for ever.
>> I'm pretty sure there are at least 2 vendors who have
>> HW support to do the same thing. Without docs everyone
>> will implement is slightly differently :(
>
>No replies so far, and v6.8 merge window has just begun,
>so let me drop this from -next for now.
>

But why revert ? what was wrong with the code or the current design? 
The current comments aren't that critical and I am sure you understand
that people are on holiday vacation.

We will provide the docs, but IMHO, docs could have been easily a follow
up.

What's the point of the upstream process if a surprise
revert can be done at any point by a maintainer? This is is not the first
instance, This has happened before with the management PF first iteration,
at least that time you asked for a revert and we approved, but this revert
came as a complete surprise.. 

Can we not do these reverts in such a stealthy way, this makes the whole
acceptance criteria unreliable, many teams rely on things getting accepted
so they plan the next steps, we have an upstream first open source policy
at nVidia networking and predictability is very important to us,
uncertainty especially when things are already accepted is something that
is very hard for us to work with.

Thanks,
Saeed.