[RFC,net-next,1/8] ethtool: Add ability to control transceiver modules' low power mode

From: Ido Schimmel <idosch@nvidia.com>

From: Ido Schimmel <idosch@nvidia.com>

Add a pair of new ethtool messages, 'ETHTOOL_MSG_MODULE_SET' and
'ETHTOOL_MSG_MODULE_GET', that can be used to control transceiver
modules parameters and retrieve their status.

The first parameter to control is the low power mode of the module. It
is only relevant for paged memory modules, as flat memory modules always
operate in low power mode.

When a paged memory module is in low power mode, its power consumption
is reduced to the minimum, the management interface towards the host is
available and the data path is deactivated.

User space can choose to put modules that are not currently in use in
low power mode and transition them to high power mode before putting the
associated ports administratively up.

Transitioning into low power mode means loss of carrier, so error is
returned when the netdev is administratively up.

The user API is designed to be generic enough so that it could be used
for modules with different memory maps (e.g., SFF-8636, CMIS).

The only implementation of the device driver API in this series is for a
MAC driver (mlxsw) where the module is controlled by the device's
firmware, but it is designed to be generic enough so that it could also
be used by implementations where the module is controlled by the CPU.

CMIS testing
============

 # ethtool -m swp11
 Identifier                                : 0x18 (QSFP-DD Double Density 8X Pluggable Transceiver (INF-8628))
 ...
 Module State                              : 0x03 (ModuleReady)
 LowPwrAllowRequestHW                      : Off
 LowPwrRequestSW                           : Off

The module is not in low power mode, as it is not forced by hardware
(LowPwrAllowRequestHW is off) or by software (LowPwrRequestSW is off).

The low power mode can be queried from the kernel. In case
LowPwrAllowRequestHW was on, the kernel would need to take into account
the state of the LowPwrRequestHW signal, which is not visible to user
space.

 $ ethtool --show-module swp11
 Module parameters for swp11:
 low-power false

Turn on low power mode:

 # ethtool --set-module swp11 low-power on

Query low power mode again:

 $ ethtool --show-module swp11
 Module parameters for swp11:
 low-power true

Verify with the data read from the EEPROM:

 # ethtool -m swp11
 Identifier                                : 0x18 (QSFP-DD Double Density 8X Pluggable Transceiver (INF-8628))
 ...
 Module State                              : 0x01 (ModuleLowPwr)
 LowPwrAllowRequestHW                      : Off
 LowPwrRequestSW                           : On

Allow the module to transition out of low power mode:

 # ethtool --set-module swp11 low-power off

Query low power mode again:

 $ ethtool --show-module swp11
 Module parameters for swp11:
 low-power false

Verify with the data read from the EEPROM:

 # ethtool -m swp11
 Identifier                                : 0x18 (QSFP-DD Double Density 8X Pluggable Transceiver (INF-8628))
 ...
 Module State                              : 0x03 (ModuleReady)
 LowPwrAllowRequestHW                      : Off
 LowPwrRequestSW                           : Off

SFF-8636 testing
================

 # ethtool -m swp13
 Identifier                                : 0x11 (QSFP28)
 ...
 Extended identifier description           : 5.0W max. Power consumption,  High Power Class (> 3.5 W) enabled
 Power set                                 : Off
 Power override                            : On
 ...
 Transmit avg optical power (Channel 1)    : 0.7733 mW / -1.12 dBm
 Transmit avg optical power (Channel 2)    : 0.7649 mW / -1.16 dBm
 Transmit avg optical power (Channel 3)    : 0.7743 mW / -1.11 dBm
 Transmit avg optical power (Channel 4)    : 0.7837 mW / -1.06 dBm
 Rcvr signal avg optical power(Channel 1)  : 0.9186 mW / -0.37 dBm
 Rcvr signal avg optical power(Channel 2)  : 0.9136 mW / -0.39 dBm
 Rcvr signal avg optical power(Channel 3)  : 0.8986 mW / -0.46 dBm
 Rcvr signal avg optical power(Channel 4)  : 0.8701 mW / -0.60 dBm

The module is not in low power mode, as it is not forced by hardware
(Power override is on) or by software (Power set is off).

The low power mode can be queried from the kernel. In case Power
override was off, the kernel would need to take into account the state
of the LPMode signal, which is not visible to user space.

 $ ethtool --show-module swp13
 Module parameters for swp13:
 low-power false

Turn on low power mode:

 # ethtool --set-module swp13 low-power on

Query low power mode again:

 $ ethtool --show-module swp13
 Module parameters for swp13:
 low-power true

Verify with the data read from the EEPROM:

 # ethtool -m swp13
 Identifier                                : 0x11 (QSFP28)
 ...
 Extended identifier description           : 5.0W max. Power consumption,  High Power Class (> 3.5 W) not enabled
 Power set                                 : On
 Power override                            : On
 ...
 Transmit avg optical power (Channel 1)    : 0.0000 mW / -inf dBm
 Transmit avg optical power (Channel 2)    : 0.0000 mW / -inf dBm
 Transmit avg optical power (Channel 3)    : 0.0000 mW / -inf dBm
 Transmit avg optical power (Channel 4)    : 0.0000 mW / -inf dBm
 Rcvr signal avg optical power(Channel 1)  : 0.0000 mW / -inf dBm
 Rcvr signal avg optical power(Channel 2)  : 0.0000 mW / -inf dBm
 Rcvr signal avg optical power(Channel 3)  : 0.0000 mW / -inf dBm
 Rcvr signal avg optical power(Channel 4)  : 0.0000 mW / -inf dBm

Allow the module to transition out of low power mode:

 # ethtool --set-module swp13 low-power off

Query low power mode again:

 $ ethtool --show-module swp13
 Module parameters for swp13:
 low-power false

Verify with the data read from the EEPROM:

 # ethtool -m swp13
 Identifier                                : 0x11 (QSFP28)
 ...
 Extended identifier description           : 5.0W max. Power consumption,  High Power Class (> 3.5 W) enabled
 Power set                                 : Off
 Power override                            : On
 ...
 Transmit avg optical power (Channel 1)    : 0.7783 mW / -1.09 dBm
 Transmit avg optical power (Channel 2)    : 0.7806 mW / -1.08 dBm
 Transmit avg optical power (Channel 3)    : 0.7885 mW / -1.03 dBm
 Transmit avg optical power (Channel 4)    : 0.7985 mW / -0.98 dBm
 Rcvr signal avg optical power(Channel 1)  : 0.9124 mW / -0.40 dBm
 Rcvr signal avg optical power(Channel 2)  : 0.9071 mW / -0.42 dBm
 Rcvr signal avg optical power(Channel 3)  : 0.8993 mW / -0.46 dBm
 Rcvr signal avg optical power(Channel 4)  : 0.8644 mW / -0.63 dBm

Signed-off-by: Ido Schimmel <idosch@nvidia.com>
---
 Documentation/networking/ethtool-netlink.rst |  55 +++++-
 include/linux/ethtool.h                      |   9 +
 include/uapi/linux/ethtool_netlink.h         |  16 ++
 net/ethtool/Makefile                         |   2 +-
 net/ethtool/module.c                         | 184 +++++++++++++++++++
 net/ethtool/netlink.c                        |  19 ++
 net/ethtool/netlink.h                        |   4 +
 7 files changed, 286 insertions(+), 3 deletions(-)
 create mode 100644 net/ethtool/module.c

Message ID	20210809102152.719961-2-idosch@idosch.org (mailing list archive)
State	Superseded
Delegated to:	Netdev Maintainers
Headers	show Return-Path: <netdev-owner@kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-16.8 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER,INCLUDES_PATCH, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_GIT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id F1508C4338F for <netdev@archiver.kernel.org>; Mon, 9 Aug 2021 10:22:19 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id D71AA610A7 for <netdev@archiver.kernel.org>; Mon, 9 Aug 2021 10:22:19 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234328AbhHIKWj (ORCPT <rfc822;netdev@archiver.kernel.org>); Mon, 9 Aug 2021 06:22:39 -0400 Received: from out3-smtp.messagingengine.com ([66.111.4.27]:53507 "EHLO out3-smtp.messagingengine.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233212AbhHIKWi (ORCPT <rfc822;netdev@vger.kernel.org>); Mon, 9 Aug 2021 06:22:38 -0400 Received: from compute1.internal (compute1.nyi.internal [10.202.2.41]) by mailout.nyi.internal (Postfix) with ESMTP id 1184D5C00DB; Mon, 9 Aug 2021 06:22:18 -0400 (EDT) Received: from mailfrontend1 ([10.202.2.162]) by compute1.internal (MEProxy); Mon, 09 Aug 2021 06:22:18 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:date:from :in-reply-to:message-id:mime-version:references:subject:to :x-me-proxy:x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s= fm3; bh=uPz4aoxh3Qb/1Wdqcpnf3PJu0/wbvAPeh4adwV4N0EM=; b=oSTKdiXy MKyL/0PZ1j/RsXeQe7RAsNZ+htqy/pzjdAtwuxuKfx2S9GX1UKrBtSDCg1J5U/Fo NZZOrABPnDPIYKuMLoXrB5BhuCIJd+y51bhRlcJcHRoKca6mBpkJGImQxelfn7s1 QAwIw2PKknGfCFVZCiofq6OKhWAgPGD/WbljTCeo3uCMkp1fgD0P6kYYohRrg5Ja xWM8zDQ8lqqGe+SWV0h9Rw0m1oJ4FBs/0J6QG6EyFPb05nKjYtiV1dASIL0Ihqa3 +5P84yy7BjTSoLWEZL7N10zPAwrYPKR6W9sskHT5Opjhu7LDETflT3y4jijmbp2Q 4oGHfiTQK4q8IQ== X-ME-Sender: <xms:2QERYfAbUoSm1duL3tLigTTK_S1nYxsamd__3_5KeMAZDcbHsposiw> <xme:2QERYVihRUVwdZH0868q3aNU88NnBJkwSvNWvfHyeR2vVFC5K9E-SVQBUBPYliT-D xRR2bJbm9FKapM> X-ME-Received: <xmr:2QERYakaGxIb-T8zBdoNkW9aJTZq6Lalevo_jNYSX6kZaubdWTlSa99S72Ntnr-XCeanOhAzoequ-qHs7HKGGH4Osyo7mAuWt8bO2kp4fYgKgg> X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedvtddrjeejgddvkecutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecunecujfgurhephffvufffkffojghfggfgsedtkeertd ertddtnecuhfhrohhmpefkughoucfutghhihhmmhgvlhcuoehiughoshgthhesihguohhs tghhrdhorhhgqeenucggtffrrghtthgvrhhnpeduteeiveffffevleekleejffekhfekhe fgtdfftefhledvjefggfehgfevjeekhfenucevlhhushhtvghrufhiiigvpedtnecurfgr rhgrmhepmhgrihhlfhhrohhmpehiughoshgthhesihguohhstghhrdhorhhg X-ME-Proxy: <xmx:2QERYRycLXbN_3QTGkHgENpJR0vZvAmCkpYXkmEsV-GKcE9MJo_N4A> <xmx:2QERYURkwLkR3LTqAiL0YIza-7tt5lTL7d0ccgnp6r8sTXWNzsvrbw> <xmx:2QERYUZlltjRzNTEekTPrj2U_mDPws3_UT9LRKIrJj87Rl1uqpD9ig> <xmx:2gERYTFdPpHx-g3ULZLIFQ_qXenokV-a0MN7VISBkehU70dPwkuYwg> Received: by mail.messagingengine.com (Postfix) with ESMTPA; Mon, 9 Aug 2021 06:22:15 -0400 (EDT) From: Ido Schimmel <idosch@idosch.org> To: netdev@vger.kernel.org Cc: davem@davemloft.net, kuba@kernel.org, andrew@lunn.ch, mkubecek@suse.cz, pali@kernel.org, vadimp@nvidia.com, mlxsw@nvidia.com, Ido Schimmel <idosch@nvidia.com> Subject: [RFC PATCH net-next 1/8] ethtool: Add ability to control transceiver modules' low power mode Date: Mon, 9 Aug 2021 13:21:45 +0300 Message-Id: <20210809102152.719961-2-idosch@idosch.org> X-Mailer: git-send-email 2.31.1 In-Reply-To: <20210809102152.719961-1-idosch@idosch.org> References: <20210809102152.719961-1-idosch@idosch.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <netdev.vger.kernel.org> X-Mailing-List: netdev@vger.kernel.org X-Patchwork-Delegate: kuba@kernel.org X-Patchwork-State: RFC
Series	ethtool: Add ability to control transceiver modules \| expand [RFC,net-next,0/8] ethtool: Add ability to control transceiver modules [RFC,net-next,1/8] ethtool: Add ability to control transceiver modules' low power mode [RFC,net-next,2/8] ethtool: Add ability to reset transceiver modules [RFC,net-next,3/8] mlxsw: reg: Add fields to PMAOS register [RFC,net-next,4/8] mlxsw: Make PMAOS pack function more generic [RFC,net-next,5/8] mlxsw: reg: Add Port Module Memory Map Properties register [RFC,net-next,6/8] mlxsw: reg: Add Management Cable IO and Notifications register [RFC,net-next,7/8] mlxsw: Add ability to control transceiver modules' low power mode [RFC,net-next,8/8] mlxsw: Add ability to reset transceiver modules

Context	Check	Description
netdev/cover_letter	success	Link
netdev/fixes_present	success	Link
netdev/patch_count	success	Link
netdev/tree_selection	success	Clearly marked for net-next
netdev/subject_prefix	success	Link
netdev/cc_maintainers	warning	12 maintainers not CCed: corbet@lwn.net arnd@arndb.de ffmancera@riseup.net yangbo.lu@nxp.com danieller@nvidia.com zhengyongjun3@huawei.com saeedm@nvidia.com yajun.deng@linux.dev hkallweit1@gmail.com linux-doc@vger.kernel.org vladyslavt@nvidia.com johannes.berg@intel.com
netdev/source_inline	success	Was 0 now: 0
netdev/verify_signedoff	success	Link
netdev/module_param	success	Was 0 now: 0
netdev/build_32bit	success	Errors and warnings before: 2069 this patch: 2069
netdev/kdoc	success	Errors and warnings before: 0 this patch: 0
netdev/verify_fixes	success	Link
netdev/checkpatch	warning	WARNING: added, moved or deleted file(s), does MAINTAINERS need updating? WARNING: line length of 82 exceeds 80 columns WARNING: line length of 84 exceeds 80 columns WARNING: line length of 91 exceeds 80 columns WARNING: line length of 95 exceeds 80 columns
netdev/build_allmodconfig_warn	success	Errors and warnings before: 2061 this patch: 2061
netdev/header_inline	success	Link

[RFC,net-next,1/8] ethtool: Add ability to control transceiver modules' low power mode

Checks

Commit Message

Comments

Patch