[RFC,v4,24/27] qedn: Add support of NVME ICReq & ICResp

Message ID	20210429190926.5086-25-smalin@marvell.com (mailing list archive)
State	RFC
Headers	show Return-Path: <netdev-owner@kernel.org> From: Shai Malin <smalin@marvell.com> To: <netdev@vger.kernel.org>, <linux-nvme@lists.infradead.org>, <sagi@grimberg.me>, <hch@lst.de>, <axboe@fb.com>, <kbusch@kernel.org> CC: =?utf-8?q?David_S_=2E_Miller_davem_=40_davemloft_=2E_net_=C2=A0--cc=3DJa?= =?utf-8?q?kub_Kicinski?= <kuba@kernel.org>, <smalin@marvell.com>, <aelior@marvell.com>, <mkalderon@marvell.com>, <okulkarni@marvell.com>, <pkushwaha@marvell.com>, <malin1024@gmail.com> Subject: [RFC PATCH v4 24/27] qedn: Add support of NVME ICReq & ICResp Date: Thu, 29 Apr 2021 22:09:23 +0300 Message-ID: <20210429190926.5086-25-smalin@marvell.com> In-Reply-To: <20210429190926.5086-1-smalin@marvell.com> References: <20210429190926.5086-1-smalin@marvell.com> MIME-Version: 1.0 Content-Type: text/plain Precedence: bulk
Series	NVMeTCP Offload ULP and QEDN Device Driver \| expand [RFC,v4,00/27] NVMeTCP Offload ULP and QEDN Device Driver [RFC,v4,01/27] qed: Add NVMeTCP Offload PF Level FW and HW HSI [RFC,v4,02/27] qed: Add NVMeTCP Offload Connection Level FW and HW HSI [RFC,v4,03/27] qed: Add qed-NVMeTCP personality [RFC,v4,04/27] qed: Add support of HW filter block [RFC,v4,05/27] qed: Add NVMeTCP Offload IO Level FW and HW HSI [RFC,v4,06/27] qed: Add NVMeTCP Offload IO Level FW Initializations [RFC,v4,07/27] qed: Add IP services APIs support [RFC,v4,08/27] nvme-tcp-offload: Add nvme-tcp-offload - NVMeTCP HW offload ULP [RFC,v4,09/27] nvme-fabrics: Move NVMF_ALLOWED_OPTS and NVMF_REQUIRED_OPTS definitions [RFC,v4,10/27] nvme-tcp-offload: Add device scan implementation [RFC,v4,11/27] nvme-tcp-offload: Add controller level implementation [RFC,v4,12/27] nvme-tcp-offload: Add controller level error recovery implementation [RFC,v4,13/27] nvme-tcp-offload: Add queue level implementation [RFC,v4,14/27] nvme-tcp-offload: Add IO level implementation [RFC,v4,15/27] nvme-tcp-offload: Add Timeout and ASYNC Support [RFC,v4,16/27] qedn: Add qedn - Marvell's NVMeTCP HW offload vendor driver [RFC,v4,17/27] qedn: Add qedn probe [RFC,v4,18/27] qedn: Add qedn_claim_dev API support [RFC,v4,19/27] qedn: Add IRQ and fast-path resources initializations [RFC,v4,20/27] qedn: Add connection-level slowpath functionality [RFC,v4,21/27] qedn: Add support of configuring HW filter block [RFC,v4,22/27] qedn: Add IO level nvme_req and fw_cq workqueues [RFC,v4,23/27] qedn: Add support of Task and SGL [RFC,v4,24/27] qedn: Add support of NVME ICReq & ICResp [RFC,v4,25/27] qedn: Add IO level fastpath functionality [RFC,v4,26/27] qedn: Add Connection and IO level recovery flows [RFC,v4,27/27] qedn: Add support of ASYNC

Message ID

20210429190926.5086-25-smalin@marvell.com (mailing list archive)

State

RFC

Headers

From: Shai Malin <smalin@marvell.com>
To: <netdev@vger.kernel.org>, <linux-nvme@lists.infradead.org>,
        <sagi@grimberg.me>, <hch@lst.de>, <axboe@fb.com>,
        <kbusch@kernel.org>
CC: =?utf-8?q?David_S_=2E_Miller_davem_=40_davemloft_=2E_net_=C2=A0--cc=3DJa?=
	=?utf-8?q?kub_Kicinski?=  <kuba@kernel.org>, <smalin@marvell.com>,
 <aelior@marvell.com>, <mkalderon@marvell.com>, <okulkarni@marvell.com>,
 <pkushwaha@marvell.com>, <malin1024@gmail.com>
Subject: [RFC PATCH v4 24/27] qedn: Add support of NVME ICReq & ICResp
Date: Thu, 29 Apr 2021 22:09:23 +0300
Message-ID: <20210429190926.5086-25-smalin@marvell.com>
In-Reply-To: <20210429190926.5086-1-smalin@marvell.com>
References: <20210429190926.5086-1-smalin@marvell.com>
MIME-Version: 1.0
Content-Type: text/plain
Precedence: bulk

Series

NVMeTCP Offload ULP and QEDN Device Driver | expand

Context	Check	Description
netdev/cover_letter	success	Link
netdev/fixes_present	success	Link
netdev/patch_count	fail	Series longer than 15 patches
netdev/tree_selection	success	Guessed tree name to be net-next
netdev/subject_prefix	success	Link
netdev/cc_maintainers	success	CCed 3 of 3 maintainers
netdev/source_inline	success	Was 0 now: 0
netdev/verify_signedoff	success	Link
netdev/module_param	success	Was 0 now: 0
netdev/build_32bit	fail	Errors and warnings before: 37 this patch: 41
netdev/kdoc	success	Errors and warnings before: 0 this patch: 0
netdev/verify_fixes	success	Link
netdev/checkpatch	warning	WARNING: line length of 81 exceeds 80 columns WARNING: line length of 82 exceeds 80 columns WARNING: line length of 86 exceeds 80 columns WARNING: line length of 88 exceeds 80 columns WARNING: line length of 90 exceeds 80 columns WARNING: line length of 93 exceeds 80 columns
netdev/build_allmodconfig_warn	success	Errors and warnings before: 35 this patch: 35
netdev/header_inline	success	Link

Context

Check

Description

netdev/cover_letter

success

Link

netdev/fixes_present

success

Link

netdev/patch_count

fail

Series longer than 15 patches

netdev/tree_selection

success

Guessed tree name to be net-next

netdev/subject_prefix

success

Link

netdev/cc_maintainers

success

CCed 3 of 3 maintainers

netdev/source_inline

success

Was 0 now: 0

netdev/verify_signedoff

success

Link

netdev/module_param

success

Was 0 now: 0

netdev/build_32bit

fail

Errors and warnings before: 37 this patch: 41

netdev/kdoc

success

Errors and warnings before: 0 this patch: 0

netdev/verify_fixes

success

Link

netdev/checkpatch

warning

WARNING: line length of 81 exceeds 80 columns WARNING: line length of 82 exceeds 80 columns WARNING: line length of 86 exceeds 80 columns WARNING: line length of 88 exceeds 80 columns WARNING: line length of 90 exceeds 80 columns WARNING: line length of 93 exceeds 80 columns

netdev/build_allmodconfig_warn

success

Errors and warnings before: 35 this patch: 35

netdev/header_inline

success

Link

Commit Message

Shai Malin April 29, 2021, 7:09 p.m. UTC

From: Prabhakar Kushwaha <pkushwaha@marvell.com>

Once a TCP connection established, the host sends an Initialize
Connection Request (ICReq) PDU to the controller.
Further Initialize Connection Response (ICResp) PDU received from
controller is processed by host to establish a connection and
exchange connection configuration parameters.

This patch present support of generation of ICReq and processing of
ICResp. It also update host configuration based on exchanged parameters.

Acked-by: Igor Russkikh <irusskikh@marvell.com>
Signed-off-by: Prabhakar Kushwaha <pkushwaha@marvell.com>
Signed-off-by: Omkar Kulkarni <okulkarni@marvell.com>
Signed-off-by: Michal Kalderon <mkalderon@marvell.com>
Signed-off-by: Ariel Elior <aelior@marvell.com>
Signed-off-by: Shai Malin <smalin@marvell.com>
---
 drivers/nvme/hw/qedn/qedn.h      |  36 ++++
 drivers/nvme/hw/qedn/qedn_conn.c | 317 ++++++++++++++++++++++++++++++-
 drivers/nvme/hw/qedn/qedn_main.c |  22 +++
 drivers/nvme/hw/qedn/qedn_task.c |   8 +-
 4 files changed, 379 insertions(+), 4 deletions(-)

Comments

Hannes Reinecke May 2, 2021, 11:53 a.m. UTC | #1

On 4/29/21 9:09 PM, Shai Malin wrote:
> From: Prabhakar Kushwaha <pkushwaha@marvell.com>
> 
> Once a TCP connection established, the host sends an Initialize
> Connection Request (ICReq) PDU to the controller.
> Further Initialize Connection Response (ICResp) PDU received from
> controller is processed by host to establish a connection and
> exchange connection configuration parameters.
> 
> This patch present support of generation of ICReq and processing of
> ICResp. It also update host configuration based on exchanged parameters.
> 
> Acked-by: Igor Russkikh <irusskikh@marvell.com>
> Signed-off-by: Prabhakar Kushwaha <pkushwaha@marvell.com>
> Signed-off-by: Omkar Kulkarni <okulkarni@marvell.com>
> Signed-off-by: Michal Kalderon <mkalderon@marvell.com>
> Signed-off-by: Ariel Elior <aelior@marvell.com>
> Signed-off-by: Shai Malin <smalin@marvell.com>
> ---
>   drivers/nvme/hw/qedn/qedn.h      |  36 ++++
>   drivers/nvme/hw/qedn/qedn_conn.c | 317 ++++++++++++++++++++++++++++++-
>   drivers/nvme/hw/qedn/qedn_main.c |  22 +++
>   drivers/nvme/hw/qedn/qedn_task.c |   8 +-
>   4 files changed, 379 insertions(+), 4 deletions(-)
> 
> diff --git a/drivers/nvme/hw/qedn/qedn.h b/drivers/nvme/hw/qedn/qedn.h
> index 880ca245b02c..773a57994148 100644
> --- a/drivers/nvme/hw/qedn/qedn.h
> +++ b/drivers/nvme/hw/qedn/qedn.h
> @@ -16,6 +16,7 @@
>   
>   /* Driver includes */
>   #include "../../host/tcp-offload.h"
> +#include <linux/nvme-tcp.h>
>   
>   #define QEDN_MAJOR_VERSION		8
>   #define QEDN_MINOR_VERSION		62
> @@ -52,6 +53,8 @@
>   
>   /* Protocol defines */
>   #define QEDN_MAX_IO_SIZE QED_NVMETCP_MAX_IO_SIZE
> +#define QEDN_MAX_PDU_SIZE 0x80000 /* 512KB */
> +#define QEDN_MAX_OUTSTANDING_R2T_PDUS 0 /* 0 Based == 1 max R2T */
>   
>   #define QEDN_SGE_BUFF_SIZE 4096
>   #define QEDN_MAX_SGES_PER_TASK DIV_ROUND_UP(QEDN_MAX_IO_SIZE, QEDN_SGE_BUFF_SIZE)
> @@ -65,6 +68,11 @@
>   #define QEDN_TASK_INSIST_TMO 1000 /* 1 sec */
>   #define QEDN_INVALID_ITID 0xFFFF
>   
> +#define QEDN_ICREQ_FW_PAYLOAD (sizeof(struct nvme_tcp_icreq_pdu) - \
> +			       sizeof(struct nvmetcp_init_conn_req_hdr))
> +/* The FW will handle the ICReq as CCCID 0 (FW internal design) */
> +#define QEDN_ICREQ_CCCID 0
> +
>   /*
>    * TCP offload stack default configurations and defines.
>    * Future enhancements will allow controlling the configurable
> @@ -136,6 +144,16 @@ struct qedn_fp_queue {
>   	char irqname[QEDN_IRQ_NAME_LEN];
>   };
>   
> +struct qedn_negotiation_params {
> +	u32 maxh2cdata; /* Negotiation */
> +	u32 maxr2t; /* Validation */
> +	u16 pfv; /* Validation */
> +	bool hdr_digest; /* Negotiation */
> +	bool data_digest; /* Negotiation */
> +	u8 cpda; /* Negotiation */
> +	u8 hpda; /* Validation */
> +};
> +
>   struct qedn_ctx {
>   	struct pci_dev *pdev;
>   	struct qed_dev *cdev;
> @@ -195,6 +213,9 @@ struct qedn_endpoint {
>   	struct qed_chain fw_sq_chain;
>   	void __iomem *p_doorbell;
>   
> +	/* Spinlock for accessing FW queue */
> +	spinlock_t doorbell_lock;
> +
>   	/* TCP Params */
>   	__be32 dst_addr[4]; /* In network order */
>   	__be32 src_addr[4]; /* In network order */
> @@ -268,6 +289,12 @@ struct qedn_ctrl {
>   	atomic_t host_num_active_conns;
>   };
>   
> +struct qedn_icreq_padding {
> +	u32 *buffer;
> +	dma_addr_t pa;
> +	struct nvmetcp_sge sge;
> +};
> +
>   /* Connection level struct */
>   struct qedn_conn_ctx {
>   	/* IO path */
> @@ -329,6 +356,11 @@ struct qedn_conn_ctx {
>   
>   	size_t sq_depth;
>   
> +	struct qedn_negotiation_params required_params;
> +	struct qedn_negotiation_params pdu_params;
> +	struct nvmetcp_icresp_hdr_psh icresp;
> +	struct qedn_icreq_padding *icreq_pad;
> +
>   	/* "dummy" socket */
>   	struct socket *sock;
>   };
> @@ -337,6 +369,7 @@ enum qedn_conn_resources_state {
>   	QEDN_CONN_RESRC_FW_SQ,
>   	QEDN_CONN_RESRC_ACQUIRE_CONN,
>   	QEDN_CONN_RESRC_TASKS,
> +	QEDN_CONN_RESRC_ICREQ_PAD,
>   	QEDN_CONN_RESRC_CCCID_ITID_MAP,
>   	QEDN_CONN_RESRC_TCP_PORT,
>   	QEDN_CONN_RESRC_MAX = 64
> @@ -375,5 +408,8 @@ void qedn_common_clear_fw_sgl(struct storage_sgl_task_params *sgl_task_params);
>   void qedn_return_active_tasks(struct qedn_conn_ctx *conn_ctx);
>   void qedn_destroy_free_tasks(struct qedn_fp_queue *fp_q,
>   			     struct qedn_io_resources *io_resrc);
> +void qedn_swap_bytes(u32 *p, int size);
> +void qedn_prep_icresp(struct qedn_conn_ctx *conn_ctx, struct nvmetcp_fw_cqe *cqe);
> +void qedn_ring_doorbell(struct qedn_conn_ctx *conn_ctx);
>   
>   #endif /* _QEDN_H_ */
> diff --git a/drivers/nvme/hw/qedn/qedn_conn.c b/drivers/nvme/hw/qedn/qedn_conn.c
> index 10a80fbeac43..5679354aa0e0 100644
> --- a/drivers/nvme/hw/qedn/qedn_conn.c
> +++ b/drivers/nvme/hw/qedn/qedn_conn.c
> @@ -34,6 +34,25 @@ inline int qedn_qid(struct nvme_tcp_ofld_queue *queue)
>   	return queue - queue->ctrl->queues;
>   }
>   
> +void qedn_ring_doorbell(struct qedn_conn_ctx *conn_ctx)
> +{
> +	struct nvmetcp_db_data dbell = { 0 };
> +	u16 prod_idx;
> +
> +	dbell.agg_flags = 0;
> +	dbell.params |= DB_DEST_XCM << NVMETCP_DB_DATA_DEST_SHIFT;
> +	dbell.params |= DB_AGG_CMD_SET << NVMETCP_DB_DATA_AGG_CMD_SHIFT;
> +	dbell.params |=
> +		DQ_XCM_ISCSI_SQ_PROD_CMD << NVMETCP_DB_DATA_AGG_VAL_SEL_SHIFT;
> +	dbell.params |= 1 << NVMETCP_DB_DATA_BYPASS_EN_SHIFT;
> +	prod_idx = qed_chain_get_prod_idx(&conn_ctx->ep.fw_sq_chain);
> +	dbell.sq_prod = cpu_to_le16(prod_idx);
> +
> +	/* wmb - Make sure fw idx is coherent */
> +	wmb();
> +	writel(*(u32 *)&dbell, conn_ctx->ep.p_doorbell);
> +}
> +
>   int qedn_set_con_state(struct qedn_conn_ctx *conn_ctx, enum qedn_conn_state new_state)
>   {
>   	spin_lock_bh(&conn_ctx->conn_state_lock);
> @@ -130,6 +149,71 @@ int qedn_initialize_endpoint(struct qedn_endpoint *ep, u8 *local_mac_addr,
>   	return -1;
>   }
>   
> +static int qedn_alloc_icreq_pad(struct qedn_conn_ctx *conn_ctx)
> +{
> +	struct qedn_ctx *qedn = conn_ctx->qedn;
> +	struct qedn_icreq_padding *icreq_pad;
> +	u32 *buffer;
> +	int rc = 0;
> +
> +	icreq_pad = kzalloc(sizeof(*icreq_pad), GFP_KERNEL);
> +	if (!icreq_pad)
> +		return -ENOMEM;
> +
> +	conn_ctx->icreq_pad = icreq_pad;
> +	memset(&icreq_pad->sge, 0, sizeof(icreq_pad->sge));
> +	buffer = dma_alloc_coherent(&qedn->pdev->dev,
> +				    QEDN_ICREQ_FW_PAYLOAD,
> +				    &icreq_pad->pa,
> +				    GFP_KERNEL);
> +	if (!buffer) {
> +		pr_err("Could not allocate icreq_padding SGE buffer.\n");
> +		rc =  -ENOMEM;
> +		goto release_icreq_pad;
> +	}
> +
> +	DMA_REGPAIR_LE(icreq_pad->sge.sge_addr, icreq_pad->pa);
> +	icreq_pad->sge.sge_len = cpu_to_le32(QEDN_ICREQ_FW_PAYLOAD);
> +	icreq_pad->buffer = buffer;
> +	set_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state);
> +
> +	return 0;
> +
> +release_icreq_pad:
> +	kfree(icreq_pad);
> +	conn_ctx->icreq_pad = NULL;
> +
> +	return rc;
> +}
> +
> +static void qedn_free_icreq_pad(struct qedn_conn_ctx *conn_ctx)
> +{
> +	struct qedn_ctx *qedn = conn_ctx->qedn;
> +	struct qedn_icreq_padding *icreq_pad;
> +	u32 *buffer;
> +
> +	icreq_pad = conn_ctx->icreq_pad;
> +	if (unlikely(!icreq_pad)) {
> +		pr_err("null ptr in icreq_pad in conn_ctx\n");
> +		goto finally;
> +	}
> +
> +	buffer = icreq_pad->buffer;
> +	if (buffer) {
> +		dma_free_coherent(&qedn->pdev->dev,
> +				  QEDN_ICREQ_FW_PAYLOAD,
> +				  (void *)buffer,
> +				  icreq_pad->pa);
> +		icreq_pad->buffer = NULL;
> +	}
> +
> +	kfree(icreq_pad);
> +	conn_ctx->icreq_pad = NULL;
> +
> +finally:
> +	clear_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state);
> +}
> +
>   static void qedn_release_conn_ctx(struct qedn_conn_ctx *conn_ctx)
>   {
>   	struct qedn_ctx *qedn = conn_ctx->qedn;
> @@ -151,6 +235,9 @@ static void qedn_release_conn_ctx(struct qedn_conn_ctx *conn_ctx)
>   		clear_bit(QEDN_CONN_RESRC_ACQUIRE_CONN, &conn_ctx->resrc_state);
>   	}
>   
> +	if (test_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state))
> +		qedn_free_icreq_pad(conn_ctx);
> +
>   	if (test_bit(QEDN_CONN_RESRC_TASKS, &conn_ctx->resrc_state)) {
>   		clear_bit(QEDN_CONN_RESRC_TASKS, &conn_ctx->resrc_state);
>   			qedn_return_active_tasks(conn_ctx);
> @@ -309,6 +396,194 @@ void qedn_terminate_connection(struct qedn_conn_ctx *conn_ctx, int abrt_flag)
>   	queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
>   }
>   
> +static int qedn_nvmetcp_update_conn(struct qedn_ctx *qedn, struct qedn_conn_ctx *conn_ctx)
> +{
> +	struct qedn_negotiation_params *pdu_params = &conn_ctx->pdu_params;
> +	struct qed_nvmetcp_params_update *conn_info;
> +	int rc;
> +
> +	conn_info = kzalloc(sizeof(*conn_info), GFP_KERNEL);
> +	if (!conn_info)
> +		return -ENOMEM;
> +
> +	conn_info->hdr_digest_en = pdu_params->hdr_digest;
> +	conn_info->data_digest_en = pdu_params->data_digest;
> +	conn_info->max_recv_pdu_length = QEDN_MAX_PDU_SIZE;
> +	conn_info->max_io_size = QEDN_MAX_IO_SIZE;
> +	conn_info->max_send_pdu_length = pdu_params->maxh2cdata;
> +
> +	rc = qed_ops->update_conn(qedn->cdev, conn_ctx->conn_handle, conn_info);
> +	if (rc) {
> +		pr_err("Could not update connection\n");
> +		rc = -ENXIO;
> +	}
> +
> +	kfree(conn_info);
> +
> +	return rc;
> +}
> +
> +static int qedn_update_ramrod(struct qedn_conn_ctx *conn_ctx)
> +{
> +	struct qedn_ctx *qedn = conn_ctx->qedn;
> +	int rc = 0;
> +
> +	rc = qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_UPDATE_EQE);
> +	if (rc)
> +		return rc;
> +
> +	rc = qedn_nvmetcp_update_conn(qedn, conn_ctx);
> +	if (rc)
> +		return rc;
> +
> +	if (conn_ctx->state != CONN_STATE_WAIT_FOR_UPDATE_EQE) {
> +		pr_err("cid 0x%x: Unexpected state 0x%x after update ramrod\n",
> +		       conn_ctx->fw_cid, conn_ctx->state);
> +
> +		return -EINVAL;
> +	}
> +
> +	return rc;
> +}
> +
> +static int qedn_send_icreq(struct qedn_conn_ctx *conn_ctx)
> +{
> +	struct nvmetcp_init_conn_req_hdr *icreq_ptr = NULL;
> +	struct storage_sgl_task_params *sgl_task_params;
> +	struct nvmetcp_task_params task_params;
> +	struct qedn_task_ctx *qedn_task = NULL;
> +	struct nvme_tcp_icreq_pdu icreq;
> +	struct nvmetcp_wqe *chain_sqe;
> +	struct nvmetcp_wqe local_sqe;
> +
> +	qedn_task = qedn_get_task_from_pool_insist(conn_ctx, QEDN_ICREQ_CCCID);
> +	if (!qedn_task)
> +		return -EINVAL;
> +
> +	memset(&icreq, 0, sizeof(icreq));
> +	memset(&local_sqe, 0, sizeof(local_sqe));
> +
> +	/* Initialize ICReq */
> +	icreq.hdr.type = nvme_tcp_icreq;
> +	icreq.hdr.hlen = sizeof(icreq);
> +	icreq.hdr.pdo = 0;
> +	icreq.hdr.plen = cpu_to_le32(icreq.hdr.hlen);
> +	icreq.pfv = cpu_to_le16(conn_ctx->required_params.pfv);
> +	icreq.maxr2t = cpu_to_le32(conn_ctx->required_params.maxr2t);
> +	icreq.hpda = conn_ctx->required_params.hpda;
> +	if (conn_ctx->required_params.hdr_digest)
> +		icreq.digest |= NVME_TCP_HDR_DIGEST_ENABLE;
> +	if (conn_ctx->required_params.data_digest)
> +		icreq.digest |= NVME_TCP_DATA_DIGEST_ENABLE;
> +
> +	qedn_swap_bytes((u32 *)&icreq,
> +			(sizeof(icreq) - QEDN_ICREQ_FW_PAYLOAD) /
> +			 sizeof(u32));
> +
> +	/* Initialize task params */
> +	task_params.opq.lo = cpu_to_le32(((u64)(qedn_task)) & 0xffffffff);
> +	task_params.opq.hi = cpu_to_le32(((u64)(qedn_task)) >> 32);
> +	task_params.context = qedn_task->fw_task_ctx;
> +	task_params.sqe = &local_sqe;
> +	task_params.conn_icid = (u16)conn_ctx->conn_handle;
> +	task_params.itid = qedn_task->itid;
> +	task_params.cq_rss_number = conn_ctx->default_cq;
> +	task_params.tx_io_size = QEDN_ICREQ_FW_PAYLOAD;
> +	task_params.rx_io_size = 0; /* Rx doesn't use SGL for icresp */
> +
> +	/* Init SGE for ICReq padding */
> +	sgl_task_params = &qedn_task->sgl_task_params;
> +	sgl_task_params->total_buffer_size = task_params.tx_io_size;
> +	sgl_task_params->small_mid_sge = false;
> +	sgl_task_params->num_sges = 1;
> +	memcpy(sgl_task_params->sgl, &conn_ctx->icreq_pad->sge,
> +	       sizeof(conn_ctx->icreq_pad->sge));
> +	icreq_ptr = (struct nvmetcp_init_conn_req_hdr *)&icreq;
> +
> +	qed_ops->init_icreq_exchange(&task_params, icreq_ptr, sgl_task_params,  NULL);
> +
> +	qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_IC_COMP);
> +	atomic_inc(&conn_ctx->num_active_fw_tasks);
> +
> +	/* spin_lock - doorbell is accessed  both Rx flow and response flow */
> +	spin_lock(&conn_ctx->ep.doorbell_lock);
> +	chain_sqe = qed_chain_produce(&conn_ctx->ep.fw_sq_chain);
> +	memcpy(chain_sqe, &local_sqe, sizeof(local_sqe));
> +	qedn_ring_doorbell(conn_ctx);
> +	spin_unlock(&conn_ctx->ep.doorbell_lock);
> +
> +	return 0;
> +}
> +

And this is what I meant. You _do_ swab bytes before sending it off to 
the HW, _and_ you use the standard nvme-tcp PDU definitions.
So why do you have your own, byte-swapped versions of the PDUs?

> +void qedn_prep_icresp(struct qedn_conn_ctx *conn_ctx, struct nvmetcp_fw_cqe *cqe)
> +{
> +	struct nvmetcp_icresp_hdr_psh *icresp_from_cqe =
> +		(struct nvmetcp_icresp_hdr_psh *)&cqe->nvme_cqe;
> +	struct nvme_tcp_ofld_ctrl *ctrl = conn_ctx->ctrl;
> +	struct qedn_ctrl *qctrl = NULL;
> +
> +	qctrl = (struct qedn_ctrl *)ctrl->private_data;
> +
> +	memcpy(&conn_ctx->icresp, icresp_from_cqe, sizeof(conn_ctx->icresp));
> +	qedn_set_sp_wa(conn_ctx, HANDLE_ICRESP);
> +	queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
> +}
> +
> +static int qedn_handle_icresp(struct qedn_conn_ctx *conn_ctx)
> +{
> +	struct nvmetcp_icresp_hdr_psh *icresp = &conn_ctx->icresp;
> +	u16 pfv = __swab16(le16_to_cpu(icresp->pfv_swapped));
> +	int rc = 0;
> +

Again here; you could treat the received icresp as a binaray blob. 
byteswap it, and then cast is to the standard icresp structure.
Hmm?

> +	qedn_free_icreq_pad(conn_ctx);
> +
> +	/* Validate ICResp */
> +	if (pfv != conn_ctx->required_params.pfv) {
> +		pr_err("cid %u: unsupported pfv %u\n", conn_ctx->fw_cid, pfv);
> +
> +		return -EINVAL;
> +	}
> +
> +	if (icresp->cpda > conn_ctx->required_params.cpda) {
> +		pr_err("cid %u: unsupported cpda %u\n", conn_ctx->fw_cid, icresp->cpda);
> +
> +		return -EINVAL;
> +	}
> +
> +	if ((NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest) !=
> +	    conn_ctx->required_params.hdr_digest) {
> +		if ((NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest) >
> +		    conn_ctx->required_params.hdr_digest) {
> +			pr_err("cid 0x%x: invalid header digest bit\n", conn_ctx->fw_cid);
> +		}
> +	}
> +
> +	if ((NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest) !=
> +	    conn_ctx->required_params.data_digest) {
> +		if ((NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest) >
> +		    conn_ctx->required_params.data_digest) {
> +			pr_err("cid 0x%x: invalid data digest bit\n", conn_ctx->fw_cid);
> +	}
> +	}
> +
> +	memset(&conn_ctx->pdu_params, 0, sizeof(conn_ctx->pdu_params));
> +	conn_ctx->pdu_params.maxh2cdata =
> +		__swab32(le32_to_cpu(icresp->maxdata_swapped));
> +	conn_ctx->pdu_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
> +	if (conn_ctx->pdu_params.maxh2cdata > QEDN_MAX_PDU_SIZE)
> +		conn_ctx->pdu_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
> +
> +	conn_ctx->pdu_params.pfv = pfv;
> +	conn_ctx->pdu_params.cpda = icresp->cpda;
> +	conn_ctx->pdu_params.hpda = conn_ctx->required_params.hpda;
> +	conn_ctx->pdu_params.hdr_digest = NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest;
> +	conn_ctx->pdu_params.data_digest = NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest;
> +	conn_ctx->pdu_params.maxr2t = conn_ctx->required_params.maxr2t;
> +	rc = qedn_update_ramrod(conn_ctx);
> +
> +	return rc;
> +}
> +
>   /* Slowpath EQ Callback */
>   int qedn_event_cb(void *context, u8 fw_event_code, void *event_ring_data)
>   {
> @@ -363,7 +638,8 @@ int qedn_event_cb(void *context, u8 fw_event_code, void *event_ring_data)
>   			if (rc)
>   				return rc;
>   
> -			/* Placeholder - for ICReq flow */
> +			qedn_set_sp_wa(conn_ctx, SEND_ICREQ);
> +			queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
>   		}
>   
>   		break;
> @@ -399,6 +675,7 @@ static int qedn_prep_and_offload_queue(struct qedn_conn_ctx *conn_ctx)
>   	}
>   
>   	set_bit(QEDN_CONN_RESRC_FW_SQ, &conn_ctx->resrc_state);
> +	spin_lock_init(&conn_ctx->ep.doorbell_lock);
>   	INIT_LIST_HEAD(&conn_ctx->host_pend_req_list);
>   	spin_lock_init(&conn_ctx->nvme_req_lock);
>   	atomic_set(&conn_ctx->num_active_tasks, 0);
> @@ -463,6 +740,11 @@ static int qedn_prep_and_offload_queue(struct qedn_conn_ctx *conn_ctx)
>   
>   	memset(conn_ctx->host_cccid_itid, 0xFF, dma_size);
>   	set_bit(QEDN_CONN_RESRC_CCCID_ITID_MAP, &conn_ctx->resrc_state);
> +
> +	rc = qedn_alloc_icreq_pad(conn_ctx);
> +		if (rc)
> +			goto rel_conn;
> +
>   	rc = qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_CONNECT_DONE);
>   	if (rc)
>   		goto rel_conn;
> @@ -523,6 +805,9 @@ void qedn_sp_wq_handler(struct work_struct *work)
>   
>   	qedn = conn_ctx->qedn;
>   	if (test_bit(DESTROY_CONNECTION, &conn_ctx->agg_work_action)) {
> +		if (test_bit(HANDLE_ICRESP, &conn_ctx->agg_work_action))
> +			qedn_clr_sp_wa(conn_ctx, HANDLE_ICRESP);
> +
>   		qedn_destroy_connection(conn_ctx);
>   
>   		return;
> @@ -537,6 +822,36 @@ void qedn_sp_wq_handler(struct work_struct *work)
>   			return;
>   		}
>   	}
> +
> +	if (test_bit(SEND_ICREQ, &conn_ctx->agg_work_action)) {
> +		qedn_clr_sp_wa(conn_ctx, SEND_ICREQ);
> +		rc = qedn_send_icreq(conn_ctx);
> +		if (rc)
> +			return;
> +
> +		return;
> +	}
> +
> +	if (test_bit(HANDLE_ICRESP, &conn_ctx->agg_work_action)) {
> +		rc = qedn_handle_icresp(conn_ctx);
> +
> +		qedn_clr_sp_wa(conn_ctx, HANDLE_ICRESP);
> +		if (rc) {
> +			pr_err("IC handling returned with 0x%x\n", rc);
> +			if (test_and_set_bit(DESTROY_CONNECTION, &conn_ctx->agg_work_action))
> +				return;
> +
> +			qedn_destroy_connection(conn_ctx);
> +
> +			return;
> +		}
> +
> +		atomic_inc(&conn_ctx->est_conn_indicator);
> +		qedn_set_con_state(conn_ctx, CONN_STATE_NVMETCP_CONN_ESTABLISHED);
> +		wake_up_interruptible(&conn_ctx->conn_waitq);
> +
> +		return;
> +	}
>   }
>   
>   /* Clear connection aggregative slowpath work action */
> diff --git a/drivers/nvme/hw/qedn/qedn_main.c b/drivers/nvme/hw/qedn/qedn_main.c
> index 8d9c19d63480..a6756d7250b7 100644
> --- a/drivers/nvme/hw/qedn/qedn_main.c
> +++ b/drivers/nvme/hw/qedn/qedn_main.c
> @@ -285,6 +285,19 @@ static void qedn_set_ctrl_io_cpus(struct qedn_conn_ctx *conn_ctx, int qid)
>   	conn_ctx->cpu = fp_q->cpu;
>   }
>   
> +static void qedn_set_pdu_params(struct qedn_conn_ctx *conn_ctx)
> +{
> +	/* Enable digest once supported */
> +	conn_ctx->required_params.hdr_digest = 0;
> +	conn_ctx->required_params.data_digest = 0;
> +
> +	conn_ctx->required_params.maxr2t = QEDN_MAX_OUTSTANDING_R2T_PDUS;
> +	conn_ctx->required_params.pfv = NVME_TCP_PFV_1_0;
> +	conn_ctx->required_params.cpda = 0;
> +	conn_ctx->required_params.hpda = 0;
> +	conn_ctx->required_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
> +}
> +
>   static int qedn_create_queue(struct nvme_tcp_ofld_queue *queue, int qid, size_t q_size)
>   {
>   	struct nvme_tcp_ofld_ctrl *ctrl = queue->ctrl;
> @@ -307,6 +320,7 @@ static int qedn_create_queue(struct nvme_tcp_ofld_queue *queue, int qid, size_t
>   	conn_ctx->ctrl = ctrl;
>   	conn_ctx->sq_depth = q_size;
>   	qedn_set_ctrl_io_cpus(conn_ctx, qid);
> +	qedn_set_pdu_params(conn_ctx);
>   
>   	init_waitqueue_head(&conn_ctx->conn_waitq);
>   	atomic_set(&conn_ctx->est_conn_indicator, 0);
> @@ -1073,6 +1087,14 @@ static int qedn_probe(struct pci_dev *pdev, const struct pci_device_id *id)
>   	return __qedn_probe(pdev);
>   }
>   
> +void qedn_swap_bytes(u32 *p, int size)
> +{
> +	int i;
> +
> +	for (i = 0; i < size; ++i, ++p)
> +		*p = __swab32(*p);
> +}
> +
>   static struct pci_driver qedn_pci_driver = {
>   	.name     = QEDN_MODULE_NAME,
>   	.id_table = qedn_pci_tbl,
> diff --git a/drivers/nvme/hw/qedn/qedn_task.c b/drivers/nvme/hw/qedn/qedn_task.c
> index 54f2f4cba6ea..9cb84883e95e 100644
> --- a/drivers/nvme/hw/qedn/qedn_task.c
> +++ b/drivers/nvme/hw/qedn/qedn_task.c
> @@ -536,9 +536,11 @@ void qedn_io_work_cq(struct qedn_ctx *qedn, struct nvmetcp_fw_cqe *cqe)
>   			break;
>   
>   		case NVMETCP_TASK_TYPE_INIT_CONN_REQUEST:
> -
> -			/* Placeholder - ICReq flow */
> -
> +			/* Clear ICReq-padding SGE from SGL */
> +			qedn_common_clear_fw_sgl(&qedn_task->sgl_task_params);
> +			/* Task is not required for icresp processing */
> +			qedn_return_task_to_pool(conn_ctx, qedn_task);
> +			qedn_prep_icresp(conn_ctx, cqe);
>   			break;
>   		default:
>   			pr_info("Could not identify task type\n");
> 
Cheers,

Hannes

Shai Malin May 5, 2021, 6:01 p.m. UTC | #2

On 5/2/21 2:53 PM, Hannes Reinecke wrote:
> On 4/29/21 9:09 PM, Shai Malin wrote:
> > From: Prabhakar Kushwaha <pkushwaha@marvell.com>
> >
> > Once a TCP connection established, the host sends an Initialize
> > Connection Request (ICReq) PDU to the controller.
> > Further Initialize Connection Response (ICResp) PDU received from
> > controller is processed by host to establish a connection and
> > exchange connection configuration parameters.
> >
> > This patch present support of generation of ICReq and processing of
> > ICResp. It also update host configuration based on exchanged parameters.
> >
> > Acked-by: Igor Russkikh <irusskikh@marvell.com>
> > Signed-off-by: Prabhakar Kushwaha <pkushwaha@marvell.com>
> > Signed-off-by: Omkar Kulkarni <okulkarni@marvell.com>
> > Signed-off-by: Michal Kalderon <mkalderon@marvell.com>
> > Signed-off-by: Ariel Elior <aelior@marvell.com>
> > Signed-off-by: Shai Malin <smalin@marvell.com>
> > ---
> >   drivers/nvme/hw/qedn/qedn.h      |  36 ++++
> >   drivers/nvme/hw/qedn/qedn_conn.c | 317 ++++++++++++++++++++++++++++++-
> >   drivers/nvme/hw/qedn/qedn_main.c |  22 +++
> >   drivers/nvme/hw/qedn/qedn_task.c |   8 +-
> >   4 files changed, 379 insertions(+), 4 deletions(-)
> >
> > diff --git a/drivers/nvme/hw/qedn/qedn.h b/drivers/nvme/hw/qedn/qedn.h
> > index 880ca245b02c..773a57994148 100644
> > --- a/drivers/nvme/hw/qedn/qedn.h
> > +++ b/drivers/nvme/hw/qedn/qedn.h
> > @@ -16,6 +16,7 @@
> >
> >   /* Driver includes */
> >   #include "../../host/tcp-offload.h"
> > +#include <linux/nvme-tcp.h>
> >
> >   #define QEDN_MAJOR_VERSION          8
> >   #define QEDN_MINOR_VERSION          62
> > @@ -52,6 +53,8 @@
> >
> >   /* Protocol defines */
> >   #define QEDN_MAX_IO_SIZE QED_NVMETCP_MAX_IO_SIZE
> > +#define QEDN_MAX_PDU_SIZE 0x80000 /* 512KB */
> > +#define QEDN_MAX_OUTSTANDING_R2T_PDUS 0 /* 0 Based == 1 max R2T */
> >
> >   #define QEDN_SGE_BUFF_SIZE 4096
> >   #define QEDN_MAX_SGES_PER_TASK DIV_ROUND_UP(QEDN_MAX_IO_SIZE, QEDN_SGE_BUFF_SIZE)
> > @@ -65,6 +68,11 @@
> >   #define QEDN_TASK_INSIST_TMO 1000 /* 1 sec */
> >   #define QEDN_INVALID_ITID 0xFFFF
> >
> > +#define QEDN_ICREQ_FW_PAYLOAD (sizeof(struct nvme_tcp_icreq_pdu) - \
> > +                            sizeof(struct nvmetcp_init_conn_req_hdr))
> > +/* The FW will handle the ICReq as CCCID 0 (FW internal design) */
> > +#define QEDN_ICREQ_CCCID 0
> > +
> >   /*
> >    * TCP offload stack default configurations and defines.
> >    * Future enhancements will allow controlling the configurable
> > @@ -136,6 +144,16 @@ struct qedn_fp_queue {
> >       char irqname[QEDN_IRQ_NAME_LEN];
> >   };
> >
> > +struct qedn_negotiation_params {
> > +     u32 maxh2cdata; /* Negotiation */
> > +     u32 maxr2t; /* Validation */
> > +     u16 pfv; /* Validation */
> > +     bool hdr_digest; /* Negotiation */
> > +     bool data_digest; /* Negotiation */
> > +     u8 cpda; /* Negotiation */
> > +     u8 hpda; /* Validation */
> > +};
> > +
> >   struct qedn_ctx {
> >       struct pci_dev *pdev;
> >       struct qed_dev *cdev;
> > @@ -195,6 +213,9 @@ struct qedn_endpoint {
> >       struct qed_chain fw_sq_chain;
> >       void __iomem *p_doorbell;
> >
> > +     /* Spinlock for accessing FW queue */
> > +     spinlock_t doorbell_lock;
> > +
> >       /* TCP Params */
> >       __be32 dst_addr[4]; /* In network order */
> >       __be32 src_addr[4]; /* In network order */
> > @@ -268,6 +289,12 @@ struct qedn_ctrl {
> >       atomic_t host_num_active_conns;
> >   };
> >
> > +struct qedn_icreq_padding {
> > +     u32 *buffer;
> > +     dma_addr_t pa;
> > +     struct nvmetcp_sge sge;
> > +};
> > +
> >   /* Connection level struct */
> >   struct qedn_conn_ctx {
> >       /* IO path */
> > @@ -329,6 +356,11 @@ struct qedn_conn_ctx {
> >
> >       size_t sq_depth;
> >
> > +     struct qedn_negotiation_params required_params;
> > +     struct qedn_negotiation_params pdu_params;
> > +     struct nvmetcp_icresp_hdr_psh icresp;
> > +     struct qedn_icreq_padding *icreq_pad;
> > +
> >       /* "dummy" socket */
> >       struct socket *sock;
> >   };
> > @@ -337,6 +369,7 @@ enum qedn_conn_resources_state {
> >       QEDN_CONN_RESRC_FW_SQ,
> >       QEDN_CONN_RESRC_ACQUIRE_CONN,
> >       QEDN_CONN_RESRC_TASKS,
> > +     QEDN_CONN_RESRC_ICREQ_PAD,
> >       QEDN_CONN_RESRC_CCCID_ITID_MAP,
> >       QEDN_CONN_RESRC_TCP_PORT,
> >       QEDN_CONN_RESRC_MAX = 64
> > @@ -375,5 +408,8 @@ void qedn_common_clear_fw_sgl(struct storage_sgl_task_params *sgl_task_params);
> >   void qedn_return_active_tasks(struct qedn_conn_ctx *conn_ctx);
> >   void qedn_destroy_free_tasks(struct qedn_fp_queue *fp_q,
> >                            struct qedn_io_resources *io_resrc);
> > +void qedn_swap_bytes(u32 *p, int size);
> > +void qedn_prep_icresp(struct qedn_conn_ctx *conn_ctx, struct nvmetcp_fw_cqe *cqe);
> > +void qedn_ring_doorbell(struct qedn_conn_ctx *conn_ctx);
> >
> >   #endif /* _QEDN_H_ */
> > diff --git a/drivers/nvme/hw/qedn/qedn_conn.c b/drivers/nvme/hw/qedn/qedn_conn.c
> > index 10a80fbeac43..5679354aa0e0 100644
> > --- a/drivers/nvme/hw/qedn/qedn_conn.c
> > +++ b/drivers/nvme/hw/qedn/qedn_conn.c
> > @@ -34,6 +34,25 @@ inline int qedn_qid(struct nvme_tcp_ofld_queue *queue)
> >       return queue - queue->ctrl->queues;
> >   }
> >
> > +void qedn_ring_doorbell(struct qedn_conn_ctx *conn_ctx)
> > +{
> > +     struct nvmetcp_db_data dbell = { 0 };
> > +     u16 prod_idx;
> > +
> > +     dbell.agg_flags = 0;
> > +     dbell.params |= DB_DEST_XCM << NVMETCP_DB_DATA_DEST_SHIFT;
> > +     dbell.params |= DB_AGG_CMD_SET << NVMETCP_DB_DATA_AGG_CMD_SHIFT;
> > +     dbell.params |=
> > +             DQ_XCM_ISCSI_SQ_PROD_CMD << NVMETCP_DB_DATA_AGG_VAL_SEL_SHIFT;
> > +     dbell.params |= 1 << NVMETCP_DB_DATA_BYPASS_EN_SHIFT;
> > +     prod_idx = qed_chain_get_prod_idx(&conn_ctx->ep.fw_sq_chain);
> > +     dbell.sq_prod = cpu_to_le16(prod_idx);
> > +
> > +     /* wmb - Make sure fw idx is coherent */
> > +     wmb();
> > +     writel(*(u32 *)&dbell, conn_ctx->ep.p_doorbell);
> > +}
> > +
> >   int qedn_set_con_state(struct qedn_conn_ctx *conn_ctx, enum qedn_conn_state new_state)
> >   {
> >       spin_lock_bh(&conn_ctx->conn_state_lock);
> > @@ -130,6 +149,71 @@ int qedn_initialize_endpoint(struct qedn_endpoint *ep, u8 *local_mac_addr,
> >       return -1;
> >   }
> >
> > +static int qedn_alloc_icreq_pad(struct qedn_conn_ctx *conn_ctx)
> > +{
> > +     struct qedn_ctx *qedn = conn_ctx->qedn;
> > +     struct qedn_icreq_padding *icreq_pad;
> > +     u32 *buffer;
> > +     int rc = 0;
> > +
> > +     icreq_pad = kzalloc(sizeof(*icreq_pad), GFP_KERNEL);
> > +     if (!icreq_pad)
> > +             return -ENOMEM;
> > +
> > +     conn_ctx->icreq_pad = icreq_pad;
> > +     memset(&icreq_pad->sge, 0, sizeof(icreq_pad->sge));
> > +     buffer = dma_alloc_coherent(&qedn->pdev->dev,
> > +                                 QEDN_ICREQ_FW_PAYLOAD,
> > +                                 &icreq_pad->pa,
> > +                                 GFP_KERNEL);
> > +     if (!buffer) {
> > +             pr_err("Could not allocate icreq_padding SGE buffer.\n");
> > +             rc =  -ENOMEM;
> > +             goto release_icreq_pad;
> > +     }
> > +
> > +     DMA_REGPAIR_LE(icreq_pad->sge.sge_addr, icreq_pad->pa);
> > +     icreq_pad->sge.sge_len = cpu_to_le32(QEDN_ICREQ_FW_PAYLOAD);
> > +     icreq_pad->buffer = buffer;
> > +     set_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state);
> > +
> > +     return 0;
> > +
> > +release_icreq_pad:
> > +     kfree(icreq_pad);
> > +     conn_ctx->icreq_pad = NULL;
> > +
> > +     return rc;
> > +}
> > +
> > +static void qedn_free_icreq_pad(struct qedn_conn_ctx *conn_ctx)
> > +{
> > +     struct qedn_ctx *qedn = conn_ctx->qedn;
> > +     struct qedn_icreq_padding *icreq_pad;
> > +     u32 *buffer;
> > +
> > +     icreq_pad = conn_ctx->icreq_pad;
> > +     if (unlikely(!icreq_pad)) {
> > +             pr_err("null ptr in icreq_pad in conn_ctx\n");
> > +             goto finally;
> > +     }
> > +
> > +     buffer = icreq_pad->buffer;
> > +     if (buffer) {
> > +             dma_free_coherent(&qedn->pdev->dev,
> > +                               QEDN_ICREQ_FW_PAYLOAD,
> > +                               (void *)buffer,
> > +                               icreq_pad->pa);
> > +             icreq_pad->buffer = NULL;
> > +     }
> > +
> > +     kfree(icreq_pad);
> > +     conn_ctx->icreq_pad = NULL;
> > +
> > +finally:
> > +     clear_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state);
> > +}
> > +
> >   static void qedn_release_conn_ctx(struct qedn_conn_ctx *conn_ctx)
> >   {
> >       struct qedn_ctx *qedn = conn_ctx->qedn;
> > @@ -151,6 +235,9 @@ static void qedn_release_conn_ctx(struct qedn_conn_ctx *conn_ctx)
> >               clear_bit(QEDN_CONN_RESRC_ACQUIRE_CONN, &conn_ctx->resrc_state);
> >       }
> >
> > +     if (test_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state))
> > +             qedn_free_icreq_pad(conn_ctx);
> > +
> >       if (test_bit(QEDN_CONN_RESRC_TASKS, &conn_ctx->resrc_state)) {
> >               clear_bit(QEDN_CONN_RESRC_TASKS, &conn_ctx->resrc_state);
> >                       qedn_return_active_tasks(conn_ctx);
> > @@ -309,6 +396,194 @@ void qedn_terminate_connection(struct qedn_conn_ctx *conn_ctx, int abrt_flag)
> >       queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
> >   }
> >
> > +static int qedn_nvmetcp_update_conn(struct qedn_ctx *qedn, struct qedn_conn_ctx *conn_ctx)
> > +{
> > +     struct qedn_negotiation_params *pdu_params = &conn_ctx->pdu_params;
> > +     struct qed_nvmetcp_params_update *conn_info;
> > +     int rc;
> > +
> > +     conn_info = kzalloc(sizeof(*conn_info), GFP_KERNEL);
> > +     if (!conn_info)
> > +             return -ENOMEM;
> > +
> > +     conn_info->hdr_digest_en = pdu_params->hdr_digest;
> > +     conn_info->data_digest_en = pdu_params->data_digest;
> > +     conn_info->max_recv_pdu_length = QEDN_MAX_PDU_SIZE;
> > +     conn_info->max_io_size = QEDN_MAX_IO_SIZE;
> > +     conn_info->max_send_pdu_length = pdu_params->maxh2cdata;
> > +
> > +     rc = qed_ops->update_conn(qedn->cdev, conn_ctx->conn_handle, conn_info);
> > +     if (rc) {
> > +             pr_err("Could not update connection\n");
> > +             rc = -ENXIO;
> > +     }
> > +
> > +     kfree(conn_info);
> > +
> > +     return rc;
> > +}
> > +
> > +static int qedn_update_ramrod(struct qedn_conn_ctx *conn_ctx)
> > +{
> > +     struct qedn_ctx *qedn = conn_ctx->qedn;
> > +     int rc = 0;
> > +
> > +     rc = qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_UPDATE_EQE);
> > +     if (rc)
> > +             return rc;
> > +
> > +     rc = qedn_nvmetcp_update_conn(qedn, conn_ctx);
> > +     if (rc)
> > +             return rc;
> > +
> > +     if (conn_ctx->state != CONN_STATE_WAIT_FOR_UPDATE_EQE) {
> > +             pr_err("cid 0x%x: Unexpected state 0x%x after update ramrod\n",
> > +                    conn_ctx->fw_cid, conn_ctx->state);
> > +
> > +             return -EINVAL;
> > +     }
> > +
> > +     return rc;
> > +}
> > +
> > +static int qedn_send_icreq(struct qedn_conn_ctx *conn_ctx)
> > +{
> > +     struct nvmetcp_init_conn_req_hdr *icreq_ptr = NULL;
> > +     struct storage_sgl_task_params *sgl_task_params;
> > +     struct nvmetcp_task_params task_params;
> > +     struct qedn_task_ctx *qedn_task = NULL;
> > +     struct nvme_tcp_icreq_pdu icreq;
> > +     struct nvmetcp_wqe *chain_sqe;
> > +     struct nvmetcp_wqe local_sqe;
> > +
> > +     qedn_task = qedn_get_task_from_pool_insist(conn_ctx, QEDN_ICREQ_CCCID);
> > +     if (!qedn_task)
> > +             return -EINVAL;
> > +
> > +     memset(&icreq, 0, sizeof(icreq));
> > +     memset(&local_sqe, 0, sizeof(local_sqe));
> > +
> > +     /* Initialize ICReq */
> > +     icreq.hdr.type = nvme_tcp_icreq;
> > +     icreq.hdr.hlen = sizeof(icreq);
> > +     icreq.hdr.pdo = 0;
> > +     icreq.hdr.plen = cpu_to_le32(icreq.hdr.hlen);
> > +     icreq.pfv = cpu_to_le16(conn_ctx->required_params.pfv);
> > +     icreq.maxr2t = cpu_to_le32(conn_ctx->required_params.maxr2t);
> > +     icreq.hpda = conn_ctx->required_params.hpda;
> > +     if (conn_ctx->required_params.hdr_digest)
> > +             icreq.digest |= NVME_TCP_HDR_DIGEST_ENABLE;
> > +     if (conn_ctx->required_params.data_digest)
> > +             icreq.digest |= NVME_TCP_DATA_DIGEST_ENABLE;
> > +
> > +     qedn_swap_bytes((u32 *)&icreq,
> > +                     (sizeof(icreq) - QEDN_ICREQ_FW_PAYLOAD) /
> > +                      sizeof(u32));
> > +
> > +     /* Initialize task params */
> > +     task_params.opq.lo = cpu_to_le32(((u64)(qedn_task)) & 0xffffffff);
> > +     task_params.opq.hi = cpu_to_le32(((u64)(qedn_task)) >> 32);
> > +     task_params.context = qedn_task->fw_task_ctx;
> > +     task_params.sqe = &local_sqe;
> > +     task_params.conn_icid = (u16)conn_ctx->conn_handle;
> > +     task_params.itid = qedn_task->itid;
> > +     task_params.cq_rss_number = conn_ctx->default_cq;
> > +     task_params.tx_io_size = QEDN_ICREQ_FW_PAYLOAD;
> > +     task_params.rx_io_size = 0; /* Rx doesn't use SGL for icresp */
> > +
> > +     /* Init SGE for ICReq padding */
> > +     sgl_task_params = &qedn_task->sgl_task_params;
> > +     sgl_task_params->total_buffer_size = task_params.tx_io_size;
> > +     sgl_task_params->small_mid_sge = false;
> > +     sgl_task_params->num_sges = 1;
> > +     memcpy(sgl_task_params->sgl, &conn_ctx->icreq_pad->sge,
> > +            sizeof(conn_ctx->icreq_pad->sge));
> > +     icreq_ptr = (struct nvmetcp_init_conn_req_hdr *)&icreq;
> > +
> > +     qed_ops->init_icreq_exchange(&task_params, icreq_ptr, sgl_task_params,  NULL);
> > +
> > +     qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_IC_COMP);
> > +     atomic_inc(&conn_ctx->num_active_fw_tasks);
> > +
> > +     /* spin_lock - doorbell is accessed  both Rx flow and response flow */
> > +     spin_lock(&conn_ctx->ep.doorbell_lock);
> > +     chain_sqe = qed_chain_produce(&conn_ctx->ep.fw_sq_chain);
> > +     memcpy(chain_sqe, &local_sqe, sizeof(local_sqe));
> > +     qedn_ring_doorbell(conn_ctx);
> > +     spin_unlock(&conn_ctx->ep.doorbell_lock);
> > +
> > +     return 0;
> > +}
> > +
>
> And this is what I meant. You _do_ swab bytes before sending it off to
> the HW, _and_ you use the standard nvme-tcp PDU definitions.
> So why do you have your own, byte-swapped versions of the PDUs?

We will change it and use the standard struct 'nvme_tcp_icreq_pdu'.

>
> > +void qedn_prep_icresp(struct qedn_conn_ctx *conn_ctx, struct nvmetcp_fw_cqe *cqe)
> > +{
> > +     struct nvmetcp_icresp_hdr_psh *icresp_from_cqe =
> > +             (struct nvmetcp_icresp_hdr_psh *)&cqe->nvme_cqe;
> > +     struct nvme_tcp_ofld_ctrl *ctrl = conn_ctx->ctrl;
> > +     struct qedn_ctrl *qctrl = NULL;
> > +
> > +     qctrl = (struct qedn_ctrl *)ctrl->private_data;
> > +
> > +     memcpy(&conn_ctx->icresp, icresp_from_cqe, sizeof(conn_ctx->icresp));
> > +     qedn_set_sp_wa(conn_ctx, HANDLE_ICRESP);
> > +     queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
> > +}
> > +
> > +static int qedn_handle_icresp(struct qedn_conn_ctx *conn_ctx)
> > +{
> > +     struct nvmetcp_icresp_hdr_psh *icresp = &conn_ctx->icresp;
> > +     u16 pfv = __swab16(le16_to_cpu(icresp->pfv_swapped));
> > +     int rc = 0;
> > +
>
> Again here; you could treat the received icresp as a binaray blob.
> byteswap it, and then cast is to the standard icresp structure.
> Hmm?

With the existing FW, the driver receives only the meaningful fields from
the icresp. As a future enhancement, we will consider modifying the FW
to pass the entire icresp to the driver.

>
> > +     qedn_free_icreq_pad(conn_ctx);
> > +
> > +     /* Validate ICResp */
> > +     if (pfv != conn_ctx->required_params.pfv) {
> > +             pr_err("cid %u: unsupported pfv %u\n", conn_ctx->fw_cid, pfv);
> > +
> > +             return -EINVAL;
> > +     }
> > +
> > +     if (icresp->cpda > conn_ctx->required_params.cpda) {
> > +             pr_err("cid %u: unsupported cpda %u\n", conn_ctx->fw_cid, icresp->cpda);
> > +
> > +             return -EINVAL;
> > +     }
> > +
> > +     if ((NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest) !=
> > +         conn_ctx->required_params.hdr_digest) {
> > +             if ((NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest) >
> > +                 conn_ctx->required_params.hdr_digest) {
> > +                     pr_err("cid 0x%x: invalid header digest bit\n", conn_ctx->fw_cid);
> > +             }
> > +     }
> > +
> > +     if ((NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest) !=
> > +         conn_ctx->required_params.data_digest) {
> > +             if ((NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest) >
> > +                 conn_ctx->required_params.data_digest) {
> > +                     pr_err("cid 0x%x: invalid data digest bit\n", conn_ctx->fw_cid);
> > +     }
> > +     }
> > +
> > +     memset(&conn_ctx->pdu_params, 0, sizeof(conn_ctx->pdu_params));
> > +     conn_ctx->pdu_params.maxh2cdata =
> > +             __swab32(le32_to_cpu(icresp->maxdata_swapped));
> > +     conn_ctx->pdu_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
> > +     if (conn_ctx->pdu_params.maxh2cdata > QEDN_MAX_PDU_SIZE)
> > +             conn_ctx->pdu_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
> > +
> > +     conn_ctx->pdu_params.pfv = pfv;
> > +     conn_ctx->pdu_params.cpda = icresp->cpda;
> > +     conn_ctx->pdu_params.hpda = conn_ctx->required_params.hpda;
> > +     conn_ctx->pdu_params.hdr_digest = NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest;
> > +     conn_ctx->pdu_params.data_digest = NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest;
> > +     conn_ctx->pdu_params.maxr2t = conn_ctx->required_params.maxr2t;
> > +     rc = qedn_update_ramrod(conn_ctx);
> > +
> > +     return rc;
> > +}
> > +
> >   /* Slowpath EQ Callback */
> >   int qedn_event_cb(void *context, u8 fw_event_code, void *event_ring_data)
> >   {
> > @@ -363,7 +638,8 @@ int qedn_event_cb(void *context, u8 fw_event_code, void *event_ring_data)
> >                       if (rc)
> >                               return rc;
> >
> > -                     /* Placeholder - for ICReq flow */
> > +                     qedn_set_sp_wa(conn_ctx, SEND_ICREQ);
> > +                     queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
> >               }
> >
> >               break;
> > @@ -399,6 +675,7 @@ static int qedn_prep_and_offload_queue(struct qedn_conn_ctx *conn_ctx)
> >       }
> >
> >       set_bit(QEDN_CONN_RESRC_FW_SQ, &conn_ctx->resrc_state);
> > +     spin_lock_init(&conn_ctx->ep.doorbell_lock);
> >       INIT_LIST_HEAD(&conn_ctx->host_pend_req_list);
> >       spin_lock_init(&conn_ctx->nvme_req_lock);
> >       atomic_set(&conn_ctx->num_active_tasks, 0);
> > @@ -463,6 +740,11 @@ static int qedn_prep_and_offload_queue(struct qedn_conn_ctx *conn_ctx)
> >
> >       memset(conn_ctx->host_cccid_itid, 0xFF, dma_size);
> >       set_bit(QEDN_CONN_RESRC_CCCID_ITID_MAP, &conn_ctx->resrc_state);
> > +
> > +     rc = qedn_alloc_icreq_pad(conn_ctx);
> > +             if (rc)
> > +                     goto rel_conn;
> > +
> >       rc = qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_CONNECT_DONE);
> >       if (rc)
> >               goto rel_conn;
> > @@ -523,6 +805,9 @@ void qedn_sp_wq_handler(struct work_struct *work)
> >
> >       qedn = conn_ctx->qedn;
> >       if (test_bit(DESTROY_CONNECTION, &conn_ctx->agg_work_action)) {
> > +             if (test_bit(HANDLE_ICRESP, &conn_ctx->agg_work_action))
> > +                     qedn_clr_sp_wa(conn_ctx, HANDLE_ICRESP);
> > +
> >               qedn_destroy_connection(conn_ctx);
> >
> >               return;
> > @@ -537,6 +822,36 @@ void qedn_sp_wq_handler(struct work_struct *work)
> >                       return;
> >               }
> >       }
> > +
> > +     if (test_bit(SEND_ICREQ, &conn_ctx->agg_work_action)) {
> > +             qedn_clr_sp_wa(conn_ctx, SEND_ICREQ);
> > +             rc = qedn_send_icreq(conn_ctx);
> > +             if (rc)
> > +                     return;
> > +
> > +             return;
> > +     }
> > +
> > +     if (test_bit(HANDLE_ICRESP, &conn_ctx->agg_work_action)) {
> > +             rc = qedn_handle_icresp(conn_ctx);
> > +
> > +             qedn_clr_sp_wa(conn_ctx, HANDLE_ICRESP);
> > +             if (rc) {
> > +                     pr_err("IC handling returned with 0x%x\n", rc);
> > +                     if (test_and_set_bit(DESTROY_CONNECTION, &conn_ctx->agg_work_action))
> > +                             return;
> > +
> > +                     qedn_destroy_connection(conn_ctx);
> > +
> > +                     return;
> > +             }
> > +
> > +             atomic_inc(&conn_ctx->est_conn_indicator);
> > +             qedn_set_con_state(conn_ctx, CONN_STATE_NVMETCP_CONN_ESTABLISHED);
> > +             wake_up_interruptible(&conn_ctx->conn_waitq);
> > +
> > +             return;
> > +     }
> >   }
> >
> >   /* Clear connection aggregative slowpath work action */
> > diff --git a/drivers/nvme/hw/qedn/qedn_main.c b/drivers/nvme/hw/qedn/qedn_main.c
> > index 8d9c19d63480..a6756d7250b7 100644
> > --- a/drivers/nvme/hw/qedn/qedn_main.c
> > +++ b/drivers/nvme/hw/qedn/qedn_main.c
> > @@ -285,6 +285,19 @@ static void qedn_set_ctrl_io_cpus(struct qedn_conn_ctx *conn_ctx, int qid)
> >       conn_ctx->cpu = fp_q->cpu;
> >   }
> >
> > +static void qedn_set_pdu_params(struct qedn_conn_ctx *conn_ctx)
> > +{
> > +     /* Enable digest once supported */
> > +     conn_ctx->required_params.hdr_digest = 0;
> > +     conn_ctx->required_params.data_digest = 0;
> > +
> > +     conn_ctx->required_params.maxr2t = QEDN_MAX_OUTSTANDING_R2T_PDUS;
> > +     conn_ctx->required_params.pfv = NVME_TCP_PFV_1_0;
> > +     conn_ctx->required_params.cpda = 0;
> > +     conn_ctx->required_params.hpda = 0;
> > +     conn_ctx->required_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
> > +}
> > +
> >   static int qedn_create_queue(struct nvme_tcp_ofld_queue *queue, int qid, size_t q_size)
> >   {
> >       struct nvme_tcp_ofld_ctrl *ctrl = queue->ctrl;
> > @@ -307,6 +320,7 @@ static int qedn_create_queue(struct nvme_tcp_ofld_queue *queue, int qid, size_t
> >       conn_ctx->ctrl = ctrl;
> >       conn_ctx->sq_depth = q_size;
> >       qedn_set_ctrl_io_cpus(conn_ctx, qid);
> > +     qedn_set_pdu_params(conn_ctx);
> >
> >       init_waitqueue_head(&conn_ctx->conn_waitq);
> >       atomic_set(&conn_ctx->est_conn_indicator, 0);
> > @@ -1073,6 +1087,14 @@ static int qedn_probe(struct pci_dev *pdev, const struct pci_device_id *id)
> >       return __qedn_probe(pdev);
> >   }
> >
> > +void qedn_swap_bytes(u32 *p, int size)
> > +{
> > +     int i;
> > +
> > +     for (i = 0; i < size; ++i, ++p)
> > +             *p = __swab32(*p);
> > +}
> > +
> >   static struct pci_driver qedn_pci_driver = {
> >       .name     = QEDN_MODULE_NAME,
> >       .id_table = qedn_pci_tbl,
> > diff --git a/drivers/nvme/hw/qedn/qedn_task.c b/drivers/nvme/hw/qedn/qedn_task.c
> > index 54f2f4cba6ea..9cb84883e95e 100644
> > --- a/drivers/nvme/hw/qedn/qedn_task.c
> > +++ b/drivers/nvme/hw/qedn/qedn_task.c
> > @@ -536,9 +536,11 @@ void qedn_io_work_cq(struct qedn_ctx *qedn, struct nvmetcp_fw_cqe *cqe)
> >                       break;
> >
> >               case NVMETCP_TASK_TYPE_INIT_CONN_REQUEST:
> > -
> > -                     /* Placeholder - ICReq flow */
> > -
> > +                     /* Clear ICReq-padding SGE from SGL */
> > +                     qedn_common_clear_fw_sgl(&qedn_task->sgl_task_params);
> > +                     /* Task is not required for icresp processing */
> > +                     qedn_return_task_to_pool(conn_ctx, qedn_task);
> > +                     qedn_prep_icresp(conn_ctx, cqe);
> >                       break;
> >               default:
> >                       pr_info("Could not identify task type\n");
> >
> Cheers,
>
> Hannes
> --
> Dr. Hannes Reinecke                Kernel Storage Architect
> hare@suse.de                              +49 911 74053 688
> SUSE Software Solutions GmbH, Maxfeldstr. 5, 90409 Nürnberg
> HRB 36809 (AG Nürnberg), Geschäftsführer: Felix Imendörffer

diff --git a/drivers/nvme/hw/qedn/qedn.h b/drivers/nvme/hw/qedn/qedn.h
index 880ca245b02c..773a57994148 100644
--- a/drivers/nvme/hw/qedn/qedn.h
+++ b/drivers/nvme/hw/qedn/qedn.h
@@ -16,6 +16,7 @@ 
 
 /* Driver includes */
 #include "../../host/tcp-offload.h"
+#include <linux/nvme-tcp.h>
 
 #define QEDN_MAJOR_VERSION		8
 #define QEDN_MINOR_VERSION		62
@@ -52,6 +53,8 @@ 
 
 /* Protocol defines */
 #define QEDN_MAX_IO_SIZE QED_NVMETCP_MAX_IO_SIZE
+#define QEDN_MAX_PDU_SIZE 0x80000 /* 512KB */
+#define QEDN_MAX_OUTSTANDING_R2T_PDUS 0 /* 0 Based == 1 max R2T */
 
 #define QEDN_SGE_BUFF_SIZE 4096
 #define QEDN_MAX_SGES_PER_TASK DIV_ROUND_UP(QEDN_MAX_IO_SIZE, QEDN_SGE_BUFF_SIZE)
@@ -65,6 +68,11 @@ 
 #define QEDN_TASK_INSIST_TMO 1000 /* 1 sec */
 #define QEDN_INVALID_ITID 0xFFFF
 
+#define QEDN_ICREQ_FW_PAYLOAD (sizeof(struct nvme_tcp_icreq_pdu) - \
+			       sizeof(struct nvmetcp_init_conn_req_hdr))
+/* The FW will handle the ICReq as CCCID 0 (FW internal design) */
+#define QEDN_ICREQ_CCCID 0
+
 /*
  * TCP offload stack default configurations and defines.
  * Future enhancements will allow controlling the configurable
@@ -136,6 +144,16 @@  struct qedn_fp_queue {
 	char irqname[QEDN_IRQ_NAME_LEN];
 };
 
+struct qedn_negotiation_params {
+	u32 maxh2cdata; /* Negotiation */
+	u32 maxr2t; /* Validation */
+	u16 pfv; /* Validation */
+	bool hdr_digest; /* Negotiation */
+	bool data_digest; /* Negotiation */
+	u8 cpda; /* Negotiation */
+	u8 hpda; /* Validation */
+};
+
 struct qedn_ctx {
 	struct pci_dev *pdev;
 	struct qed_dev *cdev;
@@ -195,6 +213,9 @@  struct qedn_endpoint {
 	struct qed_chain fw_sq_chain;
 	void __iomem *p_doorbell;
 
+	/* Spinlock for accessing FW queue */
+	spinlock_t doorbell_lock;
+
 	/* TCP Params */
 	__be32 dst_addr[4]; /* In network order */
 	__be32 src_addr[4]; /* In network order */
@@ -268,6 +289,12 @@  struct qedn_ctrl {
 	atomic_t host_num_active_conns;
 };
 
+struct qedn_icreq_padding {
+	u32 *buffer;
+	dma_addr_t pa;
+	struct nvmetcp_sge sge;
+};
+
 /* Connection level struct */
 struct qedn_conn_ctx {
 	/* IO path */
@@ -329,6 +356,11 @@  struct qedn_conn_ctx {
 
 	size_t sq_depth;
 
+	struct qedn_negotiation_params required_params;
+	struct qedn_negotiation_params pdu_params;
+	struct nvmetcp_icresp_hdr_psh icresp;
+	struct qedn_icreq_padding *icreq_pad;
+
 	/* "dummy" socket */
 	struct socket *sock;
 };
@@ -337,6 +369,7 @@  enum qedn_conn_resources_state {
 	QEDN_CONN_RESRC_FW_SQ,
 	QEDN_CONN_RESRC_ACQUIRE_CONN,
 	QEDN_CONN_RESRC_TASKS,
+	QEDN_CONN_RESRC_ICREQ_PAD,
 	QEDN_CONN_RESRC_CCCID_ITID_MAP,
 	QEDN_CONN_RESRC_TCP_PORT,
 	QEDN_CONN_RESRC_MAX = 64
@@ -375,5 +408,8 @@  void qedn_common_clear_fw_sgl(struct storage_sgl_task_params *sgl_task_params);
 void qedn_return_active_tasks(struct qedn_conn_ctx *conn_ctx);
 void qedn_destroy_free_tasks(struct qedn_fp_queue *fp_q,
 			     struct qedn_io_resources *io_resrc);
+void qedn_swap_bytes(u32 *p, int size);
+void qedn_prep_icresp(struct qedn_conn_ctx *conn_ctx, struct nvmetcp_fw_cqe *cqe);
+void qedn_ring_doorbell(struct qedn_conn_ctx *conn_ctx);
 
 #endif /* _QEDN_H_ */
diff --git a/drivers/nvme/hw/qedn/qedn_conn.c b/drivers/nvme/hw/qedn/qedn_conn.c
index 10a80fbeac43..5679354aa0e0 100644
--- a/drivers/nvme/hw/qedn/qedn_conn.c
+++ b/drivers/nvme/hw/qedn/qedn_conn.c
@@ -34,6 +34,25 @@  inline int qedn_qid(struct nvme_tcp_ofld_queue *queue)
 	return queue - queue->ctrl->queues;
 }
 
+void qedn_ring_doorbell(struct qedn_conn_ctx *conn_ctx)
+{
+	struct nvmetcp_db_data dbell = { 0 };
+	u16 prod_idx;
+
+	dbell.agg_flags = 0;
+	dbell.params |= DB_DEST_XCM << NVMETCP_DB_DATA_DEST_SHIFT;
+	dbell.params |= DB_AGG_CMD_SET << NVMETCP_DB_DATA_AGG_CMD_SHIFT;
+	dbell.params |=
+		DQ_XCM_ISCSI_SQ_PROD_CMD << NVMETCP_DB_DATA_AGG_VAL_SEL_SHIFT;
+	dbell.params |= 1 << NVMETCP_DB_DATA_BYPASS_EN_SHIFT;
+	prod_idx = qed_chain_get_prod_idx(&conn_ctx->ep.fw_sq_chain);
+	dbell.sq_prod = cpu_to_le16(prod_idx);
+
+	/* wmb - Make sure fw idx is coherent */
+	wmb();
+	writel(*(u32 *)&dbell, conn_ctx->ep.p_doorbell);
+}
+
 int qedn_set_con_state(struct qedn_conn_ctx *conn_ctx, enum qedn_conn_state new_state)
 {
 	spin_lock_bh(&conn_ctx->conn_state_lock);
@@ -130,6 +149,71 @@  int qedn_initialize_endpoint(struct qedn_endpoint *ep, u8 *local_mac_addr,
 	return -1;
 }
 
+static int qedn_alloc_icreq_pad(struct qedn_conn_ctx *conn_ctx)
+{
+	struct qedn_ctx *qedn = conn_ctx->qedn;
+	struct qedn_icreq_padding *icreq_pad;
+	u32 *buffer;
+	int rc = 0;
+
+	icreq_pad = kzalloc(sizeof(*icreq_pad), GFP_KERNEL);
+	if (!icreq_pad)
+		return -ENOMEM;
+
+	conn_ctx->icreq_pad = icreq_pad;
+	memset(&icreq_pad->sge, 0, sizeof(icreq_pad->sge));
+	buffer = dma_alloc_coherent(&qedn->pdev->dev,
+				    QEDN_ICREQ_FW_PAYLOAD,
+				    &icreq_pad->pa,
+				    GFP_KERNEL);
+	if (!buffer) {
+		pr_err("Could not allocate icreq_padding SGE buffer.\n");
+		rc =  -ENOMEM;
+		goto release_icreq_pad;
+	}
+
+	DMA_REGPAIR_LE(icreq_pad->sge.sge_addr, icreq_pad->pa);
+	icreq_pad->sge.sge_len = cpu_to_le32(QEDN_ICREQ_FW_PAYLOAD);
+	icreq_pad->buffer = buffer;
+	set_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state);
+
+	return 0;
+
+release_icreq_pad:
+	kfree(icreq_pad);
+	conn_ctx->icreq_pad = NULL;
+
+	return rc;
+}
+
+static void qedn_free_icreq_pad(struct qedn_conn_ctx *conn_ctx)
+{
+	struct qedn_ctx *qedn = conn_ctx->qedn;
+	struct qedn_icreq_padding *icreq_pad;
+	u32 *buffer;
+
+	icreq_pad = conn_ctx->icreq_pad;
+	if (unlikely(!icreq_pad)) {
+		pr_err("null ptr in icreq_pad in conn_ctx\n");
+		goto finally;
+	}
+
+	buffer = icreq_pad->buffer;
+	if (buffer) {
+		dma_free_coherent(&qedn->pdev->dev,
+				  QEDN_ICREQ_FW_PAYLOAD,
+				  (void *)buffer,
+				  icreq_pad->pa);
+		icreq_pad->buffer = NULL;
+	}
+
+	kfree(icreq_pad);
+	conn_ctx->icreq_pad = NULL;
+
+finally:
+	clear_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state);
+}
+
 static void qedn_release_conn_ctx(struct qedn_conn_ctx *conn_ctx)
 {
 	struct qedn_ctx *qedn = conn_ctx->qedn;
@@ -151,6 +235,9 @@  static void qedn_release_conn_ctx(struct qedn_conn_ctx *conn_ctx)
 		clear_bit(QEDN_CONN_RESRC_ACQUIRE_CONN, &conn_ctx->resrc_state);
 	}
 
+	if (test_bit(QEDN_CONN_RESRC_ICREQ_PAD, &conn_ctx->resrc_state))
+		qedn_free_icreq_pad(conn_ctx);
+
 	if (test_bit(QEDN_CONN_RESRC_TASKS, &conn_ctx->resrc_state)) {
 		clear_bit(QEDN_CONN_RESRC_TASKS, &conn_ctx->resrc_state);
 			qedn_return_active_tasks(conn_ctx);
@@ -309,6 +396,194 @@  void qedn_terminate_connection(struct qedn_conn_ctx *conn_ctx, int abrt_flag)
 	queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
 }
 
+static int qedn_nvmetcp_update_conn(struct qedn_ctx *qedn, struct qedn_conn_ctx *conn_ctx)
+{
+	struct qedn_negotiation_params *pdu_params = &conn_ctx->pdu_params;
+	struct qed_nvmetcp_params_update *conn_info;
+	int rc;
+
+	conn_info = kzalloc(sizeof(*conn_info), GFP_KERNEL);
+	if (!conn_info)
+		return -ENOMEM;
+
+	conn_info->hdr_digest_en = pdu_params->hdr_digest;
+	conn_info->data_digest_en = pdu_params->data_digest;
+	conn_info->max_recv_pdu_length = QEDN_MAX_PDU_SIZE;
+	conn_info->max_io_size = QEDN_MAX_IO_SIZE;
+	conn_info->max_send_pdu_length = pdu_params->maxh2cdata;
+
+	rc = qed_ops->update_conn(qedn->cdev, conn_ctx->conn_handle, conn_info);
+	if (rc) {
+		pr_err("Could not update connection\n");
+		rc = -ENXIO;
+	}
+
+	kfree(conn_info);
+
+	return rc;
+}
+
+static int qedn_update_ramrod(struct qedn_conn_ctx *conn_ctx)
+{
+	struct qedn_ctx *qedn = conn_ctx->qedn;
+	int rc = 0;
+
+	rc = qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_UPDATE_EQE);
+	if (rc)
+		return rc;
+
+	rc = qedn_nvmetcp_update_conn(qedn, conn_ctx);
+	if (rc)
+		return rc;
+
+	if (conn_ctx->state != CONN_STATE_WAIT_FOR_UPDATE_EQE) {
+		pr_err("cid 0x%x: Unexpected state 0x%x after update ramrod\n",
+		       conn_ctx->fw_cid, conn_ctx->state);
+
+		return -EINVAL;
+	}
+
+	return rc;
+}
+
+static int qedn_send_icreq(struct qedn_conn_ctx *conn_ctx)
+{
+	struct nvmetcp_init_conn_req_hdr *icreq_ptr = NULL;
+	struct storage_sgl_task_params *sgl_task_params;
+	struct nvmetcp_task_params task_params;
+	struct qedn_task_ctx *qedn_task = NULL;
+	struct nvme_tcp_icreq_pdu icreq;
+	struct nvmetcp_wqe *chain_sqe;
+	struct nvmetcp_wqe local_sqe;
+
+	qedn_task = qedn_get_task_from_pool_insist(conn_ctx, QEDN_ICREQ_CCCID);
+	if (!qedn_task)
+		return -EINVAL;
+
+	memset(&icreq, 0, sizeof(icreq));
+	memset(&local_sqe, 0, sizeof(local_sqe));
+
+	/* Initialize ICReq */
+	icreq.hdr.type = nvme_tcp_icreq;
+	icreq.hdr.hlen = sizeof(icreq);
+	icreq.hdr.pdo = 0;
+	icreq.hdr.plen = cpu_to_le32(icreq.hdr.hlen);
+	icreq.pfv = cpu_to_le16(conn_ctx->required_params.pfv);
+	icreq.maxr2t = cpu_to_le32(conn_ctx->required_params.maxr2t);
+	icreq.hpda = conn_ctx->required_params.hpda;
+	if (conn_ctx->required_params.hdr_digest)
+		icreq.digest |= NVME_TCP_HDR_DIGEST_ENABLE;
+	if (conn_ctx->required_params.data_digest)
+		icreq.digest |= NVME_TCP_DATA_DIGEST_ENABLE;
+
+	qedn_swap_bytes((u32 *)&icreq,
+			(sizeof(icreq) - QEDN_ICREQ_FW_PAYLOAD) /
+			 sizeof(u32));
+
+	/* Initialize task params */
+	task_params.opq.lo = cpu_to_le32(((u64)(qedn_task)) & 0xffffffff);
+	task_params.opq.hi = cpu_to_le32(((u64)(qedn_task)) >> 32);
+	task_params.context = qedn_task->fw_task_ctx;
+	task_params.sqe = &local_sqe;
+	task_params.conn_icid = (u16)conn_ctx->conn_handle;
+	task_params.itid = qedn_task->itid;
+	task_params.cq_rss_number = conn_ctx->default_cq;
+	task_params.tx_io_size = QEDN_ICREQ_FW_PAYLOAD;
+	task_params.rx_io_size = 0; /* Rx doesn't use SGL for icresp */
+
+	/* Init SGE for ICReq padding */
+	sgl_task_params = &qedn_task->sgl_task_params;
+	sgl_task_params->total_buffer_size = task_params.tx_io_size;
+	sgl_task_params->small_mid_sge = false;
+	sgl_task_params->num_sges = 1;
+	memcpy(sgl_task_params->sgl, &conn_ctx->icreq_pad->sge,
+	       sizeof(conn_ctx->icreq_pad->sge));
+	icreq_ptr = (struct nvmetcp_init_conn_req_hdr *)&icreq;
+
+	qed_ops->init_icreq_exchange(&task_params, icreq_ptr, sgl_task_params,  NULL);
+
+	qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_IC_COMP);
+	atomic_inc(&conn_ctx->num_active_fw_tasks);
+
+	/* spin_lock - doorbell is accessed  both Rx flow and response flow */
+	spin_lock(&conn_ctx->ep.doorbell_lock);
+	chain_sqe = qed_chain_produce(&conn_ctx->ep.fw_sq_chain);
+	memcpy(chain_sqe, &local_sqe, sizeof(local_sqe));
+	qedn_ring_doorbell(conn_ctx);
+	spin_unlock(&conn_ctx->ep.doorbell_lock);
+
+	return 0;
+}
+
+void qedn_prep_icresp(struct qedn_conn_ctx *conn_ctx, struct nvmetcp_fw_cqe *cqe)
+{
+	struct nvmetcp_icresp_hdr_psh *icresp_from_cqe =
+		(struct nvmetcp_icresp_hdr_psh *)&cqe->nvme_cqe;
+	struct nvme_tcp_ofld_ctrl *ctrl = conn_ctx->ctrl;
+	struct qedn_ctrl *qctrl = NULL;
+
+	qctrl = (struct qedn_ctrl *)ctrl->private_data;
+
+	memcpy(&conn_ctx->icresp, icresp_from_cqe, sizeof(conn_ctx->icresp));
+	qedn_set_sp_wa(conn_ctx, HANDLE_ICRESP);
+	queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
+}
+
+static int qedn_handle_icresp(struct qedn_conn_ctx *conn_ctx)
+{
+	struct nvmetcp_icresp_hdr_psh *icresp = &conn_ctx->icresp;
+	u16 pfv = __swab16(le16_to_cpu(icresp->pfv_swapped));
+	int rc = 0;
+
+	qedn_free_icreq_pad(conn_ctx);
+
+	/* Validate ICResp */
+	if (pfv != conn_ctx->required_params.pfv) {
+		pr_err("cid %u: unsupported pfv %u\n", conn_ctx->fw_cid, pfv);
+
+		return -EINVAL;
+	}
+
+	if (icresp->cpda > conn_ctx->required_params.cpda) {
+		pr_err("cid %u: unsupported cpda %u\n", conn_ctx->fw_cid, icresp->cpda);
+
+		return -EINVAL;
+	}
+
+	if ((NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest) !=
+	    conn_ctx->required_params.hdr_digest) {
+		if ((NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest) >
+		    conn_ctx->required_params.hdr_digest) {
+			pr_err("cid 0x%x: invalid header digest bit\n", conn_ctx->fw_cid);
+		}
+	}
+
+	if ((NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest) !=
+	    conn_ctx->required_params.data_digest) {
+		if ((NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest) >
+		    conn_ctx->required_params.data_digest) {
+			pr_err("cid 0x%x: invalid data digest bit\n", conn_ctx->fw_cid);
+	}
+	}
+
+	memset(&conn_ctx->pdu_params, 0, sizeof(conn_ctx->pdu_params));
+	conn_ctx->pdu_params.maxh2cdata =
+		__swab32(le32_to_cpu(icresp->maxdata_swapped));
+	conn_ctx->pdu_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
+	if (conn_ctx->pdu_params.maxh2cdata > QEDN_MAX_PDU_SIZE)
+		conn_ctx->pdu_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
+
+	conn_ctx->pdu_params.pfv = pfv;
+	conn_ctx->pdu_params.cpda = icresp->cpda;
+	conn_ctx->pdu_params.hpda = conn_ctx->required_params.hpda;
+	conn_ctx->pdu_params.hdr_digest = NVME_TCP_HDR_DIGEST_ENABLE & icresp->digest;
+	conn_ctx->pdu_params.data_digest = NVME_TCP_DATA_DIGEST_ENABLE & icresp->digest;
+	conn_ctx->pdu_params.maxr2t = conn_ctx->required_params.maxr2t;
+	rc = qedn_update_ramrod(conn_ctx);
+
+	return rc;
+}
+
 /* Slowpath EQ Callback */
 int qedn_event_cb(void *context, u8 fw_event_code, void *event_ring_data)
 {
@@ -363,7 +638,8 @@  int qedn_event_cb(void *context, u8 fw_event_code, void *event_ring_data)
 			if (rc)
 				return rc;
 
-			/* Placeholder - for ICReq flow */
+			qedn_set_sp_wa(conn_ctx, SEND_ICREQ);
+			queue_work(qctrl->sp_wq, &conn_ctx->sp_wq_entry);
 		}
 
 		break;
@@ -399,6 +675,7 @@  static int qedn_prep_and_offload_queue(struct qedn_conn_ctx *conn_ctx)
 	}
 
 	set_bit(QEDN_CONN_RESRC_FW_SQ, &conn_ctx->resrc_state);
+	spin_lock_init(&conn_ctx->ep.doorbell_lock);
 	INIT_LIST_HEAD(&conn_ctx->host_pend_req_list);
 	spin_lock_init(&conn_ctx->nvme_req_lock);
 	atomic_set(&conn_ctx->num_active_tasks, 0);
@@ -463,6 +740,11 @@  static int qedn_prep_and_offload_queue(struct qedn_conn_ctx *conn_ctx)
 
 	memset(conn_ctx->host_cccid_itid, 0xFF, dma_size);
 	set_bit(QEDN_CONN_RESRC_CCCID_ITID_MAP, &conn_ctx->resrc_state);
+
+	rc = qedn_alloc_icreq_pad(conn_ctx);
+		if (rc)
+			goto rel_conn;
+
 	rc = qedn_set_con_state(conn_ctx, CONN_STATE_WAIT_FOR_CONNECT_DONE);
 	if (rc)
 		goto rel_conn;
@@ -523,6 +805,9 @@  void qedn_sp_wq_handler(struct work_struct *work)
 
 	qedn = conn_ctx->qedn;
 	if (test_bit(DESTROY_CONNECTION, &conn_ctx->agg_work_action)) {
+		if (test_bit(HANDLE_ICRESP, &conn_ctx->agg_work_action))
+			qedn_clr_sp_wa(conn_ctx, HANDLE_ICRESP);
+
 		qedn_destroy_connection(conn_ctx);
 
 		return;
@@ -537,6 +822,36 @@  void qedn_sp_wq_handler(struct work_struct *work)
 			return;
 		}
 	}
+
+	if (test_bit(SEND_ICREQ, &conn_ctx->agg_work_action)) {
+		qedn_clr_sp_wa(conn_ctx, SEND_ICREQ);
+		rc = qedn_send_icreq(conn_ctx);
+		if (rc)
+			return;
+
+		return;
+	}
+
+	if (test_bit(HANDLE_ICRESP, &conn_ctx->agg_work_action)) {
+		rc = qedn_handle_icresp(conn_ctx);
+
+		qedn_clr_sp_wa(conn_ctx, HANDLE_ICRESP);
+		if (rc) {
+			pr_err("IC handling returned with 0x%x\n", rc);
+			if (test_and_set_bit(DESTROY_CONNECTION, &conn_ctx->agg_work_action))
+				return;
+
+			qedn_destroy_connection(conn_ctx);
+
+			return;
+		}
+
+		atomic_inc(&conn_ctx->est_conn_indicator);
+		qedn_set_con_state(conn_ctx, CONN_STATE_NVMETCP_CONN_ESTABLISHED);
+		wake_up_interruptible(&conn_ctx->conn_waitq);
+
+		return;
+	}
 }
 
 /* Clear connection aggregative slowpath work action */
diff --git a/drivers/nvme/hw/qedn/qedn_main.c b/drivers/nvme/hw/qedn/qedn_main.c
index 8d9c19d63480..a6756d7250b7 100644
--- a/drivers/nvme/hw/qedn/qedn_main.c
+++ b/drivers/nvme/hw/qedn/qedn_main.c
@@ -285,6 +285,19 @@  static void qedn_set_ctrl_io_cpus(struct qedn_conn_ctx *conn_ctx, int qid)
 	conn_ctx->cpu = fp_q->cpu;
 }
 
+static void qedn_set_pdu_params(struct qedn_conn_ctx *conn_ctx)
+{
+	/* Enable digest once supported */
+	conn_ctx->required_params.hdr_digest = 0;
+	conn_ctx->required_params.data_digest = 0;
+
+	conn_ctx->required_params.maxr2t = QEDN_MAX_OUTSTANDING_R2T_PDUS;
+	conn_ctx->required_params.pfv = NVME_TCP_PFV_1_0;
+	conn_ctx->required_params.cpda = 0;
+	conn_ctx->required_params.hpda = 0;
+	conn_ctx->required_params.maxh2cdata = QEDN_MAX_PDU_SIZE;
+}
+
 static int qedn_create_queue(struct nvme_tcp_ofld_queue *queue, int qid, size_t q_size)
 {
 	struct nvme_tcp_ofld_ctrl *ctrl = queue->ctrl;
@@ -307,6 +320,7 @@  static int qedn_create_queue(struct nvme_tcp_ofld_queue *queue, int qid, size_t
 	conn_ctx->ctrl = ctrl;
 	conn_ctx->sq_depth = q_size;
 	qedn_set_ctrl_io_cpus(conn_ctx, qid);
+	qedn_set_pdu_params(conn_ctx);
 
 	init_waitqueue_head(&conn_ctx->conn_waitq);
 	atomic_set(&conn_ctx->est_conn_indicator, 0);
@@ -1073,6 +1087,14 @@  static int qedn_probe(struct pci_dev *pdev, const struct pci_device_id *id)
 	return __qedn_probe(pdev);
 }
 
+void qedn_swap_bytes(u32 *p, int size)
+{
+	int i;
+
+	for (i = 0; i < size; ++i, ++p)
+		*p = __swab32(*p);
+}
+
 static struct pci_driver qedn_pci_driver = {
 	.name     = QEDN_MODULE_NAME,
 	.id_table = qedn_pci_tbl,
diff --git a/drivers/nvme/hw/qedn/qedn_task.c b/drivers/nvme/hw/qedn/qedn_task.c
index 54f2f4cba6ea..9cb84883e95e 100644
--- a/drivers/nvme/hw/qedn/qedn_task.c
+++ b/drivers/nvme/hw/qedn/qedn_task.c
@@ -536,9 +536,11 @@  void qedn_io_work_cq(struct qedn_ctx *qedn, struct nvmetcp_fw_cqe *cqe)
 			break;
 
 		case NVMETCP_TASK_TYPE_INIT_CONN_REQUEST:
-
-			/* Placeholder - ICReq flow */
-
+			/* Clear ICReq-padding SGE from SGL */
+			qedn_common_clear_fw_sgl(&qedn_task->sgl_task_params);
+			/* Task is not required for icresp processing */
+			qedn_return_task_to_pool(conn_ctx, qedn_task);
+			qedn_prep_icresp(conn_ctx, cqe);
 			break;
 		default:
 			pr_info("Could not identify task type\n");

[RFC,v4,24/27] qedn: Add support of NVME ICReq & ICResp

Checks

Commit Message

Comments

Patch