[08/51] zfcp: open-code fc_block_scsi_eh() for host reset

Message ID	20210817091456.73342-9-hare@suse.de (mailing list archive)
State	Changes Requested
Headers	show Return-Path: <linux-scsi-owner@kernel.org> From: Hannes Reinecke <hare@suse.de> To: "Martin K. Petersen" <martin.petersen@oracle.com> Cc: Christoph Hellwig <hch@lst.de>, James Bottomley <james.bottomley@hansenpartnership.com>, linux-scsi@vger.kernel.org, Hannes Reinecke <hare@suse.de>, Hannes Reinecke <hare@suse.com>, Steffen Maier <maier@linux.ibm.com>, Benjamin Block <bblock@linux.ibm.com> Subject: [PATCH 08/51] zfcp: open-code fc_block_scsi_eh() for host reset Date: Tue, 17 Aug 2021 11:14:13 +0200 Message-Id: <20210817091456.73342-9-hare@suse.de> In-Reply-To: <20210817091456.73342-1-hare@suse.de> References: <20210817091456.73342-1-hare@suse.de> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk
Series	SCSI EH argument reshuffle part II \| expand [PATCHv2,00/51] SCSI EH argument reshuffle part II [01/51] lpfc: kill lpfc_bus_reset_handler [02/51] lpfc: drop lpfc_no_handler() [03/51] sym53c8xx_2: split off bus reset from host reset [04/51] ips: Do not try to abort command from host reset [05/51] snic: reserve tag for TMF [06/51] qla1280: separate out host reset function from qla1280_error_action() [07/51] megaraid: pass in NULL scb for host reset [08/51] zfcp: open-code fc_block_scsi_eh() for host reset [09/51] mpi3mr: split off bus_reset function from host_reset [10/51] scsi: Use Scsi_Host as argument for eh_host_reset_handler [11/51] mptfc: simplify mpt_fc_block_error_handler() [12/51] mptfusion: correct definitions for mptscsih_dev_reset() [13/51] mptfc: open-code mptfc_block_error_handler() for bus reset [14/51] pmcraid: Select device in pmcraid_eh_bus_reset_handler() [15/51] qla2xxx: open-code qla2xxx_generic_reset() [16/51] qla2xxx: Do not call fc_block_scsi_eh() during bus reset [17/51] visorhba: select first device on the bus for bus_reset() [18/51] ncr53c8xx: remove 'sync_reset' argument from ncr_reset_bus() [19/51] ncr53c8xx: Complete all commands during bus reset [20/51] ncr53c8xx: Remove unused code [21/51] scsi: Use Scsi_Host and channel number as argument for eh_bus_reset_handler() [22/51] libiscsi: use cls_session as argument for target and session reset [23/51] bnx2fc: Do not rely on a scsi command when issueing lun or target reset [24/51] ibmvfc: open-code reset loop for target reset [25/51] lpfc: use fc_block_rport() [26/51] lpfc: use rport as argument for lpfc_send_taskmgmt() [27/51] lpfc: use rport as argument for lpfc_chk_tgt_mapped() [28/51] csiostor: use fc_block_rport() [29/51] qla2xxx: use fc_block_rport() [30/51] fc_fcp: use fc_block_rport() [31/51] qedf: use fc rport as argument for qedf_initiate_tmf() [32/51] sym53c8xx_2: rework reset handling [33/51] bfa: Do not use scsi command to signal TMF status [34/51] scsi_transport_iscsi: use session as argument for iscsi_block_scsi_eh() [35/51] pmcraid: select first available device for target reset [36/51] scsi: Use scsi_target as argument for eh_target_reset_handler() [37/51] aha152x: look for stuck command when resetting device [38/51] fnic: use dedicated device reset command [39/51] a1000u2w: do not rely on the command for inia100_device_reset() [40/51] aic7xxx: use scsi device as argument for BUILD_SCSIID() [41/51] aic79xx: use scsi device as argument for BUILD_SCSIID() [42/51] aic7xxx: do not reference scsi command when resetting device [43/51] aic79xx: do not reference scsi command when resetting device [44/51] xen-scsifront: add scsi device as argument to scsifront_do_request() [45/51] fas216: Rework device reset to not rely on SCSI command pointer [46/51] csiostor: use separate TMF command [47/51] snic: use dedicated device reset command [48/51] snic: Use scsi_host_busy_iter() to traverse commands [49/51] scsi: Move eh_device_reset_handler() to use scsi_device as argument [50/51] scsi: Do not allocate scsi command in scsi_ioctl_reset() [51/51] scsi_error: streamline scsi_eh_bus_device_reset()

Hannes Reinecke Aug. 17, 2021, 9:14 a.m. UTC

When issuing a host reset we should be waiting for all
ports to become unblocked; just waiting for one might
be resulting in host reset to return too early.

Signed-off-by: Hannes Reinecke <hare@suse.com>
Cc: Steffen Maier <maier@linux.ibm.com>
Cc: Benjamin Block <bblock@linux.ibm.com>
---
 drivers/s390/scsi/zfcp_scsi.c | 29 +++++++++++++++++++++++------
 1 file changed, 23 insertions(+), 6 deletions(-)

Benjamin Block Aug. 17, 2021, 11:53 a.m. UTC | #1

On Tue, Aug 17, 2021 at 11:14:13AM +0200, Hannes Reinecke wrote:
> @@ -383,9 +385,24 @@ static int zfcp_scsi_eh_host_reset_handler(struct scsi_cmnd *scpnt)
>  	}
>  	zfcp_erp_adapter_reopen(adapter, 0, "schrh_1");
>  	zfcp_erp_wait(adapter);
> -	fc_ret = fc_block_scsi_eh(scpnt);
> -	if (fc_ret)
> -		ret = fc_ret;
> +retry_rport_blocked:
> +	spin_lock_irqsave(host->host_lock, flags);
> +	list_for_each_entry(port, &adapter->port_list, list) {

You need to take the `adapter->port_list_lock` to iterate over the `port_list`.

i.e.: read_lock_irqsave(&adapter->port_list_lock, flags);

> +		struct fc_rport *rport = port->rport;
> +
> +		if (rport->port_state == FC_PORTSTATE_BLOCKED) {
> +			if (rport->flags & FC_RPORT_FAST_FAIL_TIMEDOUT)
> +				ret = FAST_IO_FAIL;
> +			else
> +				ret = NEEDS_RETRY;
> +			break;
> +		}
> +	}
> +	spin_unlock_irqrestore(host->host_lock, flags);
> +	if (ret == NEEDS_RETRY) {
> +		msleep(1000);
> +		goto retry_rport_blocked;
> +	}

I really can't say I like this open coded FC code in the driver at all.

Is there a reason we can't use `fc_block_rport()` for all the rports of
the adapter?

We already do use it for other EH callbacks in the same file, and you
already look up the rports in the adapters rport-list; so using that on
the rports in the loop, instead of open-coding it doesn't seem bad? Or
is there a locking problem? 

We might waste a few cycles with that, but frankly, this is all in EH
and after adapter reset.. all performance concerns went our of the
window with that already.

Hannes Reinecke Aug. 17, 2021, 12:54 p.m. UTC | #2

On 8/17/21 1:53 PM, Benjamin Block wrote:
> On Tue, Aug 17, 2021 at 11:14:13AM +0200, Hannes Reinecke wrote:
>> @@ -383,9 +385,24 @@ static int zfcp_scsi_eh_host_reset_handler(struct scsi_cmnd *scpnt)
>>  	}
>>  	zfcp_erp_adapter_reopen(adapter, 0, "schrh_1");
>>  	zfcp_erp_wait(adapter);
>> -	fc_ret = fc_block_scsi_eh(scpnt);
>> -	if (fc_ret)
>> -		ret = fc_ret;
>> +retry_rport_blocked:
>> +	spin_lock_irqsave(host->host_lock, flags);
>> +	list_for_each_entry(port, &adapter->port_list, list) {
> 
> You need to take the `adapter->port_list_lock` to iterate over the `port_list`.
> 
> i.e.: read_lock_irqsave(&adapter->port_list_lock, flags);
> 
>> +		struct fc_rport *rport = port->rport;
>> +
>> +		if (rport->port_state == FC_PORTSTATE_BLOCKED) {
>> +			if (rport->flags & FC_RPORT_FAST_FAIL_TIMEDOUT)
>> +				ret = FAST_IO_FAIL;
>> +			else
>> +				ret = NEEDS_RETRY;
>> +			break;
>> +		}
>> +	}
>> +	spin_unlock_irqrestore(host->host_lock, flags);
>> +	if (ret == NEEDS_RETRY) {
>> +		msleep(1000);
>> +		goto retry_rport_blocked;
>> +	}
> 
> I really can't say I like this open coded FC code in the driver at all.
> 
> Is there a reason we can't use `fc_block_rport()` for all the rports of
> the adapter?
> 
> We already do use it for other EH callbacks in the same file, and you
> already look up the rports in the adapters rport-list; so using that on
> the rports in the loop, instead of open-coding it doesn't seem bad? Or
> is there a locking problem? 
> 
> We might waste a few cycles with that, but frankly, this is all in EH
> and after adapter reset.. all performance concerns went our of the
> window with that already.
> 

Question would be why we need to call fc_block_rport() at all in host reset.
To my understanding a host reset is expected to do a full resync of the
SAN topology, so the expectation is that after zfcp_erp_wait() the port
list is stable (ie the HBA has finished processing all RSCNs related to
the SAN resync).
So can't we just drop the fc_block_rport() call here?
All the other FC drivers do fine without that ...

Cheers,

Hannes

Steffen Maier Aug. 17, 2021, 2:03 p.m. UTC | #3

On 8/17/21 2:54 PM, Hannes Reinecke wrote:
> On 8/17/21 1:53 PM, Benjamin Block wrote:
>> On Tue, Aug 17, 2021 at 11:14:13AM +0200, Hannes Reinecke wrote:
>>> @@ -383,9 +385,24 @@ static int zfcp_scsi_eh_host_reset_handler(struct scsi_cmnd *scpnt)
>>>   	}
>>>   	zfcp_erp_adapter_reopen(adapter, 0, "schrh_1");
>>>   	zfcp_erp_wait(adapter);
>>> -	fc_ret = fc_block_scsi_eh(scpnt);
>>> -	if (fc_ret)
>>> -		ret = fc_ret;
>>> +retry_rport_blocked:
>>> +	spin_lock_irqsave(host->host_lock, flags);
>>> +	list_for_each_entry(port, &adapter->port_list, list) {
>>
>> You need to take the `adapter->port_list_lock` to iterate over the `port_list`.
>>
>> i.e.: read_lock_irqsave(&adapter->port_list_lock, flags);
>>
>>> +		struct fc_rport *rport = port->rport;
>>> +
>>> +		if (rport->port_state == FC_PORTSTATE_BLOCKED) {
>>> +			if (rport->flags & FC_RPORT_FAST_FAIL_TIMEDOUT)
>>> +				ret = FAST_IO_FAIL;
>>> +			else
>>> +				ret = NEEDS_RETRY;
>>> +			break;
>>> +		}
>>> +	}
>>> +	spin_unlock_irqrestore(host->host_lock, flags);
>>> +	if (ret == NEEDS_RETRY) {
>>> +		msleep(1000);
>>> +		goto retry_rport_blocked;
>>> +	}
>>
>> I really can't say I like this open coded FC code in the driver at all.
>>
>> Is there a reason we can't use `fc_block_rport()` for all the rports of
>> the adapter?

Waiting for all rports to unblock in host_reset has been on my todo list since 
we prepared the eh callbacks to get rid of scsi_cmnd with v4.18 commits:
674595d8519f ("scsi: zfcp: decouple our scsi_eh callbacks from scsi_cmnd")
42afc6527d43 ("scsi: zfcp: decouple TMFs from scsi_cmnd by using fc_block_rport")
26f5fa9d47c1 ("scsi: zfcp: decouple SCSI setup of TMF from scsi_cmnd")
39abb11aca00 ("scsi: zfcp: decouple FSF request setup of TMF from scsi_cmnd")
e0116c91c7d8 ("scsi: zfcp: split FCP_CMND IU setup between SCSI I/O and TMF again")
266883f2f7d5 ("scsi: zfcp: decouple TMF response handler from scsi_cmnd")
822121186375 ("scsi: zfcp: decouple SCSI traces for scsi_eh / TMF from scsi_cmnd")

But the synchronization is non-trivial as Benjamin's question shows. There are 
also considerations about lock order, etc.

I'm busy with other things, so don't hold your breath until I can review and 
test the code; I don't want any regression in that recovery code.

>> We already do use it for other EH callbacks in the same file, and you
>> already look up the rports in the adapters rport-list; so using that on
>> the rports in the loop, instead of open-coding it doesn't seem bad? Or
>> is there a locking problem?
>>
>> We might waste a few cycles with that, but frankly, this is all in EH
>> and after adapter reset.. all performance concerns went our of the
>> window with that already.
>>
> 
> Question would be why we need to call fc_block_rport() at all in host reset.
> To my understanding a host reset is expected to do a full resync of the
> SAN topology, so the expectation is that after zfcp_erp_wait() the port
> list is stable (ie the HBA has finished processing all RSCNs related to
> the SAN resync).

There is more to do in zfcp than in other FC HBA drivers, e.g. LUN open 
recoveries and how they related to rport unblock:
v4.10 6f2ce1c6af37 ("scsi: zfcp: fix rport unblock race with LUN recovery").
The rport unblock is async to our internal recovery. zfcp_erp_wait() only waits 
for the latter by design.

> So can't we just drop the fc_block_rport() call here?

I don't think so.

> All the other FC drivers do fine without that ...

It would have been nice to have a common interface for all scsi_eh scopes. I.e. 
fc_block_host(struct Scsi_Host*) like we already have for 
fc_block_scsi_eh(struct scsi_cmnd*) and fc_block_rport(struct fc_rport*) [the 
latter having been introduced at the time of above eh callback preparations].
But if zfcp is the only one needing it for host_reset, having the code only in 
zfcp seems fine to me.

Hannes Reinecke Aug. 17, 2021, 2:10 p.m. UTC | #4

On 8/17/21 4:03 PM, Steffen Maier wrote:
> On 8/17/21 2:54 PM, Hannes Reinecke wrote:
>> On 8/17/21 1:53 PM, Benjamin Block wrote:
>>> On Tue, Aug 17, 2021 at 11:14:13AM +0200, Hannes Reinecke wrote:
>>>> @@ -383,9 +385,24 @@ static int
>>>> zfcp_scsi_eh_host_reset_handler(struct scsi_cmnd *scpnt)
>>>>       }
>>>>       zfcp_erp_adapter_reopen(adapter, 0, "schrh_1");
>>>>       zfcp_erp_wait(adapter);
>>>> -    fc_ret = fc_block_scsi_eh(scpnt);
>>>> -    if (fc_ret)
>>>> -        ret = fc_ret;
>>>> +retry_rport_blocked:
>>>> +    spin_lock_irqsave(host->host_lock, flags);
>>>> +    list_for_each_entry(port, &adapter->port_list, list) {
>>>
>>> You need to take the `adapter->port_list_lock` to iterate over the
>>> `port_list`.
>>>
>>> i.e.: read_lock_irqsave(&adapter->port_list_lock, flags);
>>>
>>>> +        struct fc_rport *rport = port->rport;
>>>> +
>>>> +        if (rport->port_state == FC_PORTSTATE_BLOCKED) {
>>>> +            if (rport->flags & FC_RPORT_FAST_FAIL_TIMEDOUT)
>>>> +                ret = FAST_IO_FAIL;
>>>> +            else
>>>> +                ret = NEEDS_RETRY;
>>>> +            break;
>>>> +        }
>>>> +    }
>>>> +    spin_unlock_irqrestore(host->host_lock, flags);
>>>> +    if (ret == NEEDS_RETRY) {
>>>> +        msleep(1000);
>>>> +        goto retry_rport_blocked;
>>>> +    }
>>>
>>> I really can't say I like this open coded FC code in the driver at all.
>>>
>>> Is there a reason we can't use `fc_block_rport()` for all the rports of
>>> the adapter?
> 
> Waiting for all rports to unblock in host_reset has been on my todo list
> since we prepared the eh callbacks to get rid of scsi_cmnd with v4.18
> commits:
> 674595d8519f ("scsi: zfcp: decouple our scsi_eh callbacks from scsi_cmnd")
> 42afc6527d43 ("scsi: zfcp: decouple TMFs from scsi_cmnd by using
> fc_block_rport")
> 26f5fa9d47c1 ("scsi: zfcp: decouple SCSI setup of TMF from scsi_cmnd")
> 39abb11aca00 ("scsi: zfcp: decouple FSF request setup of TMF from
> scsi_cmnd")
> e0116c91c7d8 ("scsi: zfcp: split FCP_CMND IU setup between SCSI I/O and
> TMF again")
> 266883f2f7d5 ("scsi: zfcp: decouple TMF response handler from scsi_cmnd")
> 822121186375 ("scsi: zfcp: decouple SCSI traces for scsi_eh / TMF from
> scsi_cmnd")
> 
> But the synchronization is non-trivial as Benjamin's question shows.
> There are also considerations about lock order, etc.
> 
> I'm busy with other things, so don't hold your breath until I can review
> and test the code; I don't want any regression in that recovery code.
> 
>>> We already do use it for other EH callbacks in the same file, and you
>>> already look up the rports in the adapters rport-list; so using that on
>>> the rports in the loop, instead of open-coding it doesn't seem bad? Or
>>> is there a locking problem?
>>>
>>> We might waste a few cycles with that, but frankly, this is all in EH
>>> and after adapter reset.. all performance concerns went our of the
>>> window with that already.
>>>
>>
>> Question would be why we need to call fc_block_rport() at all in host
>> reset.
>> To my understanding a host reset is expected to do a full resync of the
>> SAN topology, so the expectation is that after zfcp_erp_wait() the port
>> list is stable (ie the HBA has finished processing all RSCNs related to
>> the SAN resync).
> 
> There is more to do in zfcp than in other FC HBA drivers, e.g. LUN open
> recoveries and how they related to rport unblock:
> v4.10 6f2ce1c6af37 ("scsi: zfcp: fix rport unblock race with LUN
> recovery").
> The rport unblock is async to our internal recovery. zfcp_erp_wait()
> only waits for the latter by design.
> 
>> So can't we just drop the fc_block_rport() call here?
> 
> I don't think so.
> 
>> All the other FC drivers do fine without that ...
> 
> It would have been nice to have a common interface for all scsi_eh
> scopes. I.e. fc_block_host(struct Scsi_Host*) like we already have for
> fc_block_scsi_eh(struct scsi_cmnd*) and fc_block_rport(struct fc_rport*)
> [the latter having been introduced at the time of above eh callback
> preparations].
> But if zfcp is the only one needing it for host_reset, having the code
> only in zfcp seems fine to me.
> 
> 
Right. Just wanted to clarify that.
If we need to use fc_block_rport() in host reset so be it; just wanted
to clarify if this _really_ is the case (and not just some copy'n'paste
stuff).
I'll be reworking the patch to call fc_block_rport().

Cheers,

Hannes

Steffen Maier Aug. 18, 2021, 11 a.m. UTC | #5

On 8/17/21 4:10 PM, Hannes Reinecke wrote:
> On 8/17/21 4:03 PM, Steffen Maier wrote:
>> On 8/17/21 2:54 PM, Hannes Reinecke wrote:
>>> On 8/17/21 1:53 PM, Benjamin Block wrote:
>>>> On Tue, Aug 17, 2021 at 11:14:13AM +0200, Hannes Reinecke wrote:
>>>>> @@ -383,9 +385,24 @@ static int
>>>>> zfcp_scsi_eh_host_reset_handler(struct scsi_cmnd *scpnt)
>>>>>        }
>>>>>        zfcp_erp_adapter_reopen(adapter, 0, "schrh_1");
>>>>>        zfcp_erp_wait(adapter);
>>>>> -    fc_ret = fc_block_scsi_eh(scpnt);
>>>>> -    if (fc_ret)
>>>>> -        ret = fc_ret;
>>>>> +retry_rport_blocked:
>>>>> +    spin_lock_irqsave(host->host_lock, flags);
>>>>> +    list_for_each_entry(port, &adapter->port_list, list) {
>>>>
>>>> You need to take the `adapter->port_list_lock` to iterate over the
>>>> `port_list`.
>>>>
>>>> i.e.: read_lock_irqsave(&adapter->port_list_lock, flags);
>>>>
>>>>> +        struct fc_rport *rport = port->rport;
>>>>> +
>>>>> +        if (rport->port_state == FC_PORTSTATE_BLOCKED) {
>>>>> +            if (rport->flags & FC_RPORT_FAST_FAIL_TIMEDOUT)
>>>>> +                ret = FAST_IO_FAIL;
>>>>> +            else
>>>>> +                ret = NEEDS_RETRY;
>>>>> +            break;
>>>>> +        }
>>>>> +    }
>>>>> +    spin_unlock_irqrestore(host->host_lock, flags);
>>>>> +    if (ret == NEEDS_RETRY) {
>>>>> +        msleep(1000);
>>>>> +        goto retry_rport_blocked;
>>>>> +    }
>>>>
>>>> I really can't say I like this open coded FC code in the driver at all.
>>>>
>>>> Is there a reason we can't use `fc_block_rport()` for all the rports of
>>>> the adapter?
>>
>> Waiting for all rports to unblock in host_reset has been on my todo list
>> since we prepared the eh callbacks to get rid of scsi_cmnd with v4.18
>> commits:
>> 674595d8519f ("scsi: zfcp: decouple our scsi_eh callbacks from scsi_cmnd")
>> 42afc6527d43 ("scsi: zfcp: decouple TMFs from scsi_cmnd by using
>> fc_block_rport")
>> 26f5fa9d47c1 ("scsi: zfcp: decouple SCSI setup of TMF from scsi_cmnd")
>> 39abb11aca00 ("scsi: zfcp: decouple FSF request setup of TMF from
>> scsi_cmnd")
>> e0116c91c7d8 ("scsi: zfcp: split FCP_CMND IU setup between SCSI I/O and
>> TMF again")
>> 266883f2f7d5 ("scsi: zfcp: decouple TMF response handler from scsi_cmnd")
>> 822121186375 ("scsi: zfcp: decouple SCSI traces for scsi_eh / TMF from
>> scsi_cmnd")
>>
>> But the synchronization is non-trivial as Benjamin's question shows.
>> There are also considerations about lock order, etc.
>>
>> I'm busy with other things, so don't hold your breath until I can review
>> and test the code; I don't want any regression in that recovery code.
>>
>>>> We already do use it for other EH callbacks in the same file, and you
>>>> already look up the rports in the adapters rport-list; so using that on
>>>> the rports in the loop, instead of open-coding it doesn't seem bad? Or
>>>> is there a locking problem?
>>>>
>>>> We might waste a few cycles with that, but frankly, this is all in EH
>>>> and after adapter reset.. all performance concerns went our of the
>>>> window with that already.
>>>>
>>>
>>> Question would be why we need to call fc_block_rport() at all in host
>>> reset.
>>> To my understanding a host reset is expected to do a full resync of the
>>> SAN topology, so the expectation is that after zfcp_erp_wait() the port
>>> list is stable (ie the HBA has finished processing all RSCNs related to
>>> the SAN resync).
>>
>> There is more to do in zfcp than in other FC HBA drivers, e.g. LUN open
>> recoveries and how they related to rport unblock:
>> v4.10 6f2ce1c6af37 ("scsi: zfcp: fix rport unblock race with LUN
>> recovery").
>> The rport unblock is async to our internal recovery. zfcp_erp_wait()
>> only waits for the latter by design.
>>
>>> So can't we just drop the fc_block_rport() call here?
>>
>> I don't think so.
>>
>>> All the other FC drivers do fine without that ...
>>
>> It would have been nice to have a common interface for all scsi_eh
>> scopes. I.e. fc_block_host(struct Scsi_Host*) like we already have for
>> fc_block_scsi_eh(struct scsi_cmnd*) and fc_block_rport(struct fc_rport*)
>> [the latter having been introduced at the time of above eh callback
>> preparations].
>> But if zfcp is the only one needing it for host_reset, having the code
>> only in zfcp seems fine to me.
>>
>>
> Right. Just wanted to clarify that.
> If we need to use fc_block_rport() in host reset so be it; just wanted
> to clarify if this _really_ is the case (and not just some copy'n'paste
> stuff).
> I'll be reworking the patch to call fc_block_rport().

On second thought, I might have been wrong.

The argument I used with the old commit was that we must not unblock the rport 
too early with regards to zfcp-internal recovery. This is fixed within zfcp 
recovery (erp) code. So after zfcp_erp_wait() in host_reset, this is still 
ensured; and eventually the rport unblock will occur.

I guess I was rather worried about returning from the host_reset callback with 
the async rport(s) unblock still pending. After all, (some) other reset_handler 
sync with rport unblock. However I cannot remember all details right now.

Before you invest more time into this, maybe just drop this patch from the 
series for now and we solve it later on? I mean it's not necessary for the 
reset_handler function signature change.

Hannes Reinecke Aug. 18, 2021, 11:58 a.m. UTC | #6

On 8/18/21 1:00 PM, Steffen Maier wrote:
> On 8/17/21 4:10 PM, Hannes Reinecke wrote:
>> On 8/17/21 4:03 PM, Steffen Maier wrote:
[ .. ]
>>> It would have been nice to have a common interface for all scsi_eh
>>> scopes. I.e. fc_block_host(struct Scsi_Host*) like we already have for
>>> fc_block_scsi_eh(struct scsi_cmnd*) and fc_block_rport(struct fc_rport*)
>>> [the latter having been introduced at the time of above eh callback
>>> preparations].
>>> But if zfcp is the only one needing it for host_reset, having the code
>>> only in zfcp seems fine to me.
>>>
>>>
>> Right. Just wanted to clarify that.
>> If we need to use fc_block_rport() in host reset so be it; just wanted
>> to clarify if this _really_ is the case (and not just some copy'n'paste
>> stuff).
>> I'll be reworking the patch to call fc_block_rport().
> 
> On second thought, I might have been wrong.
> 
> The argument I used with the old commit was that we must not unblock the
> rport too early with regards to zfcp-internal recovery. This is fixed
> within zfcp recovery (erp) code. So after zfcp_erp_wait() in host_reset,
> this is still ensured; and eventually the rport unblock will occur.
> 
> I guess I was rather worried about returning from the host_reset
> callback with the async rport(s) unblock still pending. After all,
> (some) other reset_handler sync with rport unblock. However I cannot
> remember all details right now.
> 
> Before you invest more time into this, maybe just drop this patch from
> the series for now and we solve it later on? I mean it's not necessary
> for the reset_handler function signature change.
> 
Well, actually it is.
With the signature change host_reset is being called with a Scsi_Host
argument, so we cannot identify 'the' rport.
But I've modified the patch to cycle through all rports and call
fc_block_rport() on each of them.
That should be good enough for now.

Cheers,

Hannes

[08/51] zfcp: open-code fc_block_scsi_eh() for host reset

Commit Message

Comments

Patch