Message ID | 20181007083537.89131-1-hare@suse.de (mailing list archive) |
---|---|
Headers | show |
Series | libfc state machine fixes | expand |
Hannes, > here are some patches for PRLI issues in libfc we've come across > recently. The libfc ones are pretty straightforward, but the scsi > state machine one probably warrants some discussion. What happened > was that in some fabrics the RSCN might get lost or incompletely > reseived. This will then cause SCSI EH to be triggered for the lost > rports, setting the devices to offline. But later on we do get an > RSCN, which would reinstate the rports, but unfortunately the devices > will remain in OFFLINE as we cannot transition back to running. The > solution I've came up with was to allow transitions from OFFLINE to > BLOCKED, as during RSCN processing the devices will be set to blocked, > and so I found it only reasonable to allow this transition. I queued this up for now since I think it is the lesser of two evils. However, I do think we'll have to have the ability to distinguish between offlined-by-user-action, offlined-by-device-error, and offlined-by-transport-event as Ewan pointed out.