21.17. Controlling the SCSI Command Timer and Device Status
The Linux SCSI layer sets a timer on each command. When this timer expires, the SCSI layer will quiesce the host bus adapter (HBA) and wait for all outstanding commands to either time out or complete. Afterwards, the SCSI layer will activate the driver's error handler.
When the error handler is triggered, it attempts the following operations in order (until one successfully executes):
-
Abort the command.
-
Reset the device.
-
Reset the bus.
-
Reset the host.
If all of these operations fail, the device will be set to the offline
state. When this occurs, all I/O to that device will be failed, until the problem is corrected and the user sets the device to running
.
The process is different, however, if a device uses the fibre channel protocol and the rport
is blocked. In such cases, the drivers wait for several seconds for the rport
to become online again before activating the error handler. This prevents devices from becoming offline due to temporary transport problems.
To display the state of a device, use:
cat /sys/block/device-name
/device/state
To set a device to running
state, use:
echo running > /sys/block/device-name
/device/state
To control the command timer, you can write to /sys/block/device-name
/device/timeout
. To do so, run:
echo value
/sys/block/device-name
/device/timeout
Here, value
is the timeout value (in seconds) you want to implement.