Hi All,
I have an Adaptec 19160B Ultra160 SCSI controller with a
QUANTUM ATLAS10K3_36_WLS disk. Running Mandrake Linux 8.2
with its default kernel 2.4.18-6mdk. The system auto-detected
the controller and installed the AIC7XXX driver for it.
It works most of the time, however I do get some problems
time-to-time (typically when there is a heavy load on the
disk, e.g. copy/read/write some large files). In these
cases the whole system "hangs" for a minute or two, then
I see in the /var/log/messages file the following kind of
lines:
Nov 14 15:41:07 mercury kernel: scsi0:0:6:0: Attempting to queue an ABORT message
Nov 14 15:41:07 mercury kernel: scsi0:0:6:0: Command found on device queue
Nov 14 15:41:07 mercury kernel: aic7xxx_abort returns 0x2002
Nov 14 15:41:07 mercury kernel: scsi0:0:6:0: Attempting to queue an ABORT message
Nov 14 15:41:07 mercury kernel: scsi0: Dumping Card State while idle, at SEQADDR 0x9
Nov 14 15:41:07 mercury kernel: ACCUM = 0x4, SINDEX = 0x64, DINDEX = 0x65, ARG_2 = 0x7
Nov 14 15:41:07 mercury kernel: HCNT = 0x0
Nov 14 15:41:07 mercury kernel: SCSISEQ = 0x12, SBLKCTL = 0xa
Nov 14 15:41:07 mercury kernel: DFCNTRL = 0x0, DFSTATUS = 0x89
Nov 14 15:41:07 mercury kernel: LASTPHASE = 0x1, SCSISIGI = 0x0, SXFRCTL0 = 0x80
Nov 14 15:41:07 mercury kernel: SSTAT0 = 0x0, SSTAT1 = 0x8
Nov 14 15:41:07 mercury kernel: SCSIPHASE = 0x0
Nov 14 15:41:07 mercury kernel: STACK == 0x3, 0x175, 0x160, 0xe7
Nov 14 15:41:07 mercury kernel: SCB count = 112
Nov 14 15:41:07 mercury kernel: Kernel NEXTQSCB = 9
Nov 14 15:41:07 mercury kernel: Card NEXTQSCB = 9
Nov 14 15:41:07 mercury kernel: QINFIFO entries:
Nov 14 15:41:07 mercury kernel: Waiting Queue entries:
Nov 14 15:41:07 mercury kernel: Disconnected Queue entries: 27:28 20:30 11:24 15:65 13:25 26:4 18:21 7:39 14:41 5:36 2:31 12:15 24:14 28:19 10:23 23:1 0:54 29:57 9:11 16:34 22:61 25:66 17:50 31:64 3:26 8:20 21:47 6:33
1:42 19:56 30:18 4:55
Nov 14 15:41:07 mercury kernel: QOUTFIFO entries:
Nov 14 15:41:07 mercury kernel: Sequencer Free SCB List:
Nov 14 15:41:07 mercury kernel: Pending list: 28, 30, 24, 65, 25, 4, 21, 39, 41, 36, 31, 15, 7, 35, 14, 53, 13, 19, 48, 12, 16, 58, 5, 67, 8, 29, 23, 46, 62, 52, 37, 1, 22, 6, 10, 17, 40, 63, 38, 32, 61, 111, 54, 60, 27, 49, 45, 59, 51, 57, 11, 34, 44, 66, 50, 64, 26, 20, 47, 33, 42, 56, 18, 55
Nov 14 15:41:07 mercury kernel: Kernel Free SCB list: 43 104 105 106 107 100 101 102 103 96 97 98 99 92 93 94 95 88 89 90 91 84 85 86 87 80 81 82 83 76 77 78 79 72 73 74 75 68 69 70 71 3 0 2 110 109 108
Nov 14 15:41:07 mercury kernel: DevQ(0:6:0): 150 waiting
Nov 14 15:41:07 mercury kernel: (scsi0:A:6:0): Queuing a recovery SCB
Nov 14 15:41:07 mercury kernel: scsi0:0:6:0: Device is disconnected, re-queuing SCB
Nov 14 15:41:07 mercury kernel: Recovery code sleeping
Nov 14 15:41:07 mercury kernel: (scsi0:A:6:0): Abort Tag Message Sent
Nov 14 15:41:07 mercury kernel: (scsi0:A:6:0): SCB 55 - Abort Tag Completed.
Nov 14 15:41:07 mercury kernel: Recovery SCB completes
Nov 14 15:41:07 mercury kernel: Recovery code awake
Nov 14 15:41:07 mercury kernel: aic7xxx_abort returns 0x2002
Nov 14 15:41:07 mercury kernel: scsi0:0:6:0: Attempting to queue an ABORT message
Nov 14 15:41:07 mercury kernel: scsi0: Dumping Card State while idle, at SEQADDR 0x9
Nov 14 15:41:07 mercury kernel: ACCUM = 0x0, SINDEX = 0x37, DINDEX = 0xe4, ARG_2 = 0x0
Nov 14 15:41:07 mercury kernel: HCNT = 0x0
Nov 14 15:41:07 mercury kernel: SCSISEQ = 0x12, SBLKCTL = 0xa
Nov 14 15:41:07 mercury kernel: DFCNTRL = 0x0, DFSTATUS = 0x89
Nov 14 15:41:07 mercury kernel: LASTPHASE = 0x1, SCSISIGI = 0x0, SXFRCTL0 = 0x80
Nov 14 15:41:07 mercury kernel: SSTAT0 = 0x0, SSTAT1 = 0x8
Nov 14 15:41:07 mercury kernel: SCSIPHASE = 0x0
Nov 14 15:41:07 mercury kernel: STACK == 0x3, 0x108, 0x160, 0xe7
Nov 14 15:41:07 mercury kernel: SCB count = 112
Nov 14 15:41:07 mercury kernel: Kernel NEXTQSCB = 9
Nov 14 15:41:07 mercury kernel: Card NEXTQSCB = 9
Nov 14 15:41:07 mercury kernel: QINFIFO entries:
Nov 14 15:41:07 mercury kernel: Waiting Queue entries:
Nov 14 15:41:07 mercury kernel: Disconnected Queue entries: 27:28 20:30 11:24 15:65 13:25 26:4 18:21 7:39 14:41 5:36 2:31 12:15 24:14 28:19 10:23 23:1 0:54 29:57 9:11 16:34 22:61 25:66 17:50 31:64 3:26 8:20 21:47 6:33
1:42 19:56 30:18
Nov 14 15:41:07 mercury kernel: QOUTFIFO entries:
Nov 14 15:41:07 mercury kernel: Sequencer Free SCB List: 4
Nov 14 15:41:07 mercury kernel: Pending list: 28, 30, 24, 65, 25, 4, 21, 39, 41, 36, 31, 15, 7, 35, 14, 53, 13, 19, 48, 12, 16, 58, 5, 67, 8, 29, 23, 46, 62, 52, 37, 1, 22, 6, 10, 17, 40, 63, 38, 32, 61, 111, 54, 60, 27, 49, 45, 59, 51, 57, 11, 34, 44, 66, 50, 64, 26, 20, 47, 33, 42, 56, 18
Nov 14 15:41:07 mercury kernel: Kernel Free SCB list: 55 43 104 105 106 107 100 101 102 103 96 97 98 99 92 93 94 95 88 89 90 91 84 85 86 87 80 81 82 83 76 77 78 79 72 73 74 75 68 69 70 71 3 0 2 110 109 108
Nov 14 15:41:07 mercury kernel: DevQ(0:6:0): 0 waiting
Nov 14 15:41:07 mercury kernel: (scsi0:A:6:0): Queuing a recovery SCB
Nov 14 15:41:07 mercury kernel: scsi0:0:6:0: Device is disconnected, re-queuing SCB
Nov 14 15:41:07 mercury kernel: Recovery code sleeping
Nov 14 15:41:07 mercury kernel: (scsi0:A:6:0): Abort Tag Message Sent
Nov 14 15:41:07 mercury kernel: (scsi0:A:6:0): SCB 18 - Abort Tag Completed.
Nov 14 15:41:07 mercury kernel: Recovery SCB completes
Nov 14 15:41:07 mercury kernel: Recovery code awake
Nov 14 15:41:07 mercury kernel: aic7xxx_abort returns 0x2002
Sometimes the system comes back and all the files seem OK, but other
cases some files get corrupted.
Any idea what could be the problem ?
Is it a faulty disk, faulty controller or some setup problem?
Any suggestions would be greatly appreciated!
Thanks,
Zsolt