We've been having problems with SCSI stalls recently on our main
server, an HP 9000/735 running HPUX 9.01. Every 5-30 minutes all SCSI
activity on the machine will halt for about 15 seconds. syslog logs
the following kern.info messages:
Jan 26 21:35:02 envy syslog:
Jan 26 21:35:02 envy syslog: Jan 26 21:35
Jan 26 21:35:02 envy syslog: SCSI: NCR stall (normal condition), bus = 0x1
Jan 26 21:55:01 envy syslog:
Jan 26 21:55:01 envy syslog: Jan 26 21:55
Jan 26 21:55:01 envy syslog: SCSI: NCR stall (normal condition), bus = 0x1
Jan 26 21:55:01 envy last message repeated 2 times
As far as I can remember, this behavior first showed up when we added
a new disk, a Quantum XP34300, to the bus. We've since added another
XP34300. It seems to me that the timeouts are getting more frequent
as time goes on, but it's hard to tell.
Does anyone have any idea what these errors are, and how to fix them?
Does anyone know of incompatibilities between the Quantum XP34300 and
HP's SCSI hardware? The XP34300's are in APS cases with Digital
Active Termination - the drive on the end of the bus is an XP34300
with its termination turned on. Perhaps DATerm is a bad idea?
I searched the HP patch database, and came up with patch PHKL_6050
(s700 9.0[357] SCSI, NFS, VM, HIL, graphics kernel megapatch), which
makes claims to solve this kind of thing:
PHKL_4099:
This patch fixes two known corner cases on the NCR53C700
Single-Ended bus that create timing windows in the scsi_c700
interface driver. ... The fix was for the driver to check for
both these conditions and as a result, it addresses all known
SCSI request timeout problems encountered to date.
Unfortunately, this patch is not for 9.01. PHKL_6049 claims to be the
equivalent to PHKL_6050 for 9.01 ("Equivalent Patches: PHKL_6050:
s700: 9.03 9.05 9.07"), but doesn't mention SCSI driver fixes.
Indeed, I tried installing it and it hasn't fixed the problems.
Does anyone know if my problem is in fact the one that PHKL_4099
claims to fix?
Here's pertinent system info:
# uname -a
HP-UX envy A.09.01 A 9000/735 2010594854 two-user license
# cd /dev/rdsk ; for drive in *; do echo ; /etc/diskinfo $drive ; done
SCSI describe of c201d0s0:
vendor: TOSHIBA
product id: CD-ROM XM-3301TA
type: CD-ROM
size: 208304 Kbytes
bytes per sector: 2048
diskinfo: can't open c201d1s0: Invalid argument
diskinfo: can't open c201d2s0: No such device or address
SCSI describe of c201d3s0:
vendor: HP
product id: 97560
type: direct access
size: 1323540 Kbytes
bytes per sector: 512
SCSI describe of c201d4s0:
vendor: Quantum
product id: XP34300
type: direct access
size: 4199760 Kbytes
bytes per sector: 512
SCSI describe of c201d5s0:
vendor: Quantum
product id: XP34300
type: direct access
size: 4199760 Kbytes
bytes per sector: 512
SCSI describe of c201d6s0:
vendor: SEAGATE
product id: ST3600N
type: direct access
size: 511742 Kbytes
bytes per sector: 512
# /etc/ioscan -f
Class H/W Path Driver H/W Status S/W Status
=======================================================
graphics 1.0.0 graph3 ok(0x577) ok
scsi 2.0.1 c700 ok(0x7071) ok
disk 2.0.1.0.0 scsi ok(0x5800202) ok
tape_drive 2.0.1.1.0 scsitape ok(0x1800202) ok
disk 2.0.1.3.0 scsi ok(0x201) ok
disk 2.0.1.4.0 scsi ok(0x202) ok
disk 2.0.1.5.0 scsi ok(0x202) ok
disk 2.0.1.6.0 scsi ok(0x202) ok
lan 2.0.2 lan01 ok(0x7072) ok
hil 2.0.3 hil ok(0x7073) ok
serial 2.0.4 asio0 ok(0x7075) ok
serial 2.0.5 asio0 ok(0x7075) ok
parallel 2.0.6 parallel ok(0x7074) ok
scsi 2.0.7 c700 ok(0x707c) ok
audio 2.0.8 audio ok(0x7f) ok
lan 4.1.0 lan01 ok(0x7650) ok
Thanks for any help y'all can provide!
Seth