2.4.19-pre3 udma ide hang with heavy disk activity

2.4.19-pre3 udma ide hang with heavy disk activity

Post by Geoffrey Hof » Tue, 02 Apr 2002 05:50:06



On kernels 2.4.19-pre3 and later I get a disk hang any time I generate
heavy disk activity.  My test is a quick script that untars 2.4.0 and
then applys patch up to 2.4.19 and repeats.  With kernels 19-pre3
through 19-pre5, I can usually get through about the third patch and then
my hard drive activity led goes solid and I can no longer access any
disk.  This is happening specifically with a VIA controller (vt82c596b
rev 22) with a Maxtor 91536U6.  Attached are part of my dmesg,
proc/ide/via, hdinfo of /dev/hda and part of my .config.  If I use
hdparm to set the drive to mdma2 in 19-pre3 I never see any problems.  It
only happens in any udma mode.  I compiled 19-pre3 with the via
driver disabled, but the problem continues.  I also tried 19-pre5 as I
looked and saw that it contains more ide updates, but I get the same
results.

In 19-pre2 and previous kernels, If the drive is running in UDMA66, I
ocassionally get checksum errors and retries so I usually on boot put
the drive in UDMA33 mode.  I have never seen any error of any kind in
this mode.  I have tried modes udma0, 2, and 4 on 2.4.19-pre3 and they
all eventually hang.  When this hang occurs, I get no kernel messages at
all.  I set dmesg to level 8 so I should see any message on the console.
If it weren't for ext3 and reiserfs, I would be very tired of fscks by
now.  If there is anything that I can try to help narrow this problem
down, please let me know.

  dmesg
< 1K Download

  via
1K Download

  hdinfo
1K Download

  dot-config
2K Download
 
 
 

2.4.19-pre3 udma ide hang with heavy disk activity

Post by Geoffrey Hof » Tue, 02 Apr 2002 07:50:07


I just took a look at these two patches and they are already
incorporated in linux-2.4.19-pre5 which gives me the same results as
2.4.19-pre3.  Do you have any other suggestions?

Thanks to everyone who works on the IDE subsystem.  I'm sure someone
will point me to a solution in no time flat.


> Please apply these which have been submitted to Alan Cox but originate
> from Vojtech.

> Cheers,

> Andre Hedrick
> LAD Storage Consulting Group


> > On kernels 2.4.19-pre3 and later I get a disk hang any time I generate
> > heavy disk activity.  My test is a quick script that untars 2.4.0 and
> > then applys patch up to 2.4.19 and repeats.  With kernels 19-pre3
> > through 19-pre5, I can usually get through about the third patch and then
> > my hard drive activity led goes solid and I can no longer access any
> > disk.  This is happening specifically with a VIA controller (vt82c596b
> > rev 22) with a Maxtor 91536U6.  Attached are part of my dmesg,
> > proc/ide/via, hdinfo of /dev/hda and part of my .config.  If I use
> > hdparm to set the drive to mdma2 in 19-pre3 I never see any problems.  It
> > only happens in any udma mode.  I compiled 19-pre3 with the via
> > driver disabled, but the problem continues.  I also tried 19-pre5 as I
> > looked and saw that it contains more ide updates, but I get the same
> > results.

> > In 19-pre2 and previous kernels, If the drive is running in UDMA66, I
> > ocassionally get checksum errors and retries so I usually on boot put
> > the drive in UDMA33 mode.  I have never seen any error of any kind in
> > this mode.  I have tried modes udma0, 2, and 4 on 2.4.19-pre3 and they
> > all eventually hang.  When this hang occurs, I get no kernel messages at
> > all.  I set dmesg to level 8 so I should see any message on the console.
> > If it weren't for ext3 and reiserfs, I would be very tired of fscks by
> > now.  If there is anything that I can try to help narrow this problem
> > down, please let me know.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

 
 
 

1. Promise 20267 hangs with 2.4.19-pre3 and 2.4.19-pre3-ac3

With 2.4.18 everything works fine:

  Mar 13 13:56:13 athlon kernel: PDC20267: IDE controller on PCI bus 00 dev 48
  Mar 13 13:56:13 athlon kernel: PDC20267: chipset revision 2
  Mar 13 13:56:13 athlon kernel: PDC20267: not 100%% native mode: will probe irqs later
  Mar 13 13:56:13 athlon kernel: PDC20267: (U)DMA Burst Bit ENABLED Primary PCI Mode Secondary PCI Mode.
  Mar 13 13:56:13 athlon kernel:     ide2: BM-DMA at 0x1800-0x1807, BIOS settings: hde:pio, hdf:pio
  Mar 13 13:56:13 athlon kernel:     ide3: BM-DMA at 0x1808-0x180f, BIOS settings: hdg:pio, hdh:pio

  Mar 13 13:56:13 athlon kernel: hde: Maxtor 4G120J6, ATA DISK drive
  Mar 13 13:56:13 athlon kernel: hdf: Maxtor 4G160J8, ATA DISK drive
  Mar 13 13:56:13 athlon kernel: hdg: TOSHIBA CD-ROM XM-6602B, ATAPI CD/DVD-ROM drive

  Mar 13 13:56:14 athlon kernel: hde: 240121728 sectors (122942 MB) w/2048KiB Cache, CHS=238216/16/63, UDMA(100)
  Mar 13 13:56:14 athlon kernel: hdf: 268435455 sectors (137439 MB) w/2048KiB Cache, CHS=266305/16/63, UDMA(100)

  Mar 13 13:56:14 athlon kernel: Partition check:
  Mar 13 13:56:14 athlon kernel:  hde: hde1 hde2 hde3 hde4
  Mar 13 13:56:14 athlon kernel:  hdf: hdf1

With 2.4.19-pre3 and pre3-ac3 the disk type and size are reported correctly
but the systems hangs at the partition check (I couldn't find a copy of the log on
the disk so I edited the above to match what I remembered on the screen):

  Mar 13 13:56:13 athlon kernel: hde: Maxtor 4G120J6, ATA DISK drive
  Mar 13 13:56:14 athlon kernel: hde: 240121728 sectors (122942 MB) w/2048KiB Cache, CHS=238216/16/63, UDMA(100)
  Mar 13 13:56:14 athlon kernel: Partition check:
  Mar 13 13:56:14 athlon kernel:  hde:

I tried the experiental ide timeout workaround and that does print a timeout message
after a few seconds but the system still hangs there.

Its a Tyan 2462 athlon SMP with real SMP CPUs. There is another disk and a cdrom on
this controller but due to cable lengths I can't switch things around.

I am willing to try any patch or suggestion...

Andrew

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

2. help needed with compiler errs

3. 2.4.19-pre3 kernel freeze (SMP broken between pre2 and pre3?)

4. Boot problem on Netra-t1

5. 2.4.19-pre3-ac6 kernel hang and stack dump

6. mmap(2) man page

7. MCE hangs on 2.4.19-pre3-rmap12g

8. ?? ftp 'ls'-command with 'pg' ??

9. drivers/ide/qd65xx: no cli/sti (2.4.19-pre3 & 2.5.28)

10. ide-cd oops in 2.4.19-pre3-ac5

11. 2.4.19-pre3 IDE-problem

12. OOPS: ide-cd, 2.4.19-pre3-ac5

13. IDE Panic (2.4.19-pre3-ac3)