MCE hangs on 2.4.19-pre3-rmap12g

MCE hangs on 2.4.19-pre3-rmap12g

Post by Georg Nikody » Sun, 17 Mar 2002 23:50:09



In 2.4.19-pre3, Marcelo added CONFIG_X86_MCE (Machine Check
Exception).  I'm running from Rik's bk available -rmap12g tree.

I've tried enabling this option on both a T20 (IBM) and an i8k (Dell).
(I became interested in the option when one of them started having
flakiness which I briefly thought might be heat related).

Both laptops hang very early in the boot process after emitting the
message:

    Intel machine check architecture supported

Kind of a drag since the keyboard isn't alive yet, so power cycling is
your only option at this point.

I'm not particularly concerned except that the help for this option
ends with the sentences:

    Similarly, if MCE is built in and creates a problem on some new
    non-standard machine, you can boot with "nomce" to disable it.

I think that "creates a problem" understates the observed "prevents
your machine from booting".

    MCE support simply ignores non-MCE processors like the 386 and
    486, so nearly everyone can say Y here.

This reads like an encouragement when there should really be a
warning.

Here's the cpuinfo if each machine.  Perhaps one of the x86
cognoscenti can tell me that I'm a dolt, that the flags: like clearly
indicates that CONFIG_X86_MCE can never work.

Thinkpad T20:

(lou) 1001$ cat /proc/cpuinfo
processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 8
model name      : Pentium III (Coppermine)
stepping        : 3
cpu MHz         : 746.766
cache size      : 256 KB
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 2
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 sep mtrr pge mca cmov pat pse36 mmx fxsr sse
bogomips        : 1490.94

Dell Inspiron 8000:

(keller) 1001$ cat /proc/cpuinfo
processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 8
model name      : Pentium III (Coppermine)
stepping        : 10
cpu MHz         : 996.692
cache size      : 256 KB
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 2
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 sep mtrr pge mca cmov pat pse36 mmx fxsr sse
bogomips        : 1985.74
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

 
 
 

MCE hangs on 2.4.19-pre3-rmap12g

Post by Dave Jone » Mon, 18 Mar 2002 01:10:10


 > Both laptops hang very early in the boot process after emitting the
 > message:
 >
 >     Intel machine check architecture supported

 Read the archives. This bug has been discussed to death already,
 and there have been at least half a dozen postings of patches.
 (hint: drop the bluesmoke.c changes from pre3)

--
| Dave Jones.        http://www.codemonkey.org.uk
| SuSE Labs
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

 
 
 

1. Promise 20267 hangs with 2.4.19-pre3 and 2.4.19-pre3-ac3

With 2.4.18 everything works fine:

  Mar 13 13:56:13 athlon kernel: PDC20267: IDE controller on PCI bus 00 dev 48
  Mar 13 13:56:13 athlon kernel: PDC20267: chipset revision 2
  Mar 13 13:56:13 athlon kernel: PDC20267: not 100%% native mode: will probe irqs later
  Mar 13 13:56:13 athlon kernel: PDC20267: (U)DMA Burst Bit ENABLED Primary PCI Mode Secondary PCI Mode.
  Mar 13 13:56:13 athlon kernel:     ide2: BM-DMA at 0x1800-0x1807, BIOS settings: hde:pio, hdf:pio
  Mar 13 13:56:13 athlon kernel:     ide3: BM-DMA at 0x1808-0x180f, BIOS settings: hdg:pio, hdh:pio

  Mar 13 13:56:13 athlon kernel: hde: Maxtor 4G120J6, ATA DISK drive
  Mar 13 13:56:13 athlon kernel: hdf: Maxtor 4G160J8, ATA DISK drive
  Mar 13 13:56:13 athlon kernel: hdg: TOSHIBA CD-ROM XM-6602B, ATAPI CD/DVD-ROM drive

  Mar 13 13:56:14 athlon kernel: hde: 240121728 sectors (122942 MB) w/2048KiB Cache, CHS=238216/16/63, UDMA(100)
  Mar 13 13:56:14 athlon kernel: hdf: 268435455 sectors (137439 MB) w/2048KiB Cache, CHS=266305/16/63, UDMA(100)

  Mar 13 13:56:14 athlon kernel: Partition check:
  Mar 13 13:56:14 athlon kernel:  hde: hde1 hde2 hde3 hde4
  Mar 13 13:56:14 athlon kernel:  hdf: hdf1

With 2.4.19-pre3 and pre3-ac3 the disk type and size are reported correctly
but the systems hangs at the partition check (I couldn't find a copy of the log on
the disk so I edited the above to match what I remembered on the screen):

  Mar 13 13:56:13 athlon kernel: hde: Maxtor 4G120J6, ATA DISK drive
  Mar 13 13:56:14 athlon kernel: hde: 240121728 sectors (122942 MB) w/2048KiB Cache, CHS=238216/16/63, UDMA(100)
  Mar 13 13:56:14 athlon kernel: Partition check:
  Mar 13 13:56:14 athlon kernel:  hde:

I tried the experiental ide timeout workaround and that does print a timeout message
after a few seconds but the system still hangs there.

Its a Tyan 2462 athlon SMP with real SMP CPUs. There is another disk and a cdrom on
this controller but due to cable lengths I can't switch things around.

I am willing to try any patch or suggestion...

Andrew

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

2. read only file system..even for root

3. 2.4.19-pre3 kernel freeze (SMP broken between pre2 and pre3?)

4. SCO buys Unixware

5. 2.4.19-pre3 udma ide hang with heavy disk activity

6. Hi! Where do I set ORGANIZATION?

7. 2.4.19-pre3-ac6 kernel hang and stack dump

8. WTMP (login) accounting file in Unix

9. 2.4.19-rc2 -> 2.4.19-rc3 : no more eth (fwd)

10. 3Ware ok 2.4.19, dies 2.4.19-ac4

11. 2.4.19-pre6aa1 (possible all kernel after 2.4.19-pre2) athlon PCI workaround

12. Kernel panic 2.4.19-pre6 AND 2.4.19-pre5-ac3 - More info - ksymoops

13. kbuild25 version 3.0 for 2.4.19-pre9 and 2.4.19-pre9-ac3