PROBLEM: Timer interrupt stopped (2.2.20, 2.4.x)

Post by Sidik Isan » Wed, 27 Mar 2002 20:00:09

Hello -

  We've been seeing intermittent (~weekly) lock-ups with our PCs which
  never happened with the 2.2.16 kernel.  Today the console on one
  of them froze shortly after booting up to the text login (no X).
  But this particular time, it still accepts ssh connections, so
  is there anything I should check while it is in this state?
  /proc/interrupts shows one very interesting thing:

  0:      63444    IO-APIC-edge  timer       <-- Not incrementing anymore!
  1:          2    IO-APIC-edge  keyboard
  2:          0          XT-PIC  cascade
  8:          0    IO-APIC-edge  rtc
 12:        245    IO-APIC-edge  PS/2 Mouse
 13:          1          XT-PIC  fpu
 14:        858    IO-APIC-edge  ide0
 15:          7    IO-APIC-edge  ide1
 19:     110413   IO-APIC-level  usb-uhci, eth0
NMI:          0
ERR:          0

  Mouse and keyboard interrupts are also not incrementing, but the disk
  and network interrupts are.  The kernel seems happy (no errors logged.)

Linux version 2.2.20-smp #1 SMP Sun Mar 24 18:35:47 HST 2002
Built with:  gcc, binutils
Patches:     Vanilla tree plus DEVFS and RAID 0.90
Machine:     Dell Precision 210 Workstation
Motherboard: Dell, 440BX based (supports dual processors)
Processor:   Single P-III 500 MHz
PCI Cards:   Only an NVidia graphics card (but X was never started yet.)
Memory:      1GB ECC
Disks:       2x13GB Maxtor IDE's on hda,hdb

  The same thing happened before with a 2.4 kernel a few months ago.
  At that time the machine had 512MB of RAM.  If anything useful can
  be done to track down the problem while the machine is like this,
  please let me know and I'll try before rebooting tomorrow.


- Sidik
1. Kernel 2.2.20 compile problem

RedHat 6.2  Dual PII Xeon 450Mhz w/2MB cache & 2GB SDRAM

First I have a 2.2.20 kernel that boots and runs on this system but was
trying to build a custom kernel exactly as I have done several times
before with older 2.2.xx kernels..

It compiles without any problems but when I reboot the system I get the
following error and then the boot freezes.
Any help or ideas why this would be happening?

Unable to handle kernel NULL pointer dereference at virtual address
current->tss.cr3 = 00101000, %cr3 = 00101000
*pde = 00000000
Oops: 0002
CPU: 0
EIP: 0010:[<801fc938>]
EFLAGS: 00010246
eax: 00000000  ebx: 00000000  ecx: 8009e2c0  edx; 00000000
esi: 00000580  edi: 00000016  ebp: 0008e000  esp: 80097f68
ds: 0018  es: 0018  ss:0018
Process swapper (pid: 1, process nr: 2, stackpage=80097000

Stack: 00000000  0008e000  00000008  8027dde0  00000000 00000580
00000024  00000008 80296753 00000f00 80096000 802c5af0 80291384 00000f00
80096000 802c5af0 0008e000 00000018 00000078 8010a5c6 00000010 00000202
Call Trace: [<8010a5c6>] [<80115821>] [< 80115314>] [<8010607e>]
Code: c7 02 00 04 00 00 03 05 94 2a 2b 80 c7 00 00 02 00 00 43 0f

At this point it freezes.
Any ideas what the problem is or how to fix it.
I have compiled older kernels using the exact same proceedure and they
worked great.

Thanks in advance for help solving this problem.
Confucius say: He who play in root, eventually kill tree.
Registered with the Linux Counter.
ID # 123538

