I'm running a dual P2 - 233 MHZ processor Linux server with Slackware 3.4
(Kernel 2.0.33 complied for SMP).
Here are some specs about the server
2 - 233 MHZ Pentium 2 processors
Asus P2L97-DS Motherboard (USB and Power saving features turned off)
Using Scsi adapter on the board controlling 2 Seagate Barracuda ULTRA WIDE
128 MB of DRAM
3com 905 Network Card (0.49 drivers)
Generic Trident svga card
After a period of time the server just crashes, this can be after one day
or after 14 days. When it crashes the screen turns black and the server
shows no sign of live, the Keyboard does not respond neither can you ping
the server. There are no indications on why the server crashes... (no
entries in the logs !!!).
The Power lights on the machine are still on when it crashes and the drives
are still spinning. The only way to get this machine back to live is by
turning it's power off and back on. I called Asus to find out if there may
be a problem with the settings but everything seemed to be fine, so they
told me to replace the board which I did but the machine keeps crashing.
And yes I do have the following statement in the Lilo.conf file mem=127
(total size of memory - 1MB). I even got the newest scsi drivers for the on
board 2940 adaptec. I'm running out of ideas...
Should I try a different kernel, lets say 2.1.90 ?
Is there much improvement in terms of SMP in the 2.1.xx kernels ?
Does anybody think this is caused by the kernel ?
Ps. We are running about 11 "single processor" linux servers with the same
hardware , except the board is different - none of these servers ever