> unless the alert is closed? can you explain?
> pardon my ignorance :/
We are experiencing a similar problem. We have an IBM H80, AIX 4.3.3,
ML08, 2 processors, 2GB RAM, 2 internal disks and an external disk
array, with Oracle 8.1.6, Oracle iAS9i, MQSeries, and Actuate
The box has been working fine until 15 days ago or so. It crashes,
apparently due to hardware errors (SCSI BUS ERROR scsi0, DISK
OPERATION ERROR hdisk0 hdisk1). We are mirroring all system
filesystems, including swap space. I say apparently because IBM has
not found any hardware errors when they come to check it (they have
been here a lot lately!) and think this is a software problem.
As soon as the OS detects errors, the databases "crash" giving the
Errors in file /u01/oracle/admin/db1/bdump/ckpt_34316_db1.trc:
ORA-00206: error in writing (block 3, # blocks 1) of controlfile
ORA-00202: controlfile: '/u01/oradata/db1/redo/log1/control01.ctl'
ORA-27063: skgfospo: number of bytes read/written is incorrect
IBM AIX RISC System/6000 Error: 5: I/O error
Additional information: -1
Additional information: 8192
Mon Jul 30 14:28:06 2001
CKPT: terminating instance due to error 221
Instance terminated by CKPT, pid = 34316
(BTW, the first set of redo logs are on rootvg, the other set on the
external disk array.)
From this moment, no command works, including OS. The only thing left
is to reboot the machine (shutdown doesn't work either) by pressing
the reset button.
We have been rebooting the machine 3-4 times everyday. It has even
crashed at 2 A.M. when no one is in the office.
We have increased the maximum number of asynchronous I/O servers to
100 (using pstat we've seen a maximum of 38 when everything works),
but nothing improved.
At most we've seen a total of 0.6% of swap space used (3GB swap space:
2GB /dev/hd6, 1GB on external disk array).
Can someone please give further explanations regarding the Oracle bug
with async I/O?
What do you mean by using multiple dbwriters?
Any help is appreciated!
Via Carciano, 4
00131 Roma, Italia