"Found Dead Dispatcher" message

"Found Dead Dispatcher" message

Post by Jim.Pick.. » Sat, 31 Jan 1998 04:00:00



Dear Oracle Gurus:

We are experiencing a "found dead dispatcher" condition once a week or so.

This is sometimes accompanied by a "found dead multi-threaded server" message
as well.  

The database continues to function "normally" after this event, but causes us DBAs
to have to run around and diagnose the cause of the trace file dump.

We run this application user Oracle7 Server Release 7.1.6.2.0 on a Dec Alpha/Digital
Unix platform.

Have any of you experienced this condition or have any ideas on why this occurs ?

The alert log will show the following:
-------------------------------------------------------
Fri Jan 30 02:52:51 1998
found dead dispatcher 'D000', pid = (19, 1)

Fri Jan 30 06:41:57 1998
found dead multi-threaded server 'S000', pid = (9, 1)
-------------------------------------------------------

Trace files are generated for both events.

The dispatcher trc file shows:

------------------------------------------------------------------------------------------------
Dump file /cincom/oracle/admin/CDS/bdump/d000_19933.trc

Oracle7 Server Release 7.1.6.2.0 - Production Release  
With the distributed, replication and parallel query options
PL/SQL Release 2.1.6.2.0 - Production                      
ORACLE_HOME = /ross/oracle          
ORACLE_SID = CDS  
Oracle process number: 19        Unix process id: 19933
System name:    OSF1
Node name:      cats
Release:        V3.2
Version:        62
Machine:        alpha

Fri Jan 30 02:52:11 1998
----- Call Stack Trace -----
                                                                     argument
                                                                     values in
                                                                     hex (?
calling                        call   entry                          means
location                       type   point                          dubious)
--------------------           ------ --------------------           ---------
Fri Jan 30 02:52:14 1998

- snip -

********************   Cursor Dump   ************************                                    

No curdef array
ssexhd: crashing the process...                                                                  
------------------------------------------------------------------------------------------------

The dead server creates the following .trc file:

------------------------------------------------------------------------------------------------
Dump file /cincom/oracle/admin/CDS/bdump/s000_7286.trc
Oracle7 Server Release 7.1.6.2.0 - Production Release
With the distributed, replication and parallel query options
PL/SQL Release 2.1.6.2.0 - Production
ORACLE_HOME = /ross/oracle
ORACLE_SID = CDS Oracle process number: 9         Unix process id: 7286
System name:    OSF1
Node name:      cats
Release:        V3.2
Version:        62
Machine:        alpha

Fri Jan 30 06:39:00 1998
Fri Jan 30 06:39:00 1998
*** SESSION ID:(59.275)
Exception signal: 11, code = 1
sc_onstack: 0000000000000002, sc_mask 0000000000000000
pc: 0000000120337ffc, ps: 0000000000000008, ownedfp: 0000000000000001

- snip -

r31: 0000000000000000
fpcr: 8d00000000000000, fp_control: 000000000205fbb0
trap_a0: 0000000000000000, trap_a1: 0000000000000001, trap_a2: 0000000000000000
fp_trap_pc: 000000011fff4f50, fp_trigger_sum: 00000000020314f0, fp_trigger_inst 0000000120020c18

----- Call Stack Trace -----
                                                                     argument  
                                                                     values in
                                                                     hex (?    
calling                        call   entry                          means    
location                       type   point                          dubious)  
--------------------           ------ --------------------           ---------
Fri Jan 30 06:39:02 1998
Exception signal: 11, code = 1
sc_onstack: 0000000000000002, sc_mask 0000000000000000
ssexhd: crashing the process...
------------------------------------------------------------------------------------------------

Here's a complete alert log for the total period of time that the database has been "up"
before the dispatcher/server dies.

=========================================================================
Starting up ORACLE RDBMS Version: 7.1.6.2.0.
System parameters with non-default values:
  processes                = 400
  timed_statistics         = TRUE
  shared_pool_size         = 65536000
  nls_date_format          = DD-MON-RR
  control_files            = /cincom/oracle/data/CDS/control01.ctl,
/cincom/oracle/data/CDS/control02.ctl, /cincom/oracle/data/CDS/c
ontrol03.ctl
  db_block_buffers         = 8000
  compatible               = 7.1.0.0
  log_archive_start        = TRUE
  log_archive_dest         = /cincom/data/oracle/archive/CDS/arch.log
  log_buffer               = 2725888
  log_checkpoint_interval  = 25600000
  log_small_entry_max_size = 200
  checkpoint_process       = TRUE
  rollback_segments        = bigrbs01, bigrbs02
  sequence_cache_entries   = 100
  sequence_cache_hash_buckets= 89
  remote_login_passwordfile= NONE
  global_names             = TRUE
  mts_service              = CDS
  mts_servers              = 10
  mts_dispatchers          = tcp,20
  mts_max_dispatchers      = 30
  mts_listener_address     = (ADDRESS=(PROTOCOL=tcp) (HOST=cats) (PORT=listener))
  audit_trail              = NONE
  sort_area_size           = 5999616
  sort_area_retained_size  = 5999616
  db_name                  = CDS
  open_cursors             = 1000
  ifile                    = /cincom/oracle/admin/CDS/pfile/configCDS.ora
  background_dump_dest     = /cincom/oracle/admin/CDS/bdump
  user_dump_dest           = /cincom/oracle/admin/CDS/udump
  max_dump_file_size       = 51200000
  core_dump_dest           = /cincom/oracle/admin/CDS/cdump
PMON started
DBWR started
ARCH started
LGWR started
Fri Jan 30 01:43:45 1998
RECO started
Fri Jan 30 01:43:45 1998
starting up 10 shared server(s) ...
Fri Jan 30 01:44:05 1998
starting up 20 dispatcher(s) for network protocol 'tcp'...                                      

Fri Jan 30 01:44:46 1998
alter database  mount exclusive
Fri Jan 30 01:44:46 1998
Completed: alter database  mount exclusive
Fri Jan 30 01:44:46 1998
alter database ARCHIVELOG
Completed: alter database ARCHIVELOG
Fri Jan 30 01:44:46 1998
alter database open
Fri Jan 30 01:44:58 1998
Thread 1 opened at log sequence 38666
  Current log# 5 seq# 38666 mem# 0: /cincom/oracle/data/CDS/redo05.rdo
Fri Jan 30 01:44:58 1998
SMON: enabling cache recovery
Fri Jan 30 01:45:01 1998
Completed: alter database open

Fri Jan 30 02:52:51 1998
found dead dispatcher 'D000', pid = (19, 1)
Fri Jan 30 06:41:57 1998
found dead multi-threaded server 'S000', pid = (9, 1)
Fri Jan 30 08:16:00 1998

Thread 1 advanced to log sequence 38667
  Current log# 6 seq# 38667 mem# 0: /cincom/oracle/data/CDS/redo6.rdo

========================================================================

======================================
  Jim Pickett
    Manager of Database Administration
    Cubic Corporation

  Work:   (619)505-2868
  Pager:  (619)494-5539
  E-mail: jim_pick...@cubic.com

                    01/30/98  09:18:23
======================================

 
 
 

1. oninit -i: "Cannot find message file"- Error Messages

Hallo!

I have a problem during initialisation of IDS 7.31 on SCO OpenServer 5.0.5
platform.
When I dry oninit -i the message "Cannot find message file" appears.

The installed IDS on SCO around 10 time on other machine and I had no
problems. So I think the enviroment and also the onconfig is correctly set.

Maybe someone had a idea...

Thanks, Gnter


CDSPELL=cdspell

CLIENT_LOCALE=german_germany.8859

COLLCHAR=0

COLUMNS=80

DBDATE=DMY4/

DBLANG=2

DBMONEY=.

DB_LOCALE=german_germany.8859

ENGINE=ONLINE

ENV=/etc/kshrc

ERRNO=13

FCEDIT=/bin/ed

HOME=/usr2/users/informix

HUSHLOGIN=FALSE

HZ=100

IFS='

'

INFORMIXDIR=/usr2/program/inf7

INFORMIXSERVER=viper

LANG=german_germany.8859

LC_COLLATE=german_germany.8859

LINENO=1

LINES=24

LOGNAME=informix

MAIL=/usr/spool/mail/informix

MAILCHECK=600


MSG_MAIL=1

MS_PROFILE=1

OLDPWD=/usr2/users/informix

ooefire

OPTARG

OPTIND=1

PATH=/bin:/usr/bin:.:/usr2/program/inf7/bin

PPID=6038


PS2='> '

PS3='#? '

PS4='+ '

PWD=/usr2/program/inf7

RANDOM=4915

SECONDS=475

SERVER_LOCALE=german_germany.8859

SHELL=/bin/ksh

TERM=r2

TMOUT=0

TZ=CET-1CETDST,M3.5.0/1,M10.5.0/1

USER=informix

2. Using TQuery for insert with bind variables

3. Error Message: "Cannot find Installable ISAM"

4. TPC-D benchmark for data query

5. "Data source not found" message

6. Database Size

7. PLEASE HELP "message file not found"

8. HostAccess to Excel

9. Eliminate "no records found" message

10. unknown error messages:"no catalog found"!!

11. found dead dispatcher

12. max of ("...","...","..")

13. "Unknown error message number "16118".