looping processes under solaris 2.5

looping processes under solaris 2.5

Post by Kim Holbur » Tue, 23 Apr 1996 04:00:00



Ever since I upgraded to Solaris 2.5 our system has had processes that
just decide suddenly to go into a loop, consuming most of the CPU
resources.  The machine in question has about 10,000 users, of whom at
any one time there are about 100 logged on.  The looping processes make
the machine very slow and eventualy almost unusable.  I have to
constantly monitor the system and kill them.  We have had no answers
from sun as yet, the problem started at the beginning of March.  The
system is not used for X so almost all the processes are text based.  
The processes all have certain characteristics.  They are invoked from a
C program with a system call.  They are all programs that require
keyboard input and access the network like "telnet".  When I truss them
they are mostly looping while doing a read and getting a "0" (zero)
back.  The processes only seem to go rogue when there are more than 400
processes running on the system in total.  I have seen these run-away
processes on other less highly used systems but much more rarely.

Could you please send any replies to me via email as my newserver is not
functioning correctly at the moment.  I will summarise responses if I
get any and anyone else is interested.
--
+-------------------------+--------------------------------------------+

| Computer Centre ISD     |  PGP Public Key :                          |

| PO Box 1 Belconnen      |  Phone    : (06)201 5512, (06)201 5500     |
| ACT  Australia  2617    |  Fax      : (06)201 5502, (06)201 5501     |
+-------------------------+--------------------------------------------+
|      born again and again and again and again and again buddhist     |
|                      well... in a previous life anyway  :-)          |
+----------------------------------------------------------------------+

 
 
 

looping processes under solaris 2.5

Post by Casper H.S. Dik - Network Security Engine » Wed, 24 Apr 1996 04:00:00



>C program with a system call.  They are all programs that require
>keyboard input and access the network like "telnet".  When I truss them
>they are mostly looping while doing a read and getting a "0" (zero)
>back.  The processes only seem to go rogue when there are more than 400
>processes running on the system in total.  I have seen these run-away
>processes on other less highly used systems but much more rarely.

A process looping while reading "0" bytes is most likely broken.
A 0 byte reads in Unix indicates an EOF condition, a program should handle it
gracefully and not hope that it will just go away.

Casper
--
Expressed in this posting are my opinions.  They are in no way related
to opinions held by my employer, Sun Microsystems.
Statements on Sun products included here are not gospel and may
be fiction rather than truth.

 
 
 

1. CPU loop with bundled CDE on Solaris 2.5

Anyone else seen the stock CDE (the one that is bundled on 2.5 HW 1/96)
with dtfile always going into a CPU loop?  I've seen this behavior
also with the first 2.5 release.

I've symlink'd /bin/true to dtfile for now.

Thanks!

Frank
--

 Software Engineer/UNIX Systems | "I went into her cave and returned without
 Tektronix CNA Division         |  the ears of a rabbit."

2. Script to create HTML doc w/Dynamic IP address in it.

3. FS: Solaris 2.5 Desktop (CDE) and/or Solaris 2.5 Server Packages for x86 desktops

4. Better TGA Driver?

5. corruption of process shared text with Solaris 2.5 on 670MP?

6. Laptop modem and sound card problems

7. Run-away lpNet process on X86 with Solaris 2.5

8. tape drive with Linux

9. A good Process Monitor for Solaris 2.5?

10. Solaris 2.5 and defunct process

11. Strange Solaris (2.5) defunct process.

12. how to install solaris 2.5 x86 to 2.5 gig WD HD

13. Solaris 2.5 & NeWSprint 2.5