nfs client problems with Sun solaris 251 server

nfs client problems with Sun solaris 251 server

Post by jeffd0.. » Wed, 09 Jun 1999 04:00:00



We mount to our linux box from a Sun E4000 NFS File server using the
following options in the linux /etc/fstab.
rw,bg,hard,intr,timeo=14,retrans=6

We have several incidents where the Sun File Server crashes, all
the NFS clients (Sun, AIX, Linux) background. When the server
comes backup the Sun and AIX jobs continue on. However on the linux
side, we get a connection refused and the jobs dies.

we are running linux 2.2.2 kernel and solaris 2.5.1 os may 1999 patch
set.

here are some of the messages

Jun  7 21:01:17 wolf65 kernel: nfs: server hac5ce not responding, still
trying
Jun  7 21:01:36 wolf65 kernel: nfs: task 16303 can't get a request slot
Jun  7 21:02:57 wolf65 kernel: nfs: task 16304 can't get a request slot
Jun  7 21:25:34 wolf65 named[276]: Cleaned cache of 0 RRs
Jun  7 21:25:34 wolf65 named[276]: USAGE 928808734 928333477
CPU=0.03u/0.01s CHILDCPU=0u/0s
Jun  7 21:25:34 wolf65 named[276]: NSTATS 928808734 928333477
Jun  7 21:25:34 wolf65 named[276]: XSTATS 928808734 928333477 RR=0
RNXD=0 RFwdR=0 RDupR=0 RFail=0 RFErr=0 RErr=0 RAXFR=0 RLame=0 ROpts=0
SSysQ=1 SAns=0 SFwdQ=0 SDupQ=89436 SErr=0 RQ=0 RIQ=0 RFwdQ=0 RDupQ=0
RTCP=0 SFwdR=0 SFail=0 SFErr=0 SNaAns=0 SNXD=0
Jun  7 22:24:31 wolf65 kernel: nfs: RPC call returned error 111
Jun  7 22:24:31 wolf65 kernel: RPC: task of released request still
queued!
Jun  7 22:24:31 wolf65 kernel: RPC: (task is on xprt_pending)
Jun  7 22:24:33 wolf65 kernel: nfs: RPC call returned error 111
Jun  7 22:24:33 wolf65 kernel: RPC: task of released request still
queued!
Jun  7 22:24:33 wolf65 kernel: RPC: (task is on xprt_pending)
Jun  7 22:24:33 wolf65 kernel: nfs_revalidate_inode:
denver3d/pottera02_beams2_w65.log getattr failed, ino=150022, error=-111
Jun  7 22:24:42 wolf65 kernel: nfs: server hac5ce not responding, still
trying
Jun  7 22:24:51 wolf65 kernel: nfs: RPC call returned error 111
Jun  7 22:24:51 wolf65 kernel: RPC: task of released request still
queued!
Jun  7 22:24:51 wolf65 kernel: RPC: (task is on xprt_pending)
Jun  7 22:24:51 wolf65 kernel: nfs_revalidate_inode:
denver3d/pottera02_beams2_w65.log getattr failed, ino=150022, error=-111
Jun  7 22:25:01 wolf65 kernel: nfs: server hac5ce not responding, still
trying
Jun  7 22:25:01 wolf65 kernel: nfs: RPC call returned error 111
Jun  7 22:25:01 wolf65 kernel: RPC: task of released request still
queued!
Jun  7 22:25:01 wolf65 kernel: RPC: (task is on xprt_pending)
Jun  7 22:25:01 wolf65 kernel: nfs_revalidate_inode:
denver3d/pottera02_beams2_w65.log getattr failed, ino=150022, error=-111
Jun  7 22:25:02 wolf65 kernel: nfs: RPC call returned error 111
Jun  7 22:25:02 wolf65 kernel: RPC: task of released request still
queued!
Jun  7 22:25:02 wolf65 kernel: RPC: (task is on xprt_pending)
Jun  7 22:25:02 wolf65 kernel: nfs_revalidate_inode:
denver3d/pottera02_beams2_w65.log getattr failed, ino=150022, error=-111
Jun  7 22:25:12 wolf65 kernel: nfs: server hac5ce not responding, still
trying
Jun  7 22:25:34 wolf65 named[276]: Cleaned cache of 0 RRs
Jun  7 22:25:34 wolf65 named[276]: USAGE 928812334 928333477
CPU=0.03u/0.01s CHILDCPU=0u/0s
Jun  7 22:25:34 wolf65 named[276]: NSTATS 928812334 928333477
Jun  7 22:25:34 wolf65 named[276]: XSTATS 928812334 928333477 RR=0
RNXD=0 RFwdR=0 RDupR=0 RFail=0 RFErr=0 RErr=0 RAXFR=0 RLame=0 ROpts=0
SSysQ=1 SAns=0 SFwdQ=0 SDupQ=90140 SErr=0 RQ=0 RIQ=0 RFwdQ=0 RDupQ=0
RTCP=0 SFwdR=0 SFail=0 SFErr=0 SNaAns=0 SNXD=0
Jun  7 22:25:38 wolf65 kernel: nfs: server hac5ce OK
Jun  7 22:25:38 wolf65 kernel: nfs: server hac5ce OK

thanks for any help

Sent via Deja.com http://www.deja.com/
Share what you know. Learn what you don't.

 
 
 

nfs client problems with Sun solaris 251 server

Post by jeffd0.. » Fri, 11 Jun 1999 04:00:00


putting the tcp option for the nfs mounts in /etc/fstab seemed
to have corrected this problem. don't know why, maybe someone can
explain.



> We mount to our linux box from a Sun E4000 NFS File server using the
> following options in the linux /etc/fstab.
> rw,bg,hard,intr,timeo=14,retrans=6

> We have several incidents where the Sun File Server crashes, all
> the NFS clients (Sun, AIX, Linux) background. When the server
> comes backup the Sun and AIX jobs continue on. However on the linux
> side, we get a connection refused and the jobs dies.

> we are running linux 2.2.2 kernel and solaris 2.5.1 os may 1999 patch
> set.

> here are some of the messages

> Jun  7 21:01:17 wolf65 kernel: nfs: server hac5ce not responding,
still
> trying
> Jun  7 21:01:36 wolf65 kernel: nfs: task 16303 can't get a request
slot
> Jun  7 21:02:57 wolf65 kernel: nfs: task 16304 can't get a request
slot
> Jun  7 21:25:34 wolf65 named[276]: Cleaned cache of 0 RRs
> Jun  7 21:25:34 wolf65 named[276]: USAGE 928808734 928333477
> CPU=0.03u/0.01s CHILDCPU=0u/0s
> Jun  7 21:25:34 wolf65 named[276]: NSTATS 928808734 928333477
> Jun  7 21:25:34 wolf65 named[276]: XSTATS 928808734 928333477 RR=0
> RNXD=0 RFwdR=0 RDupR=0 RFail=0 RFErr=0 RErr=0 RAXFR=0 RLame=0 ROpts=0
> SSysQ=1 SAns=0 SFwdQ=0 SDupQ=89436 SErr=0 RQ=0 RIQ=0 RFwdQ=0 RDupQ=0
> RTCP=0 SFwdR=0 SFail=0 SFErr=0 SNaAns=0 SNXD=0
> Jun  7 22:24:31 wolf65 kernel: nfs: RPC call returned error 111
> Jun  7 22:24:31 wolf65 kernel: RPC: task of released request still
> queued!
> Jun  7 22:24:31 wolf65 kernel: RPC: (task is on xprt_pending)
> Jun  7 22:24:33 wolf65 kernel: nfs: RPC call returned error 111
> Jun  7 22:24:33 wolf65 kernel: RPC: task of released request still
> queued!
> Jun  7 22:24:33 wolf65 kernel: RPC: (task is on xprt_pending)
> Jun  7 22:24:33 wolf65 kernel: nfs_revalidate_inode:
> denver3d/pottera02_beams2_w65.log getattr failed, ino=150022,
error=-111
> Jun  7 22:24:42 wolf65 kernel: nfs: server hac5ce not responding,
still
> trying
> Jun  7 22:24:51 wolf65 kernel: nfs: RPC call returned error 111
> Jun  7 22:24:51 wolf65 kernel: RPC: task of released request still
> queued!
> Jun  7 22:24:51 wolf65 kernel: RPC: (task is on xprt_pending)
> Jun  7 22:24:51 wolf65 kernel: nfs_revalidate_inode:
> denver3d/pottera02_beams2_w65.log getattr failed, ino=150022,
error=-111
> Jun  7 22:25:01 wolf65 kernel: nfs: server hac5ce not responding,
still
> trying
> Jun  7 22:25:01 wolf65 kernel: nfs: RPC call returned error 111
> Jun  7 22:25:01 wolf65 kernel: RPC: task of released request still
> queued!
> Jun  7 22:25:01 wolf65 kernel: RPC: (task is on xprt_pending)
> Jun  7 22:25:01 wolf65 kernel: nfs_revalidate_inode:
> denver3d/pottera02_beams2_w65.log getattr failed, ino=150022,
error=-111
> Jun  7 22:25:02 wolf65 kernel: nfs: RPC call returned error 111
> Jun  7 22:25:02 wolf65 kernel: RPC: task of released request still
> queued!
> Jun  7 22:25:02 wolf65 kernel: RPC: (task is on xprt_pending)
> Jun  7 22:25:02 wolf65 kernel: nfs_revalidate_inode:
> denver3d/pottera02_beams2_w65.log getattr failed, ino=150022,
error=-111
> Jun  7 22:25:12 wolf65 kernel: nfs: server hac5ce not responding,
still
> trying
> Jun  7 22:25:34 wolf65 named[276]: Cleaned cache of 0 RRs
> Jun  7 22:25:34 wolf65 named[276]: USAGE 928812334 928333477
> CPU=0.03u/0.01s CHILDCPU=0u/0s
> Jun  7 22:25:34 wolf65 named[276]: NSTATS 928812334 928333477
> Jun  7 22:25:34 wolf65 named[276]: XSTATS 928812334 928333477 RR=0
> RNXD=0 RFwdR=0 RDupR=0 RFail=0 RFErr=0 RErr=0 RAXFR=0 RLame=0 ROpts=0
> SSysQ=1 SAns=0 SFwdQ=0 SDupQ=90140 SErr=0 RQ=0 RIQ=0 RFwdQ=0 RDupQ=0
> RTCP=0 SFwdR=0 SFail=0 SFErr=0 SNaAns=0 SNXD=0
> Jun  7 22:25:38 wolf65 kernel: nfs: server hac5ce OK
> Jun  7 22:25:38 wolf65 kernel: nfs: server hac5ce OK

> thanks for any help

> Sent via Deja.com http://www.deja.com/
> Share what you know. Learn what you don't.

Sent via Deja.com http://www.deja.com/
Share what you know. Learn what you don't.

 
 
 

1. Wierd AIX/NFS client, Sun/NFS server problem

AIX 3.1.6 and 3.1.5 on 320s, SunOS 4.1.1 on SS2s, thinnet on the IBMs,
both on the suns.

Suddenly (why do things always happen suddenly?), NFS client speed
from IBM -> Sun is *amazingly* slow.   As in 8 minutes to copy a
400K file from the remote filesystem to /tmp.  AIX<->AIX speed
is per usual, and Sun<->Sun speed is usual.

The only thing I've changed recently is to move the gateway for our
subnet to a new IP address.  I changed my rc files to reflect the change.
I don't remember the AIX machines slowing down then, either, as I was
using them to build software on the remote file systems immediately after the
change.

I've applied nfsextfix, and that didn't help.  etherfind(1) on the sun
shows that the 320s are asking and getting nfs requests w/ no exraneous
packets inbetween requests.  It's just that there's a 5-10 second lag
between each nfs request (which are 8K in size, like they've always been.)

There are no routers or bridges between the clients and the server.  The
arp table information on the AIX clients is correct.  The routing tables
as reported by netstat -rn seem to be just fine.  The percentage of bad
packets reported by nfsstat, etc, are all well below tolerances (and
are below the rates for the sun clients!).

Any clues/hints?

--

vox: (713) 749-2126  '91 CB750, DoD# 0378

2. KDE 2.01 & SuSE Linux 6.4

3. Solaris 251 and Adaptec SCSI Card Problem

4. YDL Installation: Can't Configure Timezone

5. problem with LINUX (NFS server) and Solaris 2.7 (intel ) NFS client.

6. Shell & modem as a dumb terminal & file transfer? Is it possible?

7. NFS mount from RedHat server to Sun Solaris client, how to get it to perform?

8. Gimp-1.0.4 compiles with ccc

9. NFS Problem: Linux server and SUN/HP clients

10. Solaris 251 & Network Time Protocol

11. solaris 251 ultras & ntp?

12. possible mbuf exhaustion on solaris 251 box?

13. changing resolution in Solaris 251