Solaris 9 jumpstart from Linux server - problems

Solaris 9 jumpstart from Linux server - problems

Post by Chris Philli » Fri, 18 Jun 2004 04:10:07



Here's the deal - I'm trying to setup our research lab which has one
server (Mandrake 10.0, Dual 3.2Ghz Xeon, 1 Gb RAM, named "wardrobe")
and eight Sun workstations (6 Sun-Blade 100s, 2 Sun-Blade 150s). The
Sun workstations are to run Solaris 9. I'd like to run the setup using
a jumpstart, and I think I'm close, but the system is stalling during
the boot. I've setup the server to only manage one workstation, called
susan. Here's what my setup looks like:


rarpd-ss981107-2mdk
bootparamd-0.17-5mdk
tftp-server-0.34-1mdk
nfs-utils-1.0.6-1mdk


0:3:bA:8:7D:98  susan


susan   root=wardrobe:/var/solaris9_install/Solaris_9/Tools/Boot \
        install=wardrobe:/var/solaris9_install/Solaris_9/ \
        boottype=:in \
        rootopts=:rsize=32768


**snip**
/var/solaris9_install/ 10.22.0.0/16(ro)


127.0.0.1               localhost
10.22.10.22             wardrobe
**snip**
10.22.10.33             susan


total 472
-rwxr-xr-x  1 root root 152376 Jun 15 16:37 0A160A21*
-rwxr-xr-x  1 root root 152376 Jun 15 16:37
0A160A21.SUNW,SUN-BLADE-100*
-rwxr-xr-x  1 root root 152376 Jun 15 16:37
inetboot.SUNW,SUN-BLADE-100.Solaris_9-1*
-rw-r--r--  1 root root    383 Jun 15 16:37 rm.10.22.10.33
lrwxrwxrwx  1 root root      2 Jun 15 16:37 tftpboot -> .//

When I run the client with "boot net - install -v", it seems to run
fine until it gets to a line describing the CPU:

cpu0: UltraSparc-IIe (portid 0 impl 0x13 ver 0x14 clock 502Mhz)

and then proceeds to sit there forever. During the output, susan does
correctly get the ip address, the hostname, and the domain name. If
someone tells me how to get the bootup output from a log or file, I'll
post it here for you to see.

The install files were originally setup using the setup scripts from
the Solaris 9 CDs on one of the workstations, and then copied over to
the server. The four processes on the server are running, and I have
temporarily taken down the firewall (shorewall).

I have been using documentation from these sites:
http://www.docbert.org/Solaris/Jumpstart/linux.html
http://www.scsy.de/~mas/archiv/000017.html

If it makes a difference, I have the server providing NIS and NFS to
the machines for login and home directories. The /etc/nsswitch.conf
file is setup to serve through NIS, but everything (on the server)
looks to files first.

Thanks in advance for your help - let me know if there is anything
else I can provide to help debug this!

Chris Phillips
Westmont College

 
 
 

Solaris 9 jumpstart from Linux server - problems

Post by Darren Dunha » Fri, 18 Jun 2004 05:24:18



> Here's the deal - I'm trying to setup our research lab which has one
> server (Mandrake 10.0, Dual 3.2Ghz Xeon, 1 Gb RAM, named "wardrobe")
> and eight Sun workstations (6 Sun-Blade 100s, 2 Sun-Blade 150s). The
> Sun workstations are to run Solaris 9. I'd like to run the setup using
> a jumpstart, and I think I'm close, but the system is stalling during
> the boot. I've setup the server to only manage one workstation, called
> susan. Here's what my setup looks like:
> When I run the client with "boot net - install -v", it seems to run
> fine until it gets to a line describing the CPU:
> cpu0: UltraSparc-IIe (portid 0 impl 0x13 ver 0x14 clock 502Mhz)
> and then proceeds to sit there forever. During the output, susan does
> correctly get the ip address, the hostname, and the domain name. If
> someone tells me how to get the bootup output from a log or file, I'll
> post it here for you to see.

Linux and Solaris NFS are apparantly different in some aspects.  Take a
look at this thread...

http://groups.google.com/groups?th=7df5a88654120b61

msgs 11 and 18-20 may be relevant.

--

Senior Technical Consultant         TAOS            http://www.taos.com/
Got some Dr Pepper?                           San Francisco, CA bay area
         < This line left intentionally blank to confuse you. >

 
 
 

Solaris 9 jumpstart from Linux server - problems

Post by Sami Ketol » Fri, 18 Jun 2004 05:36:29



> fine until it gets to a line describing the CPU:

> cpu0: UltraSparc-IIe (portid 0 impl 0x13 ver 0x14 clock 502Mhz)

> and then proceeds to sit there forever. During the output, susan does
> correctly get the ip address, the hostname, and the domain name. If
> someone tells me how to get the bootup output from a log or file, I'll
> post it here for you to see.

Have you tried to snoop network traffic while this happens to find
out what excactly stalls the install?

Sami

--
.signature: No such file or directory

 
 
 

Solaris 9 jumpstart from Linux server - problems

Post by Dave Uhrin » Fri, 18 Jun 2004 05:57:51



> Linux and Solaris NFS are apparantly different in some aspects.

That must be the understatement of the week.
 
 
 

Solaris 9 jumpstart from Linux server - problems

Post by Chris Philli » Fri, 18 Jun 2004 10:55:47


Quote:> Have you tried to snoop network traffic while this happens to find
> out what excactly stalls the install?

Okay, so tcpdump traffic doesn't seem to tell me anything particularly
useful, but here it is (I wanted the initial messages as well, so I
used the ether host option - the network is extremely busy, so it had
to be filtered):


tcpdump: listening on eth0, link-type EN10MB (Ethernet), capture size
96 bytes
18:31:08.315060 rarp who-is susan tell susan
18:31:12.928579 rarp who-is susan tell susan
18:31:12.952701 IP (tos 0x0, ttl 255, id 1, offset 0, flags [DF],
length: 104) susan.biff > 255.255.255.255.sunrpc: UDP, length: 76
18:31:12.954722 arp who-has wardrobe (Broadcast) tell susan
18:31:12.956485 arp who-has wardrobe (Broadcast) tell susan
18:31:26.264281 rarp who-is susan tell susan
18:31:26.372478 arp who-has susan (Broadcast) tell susan
18:31:26.373633 IP (tos 0x0, ttl   1, id 5961, offset 0, flags [DF],
length: 124) susan.1023 > 255.255.255.255.sunrpc: UDP, length: 96
18:31:26.533581 arp who-has wardrobe (Broadcast) tell susan
18:31:26.534819 arp who-has wardrobe (Broadcast) tell susan
18:31:28.364162 arp who-has susan (Broadcast) tell susan
18:31:30.394055 arp who-has susan (Broadcast) tell susan
18:31:32.384176 arp who-has susan (Broadcast) tell susan
18:31:34.384204 arp who-has susan (Broadcast) tell susan

Also, here is the tail of /var/log/messages:
Jun 16 18:49:38 wardrobe rpc.mountd: authenticated mount request from
susan:514 for /var/solaris9_install/Solaris_9/Tools/Boot
(/var/solaris9_install)
Jun 16 18:49:38 wardrobe rpc.mountd: authenticated mount request from
susan:514 for /var/solaris9_install/Solaris_9/Tools/Boot
(/var/solaris9_install)
Jun 16 18:49:52 wardrobe rpc.mountd: authenticated mount request from
susan:1023 for /var/solaris9_install/Solaris_9/Tools/Boot
(/var/solaris9_install)

As for the NFS linux vs. solaris thing, I tried exporting the file
system as read-write instead of read-only, but achieved the same
result. Similarly, I used the NFS no_root_squash option, but failed
again.

Thanks,
Chris

 
 
 

Solaris 9 jumpstart from Linux server - problems

Post by Chris Philli » Fri, 18 Jun 2004 11:07:21


Quote:> Linux and Solaris NFS are apparantly different in some aspects.  Take a
> look at this thread...

> http://groups.google.com/groups?th=7df5a88654120b61

> msgs 11 and 18-20 may be relevant.

Yeah, what could be more true :) - but, after reading, and rereading,
(and trying) the suggestions there, nothing seems to help. Any other
ideas?

Thanks
Chris

 
 
 

Solaris 9 jumpstart from Linux server - problems

Post by Chris Philli » Fri, 18 Jun 2004 11:09:32


ack! I feel really dumb now - just noticed what I missed! Why aren't
there any NFS messages? There should be some in there right? Thanks,
Chris
 
 
 

Solaris 9 jumpstart from Linux server - problems

Post by Dave Uhrin » Fri, 18 Jun 2004 11:15:00



> ack! I feel really dumb now - just noticed what I missed! Why aren't
> there any NFS messages? There should be some in there right? Thanks,
> Chris

Linux NFS is broken, particularly if you used Reiserfs when you installed
the system.

Why don't you use one of the UltraSPARC machines as the install server?

 
 
 

Solaris 9 jumpstart from Linux server - problems

Post by Darren Dunha » Sat, 19 Jun 2004 00:22:24



> susan:514 for /var/solaris9_install/Solaris_9/Tools/Boot
> (/var/solaris9_install)
> Jun 16 18:49:38 wardrobe rpc.mountd: authenticated mount request from
> susan:514 for /var/solaris9_install/Solaris_9/Tools/Boot
> (/var/solaris9_install)
> Jun 16 18:49:52 wardrobe rpc.mountd: authenticated mount request from
> susan:1023 for /var/solaris9_install/Solaris_9/Tools/Boot
> (/var/solaris9_install)

It would appear the root filesystem mount is failing.  That's not really
what I'd expect if you've actually gotten the boot to start printing
things about the CPU and loading kernel modules.

You may be able to turn on debugging on the server side to see if you're
getting any messages about these mounts.

--

Senior Technical Consultant         TAOS            http://www.taos.com/
Got some Dr Pepper?                           San Francisco, CA bay area
         < This line left intentionally blank to confuse you. >

 
 
 

1. Problem using linux as jumpstart server for solaris 2.6 clients

Hello,

I am trying to use a linux X86 machine as a jumpstart server for solaris
2.6 (Sparc) clients.

The principle problems I solved - ARP, bootparamd and tftpboot are
working fine. But when the sparc machine tries to get the
'..kernel/unix' file from the linux server it only reads some bytes and
then it hangs (RPC Timed out).
This seems to be a NFS problem -- I already heard about strange problems
which are related to NFS and linux/solaris :-)

The strange thing is that if the solaris machine is running (i. e. NOT
during 'boot net - install') this machine can mount and access this
directory where the kernel is located without any problems.

I tried to launch the rpc.nfsd on the linux box with some debug switches
(-d ugid -d fhcache -d auth -n -p -F) and saw that the solaris box
access during the boot process the kernel file, gets some bytes from it
and then repeatedly tries to access an 8kb block at position 160 of this
kernel/unix file. This block seems to be sent from the linux box to the
machine but seems not to reach the solaris box.

I have already played a little bit around with several nfs options
(root_squashing, anonuid, anongid) without any improvements.

Is there anybody who has managed to use a linux box as a jumpstart
server? Can anybody tell me wether there are any principal differences
in the way solaris uses nfs during jumpstart compared to normal usage?

Thanks for any help,

        Gunther

2. Optio on AIX

3. solaris 10 jumpstart server; how to check boot server?

4. FreeBSD doesn't reroute packets between interfaces?

5. Linux Jumpstart Server for Solaris???

6. LILO and BIG drives.

7. MrZammler - jumpstart solaris via Linux boot server

8. XT as terminal for Linux

9. Jumpstart 2.5.1 Client from a 2.6 Jumpstart Server

10. Solaris 8 jumpstart server wont build a 2.6 client

11. Solaris 8 jumpstart / automountd server .DtDirDataType not responding

12. Hands off Solaris 2.5.1 Jumpstart Server

13. Solaris 10 Jumpstart Server Issue