Perfomance: tar vs ftp vs rsync vs cp vs ?

Perfomance: tar vs ftp vs rsync vs cp vs ?

Post by Andrei Gornov » Wed, 21 Mar 2001 05:25:40



Hello!

I'm looking for the fastest way to transfer files between 2 SUN
boxes sitting on a 100MB full-duplex ethernet.  Data: ~100Gb of
binary files ~30Mb each.

Copying and moving disks around is not an option - don't have
physical access to the servers.

Files are not organized by filesystem and/or directories.  I.e. I
have to transfer a list of files spread all over the places.

Things I'm considering:

1.  cd /net/src; tar cf - list_of_files | (cd /net/dst; tar xf -)
2.  for i in list_of_files; do rsync -options /net/src/$i /net/dst/$i; done
3.  for i in list_of_files; do ftp each file; done
4.  for i in list_of_files; do cp each file; done

For all 4 options I can break the list_of_files into sublists and
do them in parallel.

- Approximately 5% of files are the same on src and dst servers;
other files may have the same names, but are different and not
even similar (chunks of a database)

- Files compress with gzip 1:2 - I could reduce the total amount
of data travelling over the link by 2 via "z" option in tar or
rsync, but I'm afraid the overhead of gziping/ungzipping them
will consume more time, than I save on the network transers.
Server details: SRC server - E6500, 22 400MHz cpu, 48Gb RAM;
DST server - E10K, 4 333MHz cpu, 4Gb RAM.

Any thoughts on the above 4 methods and any other ideas on moving
things will be appreciated.

Thanks,

--
Andrei

 
 
 

Perfomance: tar vs ftp vs rsync vs cp vs ?

Post by Victor Orteg » Wed, 21 Mar 2001 06:36:47



> Any thoughts on the above 4 methods and any other ideas on moving
> things will be appreciated.

In my experience, FTP has always been faster than rcp/rsync or NFS
(I'm assuming it's over NFS that you proposed to use tar and cp).  I
would recommend moving everything via FTP and then using rsync to
regain permissions and ownerships--that combination should give you
the best results in the shortest time.  But you may want to test
various options with a small set of files first to verify.

Victor

 
 
 

Perfomance: tar vs ftp vs rsync vs cp vs ?

Post by Alberto da Silv » Wed, 21 Mar 2001 09:25:03




> > Any thoughts on the above 4 methods and any other ideas on moving
> > things will be appreciated.

> In my experience, FTP has always been faster than rcp/rsync or NFS
> (I'm assuming it's over NFS that you proposed to use tar and cp).  I
> would recommend moving everything via FTP and then using rsync to
> regain permissions and ownerships--that combination should give you
> the best results in the shortest time.  But you may want to test
> various options with a small set of files first to verify.

> Victor

In some tests I did a couple of months ago I've also found that FTP was the
fastest.
wget will help if you decide to go with FTP.

rsync / rdist can be very fast if the files are not changing a lot.

Alberto

 
 
 

Perfomance: tar vs ftp vs rsync vs cp vs ?

Post by Dan Bircha » Thu, 22 Mar 2001 06:27:14



> Hello!

> I'm looking for the fastest way to transfer files between 2 SUN
> boxes sitting on a 100MB full-duplex ethernet.  Data: ~100Gb of
> binary files ~30Mb each.

> Files are not organized by filesystem and/or directories.  I.e. I
> have to transfer a list of files spread all over the places.

> Things I'm considering:

> 1.  cd /net/src; tar cf - list_of_files | (cd /net/dst; tar xf -)
> 2.  for i in list_of_files; do rsync -options /net/src/$i /net/dst/$i; done
> 3.  for i in list_of_files; do ftp each file; done
> 4.  for i in list_of_files; do cp each file; done

If there's a chance that some of the files *won't* have changed every
time you're doing the copy, rsync's option to only copy updated files
might come in handy.  3,000 30Mb files is a good bit of data, and if
you only *need* to copy 2,000, doing so is a win.

But if you have to change their names in the process, I'm not sure how
well rsync would handle that.

Quote:> Server details: SRC server - E6500, 22 400MHz cpu, 48Gb RAM;
> DST server - E10K, 4 333MHz cpu, 4Gb RAM.

Nice toys.

-Dan

--
Dan Birchall - Palolo Valley - Honolulu HI - http://dan.scream.org/
My addresses expire... take out the hex stamp if your reply bounces

 
 
 

1. Linux vs OS2 vs NT vs Win95 vs Multics vs PDP11 vs BSD geeks

        Every machine and operating system has got its useful
purpose...

        I see no point in argueing with people which OS is better, and
which is worse, and what will survive and what wont...

        The bottom line is obviously the best OS is the one that make
the end user most productive.    Ive used quite a variety of software
from intel, ibm, MS, sun, GNU, DEC/compaq, etc,   and everything OS
has got its UPz and DOWnz, so depending on what you want to do with it
yer machine, probably determines what OS you run.

        So lets cut to the chase -  OS bashing is a waste of time,
and most of the time I'd say the person putting it down just hasn't
seen that particular OS's potential,  or should I say speciality....

      Hell,  Plan 9 has even got some interesting features.. <snicker>

       And all PC users know,  that no matter what use on a day to day
basis on the PC, that one day you will need to boot good ole ancient
DOS to do something...

2. scp UNBELIEVABLY slow

3. Slackware vs SuSE vs Debian vs Redhat vs ....

4. 4586np Seagate Autoloader

5. DOS vs. Windows vs. Mac vs. Unix vs. NS

6. Compuserve and ISDN-Card

7. KDE vs. Openlook vs. Xfree86 vs. MetroX vs. CDE

8. cant execute most of my d-series bins

9. Redhat vs Debian vs Yggdrasil vs Caldera vs ...

10. tape battle: tar vs afio vs cpio vs..

11. BSD vs S5 vs MACH vs OSF/1 (no religion, please!)

12. GTK vs Qt vs Motif vs tcl/Tk???

13. Short Summary of Linux vs. Win9X vs. Mac vs. ???