Is there a library that does hardware accelerated vectors operations
on a Sun? My loops in C just don't cut it.
Thanks,
David Jelinek
Thanks,
David Jelinek
The Sun Performance library is nice for folks doing any mathematical
work. On a single cpu, the 1000x1000 DP Linpack yield 460 Mflops. By
then setting PARALLEL=xx, one can get some nice speedups. I just tried
this on a 16 cpu system and obtained 1003 MFlops/s (400 Mhz cpus + 8MB
L2 caches). Of course, 1000x1000 is too small of a problem. On a
4000x4000 problem, one gets 3.1 GFlops, for example.
--
Hi !
----
> > Is there a library that does hardware accelerated vectors operations
> > on a Sun? My loops in C just don't cut it.
> There is/are libraries that "vectorize" some trig functions, and other
> wise provide hand-coded math libraries. The Sun Performance library has
> many routines also. You didn't provide any details, so I can't help you
> too much.
----
Bye,
Roland
--
__ . . __
(o.\ \/ /.o) Roland Mainz C/JAVA programmer
(;O/ \/ \O;) TEL +49 (0) 641/99-13193 FAX +49 (0) 641/99-41359
Here are the details: I need to do basic operations such as
subtracting two vectors and taking magnitudes of vectors. I'm doing
this many many times, so that's why speed is such an issue.
I'm working on a Sun Ultra 30.
Need to know anything else?
David Jelinek
: > Is there a library that does hardware accelerated vectors operations
: > on a Sun? My loops in C just don't cut it.
: There is/are libraries that "vectorize" some trig functions, and other
: wise provide hand-coded math libraries. The Sun Performance library has
: many routines also. You didn't provide any details, so I can't help you
: too much.
: The Sun Performance library is nice for folks doing any mathematical
: work. On a single cpu, the 1000x1000 DP Linpack yield 460 Mflops. By
: then setting PARALLEL=xx, one can get some nice speedups. I just tried
: this on a 16 cpu system and obtained 1003 MFlops/s (400 Mhz cpus + 8MB
: L2 caches). Of course, 1000x1000 is too small of a problem. On a
: 4000x4000 problem, one gets 3.1 GFlops, for example.
: --
--
> Here are the details: I need to do basic operations such as
> subtracting two vectors and taking magnitudes of vectors. I'm doing
> this many many times, so that's why speed is such an issue.
> I'm working on a Sun Ultra 30.
> Need to know anything else?
ie X + a Y. if a = -1, you can do the subtraction...but this seems like
extra overhead.
Be sure to also check the High Performance Computing O'Reilly text by Dowd.
There are things that can be done to make sure the vectors aren't "aligned"
in memory, etc.
How big are your vectors?
--
1. FAST vector multiply routine
I was wondering if there is a fast vector multiply
routine analogous to Goto's fast DAXPY routine. I
did a few tests and found that DAXPY is about
3 times as fast
as an F90 vector multiply or vector add.
i.e.
real(kind=8), dimension(1:N) :: u,v,w
u = v*w
thanks
2. Exchange behind Linux (Port forwarding)
3. Vector instructions and Sun Forte...
5. Sun CC compiler: problem with vector
6. compiling Motif 2.0 with a recent linux distro
7. remove mixture of non-atomic operations with page->flags which requires atomic operations to access
8. Need help reading IO ports.
10. HELP: Need Sun SparC Linux FAST
11. fast/wide/diff sun controller on sparc 20?
12. Fast JAVA for SUN Solaris 2.6
13. HELP: Need Sun SparC Linux FAST