Hi,
I release fast optimized BLAS routines. This time, I can only
release level 1 routine(amax, axpy, ...., dot, rot), but
these routines are written in Assembler and much faser than
generic blas routine(as fast as CXML?).
If you want this library, please see.
ftp://www.netstat.ne.jp/pub/Linux/Linux-Alpha-JP/BLAS
p.s.
I also put fast DGETRF(decomposition routine) as a test.
Please try with my fast gemm and ger routine.
Thanks,