On Wed, 18 Jun 2014, Summers, Peter wrote:
I'm also late to this party. Here are my results from 64-bit
Windows 7:
Thanks, Peter, this is encouraging with regard to the new Windows
build. Maybe this is worth trying for 32-bit Windows too.
Small comment: that poor result from openblas on the very first test
is probably just a matter of warming up the machine's vector
registers; if you were to run it a second time you'd probably see
openblas dominating from the get-go.
? matrix_perf(0)
dgemm experiment 1, variant 1, speed in Gflops
m n k vanilla openmp openblas
128 128 128 0.51479 2.4324 0.49444
128 128 256 1.3617 2.8694 7.5620
128 128 512 1.3178 3.2178 7.3135
128 128 1024 1.4547 3.2327 6.8889
128 128 2048 1.3938 3.2093 7.5701
result: openblas dominates for mnk >= 4194304
openmp dominates for mnk < 4194304
[...]
Allin