Hi,
these are my results. Machine: Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz,
16GB RAM, Debian Testing/Unstable/Experimental (mixed).
Gretl configured: by "'--enable-build-doc' '--enable-openmp'
'--with-odbc''" and compiled by GCC 4.9.0.
All the Best,
Marcin
######################################################
dgemm experiment 1, variant 1, speed in Gflops
m n k vanilla openmp netlib
128 128 128 1,2844 8,4280 2,3635
128 128 256 2,3648 9,8147 8,7647
128 128 512 2,5742 10,958 8,6539
128 128 1024 2,7452 4,5530 9,1735
128 128 2048 2,7503 5,5682 9,2473
result: netlib dominates for mnk >= 16777216
openmp dominates for mnk < 16777216
dgemm experiment 1, variant 2, speed in Gflops
m n k vanilla openmp netlib
128 128 128 2,4507 3,6337 7,8852
256 256 128 2,7755 8,6371 8,1046
512 512 128 2,7741 9,9523 8,3603
1024 1024 128 2,8880 9,6166 8,5778
2048 2048 128 2,8775 8,2817 8,7284
result: netlib dominates for mnk >= 536870912
dgemm experiment 1, variant 3, speed in Gflops
m n k vanilla openmp netlib
128 128 128 2,5831 8,4664 7,7957
256 256 256 2,9340 11,584 8,6843
512 512 512 3,0497 11,710 9,7398
1024 1024 1024 2,3554 7,4442 10,085
2048 2048 2048 2,4262 5,1813 10,295
result: netlib dominates for mnk >= 1073741824
openmp dominates for mnk < 1073741824
dgemm experiment 2, variant 1, speed in Gflops
m n k vanilla openmp netlib
8 8 8 0,77332 0,21404 0,61904
16 8 8 0,86759 0,37646 0,98366
32 8 8 1,1219 0,30031 1,3463
64 8 8 1,3700 0,82423 1,6513
128 8 8 1,4883 1,2634 1,7845
256 8 8 1,5561 1,5149 1,9103
512 8 8 1,6070 1,6414 1,6082
1024 8 8 1,6016 1,8070 1,8167
2048 8 8 1,5968 1,7187 1,8652
4096 8 8 1,5807 1,4528 1,7123
result: netlib dominates for mnk >= 65536
dgemm experiment 2, variant 2, speed in Gflops
m n k vanilla openmp netlib
10 2 1000 2,0464 0,68509 2,6785
20 2 1000 2,3376 2,4716 3,0151
40 2 1000 2,0023 3,2774 3,0313
80 2 1000 2,4535 4,5667 2,8541
160 2 1000 2,7358 4,4775 2,8616
320 2 1000 2,8717 6,4803 2,8914
640 2 1000 2,4402 5,3052 1,8185
1280 2 1000 2,3685 4,9653 1,9498
2560 2 1000 2,3710 4,0467 1,6928
5120 2 1000 2,3778 4,3298 1,6417
result: openmp dominates for mnk >= 80000
netlib dominates for mnk < 80000
dgemm experiment 2, variant 3, speed in Gflops
m n k vanilla openmp netlib
10 10 1000 2,1420 2,7267 3,1789
20 10 1000 2,4351 5,0807 3,8299
40 10 1000 2,1367 7,2032 4,0107
80 10 1000 2,5392 9,3613 5,6899
160 10 1000 2,8350 10,593 5,6848
320 10 1000 2,8115 10,835 6,0015
result: openmp dominates for mnk >= 200000
netlib dominates for mnk < 200000
Operating system: Linux (64-bit)
BLAS library: Netlib
Number of processors: 4
OpenMP enabled: yes
Performance summary:
vanilla -
dominates outright in 0 out of 6 tests
openmp -
dominates outright in 0 out of 6 tests
dominates in 2 test(s) for mnk >= (80000, 200000)
dominates in 2 test(s) for mnk < (16777216, 1073741824)
netlib -
dominates outright in 0 out of 6 tests
dominates in 4 test(s) for mnk >= (16777216, 536870912, 1073741824,
65536)
dominates in 2 test(s) for mnk < (80000, 200000)
######################################################
--
Marcin Błażejowski
GG: 203127