![]() |
|
|
#626 |
|
Jun 2003
5,087 Posts |
|
|
|
|
|
|
#627 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
216810 Posts |
|
|
|
|
|
|
#628 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
23×271 Posts |
A old, slow, mobile Core2duo laptop
![]() Code:
Compare your results to other computers at http://www.mersenne.org/report_benchmarks Genuine Intel(R) CPU T2300 @ 1.66GHz CPU speed: 1662.59 MHz, 2 cores CPU features: Prefetch, SSE, SSE2 L1 cache size: 32 KB L2 cache size: 2 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 128 Prime95 32-bit version 27.9, RdtscTiming=1 Best time for 768K FFT length: 59.493 ms., avg: 61.800 ms. Best time for 896K FFT length: 62.997 ms., avg: 63.887 ms. Best time for 1024K FFT length: 69.254 ms., avg: 70.511 ms. Best time for 1280K FFT length: 88.191 ms., avg: 90.180 ms. Best time for 1536K FFT length: 124.423 ms., avg: 125.562 ms. Best time for 1792K FFT length: 130.424 ms., avg: 131.785 ms. Best time for 2048K FFT length: 170.325 ms., avg: 171.976 ms. Best time for 2560K FFT length: 187.264 ms., avg: 188.822 ms. Best time for 3072K FFT length: 262.473 ms., avg: 264.385 ms. Best time for 3584K FFT length: 275.117 ms., avg: 277.280 ms. Best time for 4096K FFT length: 358.912 ms., avg: 361.077 ms. Best time for 5120K FFT length: 460.779 ms., avg: 463.982 ms. Best time for 6144K FFT length: 474.206 ms., avg: 477.177 ms. Best time for 7168K FFT length: 594.637 ms., avg: 597.844 ms. Best time for 8192K FFT length: 668.254 ms., avg: 675.463 ms. Timing FFTs using 2 threads. Best time for 768K FFT length: 30.318 ms., avg: 31.608 ms. Best time for 896K FFT length: 32.559 ms., avg: 32.915 ms. Best time for 1024K FFT length: 35.453 ms., avg: 35.889 ms. Best time for 1280K FFT length: 44.960 ms., avg: 45.301 ms. Best time for 1536K FFT length: 63.113 ms., avg: 63.704 ms. Best time for 1792K FFT length: 66.956 ms., avg: 67.570 ms. Best time for 2048K FFT length: 86.530 ms., avg: 87.021 ms. Best time for 2560K FFT length: 94.901 ms., avg: 95.900 ms. Best time for 3072K FFT length: 133.024 ms., avg: 134.914 ms. Best time for 3584K FFT length: 140.476 ms., avg: 142.084 ms. Best time for 4096K FFT length: 181.847 ms., avg: 183.254 ms. Best time for 5120K FFT length: 234.322 ms., avg: 236.049 ms. Best time for 6144K FFT length: 242.536 ms., avg: 244.360 ms. Best time for 7168K FFT length: 311.855 ms., avg: 315.346 ms. Best time for 8192K FFT length: 346.526 ms., avg: 350.310 ms. Best time for 61 bit trial factors: 7.834 ms. Best time for 62 bit trial factors: 7.928 ms. Best time for 63 bit trial factors: 11.327 ms. Best time for 64 bit trial factors: 11.287 ms. Best time for 65 bit trial factors: 19.189 ms. Best time for 66 bit trial factors: 19.057 ms. Best time for 67 bit trial factors: 19.113 ms. Best time for 75 bit trial factors: 20.524 ms. Best time for 76 bit trial factors: 20.517 ms. Best time for 77 bit trial factors: 20.498 ms. |
|
|
|
|
|
#629 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
FX-8350, stock
Code:
Compare your results to other computers at http://www.mersenne.org/report_benchmarks AMD FX(tm)-8350 Eight-Core Processor CPU speed: 4000.00 MHz, 8 cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, FMA L1 cache size: 16 KB L2 cache size: 2 MB, L3 cache size: 8 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes L1 TLBS: 64 L2 TLBS: 1024 Prime95 64-bit version 27.9, RdtscTiming=1 Best time for 768K FFT length: 11.449 ms., avg: 11.733 ms. Best time for 896K FFT length: 13.723 ms., avg: 13.962 ms. Best time for 1024K FFT length: 14.997 ms., avg: 15.458 ms. Best time for 1280K FFT length: 19.966 ms., avg: 20.311 ms. Best time for 1536K FFT length: 24.877 ms., avg: 25.141 ms. Best time for 1792K FFT length: 29.441 ms., avg: 30.319 ms. Best time for 2048K FFT length: 32.728 ms., avg: 33.092 ms. Best time for 2560K FFT length: 40.641 ms., avg: 41.273 ms. Best time for 3072K FFT length: 50.707 ms., avg: 51.032 ms. Best time for 3584K FFT length: 60.318 ms., avg: 60.671 ms. Best time for 4096K FFT length: 67.115 ms., avg: 68.240 ms. Best time for 5120K FFT length: 91.828 ms., avg: 92.323 ms. Best time for 6144K FFT length: 108.159 ms., avg: 108.884 ms. Best time for 7168K FFT length: 131.838 ms., avg: 132.302 ms. Best time for 8192K FFT length: 140.918 ms., avg: 142.181 ms. Timing FFTs using 2 threads. Best time for 768K FFT length: 6.545 ms., avg: 6.795 ms. Best time for 896K FFT length: 7.886 ms., avg: 8.284 ms. Best time for 1024K FFT length: 8.763 ms., avg: 8.939 ms. Best time for 1280K FFT length: 11.198 ms., avg: 11.632 ms. Best time for 1536K FFT length: 13.951 ms., avg: 14.246 ms. Best time for 1792K FFT length: 16.746 ms., avg: 16.983 ms. Best time for 2048K FFT length: 18.563 ms., avg: 18.972 ms. Best time for 2560K FFT length: 22.994 ms., avg: 23.326 ms. Best time for 3072K FFT length: 28.796 ms., avg: 28.986 ms. Best time for 3584K FFT length: 34.248 ms., avg: 35.610 ms. Best time for 4096K FFT length: 38.535 ms., avg: 38.853 ms. Best time for 5120K FFT length: 51.785 ms., avg: 52.134 ms. Best time for 6144K FFT length: 61.322 ms., avg: 61.849 ms. Best time for 7168K FFT length: 75.646 ms., avg: 76.610 ms. Best time for 8192K FFT length: 81.370 ms., avg: 81.845 ms. Timing FFTs using 3 threads. Best time for 768K FFT length: 4.575 ms., avg: 6.202 ms. Best time for 896K FFT length: 5.524 ms., avg: 7.334 ms. Best time for 1024K FFT length: 6.273 ms., avg: 8.104 ms. Best time for 1280K FFT length: 7.981 ms., avg: 9.665 ms. Best time for 1536K FFT length: 9.892 ms., avg: 12.736 ms. Best time for 1792K FFT length: 11.554 ms., avg: 14.304 ms. Best time for 2048K FFT length: 12.919 ms., avg: 18.760 ms. Best time for 2560K FFT length: 15.876 ms., avg: 19.623 ms. Best time for 3072K FFT length: 19.966 ms., avg: 26.125 ms. Best time for 3584K FFT length: 23.578 ms., avg: 29.514 ms. Best time for 4096K FFT length: 26.563 ms., avg: 31.729 ms. Best time for 5120K FFT length: 35.349 ms., avg: 42.160 ms. Best time for 6144K FFT length: 42.092 ms., avg: 49.563 ms. Best time for 7168K FFT length: 52.563 ms., avg: 60.964 ms. Best time for 8192K FFT length: 55.373 ms., avg: 64.615 ms. Timing FFTs using 4 threads. Best time for 768K FFT length: 3.705 ms., avg: 4.746 ms. Best time for 896K FFT length: 4.300 ms., avg: 6.293 ms. Best time for 1024K FFT length: 5.024 ms., avg: 6.508 ms. Best time for 1280K FFT length: 6.097 ms., avg: 8.110 ms. Best time for 1536K FFT length: 7.718 ms., avg: 10.003 ms. Best time for 1792K FFT length: 9.235 ms., avg: 12.514 ms. Best time for 2048K FFT length: 10.236 ms., avg: 12.915 ms. Best time for 2560K FFT length: 15.134 ms., avg: 20.056 ms. Best time for 3072K FFT length: 15.696 ms., avg: 19.853 ms. Best time for 3584K FFT length: 18.634 ms., avg: 23.609 ms. Best time for 4096K FFT length: 20.994 ms., avg: 26.272 ms. Best time for 5120K FFT length: 28.256 ms., avg: 35.633 ms. Best time for 6144K FFT length: 33.219 ms., avg: 41.663 ms. Best time for 7168K FFT length: 41.325 ms., avg: 49.725 ms. Best time for 8192K FFT length: 44.421 ms., avg: 65.067 ms. Timing FFTs using 5 threads. Best time for 768K FFT length: 3.033 ms., avg: 3.882 ms. Best time for 896K FFT length: 3.606 ms., avg: 5.068 ms. Best time for 1024K FFT length: 4.060 ms., avg: 5.775 ms. Best time for 1280K FFT length: 4.955 ms., avg: 6.522 ms. Best time for 1536K FFT length: 6.351 ms., avg: 10.004 ms. Best time for 1792K FFT length: 7.297 ms., avg: 9.379 ms. Best time for 2048K FFT length: 8.427 ms., avg: 10.605 ms. Best time for 2560K FFT length: 10.456 ms., avg: 13.637 ms. Best time for 3072K FFT length: 12.964 ms., avg: 16.167 ms. Best time for 3584K FFT length: 15.269 ms., avg: 20.292 ms. Best time for 4096K FFT length: 17.218 ms., avg: 20.563 ms. Best time for 5120K FFT length: 22.814 ms., avg: 27.928 ms. Best time for 6144K FFT length: 26.994 ms., avg: 33.428 ms. Best time for 7168K FFT length: 32.807 ms., avg: 40.723 ms. Best time for 8192K FFT length: 35.733 ms., avg: 43.856 ms. Timing FFTs using 6 threads. Best time for 768K FFT length: 2.563 ms., avg: 3.510 ms. Best time for 896K FFT length: 3.065 ms., avg: 4.071 ms. Best time for 1024K FFT length: 3.580 ms., avg: 4.773 ms. Best time for 1280K FFT length: 4.392 ms., avg: 5.393 ms. Best time for 1536K FFT length: 5.476 ms., avg: 7.221 ms. Best time for 1792K FFT length: 6.497 ms., avg: 8.434 ms. Best time for 2048K FFT length: 7.411 ms., avg: 10.215 ms. Best time for 2560K FFT length: 9.189 ms., avg: 13.778 ms. Best time for 3072K FFT length: 11.411 ms., avg: 19.292 ms. Best time for 3584K FFT length: 13.261 ms., avg: 20.891 ms. Best time for 4096K FFT length: 15.204 ms., avg: 19.913 ms. [Wed Mar 05 20:59:34 2014] Best time for 5120K FFT length: 20.102 ms., avg: 30.085 ms. Best time for 6144K FFT length: 23.951 ms., avg: 34.278 ms. Best time for 7168K FFT length: 28.980 ms., avg: 35.911 ms. Best time for 8192K FFT length: 35.428 ms., avg: 46.971 ms. Timing FFTs using 7 threads. Best time for 768K FFT length: 2.318 ms., avg: 3.234 ms. Best time for 896K FFT length: 2.744 ms., avg: 3.864 ms. Best time for 1024K FFT length: 3.142 ms., avg: 4.614 ms. Best time for 1280K FFT length: 3.819 ms., avg: 5.155 ms. Best time for 1536K FFT length: 4.808 ms., avg: 6.215 ms. Best time for 1792K FFT length: 5.603 ms., avg: 7.747 ms. Best time for 2048K FFT length: 7.230 ms., avg: 22.162 ms. Best time for 2560K FFT length: 8.983 ms., avg: 12.483 ms. Best time for 3072K FFT length: 11.048 ms., avg: 15.321 ms. Best time for 3584K FFT length: 12.144 ms., avg: 47.668 ms. Best time for 4096K FFT length: 14.530 ms., avg: 19.239 ms. Best time for 5120K FFT length: 19.429 ms., avg: 42.841 ms. Best time for 6144K FFT length: 21.432 ms., avg: 54.377 ms. Best time for 7168K FFT length: 27.486 ms., avg: 58.570 ms. Best time for 8192K FFT length: 30.697 ms., avg: 59.162 ms. Timing FFTs using 8 threads. Best time for 768K FFT length: 2.043 ms., avg: 2.991 ms. Best time for 896K FFT length: 2.514 ms., avg: 3.700 ms. Best time for 1024K FFT length: 3.130 ms., avg: 8.734 ms. Best time for 1280K FFT length: 3.601 ms., avg: 5.900 ms. Best time for 1536K FFT length: 4.844 ms., avg: 6.465 ms. Best time for 1792K FFT length: 5.253 ms., avg: 20.067 ms. Best time for 2048K FFT length: 6.575 ms., avg: 8.697 ms. Best time for 2560K FFT length: 7.341 ms., avg: 14.146 ms. Best time for 3072K FFT length: 10.258 ms., avg: 14.469 ms. Best time for 3584K FFT length: 11.977 ms., avg: 23.494 ms. Best time for 4096K FFT length: 12.371 ms., avg: 17.352 ms. Best time for 5120K FFT length: 17.008 ms., avg: 22.287 ms. Best time for 6144K FFT length: 20.665 ms., avg: 29.451 ms. Best time for 7168K FFT length: 24.108 ms., avg: 32.520 ms. Best time for 8192K FFT length: 28.026 ms., avg: 41.870 ms. Best time for 61 bit trial factors: 3.105 ms. Best time for 62 bit trial factors: 3.190 ms. Best time for 63 bit trial factors: 3.315 ms. Best time for 64 bit trial factors: 3.906 ms. Best time for 65 bit trial factors: 4.867 ms. Best time for 66 bit trial factors: 5.786 ms. Best time for 67 bit trial factors: 5.746 ms. Best time for 75 bit trial factors: 5.621 ms. Best time for 76 bit trial factors: 5.595 ms. Best time for 77 bit trial factors: 5.611 ms. |
|
|
|
|
|
#630 |
|
Mar 2014
Germany
23·3·5 Posts |
i5, reasonable overclock to 4.05GHz:
Code:
Intel(R) Core(TM) i5-3570K CPU @ 3.40GHz CPU speed: 4050.66 MHz, 4 cores CPU features: Prefetch, SSE, SSE2, SSE4, AVX L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 6 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Prime95 64-bit version 27.9, RdtscTiming=1 Best time for 768K FFT length: 3.915 ms., avg: 3.976 ms. Best time for 896K FFT length: 4.687 ms., avg: 4.719 ms. Best time for 1024K FFT length: 5.262 ms., avg: 5.349 ms. Best time for 1280K FFT length: 6.724 ms., avg: 6.766 ms. Best time for 1536K FFT length: 8.258 ms., avg: 8.339 ms. Best time for 1792K FFT length: 9.846 ms., avg: 9.971 ms. Best time for 2048K FFT length: 11.023 ms., avg: 11.072 ms. Best time for 2560K FFT length: 13.942 ms., avg: 14.045 ms. Best time for 3072K FFT length: 17.383 ms., avg: 17.420 ms. Best time for 3584K FFT length: 21.070 ms., avg: 21.136 ms. Best time for 4096K FFT length: 23.582 ms., avg: 23.702 ms. Best time for 5120K FFT length: 30.812 ms., avg: 30.980 ms. Best time for 6144K FFT length: 36.902 ms., avg: 37.148 ms. Best time for 7168K FFT length: 44.760 ms., avg: 45.009 ms. Best time for 8192K FFT length: 51.697 ms., avg: 51.884 ms. Timing FFTs using 2 threads. Best time for 768K FFT length: 2.069 ms., avg: 2.114 ms. Best time for 896K FFT length: 2.429 ms., avg: 2.489 ms. Best time for 1024K FFT length: 2.738 ms., avg: 2.797 ms. Best time for 1280K FFT length: 3.522 ms., avg: 3.570 ms. Best time for 1536K FFT length: 4.303 ms., avg: 4.348 ms. Best time for 1792K FFT length: 5.119 ms., avg: 5.164 ms. Best time for 2048K FFT length: 5.745 ms., avg: 5.829 ms. Best time for 2560K FFT length: 7.226 ms., avg: 7.294 ms. Best time for 3072K FFT length: 9.040 ms., avg: 9.093 ms. Best time for 3584K FFT length: 10.850 ms., avg: 10.887 ms. Best time for 4096K FFT length: 12.258 ms., avg: 12.317 ms. Best time for 5120K FFT length: 15.881 ms., avg: 16.040 ms. Best time for 6144K FFT length: 19.024 ms., avg: 19.184 ms. Best time for 7168K FFT length: 22.909 ms., avg: 23.031 ms. Best time for 8192K FFT length: 26.835 ms., avg: 26.962 ms. Timing FFTs using 3 threads. Best time for 768K FFT length: 1.452 ms., avg: 1.477 ms. Best time for 896K FFT length: 1.700 ms., avg: 1.744 ms. Best time for 1024K FFT length: 1.943 ms., avg: 1.979 ms. Best time for 1280K FFT length: 2.536 ms., avg: 2.583 ms. Best time for 1536K FFT length: 3.079 ms., avg: 3.114 ms. Best time for 1792K FFT length: 3.695 ms., avg: 3.719 ms. Best time for 2048K FFT length: 4.122 ms., avg: 4.158 ms. Best time for 2560K FFT length: 5.311 ms., avg: 5.411 ms. Best time for 3072K FFT length: 6.525 ms., avg: 6.556 ms. Best time for 3584K FFT length: 7.958 ms., avg: 8.008 ms. Best time for 4096K FFT length: 8.892 ms., avg: 8.966 ms. Best time for 5120K FFT length: 11.540 ms., avg: 11.623 ms. Best time for 6144K FFT length: 13.907 ms., avg: 13.974 ms. Best time for 7168K FFT length: 16.437 ms., avg: 16.479 ms. Best time for 8192K FFT length: 19.120 ms., avg: 19.218 ms. Timing FFTs using 4 threads. Best time for 768K FFT length: 1.149 ms., avg: 1.237 ms. Best time for 896K FFT length: 1.397 ms., avg: 1.449 ms. Best time for 1024K FFT length: 1.624 ms., avg: 1.672 ms. Best time for 1280K FFT length: 2.150 ms., avg: 2.203 ms. Best time for 1536K FFT length: 2.677 ms., avg: 2.715 ms. Best time for 1792K FFT length: 3.198 ms., avg: 3.259 ms. Best time for 2048K FFT length: 3.626 ms., avg: 3.699 ms. Best time for 2560K FFT length: 4.675 ms., avg: 4.770 ms. Best time for 3072K FFT length: 5.661 ms., avg: 5.785 ms. Best time for 3584K FFT length: 6.969 ms., avg: 7.014 ms. Best time for 4096K FFT length: 7.927 ms., avg: 7.970 ms. Best time for 5120K FFT length: 10.137 ms., avg: 10.314 ms. Best time for 6144K FFT length: 12.179 ms., avg: 12.249 ms. Best time for 7168K FFT length: 14.303 ms., avg: 14.420 ms. Best time for 8192K FFT length: 16.310 ms., avg: 16.361 ms. Best time for 61 bit trial factors: 1.737 ms. Best time for 62 bit trial factors: 1.779 ms. Best time for 63 bit trial factors: 2.005 ms. Best time for 64 bit trial factors: 2.025 ms. Best time for 65 bit trial factors: 2.394 ms. Best time for 66 bit trial factors: 2.796 ms. Best time for 67 bit trial factors: 2.761 ms. Best time for 75 bit trial factors: 2.687 ms. Best time for 76 bit trial factors: 2.681 ms. Best time for 77 bit trial factors: 2.685 ms. |
|
|
|
|
|
#631 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
23×271 Posts |
i7 4770 with dual 1600.
Code:
Compare your results to other computers at http://www.mersenne.org/report_benchmarks Intel(R) Core(TM) i7-4770 CPU @ 3.40GHz CPU speed: 3888.62 MHz, 4 hyperthreaded cores CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 8 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Prime95 64-bit version 28.5, RdtscTiming=1 Best time for 1024K FFT length: 3.700 ms., avg: 4.349 ms. Best time for 1280K FFT length: 4.953 ms., avg: 5.587 ms. Best time for 1536K FFT length: 5.983 ms., avg: 6.250 ms. Best time for 1792K FFT length: 7.203 ms., avg: 7.665 ms. Best time for 2048K FFT length: 8.344 ms., avg: 8.411 ms. Best time for 2560K FFT length: 10.692 ms., avg: 10.745 ms. Best time for 3072K FFT length: 13.228 ms., avg: 15.204 ms. Best time for 3584K FFT length: 15.299 ms., avg: 17.222 ms. Best time for 4096K FFT length: 17.656 ms., avg: 20.199 ms. Best time for 5120K FFT length: 22.206 ms., avg: 22.579 ms. Best time for 6144K FFT length: 26.748 ms., avg: 27.015 ms. Best time for 7168K FFT length: 31.938 ms., avg: 32.762 ms. Best time for 8192K FFT length: 36.972 ms., avg: 37.952 ms. Timing FFTs using 2 threads on 1 physical CPU. Best time for 1024K FFT length: 3.751 ms., avg: 3.792 ms. Best time for 1280K FFT length: 4.944 ms., avg: 5.096 ms. Best time for 1536K FFT length: 6.100 ms., avg: 6.278 ms. Best time for 1792K FFT length: 7.437 ms., avg: 7.734 ms. Best time for 2048K FFT length: 8.446 ms., avg: 8.594 ms. Best time for 2560K FFT length: 10.716 ms., avg: 10.985 ms. Best time for 3072K FFT length: 13.517 ms., avg: 13.820 ms. Best time for 3584K FFT length: 15.456 ms., avg: 15.831 ms. Best time for 4096K FFT length: 18.554 ms., avg: 19.385 ms. Best time for 5120K FFT length: 23.526 ms., avg: 24.448 ms. Best time for 6144K FFT length: 29.115 ms., avg: 30.631 ms. Best time for 7168K FFT length: 34.419 ms., avg: 36.019 ms. Best time for 8192K FFT length: 39.227 ms., avg: 42.706 ms. Timing FFTs using 2 threads on 2 physical CPUs. Best time for 1024K FFT length: 2.027 ms., avg: 2.318 ms. Best time for 1280K FFT length: 2.665 ms., avg: 2.873 ms. Best time for 1536K FFT length: 3.275 ms., avg: 3.769 ms. Best time for 1792K FFT length: 3.943 ms., avg: 4.069 ms. Best time for 2048K FFT length: 4.644 ms., avg: 5.560 ms. Best time for 2560K FFT length: 5.880 ms., avg: 6.841 ms. Best time for 3072K FFT length: 6.974 ms., avg: 7.046 ms. Best time for 3584K FFT length: 8.359 ms., avg: 8.543 ms. Best time for 4096K FFT length: 9.776 ms., avg: 10.464 ms. Best time for 5120K FFT length: 12.107 ms., avg: 12.249 ms. Best time for 6144K FFT length: 14.502 ms., avg: 14.654 ms. Best time for 7168K FFT length: 17.251 ms., avg: 18.875 ms. Best time for 8192K FFT length: 20.534 ms., avg: 22.736 ms. Timing FFTs using 3 threads on 3 physical CPUs. Best time for 1024K FFT length: 1.391 ms., avg: 1.617 ms. Best time for 1280K FFT length: 2.180 ms., avg: 2.355 ms. Best time for 1536K FFT length: 2.735 ms., avg: 2.929 ms. Best time for 1792K FFT length: 3.351 ms., avg: 3.803 ms. Best time for 2048K FFT length: 3.925 ms., avg: 4.378 ms. Best time for 2560K FFT length: 4.627 ms., avg: 5.106 ms. Best time for 3072K FFT length: 5.674 ms., avg: 6.073 ms. Best time for 3584K FFT length: 6.565 ms., avg: 6.684 ms. Best time for 4096K FFT length: 7.614 ms., avg: 8.048 ms. Best time for 5120K FFT length: 9.570 ms., avg: 9.697 ms. Best time for 6144K FFT length: 11.510 ms., avg: 12.524 ms. Best time for 7168K FFT length: 13.851 ms., avg: 14.461 ms. Best time for 8192K FFT length: 16.224 ms., avg: 16.676 ms. Timing FFTs using 4 threads on 4 physical CPUs. Best time for 1024K FFT length: 1.253 ms., avg: 1.544 ms. Best time for 1280K FFT length: 1.656 ms., avg: 1.724 ms. Best time for 1536K FFT length: 2.200 ms., avg: 2.250 ms. Best time for 1792K FFT length: 2.774 ms., avg: 2.865 ms. Best time for 2048K FFT length: 3.341 ms., avg: 3.409 ms. Best time for 2560K FFT length: 4.290 ms., avg: 4.367 ms. Best time for 3072K FFT length: 5.256 ms., avg: 5.295 ms. Best time for 3584K FFT length: 6.165 ms., avg: 6.288 ms. Best time for 4096K FFT length: 7.217 ms., avg: 7.356 ms. Best time for 5120K FFT length: 9.422 ms., avg: 10.111 ms. Best time for 6144K FFT length: 11.306 ms., avg: 12.053 ms. Best time for 7168K FFT length: 13.203 ms., avg: 13.532 ms. Best time for 8192K FFT length: 15.468 ms., avg: 16.178 ms. Timing FFTs using 8 threads on 4 physical CPUs. Best time for 1024K FFT length: 1.253 ms., avg: 1.409 ms. Best time for 1280K FFT length: 1.828 ms., avg: 2.052 ms. Best time for 1536K FFT length: 2.325 ms., avg: 2.491 ms. Best time for 1792K FFT length: 2.958 ms., avg: 3.103 ms. Best time for 2048K FFT length: 3.517 ms., avg: 3.658 ms. Best time for 2560K FFT length: 4.566 ms., avg: 4.761 ms. Best time for 3072K FFT length: 5.511 ms., avg: 5.667 ms. Best time for 3584K FFT length: 6.460 ms., avg: 6.653 ms. Best time for 4096K FFT length: 7.562 ms., avg: 8.044 ms. Best time for 5120K FFT length: 9.492 ms., avg: 9.647 ms. Best time for 6144K FFT length: 11.490 ms., avg: 11.762 ms. Best time for 7168K FFT length: 13.478 ms., avg: 13.875 ms. Best time for 8192K FFT length: 15.687 ms., avg: 15.831 ms. |
|
|
|
|
|
#632 |
|
"Antonio Key"
Sep 2011
UK
32·59 Posts |
i5 3570k @ 4.4GHz with 16GB dual channel 2400 (Kingston HyperX Beast)
Code:
Intel(R) Core(TM) i5-3570K CPU @ 3.40GHz CPU speed: 4400.00 MHz, 4 cores CPU features: Prefetch, SSE, SSE2, SSE4, AVX L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 6 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Prime95 64-bit version 28.3, RdtscTiming=1 Best time for 768K FFT length: 3.486 ms., avg: 3.549 ms. Best time for 896K FFT length: 4.206 ms., avg: 4.347 ms. Best time for 1024K FFT length: 4.766 ms., avg: 4.864 ms. Best time for 1280K FFT length: 6.060 ms., avg: 6.260 ms. Best time for 1536K FFT length: 7.352 ms., avg: 7.599 ms. Best time for 1792K FFT length: 8.829 ms., avg: 8.940 ms. Best time for 2048K FFT length: 10.065 ms., avg: 10.192 ms. Best time for 2560K FFT length: 12.740 ms., avg: 12.888 ms. Best time for 3072K FFT length: 15.812 ms., avg: 16.003 ms. Best time for 3584K FFT length: 18.842 ms., avg: 19.098 ms. Best time for 4096K FFT length: 21.494 ms., avg: 21.922 ms. Best time for 5120K FFT length: 27.260 ms., avg: 27.492 ms. Best time for 6144K FFT length: 33.790 ms., avg: 34.478 ms. Best time for 7168K FFT length: 39.983 ms., avg: 40.488 ms. Best time for 8192K FFT length: 46.807 ms., avg: 47.279 ms. Timing FFTs using 2 threads. Best time for 768K FFT length: 1.807 ms., avg: 1.852 ms. Best time for 896K FFT length: 2.238 ms., avg: 2.358 ms. Best time for 1024K FFT length: 2.482 ms., avg: 2.545 ms. Best time for 1280K FFT length: 3.168 ms., avg: 3.220 ms. Best time for 1536K FFT length: 3.829 ms., avg: 3.927 ms. Best time for 1792K FFT length: 4.563 ms., avg: 4.639 ms. Best time for 2048K FFT length: 5.212 ms., avg: 5.304 ms. Best time for 2560K FFT length: 6.589 ms., avg: 6.865 ms. Best time for 3072K FFT length: 8.165 ms., avg: 8.266 ms. Best time for 3584K FFT length: 9.712 ms., avg: 9.933 ms. Best time for 4096K FFT length: 11.110 ms., avg: 11.362 ms. Best time for 5120K FFT length: 14.081 ms., avg: 14.210 ms. Best time for 6144K FFT length: 17.394 ms., avg: 17.694 ms. Best time for 7168K FFT length: 20.524 ms., avg: 20.711 ms. Best time for 8192K FFT length: 24.146 ms., avg: 24.403 ms. Timing FFTs using 3 threads. Best time for 768K FFT length: 1.235 ms., avg: 1.316 ms. Best time for 896K FFT length: 1.557 ms., avg: 1.603 ms. Best time for 1024K FFT length: 1.727 ms., avg: 1.775 ms. Best time for 1280K FFT length: 2.219 ms., avg: 2.263 ms. Best time for 1536K FFT length: 2.663 ms., avg: 2.710 ms. Best time for 1792K FFT length: 3.179 ms., avg: 3.231 ms. Best time for 2048K FFT length: 3.648 ms., avg: 3.698 ms. Best time for 2560K FFT length: 4.610 ms., avg: 4.711 ms. Best time for 3072K FFT length: 5.719 ms., avg: 5.824 ms. Best time for 3584K FFT length: 6.776 ms., avg: 6.886 ms. Best time for 4096K FFT length: 7.776 ms., avg: 7.885 ms. Best time for 5120K FFT length: 9.827 ms., avg: 9.952 ms. Best time for 6144K FFT length: 12.108 ms., avg: 12.271 ms. Best time for 7168K FFT length: 14.183 ms., avg: 14.389 ms. Best time for 8192K FFT length: 16.790 ms., avg: 17.062 ms. Timing FFTs using 4 threads. Best time for 768K FFT length: 0.964 ms., avg: 1.013 ms. Best time for 896K FFT length: 1.226 ms., avg: 1.264 ms. Best time for 1024K FFT length: 1.384 ms., avg: 1.422 ms. Best time for 1280K FFT length: 1.806 ms., avg: 1.875 ms. Best time for 1536K FFT length: 2.166 ms., avg: 2.207 ms. Best time for 1792K FFT length: 2.580 ms., avg: 2.732 ms. Best time for 2048K FFT length: 2.958 ms., avg: 3.034 ms. Best time for 2560K FFT length: 3.849 ms., avg: 4.091 ms. Best time for 3072K FFT length: 4.664 ms., avg: 4.756 ms. Best time for 3584K FFT length: 5.507 ms., avg: 5.629 ms. Best time for 4096K FFT length: 6.303 ms., avg: 6.419 ms. Best time for 5120K FFT length: 8.060 ms., avg: 8.205 ms. Best time for 6144K FFT length: 9.786 ms., avg: 9.906 ms. Best time for 7168K FFT length: 11.466 ms., avg: 11.675 ms. Best time for 8192K FFT length: 13.524 ms., avg: 13.953 ms. Best time for 61 bit trial factors: 1.627 ms. Best time for 62 bit trial factors: 1.661 ms. Best time for 63 bit trial factors: 1.876 ms. Best time for 64 bit trial factors: 1.901 ms. Best time for 65 bit trial factors: 2.250 ms. Best time for 66 bit trial factors: 2.616 ms. Best time for 67 bit trial factors: 2.601 ms. Best time for 75 bit trial factors: 2.519 ms. Best time for 76 bit trial factors: 2.515 ms. Best time for 77 bit trial factors: 2.517 ms. |
|
|
|
|
|
#633 |
|
Apr 2007
Spessart/Germany
2×34 Posts |
i5-4670K @ 3.8 GHz, Dual DDR3 1600, 32 GB
Complete Bench Multithreaded/Multiworker for FFT 1K-8192K Is it possible to create a bench for the throughput of the 'very small' FFTs' size 96 byte - 1k ? |
|
|
|
|
|
#634 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
19·397 Posts |
|
|
|
|
|
|
#635 | |
|
"Justin"
Feb 2015
Kansas City, KS
1 Posts |
Thought I'd share some results from tuning OC settings on an i7-5820k with 16GB DDR4-3000 CL15.
Disabled hyperthreading for the benchmark. Quote:
|
|
|
|
|
|
|
#636 |
|
Mar 2003
Melbourne
5·103 Posts |
Xeon-D 1540 8c/16t @2GHz
Great little machine for 45W Code:
Compare your results to other computers at http://www.mersenne.org/report_benchmarks Intel(R) Xeon(R) CPU D-1540 @ 2.00GHz CPU speed: 1999.97 MHz, 8 hyperthreaded cores CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 12 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Prime95 64-bit version 28.7, RdtscTiming=1 Best time for 1024K FFT length: 5.928 ms., avg: 5.934 ms. Best time for 1280K FFT length: 7.586 ms., avg: 8.859 ms. Best time for 1536K FFT length: 9.252 ms., avg: 9.310 ms. Best time for 1792K FFT length: 11.354 ms., avg: 11.360 ms. Best time for 2048K FFT length: 13.330 ms., avg: 13.346 ms. Best time for 2560K FFT length: 17.025 ms., avg: 18.342 ms. Best time for 3072K FFT length: 20.565 ms., avg: 21.853 ms. Best time for 3584K FFT length: 24.552 ms., avg: 25.838 ms. Best time for 4096K FFT length: 28.031 ms., avg: 29.311 ms. Best time for 5120K FFT length: 35.517 ms., avg: 38.054 ms. Best time for 6144K FFT length: 42.686 ms., avg: 45.225 ms. Best time for 7168K FFT length: 51.095 ms., avg: 54.128 ms. Best time for 8192K FFT length: 58.593 ms., avg: 61.559 ms. Timing FFTs using 2 threads on 1 physical CPU. Best time for 1024K FFT length: 5.849 ms., avg: 5.862 ms. Best time for 1280K FFT length: 7.604 ms., avg: 7.626 ms. Best time for 1536K FFT length: 9.552 ms., avg: 9.580 ms. Best time for 1792K FFT length: 11.886 ms., avg: 13.189 ms. Best time for 2048K FFT length: 12.962 ms., avg: 14.104 ms. Best time for 2560K FFT length: 17.082 ms., avg: 18.360 ms. Best time for 3072K FFT length: 21.203 ms., avg: 22.803 ms. Best time for 3584K FFT length: 25.123 ms., avg: 26.348 ms. Best time for 4096K FFT length: 29.164 ms., avg: 30.379 ms. Best time for 5120K FFT length: 37.024 ms., avg: 39.290 ms. Best time for 6144K FFT length: 44.982 ms., avg: 47.360 ms. Best time for 7168K FFT length: 52.080 ms., avg: 55.984 ms. Best time for 8192K FFT length: 57.181 ms., avg: 61.539 ms. Timing FFTs using 2 threads on 2 physical CPUs. Best time for 1024K FFT length: 3.882 ms., avg: 4.282 ms. Best time for 1280K FFT length: 4.697 ms., avg: 6.302 ms. Best time for 1536K FFT length: 5.014 ms., avg: 6.249 ms. Best time for 1792K FFT length: 6.709 ms., avg: 8.296 ms. Best time for 2048K FFT length: 7.693 ms., avg: 8.402 ms. Best time for 2560K FFT length: 9.731 ms., avg: 10.399 ms. Best time for 3072K FFT length: 11.842 ms., avg: 13.594 ms. Best time for 3584K FFT length: 14.070 ms., avg: 15.934 ms. Best time for 4096K FFT length: 15.952 ms., avg: 17.567 ms. Best time for 5120K FFT length: 20.093 ms., avg: 21.585 ms. Best time for 6144K FFT length: 24.035 ms., avg: 25.519 ms. Best time for 7168K FFT length: 26.227 ms., avg: 29.327 ms. Best time for 8192K FFT length: 30.030 ms., avg: 33.662 ms. Timing FFTs using 3 threads on 3 physical CPUs. Best time for 1024K FFT length: 2.768 ms., avg: 4.311 ms. Best time for 1280K FFT length: 3.254 ms., avg: 3.881 ms. Best time for 1536K FFT length: 4.021 ms., avg: 4.860 ms. Best time for 1792K FFT length: 4.963 ms., avg: 5.399 ms. Best time for 2048K FFT length: 5.376 ms., avg: 5.396 ms. Best time for 2560K FFT length: 6.810 ms., avg: 8.019 ms. Best time for 3072K FFT length: 8.455 ms., avg: 9.204 ms. Best time for 3584K FFT length: 9.842 ms., avg: 10.355 ms. Best time for 4096K FFT length: 11.365 ms., avg: 11.909 ms. Best time for 5120K FFT length: 13.261 ms., avg: 15.482 ms. Best time for 6144K FFT length: 15.834 ms., avg: 16.926 ms. Best time for 7168K FFT length: 18.817 ms., avg: 19.886 ms. Best time for 8192K FFT length: 20.354 ms., avg: 24.017 ms. Timing FFTs using 4 threads on 4 physical CPUs. Best time for 1024K FFT length: 2.125 ms., avg: 3.778 ms. Best time for 1280K FFT length: 3.057 ms., avg: 3.536 ms. Best time for 1536K FFT length: 3.358 ms., avg: 4.033 ms. Best time for 1792K FFT length: 3.850 ms., avg: 4.750 ms. Best time for 2048K FFT length: 4.285 ms., avg: 5.194 ms. Best time for 2560K FFT length: 5.555 ms., avg: 6.554 ms. Best time for 3072K FFT length: 6.623 ms., avg: 7.706 ms. Best time for 3584K FFT length: 7.646 ms., avg: 7.996 ms. Best time for 4096K FFT length: 8.569 ms., avg: 10.784 ms. Best time for 5120K FFT length: 10.921 ms., avg: 12.209 ms. Best time for 6144K FFT length: 13.010 ms., avg: 13.289 ms. Best time for 7168K FFT length: 15.650 ms., avg: 17.291 ms. Best time for 8192K FFT length: 17.739 ms., avg: 19.504 ms. Timing FFTs using 5 threads on 5 physical CPUs. Best time for 1024K FFT length: 2.186 ms., avg: 2.542 ms. Best time for 1280K FFT length: 2.669 ms., avg: 3.112 ms. Best time for 1536K FFT length: 3.170 ms., avg: 3.665 ms. Best time for 1792K FFT length: 3.250 ms., avg: 4.023 ms. Best time for 2048K FFT length: 3.666 ms., avg: 4.793 ms. Best time for 2560K FFT length: 4.739 ms., avg: 5.941 ms. Best time for 3072K FFT length: 5.325 ms., avg: 6.445 ms. Best time for 3584K FFT length: 6.477 ms., avg: 7.344 ms. Best time for 4096K FFT length: 7.230 ms., avg: 9.374 ms. Best time for 5120K FFT length: 9.106 ms., avg: 11.074 ms. Best time for 6144K FFT length: 10.863 ms., avg: 13.026 ms. Best time for 7168K FFT length: 12.882 ms., avg: 13.557 ms. Best time for 8192K FFT length: 14.670 ms., avg: 16.912 ms. Timing FFTs using 6 threads on 6 physical CPUs. Best time for 1024K FFT length: 1.965 ms., avg: 2.457 ms. Best time for 1280K FFT length: 2.104 ms., avg: 2.420 ms. Best time for 1536K FFT length: 2.550 ms., avg: 2.957 ms. Best time for 1792K FFT length: 2.579 ms., avg: 4.876 ms. Best time for 2048K FFT length: 3.264 ms., avg: 4.054 ms. Best time for 2560K FFT length: 4.243 ms., avg: 4.590 ms. Best time for 3072K FFT length: 4.926 ms., avg: 5.814 ms. Best time for 3584K FFT length: 5.723 ms., avg: 6.481 ms. Best time for 4096K FFT length: 6.290 ms., avg: 7.283 ms. Best time for 5120K FFT length: 7.841 ms., avg: 8.952 ms. Best time for 6144K FFT length: 9.432 ms., avg: 10.570 ms. Best time for 7168K FFT length: 11.231 ms., avg: 12.449 ms. Best time for 8192K FFT length: 12.716 ms., avg: 13.554 ms. Timing FFTs using 7 threads on 7 physical CPUs. Best time for 1024K FFT length: 1.682 ms., avg: 1.851 ms. Best time for 1280K FFT length: 2.106 ms., avg: 2.511 ms. Best time for 1536K FFT length: 2.519 ms., avg: 2.915 ms. Best time for 1792K FFT length: 2.827 ms., avg: 3.458 ms. Best time for 2048K FFT length: 3.328 ms., avg: 4.011 ms. Best time for 2560K FFT length: 3.519 ms., avg: 4.295 ms. Best time for 3072K FFT length: 4.351 ms., avg: 5.002 ms. Best time for 3584K FFT length: 5.082 ms., avg: 5.791 ms. Best time for 4096K FFT length: 5.692 ms., avg: 6.762 ms. Best time for 5120K FFT length: 7.424 ms., avg: 8.448 ms. Best time for 6144K FFT length: 8.619 ms., avg: 9.552 ms. Best time for 7168K FFT length: 10.336 ms., avg: 11.331 ms. Best time for 8192K FFT length: 11.465 ms., avg: 12.392 ms. Timing FFTs using 8 threads on 8 physical CPUs. Best time for 1024K FFT length: 1.659 ms., avg: 1.853 ms. Best time for 1280K FFT length: 1.910 ms., avg: 2.297 ms. Best time for 1536K FFT length: 2.269 ms., avg: 2.746 ms. Best time for 1792K FFT length: 2.799 ms., avg: 3.228 ms. Best time for 2048K FFT length: 3.106 ms., avg: 3.689 ms. Best time for 2560K FFT length: 3.451 ms., avg: 4.168 ms. Best time for 3072K FFT length: 4.187 ms., avg: 4.819 ms. Best time for 3584K FFT length: 4.746 ms., avg: 5.844 ms. Best time for 4096K FFT length: 5.399 ms., avg: 6.123 ms. Best time for 5120K FFT length: 6.756 ms., avg: 7.752 ms. Best time for 6144K FFT length: 8.084 ms., avg: 8.823 ms. Best time for 7168K FFT length: 9.668 ms., avg: 10.538 ms. Best time for 8192K FFT length: 10.951 ms., avg: 11.672 ms. |
|
|
|
|
|
#637 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
23·11·73 Posts |
i7/5820K, default clock and 2400MHz memory:
Code:
[Work thread Dec 29 16:19] Iteration: 4044000 / 40955561 [9.87%], ms/iter: 1.957 Code:
[Work thread Dec 29 16:21] Iteration: 32905000 / 40947989 [80.35%], ms/iter: 4.033 |
|
|
|
|
|
#638 |
|
May 2005
23·7·29 Posts |
Attached FullBench results of Xeon E5 v3 @ 2.5GHz 10c/20t (105W CPU), with 4x4GB 2133 DDR4 SR ECC-R (gotta love those abbreviations
Some observations:
|
|
|
|
|
|
#639 | |
|
Serpentine Vermin Jar
Jul 2014
3,313 Posts |
Quote:
Also when benchmarking you may as well disable testing the HT cores. It really will hurt performance. You may have seen some gains because the HT core was actually on a different core that wasn't being used yet as a worker thread. George also explained that depending on the exponent size (and I guess that means the FFT length?) it may split the load among multiple threads in a single worker more optimally. Small FFT sizes scale horribly with a lot of cores, but the larger the FFT the better it scales... the work can be broken into more chunks which makes distributing to the multiple workers a more balanced affair. |
|
|
|
|
|
|
#640 |
|
Aug 2002
2·3·29 Posts |
First run on my Single Xeon E5-2697V3 128GB ECC DDR4-2133
![]() The 28 threads result is very much screwed. Apart from that (the HT results) there are improvements as more cores are added. More to come later. Code:
Intel(R) Xeon(R) CPU E5-2697 v3 @ 2.60GHz CPU speed: 2878.03 MHz, 14 hyperthreaded cores CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 35 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Prime95 64-bit version 28.7, RdtscTiming=1 Best time for 1024K FFT length: 4.793 ms., avg: 4.811 ms. Best time for 1280K FFT length: 6.172 ms., avg: 6.334 ms. Best time for 1536K FFT length: 7.478 ms., avg: 7.492 ms. Best time for 1792K FFT length: 9.018 ms., avg: 9.051 ms. Best time for 2048K FFT length: 10.307 ms., avg: 10.315 ms. Best time for 2560K FFT length: 12.986 ms., avg: 12.999 ms. Best time for 3072K FFT length: 15.933 ms., avg: 16.499 ms. Best time for 3584K FFT length: 18.924 ms., avg: 18.983 ms. Best time for 4096K FFT length: 21.385 ms., avg: 21.413 ms. Best time for 5120K FFT length: 27.668 ms., avg: 27.746 ms. Best time for 6144K FFT length: 34.277 ms., avg: 34.338 ms. Best time for 7168K FFT length: 41.400 ms., avg: 41.499 ms. Best time for 8192K FFT length: 47.571 ms., avg: 47.702 ms. Timing FFTs using 2 threads on 1 physical CPU. Best time for 1024K FFT length: 4.905 ms., avg: 4.974 ms. Best time for 1280K FFT length: 6.330 ms., avg: 6.391 ms. Best time for 1536K FFT length: 7.685 ms., avg: 7.776 ms. Best time for 1792K FFT length: 9.335 ms., avg: 9.601 ms. Best time for 2048K FFT length: 10.181 ms., avg: 10.200 ms. Best time for 2560K FFT length: 12.705 ms., avg: 12.723 ms. Best time for 3072K FFT length: 16.425 ms., avg: 16.866 ms. Best time for 3584K FFT length: 18.850 ms., avg: 18.961 ms. Best time for 4096K FFT length: 22.294 ms., avg: 22.514 ms. Best time for 5120K FFT length: 29.846 ms., avg: 29.933 ms. Best time for 6144K FFT length: 38.068 ms., avg: 38.292 ms. Best time for 7168K FFT length: 45.381 ms., avg: 45.985 ms. Best time for 8192K FFT length: 51.352 ms., avg: 51.703 ms. Timing FFTs using 2 threads on 2 physical CPUs. Best time for 1024K FFT length: 2.692 ms., avg: 2.969 ms. Best time for 1280K FFT length: 3.612 ms., avg: 3.914 ms. Best time for 1536K FFT length: 4.326 ms., avg: 4.632 ms. Best time for 1792K FFT length: 5.180 ms., avg: 5.525 ms. Best time for 2048K FFT length: 5.733 ms., avg: 6.017 ms. Best time for 2560K FFT length: 7.122 ms., avg: 7.486 ms. Best time for 3072K FFT length: 8.819 ms., avg: 8.944 ms. Best time for 3584K FFT length: 10.321 ms., avg: 10.460 ms. Best time for 4096K FFT length: 11.996 ms., avg: 12.220 ms. Best time for 5120K FFT length: 14.406 ms., avg: 14.915 ms. Best time for 6144K FFT length: 17.746 ms., avg: 18.433 ms. Best time for 7168K FFT length: 21.377 ms., avg: 22.592 ms. Best time for 8192K FFT length: 24.572 ms., avg: 25.116 ms. Timing FFTs using 3 threads on 3 physical CPUs. Best time for 1024K FFT length: 1.870 ms., avg: 2.142 ms. Best time for 1280K FFT length: 2.543 ms., avg: 2.827 ms. Best time for 1536K FFT length: 3.033 ms., avg: 3.405 ms. Best time for 1792K FFT length: 3.614 ms., avg: 3.951 ms. Best time for 2048K FFT length: 3.978 ms., avg: 4.337 ms. Best time for 2560K FFT length: 4.989 ms., avg: 5.296 ms. Best time for 3072K FFT length: 6.084 ms., avg: 6.584 ms. Best time for 3584K FFT length: 7.081 ms., avg: 7.577 ms. Best time for 4096K FFT length: 8.280 ms., avg: 8.682 ms. Best time for 5120K FFT length: 10.449 ms., avg: 10.751 ms. Best time for 6144K FFT length: 12.006 ms., avg: 12.684 ms. Best time for 7168K FFT length: 14.439 ms., avg: 15.151 ms. Best time for 8192K FFT length: 16.587 ms., avg: 17.212 ms. Timing FFTs using 4 threads on 4 physical CPUs. Best time for 1024K FFT length: 1.434 ms., avg: 1.866 ms. Best time for 1280K FFT length: 1.915 ms., avg: 2.116 ms. Best time for 1536K FFT length: 2.494 ms., avg: 2.937 ms. Best time for 1792K FFT length: 2.995 ms., avg: 3.457 ms. Best time for 2048K FFT length: 3.268 ms., avg: 3.776 ms. Best time for 2560K FFT length: 4.234 ms., avg: 4.628 ms. Best time for 3072K FFT length: 4.354 ms., avg: 4.776 ms. Best time for 3584K FFT length: 5.378 ms., avg: 5.798 ms. Best time for 4096K FFT length: 6.349 ms., avg: 6.733 ms. Best time for 5120K FFT length: 7.948 ms., avg: 8.342 ms. Best time for 6144K FFT length: 9.688 ms., avg: 9.909 ms. Best time for 7168K FFT length: 10.968 ms., avg: 11.825 ms. Best time for 8192K FFT length: 12.826 ms., avg: 13.745 ms. Timing FFTs using 5 threads on 5 physical CPUs. Best time for 1024K FFT length: 1.177 ms., avg: 1.525 ms. Best time for 1280K FFT length: 1.529 ms., avg: 2.143 ms. Best time for 1536K FFT length: 1.878 ms., avg: 2.314 ms. Best time for 1792K FFT length: 2.239 ms., avg: 2.841 ms. Best time for 2048K FFT length: 2.514 ms., avg: 2.915 ms. Best time for 2560K FFT length: 3.099 ms., avg: 3.808 ms. Best time for 3072K FFT length: 3.738 ms., avg: 4.138 ms. Best time for 3584K FFT length: 4.346 ms., avg: 4.836 ms. Best time for 4096K FFT length: 5.078 ms., avg: 5.404 ms. Best time for 5120K FFT length: 6.424 ms., avg: 6.765 ms. Best time for 6144K FFT length: 7.334 ms., avg: 7.846 ms. Best time for 7168K FFT length: 9.388 ms., avg: 9.865 ms. Best time for 8192K FFT length: 10.973 ms., avg: 11.361 ms. Timing FFTs using 6 threads on 6 physical CPUs. Best time for 1024K FFT length: 1.182 ms., avg: 1.461 ms. Best time for 1280K FFT length: 1.425 ms., avg: 1.853 ms. Best time for 1536K FFT length: 1.678 ms., avg: 2.094 ms. Best time for 1792K FFT length: 1.857 ms., avg: 2.409 ms. Best time for 2048K FFT length: 2.100 ms., avg: 2.598 ms. Best time for 2560K FFT length: 2.610 ms., avg: 3.135 ms. Best time for 3072K FFT length: 3.168 ms., avg: 3.591 ms. Best time for 3584K FFT length: 3.685 ms., avg: 4.050 ms. Best time for 4096K FFT length: 4.283 ms., avg: 4.721 ms. Best time for 5120K FFT length: 5.438 ms., avg: 5.776 ms. Best time for 6144K FFT length: 6.590 ms., avg: 6.846 ms. Best time for 7168K FFT length: 8.428 ms., avg: 8.624 ms. Best time for 8192K FFT length: 9.133 ms., avg: 9.448 ms. Timing FFTs using 7 threads on 7 physical CPUs. Best time for 1024K FFT length: 0.921 ms., avg: 1.409 ms. Best time for 1280K FFT length: 1.198 ms., avg: 1.736 ms. Best time for 1536K FFT length: 1.394 ms., avg: 1.886 ms. Best time for 1792K FFT length: 1.657 ms., avg: 2.128 ms. Best time for 2048K FFT length: 1.865 ms., avg: 2.330 ms. Best time for 2560K FFT length: 2.270 ms., avg: 2.834 ms. Best time for 3072K FFT length: 2.734 ms., avg: 3.216 ms. Best time for 3584K FFT length: 3.192 ms., avg: 3.710 ms. Best time for 4096K FFT length: 4.242 ms., avg: 4.901 ms. Best time for 5120K FFT length: 4.732 ms., avg: 5.248 ms. Best time for 6144K FFT length: 5.710 ms., avg: 6.123 ms. Best time for 7168K FFT length: 6.856 ms., avg: 7.115 ms. Best time for 8192K FFT length: 7.901 ms., avg: 8.278 ms. Timing FFTs using 8 threads on 8 physical CPUs. Best time for 1024K FFT length: 1.007 ms., avg: 1.203 ms. Best time for 1280K FFT length: 1.138 ms., avg: 1.477 ms. Best time for 1536K FFT length: 1.238 ms., avg: 1.689 ms. Best time for 1792K FFT length: 1.563 ms., avg: 1.920 ms. Best time for 2048K FFT length: 1.680 ms., avg: 2.123 ms. Best time for 2560K FFT length: 2.040 ms., avg: 2.738 ms. Best time for 3072K FFT length: 2.400 ms., avg: 3.054 ms. Best time for 3584K FFT length: 2.835 ms., avg: 3.233 ms. Best time for 4096K FFT length: 3.276 ms., avg: 3.973 ms. Best time for 5120K FFT length: 4.076 ms., avg: 4.716 ms. Best time for 6144K FFT length: 5.037 ms., avg: 5.514 ms. Best time for 7168K FFT length: 6.083 ms., avg: 6.396 ms. Best time for 8192K FFT length: 7.017 ms., avg: 7.649 ms. Timing FFTs using 9 threads on 9 physical CPUs. Best time for 1024K FFT length: 1.300 ms., avg: 1.346 ms. Best time for 1280K FFT length: 0.959 ms., avg: 1.739 ms. Best time for 1536K FFT length: 1.129 ms., avg: 1.580 ms. Best time for 1792K FFT length: 1.305 ms., avg: 1.750 ms. Best time for 2048K FFT length: 1.518 ms., avg: 2.216 ms. Best time for 2560K FFT length: 1.809 ms., avg: 2.373 ms. Best time for 3072K FFT length: 2.143 ms., avg: 2.668 ms. Best time for 3584K FFT length: 2.573 ms., avg: 3.073 ms. Best time for 4096K FFT length: 2.951 ms., avg: 3.561 ms. Best time for 5120K FFT length: 3.738 ms., avg: 4.276 ms. Best time for 6144K FFT length: 4.598 ms., avg: 5.062 ms. Best time for 7168K FFT length: 5.483 ms., avg: 5.940 ms. Best time for 8192K FFT length: 6.292 ms., avg: 6.690 ms. Timing FFTs using 10 threads on 10 physical CPUs. Best time for 1024K FFT length: 0.728 ms., avg: 1.032 ms. Best time for 1280K FFT length: 0.999 ms., avg: 1.347 ms. Best time for 1536K FFT length: 1.012 ms., avg: 1.402 ms. Best time for 1792K FFT length: 1.187 ms., avg: 1.793 ms. Best time for 2048K FFT length: 1.400 ms., avg: 1.889 ms. Best time for 2560K FFT length: 1.702 ms., avg: 2.253 ms. Best time for 3072K FFT length: 1.939 ms., avg: 2.417 ms. Best time for 3584K FFT length: 2.356 ms., avg: 2.915 ms. [Sat Jan 30 00:26:03 2016] Best time for 4096K FFT length: 2.691 ms., avg: 3.513 ms. Best time for 5120K FFT length: 3.377 ms., avg: 3.906 ms. Best time for 6144K FFT length: 4.139 ms., avg: 4.465 ms. Best time for 7168K FFT length: 5.060 ms., avg: 5.456 ms. Best time for 8192K FFT length: 5.911 ms., avg: 6.458 ms. Timing FFTs using 11 threads on 11 physical CPUs. Best time for 1024K FFT length: 0.691 ms., avg: 1.012 ms. Best time for 1280K FFT length: 1.285 ms., avg: 1.368 ms. Best time for 1536K FFT length: 0.932 ms., avg: 1.311 ms. Best time for 1792K FFT length: 1.116 ms., avg: 1.630 ms. Best time for 2048K FFT length: 1.337 ms., avg: 2.201 ms. Best time for 2560K FFT length: 1.590 ms., avg: 2.089 ms. Best time for 3072K FFT length: 1.793 ms., avg: 2.225 ms. Best time for 3584K FFT length: 2.147 ms., avg: 2.659 ms. Best time for 4096K FFT length: 2.487 ms., avg: 3.080 ms. Best time for 5120K FFT length: 3.151 ms., avg: 3.620 ms. Best time for 6144K FFT length: 3.930 ms., avg: 4.468 ms. Best time for 7168K FFT length: 4.688 ms., avg: 4.821 ms. Best time for 8192K FFT length: 5.445 ms., avg: 5.852 ms. Timing FFTs using 12 threads on 12 physical CPUs. Best time for 1024K FFT length: 0.773 ms., avg: 0.828 ms. Best time for 1280K FFT length: 1.369 ms., avg: 1.406 ms. Best time for 1536K FFT length: 0.932 ms., avg: 1.386 ms. Best time for 1792K FFT length: 1.084 ms., avg: 1.703 ms. Best time for 2048K FFT length: 1.270 ms., avg: 1.824 ms. Best time for 2560K FFT length: 1.484 ms., avg: 2.105 ms. Best time for 3072K FFT length: 1.644 ms., avg: 2.379 ms. Best time for 3584K FFT length: 2.053 ms., avg: 2.905 ms. Best time for 4096K FFT length: 2.329 ms., avg: 2.814 ms. Best time for 5120K FFT length: 2.924 ms., avg: 3.592 ms. Best time for 6144K FFT length: 3.762 ms., avg: 3.848 ms. Best time for 7168K FFT length: 4.520 ms., avg: 4.866 ms. Best time for 8192K FFT length: 5.338 ms., avg: 5.746 ms. Timing FFTs using 13 threads on 13 physical CPUs. Best time for 1024K FFT length: 1.023 ms., avg: 1.083 ms. Best time for 1280K FFT length: 0.835 ms., avg: 1.145 ms. Best time for 1536K FFT length: 1.363 ms., avg: 1.469 ms. Best time for 1792K FFT length: 1.172 ms., avg: 1.490 ms. Best time for 2048K FFT length: 1.221 ms., avg: 1.812 ms. Best time for 2560K FFT length: 1.472 ms., avg: 2.020 ms. Best time for 3072K FFT length: 1.562 ms., avg: 2.188 ms. Best time for 3584K FFT length: 1.891 ms., avg: 2.571 ms. Best time for 4096K FFT length: 2.150 ms., avg: 2.738 ms. Best time for 5120K FFT length: 2.797 ms., avg: 3.612 ms. Best time for 6144K FFT length: 3.472 ms., avg: 3.843 ms. Best time for 7168K FFT length: 4.593 ms., avg: 4.946 ms. Best time for 8192K FFT length: 5.153 ms., avg: 5.493 ms. Timing FFTs using 14 threads on 14 physical CPUs. Best time for 1024K FFT length: 0.975 ms., avg: 1.039 ms. Best time for 1280K FFT length: 0.809 ms., avg: 1.115 ms. Best time for 1536K FFT length: 0.814 ms., avg: 1.289 ms. Best time for 1792K FFT length: 0.960 ms., avg: 1.483 ms. Best time for 2048K FFT length: 1.223 ms., avg: 1.695 ms. Best time for 2560K FFT length: 1.635 ms., avg: 1.927 ms. Best time for 3072K FFT length: 1.522 ms., avg: 2.320 ms. Best time for 3584K FFT length: 1.818 ms., avg: 2.653 ms. Best time for 4096K FFT length: 2.059 ms., avg: 2.665 ms. Best time for 5120K FFT length: 2.646 ms., avg: 3.227 ms. Best time for 6144K FFT length: 3.395 ms., avg: 4.085 ms. Best time for 7168K FFT length: 4.240 ms., avg: 4.636 ms. Best time for 8192K FFT length: 4.905 ms., avg: 5.378 ms. Timing FFTs using 28 threads on 14 physical CPUs. Best time for 1024K FFT length: 3.022 ms., avg: 3.683 ms. Best time for 1280K FFT length: 3.287 ms., avg: 4.911 ms. Best time for 1536K FFT length: 3.430 ms., avg: 4.703 ms. Best time for 1792K FFT length: 3.735 ms., avg: 4.901 ms. Best time for 2048K FFT length: 5.461 ms., avg: 7.825 ms. Best time for 2560K FFT length: 5.530 ms., avg: 8.088 ms. Best time for 3072K FFT length: 6.958 ms., avg: 8.445 ms. Best time for 3584K FFT length: 7.098 ms., avg: 10.531 ms. Best time for 4096K FFT length: 8.573 ms., avg: 11.630 ms. Best time for 5120K FFT length: 9.140 ms., avg: 12.934 ms. Best time for 6144K FFT length: 8.914 ms., avg: 12.554 ms. Best time for 7168K FFT length: 12.851 ms., avg: 16.634 ms. Best time for 8192K FFT length: 14.782 ms., avg: 24.383 ms. Timings for 1024K FFT length (1 cpu, 1 worker): 4.26 ms. Throughput: 234.57 iter/sec. Timings for 1024K FFT length (2 cpus, 2 workers): 4.27, 4.27 ms. Throughput: 468.06 iter/sec. Timings for 1024K FFT length (3 cpus, 3 workers): 4.54, 4.51, 4.59 ms. Throughput: 659.74 iter/sec. Timings for 1024K FFT length (4 cpus, 4 workers): 4.80, 4.78, 4.81, 4.83 ms. Throughput: 832.58 iter/sec. Timings for 1024K FFT length (5 cpus, 5 workers): 5.01, 5.05, 5.04, 5.04, 5.05 ms. Throughput: 992.16 iter/sec. Timings for 1024K FFT length (6 cpus, 6 workers): 5.19, 5.20, 5.20, 5.19, 5.17, 5.18 ms. Throughput: 1156.39 iter/sec. Timings for 1024K FFT length (7 cpus, 7 workers): 5.34, 5.37, 5.36, 5.34, 5.34, 5.34, 5.33 ms. Throughput: 1309.17 iter/sec. Timings for 1024K FFT length (8 cpus, 8 workers): 5.59, 5.60, 5.60, 5.60, 5.60, 5.60, 5.60, 5.58 ms. Throughput: 1430.07 iter/sec. Timings for 1024K FFT length (9 cpus, 9 workers): 6.04, 6.04, 6.05, 6.05, 6.05, 6.05, 6.05, 6.05, 6.02 ms. Throughput: 1488.82 iter/sec. Timings for 1024K FFT length (10 cpus, 10 workers): 6.55, 6.54, 6.54, 6.54, 6.55, 6.55, 6.55, 6.54, 6.55, 6.53 ms. Throughput: 1528.25 iter/sec. Timings for 1024K FFT length (11 cpus, 11 workers): 7.11, 7.07, 7.06, 7.05, 7.04, 7.04, 7.04, 7.04, 7.04, 7.04, 7.04 ms. Throughput: 1560.24 iter/sec. Timings for 1024K FFT length (12 cpus, 12 workers): 7.60, 7.61, 7.57, 7.57, 7.55, 7.58, 7.55, 7.57, 7.56, 7.55, 7.59, 7.55 ms. Throughput: 1585.16 iter/sec. Timings for 1024K FFT length (13 cpus, 13 workers): 8.34, 8.29, 8.28, 8.28, 8.22, 8.27, 8.26, 8.24, 8.27, 8.27, 8.30, 8.30, 8.26 ms. Throughput: 1571.02 iter/sec. Timings for 1024K FFT length (14 cpus, 14 workers): 9.12, 9.05, 9.04, 9.04, 9.04, 9.05, 8.96, 8.96, 8.99, 8.96, 9.04, 9.08, 9.12, 8.99 ms. Throughput: 1550.39 iter/sec. |
|
|
|
|
|
#641 | |
|
Serpentine Vermin Jar
Jul 2014
63618 Posts |
Quote:
I forget the options to add to the config files to enable that (and also you may as well disable testing any HT cores... it adds to the noise from the test results and it won't be any faster). These many-cored systems are indeed awesome tools. |
|
|
|
|
|
|
#642 |
|
Aug 2002
2×3×29 Posts |
Is there a way just to test the 1 worker 4096K FFT speed? For 1-14 cores without the need to run the full bench?
The initial results on 12 threads on 12 cpu (leaving me with 2 cpu for other tasks), seems to work well for me. The test result don't deviate much from the actual single work 12t LL test time which suggests the work is well multithreaded. With expected completion time of a current first time LL at about 52 hours. Win10 pro seems to divide the work well to real cpu (from looking at the task manager) without needing to manually fixating threads to cores. The Xeon's turbo is complicated I can only get 2.6GHz which is base speed at 12 threads. I was hoping for more. Last fiddled with by xtreme2k on 2016-01-31 at 23:59 |
|
|
|
|
|
#643 | |
|
Serpentine Vermin Jar
Jul 2014
1100111100012 Posts |
Quote:
MinBenchFFT=4096 MaxBenchFFT=4096 BenchHyperthreads=0 BenchMultithreads=1 That last one tells it to benchmark multiple cores in a single worker. I didn't see a way to force it to only benchmark with a single worker (but with all the possible # of cores per worker). At least with those you'll test just the 4096K FFT, no HT benchmarking, and see how your throughput increases (or possibly not) with multiple core workers. Hope that helps. |
|
|
|
|
|
|
#644 |
|
Aug 2002
2·3·29 Posts |
Thanks I will give that a try tonight.
As I see it the 12 workers 12 threads benchmark time (2.329 ms) is very close to the actual LL 1 worker 12 threads (around 2.3-2.4 ms), I would expect the benchmark to show similar results. ![]() These Xeons are so powerful. |
|
|
|
|
|
#645 |
|
Aug 2002
AE16 Posts |
It's interesting to see 1 worker vs multiple workers that 1 worker is actually faster throughout. Am I reading this right?
eg - 14 cpu 1 worker - 509 it/s 14 cpu 14 workers - 371 it/s Code:
Intel(R) Xeon(R) CPU E5-2697 v3 @ 2.60GHz CPU speed: 2594.02 MHz, 14 hyperthreaded cores CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 35 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Prime95 64-bit version 28.7, RdtscTiming=1 Best time for 4096K FFT length: 23.861 ms., avg: 23.901 ms. Timing FFTs using 2 threads on 2 physical CPUs. Best time for 4096K FFT length: 12.534 ms., avg: 12.581 ms. Timing FFTs using 3 threads on 3 physical CPUs. Best time for 4096K FFT length: 8.445 ms., avg: 8.487 ms. Timing FFTs using 4 threads on 4 physical CPUs. Best time for 4096K FFT length: 6.512 ms., avg: 6.543 ms. Timing FFTs using 5 threads on 5 physical CPUs. Best time for 4096K FFT length: 4.655 ms., avg: 4.701 ms. Timing FFTs using 6 threads on 6 physical CPUs. Best time for 4096K FFT length: 3.915 ms., avg: 3.950 ms. Timing FFTs using 7 threads on 7 physical CPUs. Best time for 4096K FFT length: 3.340 ms., avg: 3.409 ms. Timing FFTs using 8 threads on 8 physical CPUs. Best time for 4096K FFT length: 2.942 ms., avg: 2.977 ms. Timing FFTs using 9 threads on 9 physical CPUs. Best time for 4096K FFT length: 2.626 ms., avg: 2.652 ms. Timing FFTs using 10 threads on 10 physical CPUs. Best time for 4096K FFT length: 2.444 ms., avg: 2.893 ms. Timing FFTs using 11 threads on 11 physical CPUs. Best time for 4096K FFT length: 2.318 ms., avg: 2.691 ms. Timing FFTs using 12 threads on 12 physical CPUs. Best time for 4096K FFT length: 2.167 ms., avg: 2.607 ms. Timing FFTs using 13 threads on 13 physical CPUs. Best time for 4096K FFT length: 2.037 ms., avg: 2.733 ms. Timing FFTs using 14 threads on 14 physical CPUs. Best time for 4096K FFT length: 1.929 ms., avg: 2.475 ms. Timings for 4096K FFT length (1 cpu, 1 worker): 19.44 ms. Throughput: 51.43 iter/sec. Timings for 4096K FFT length (2 cpus, 1 worker): 10.36 ms. Throughput: 96.56 iter/sec. Timings for 4096K FFT length (2 cpus, 2 workers): 24.33, 24.52 ms. Throughput: 81.87 iter/sec. Timings for 4096K FFT length (3 cpus, 1 worker): 7.34 ms. Throughput: 136.33 iter/sec. Timings for 4096K FFT length (3 cpus, 3 workers): 22.74, 22.71, 23.15 ms. Throughput: 131.19 iter/sec. Timings for 4096K FFT length (4 cpus, 1 worker): 5.75 ms. Throughput: 173.87 iter/sec. Timings for 4096K FFT length (4 cpus, 2 workers): 13.39, 13.33 ms. Throughput: 149.67 iter/sec. Timings for 4096K FFT length (4 cpus, 4 workers): 23.80, 23.79, 23.33, 23.35 ms. Throughput: 169.73 iter/sec. Timings for 4096K FFT length (5 cpus, 1 worker): 4.70 ms. Throughput: 212.96 iter/sec. Timings for 4096K FFT length (5 cpus, 5 workers): 24.01, 24.00, 23.86, 23.97, 23.68 ms. Throughput: 209.18 iter/sec. Timings for 4096K FFT length (6 cpus, 1 worker): 3.93 ms. Throughput: 254.21 iter/sec. Timings for 4096K FFT length (6 cpus, 2 workers): 9.17, 9.03 ms. Throughput: 219.75 iter/sec. Timings for 4096K FFT length (6 cpus, 3 workers): 12.45, 12.66, 12.46 ms. Throughput: 239.54 iter/sec. Timings for 4096K FFT length (6 cpus, 6 workers): 24.41, 24.27, 24.50, 24.14, 24.02, 24.03 ms. Throughput: 247.64 iter/sec. Timings for 4096K FFT length (7 cpus, 1 worker): 3.36 ms. Throughput: 297.56 iter/sec. Timings for 4096K FFT length (7 cpus, 7 workers): 24.62, 24.70, 24.57, 24.69, 24.66, 24.67, 24.52 ms. Throughput: 284.19 iter/sec. Timings for 4096K FFT length (8 cpus, 1 worker): 2.98 ms. Throughput: 336.02 iter/sec. Timings for 4096K FFT length (8 cpus, 2 workers): 7.01, 6.89 ms. Throughput: 287.92 iter/sec. Timings for 4096K FFT length (8 cpus, 4 workers): 13.00, 13.03, 12.98, 12.85 ms. Throughput: 308.53 iter/sec. Timings for 4096K FFT length (8 cpus, 8 workers): 25.55, 25.41, 25.48, 25.42, 25.30, 25.54, 25.21, 25.42 ms. Throughput: 314.77 iter/sec. Timings for 4096K FFT length (9 cpus, 1 worker): 2.64 ms. Throughput: 379.15 iter/sec. Timings for 4096K FFT length (9 cpus, 3 workers): 9.04, 8.96, 8.91 ms. Throughput: 334.42 iter/sec. Timings for 4096K FFT length (9 cpus, 9 workers): 26.42, 26.60, 26.72, 26.55, 26.65, 26.51, 26.68, 26.43, 26.75 ms. Throughput: 338.45 iter/sec. Timings for 4096K FFT length (10 cpus, 1 worker): 2.38 ms. Throughput: 419.87 iter/sec. Timings for 4096K FFT length (10 cpus, 2 workers): 5.64, 5.57 ms. Throughput: 356.92 iter/sec. [Mon Feb 01 21:57:10 2016] Timings for 4096K FFT length (10 cpus, 5 workers): 14.19, 14.15, 14.27, 14.03, 14.10 ms. Throughput: 353.40 iter/sec. Timings for 4096K FFT length (10 cpus, 10 workers): 28.33, 28.31, 28.23, 28.26, 28.22, 27.95, 28.18, 28.26, 28.09, 28.09 ms. Throughput: 354.71 iter/sec. Timings for 4096K FFT length (11 cpus, 1 worker): 2.20 ms. Throughput: 453.70 iter/sec. Timings for 4096K FFT length (11 cpus, 11 workers): 30.87, 31.17, 31.11, 30.83, 30.94, 31.16, 30.93, 30.65, 30.67, 30.95, 31.12 ms. Throughput: 355.48 iter/sec. Timings for 4096K FFT length (12 cpus, 1 worker): 2.25 ms. Throughput: 443.49 iter/sec. Timings for 4096K FFT length (12 cpus, 2 workers): 5.21, 5.21 ms. Throughput: 383.78 iter/sec. Timings for 4096K FFT length (12 cpus, 3 workers): 8.17, 8.17, 8.17 ms. Throughput: 367.36 iter/sec. Timings for 4096K FFT length (12 cpus, 4 workers): 11.20, 10.94, 10.83, 10.94 ms. Throughput: 364.53 iter/sec. Timings for 4096K FFT length (12 cpus, 6 workers): 16.54, 16.46, 16.34, 16.41, 16.47, 16.81 ms. Throughput: 363.58 iter/sec. Timings for 4096K FFT length (12 cpus, 12 workers): 32.97, 33.15, 33.15, 33.10, 32.65, 33.05, 32.88, 32.55, 32.73, 32.49, 33.04, 33.46 ms. Throughput: 364.37 iter/sec. Timings for 4096K FFT length (13 cpus, 1 worker): 2.08 ms. Throughput: 480.50 iter/sec. Timings for 4096K FFT length (13 cpus, 13 workers): 35.57, 35.26, 35.35, 34.97, 34.98, 35.27, 35.22, 35.10, 35.00, 35.07, 35.30, 35.47, 35.32 ms. Throughput: 369.11 iter/sec. Timings for 4096K FFT length (14 cpus, 1 worker): 1.96 ms. Throughput: 509.45 iter/sec. Timings for 4096K FFT length (14 cpus, 2 workers): 5.02, 5.10 ms. Throughput: 395.34 iter/sec. Timings for 4096K FFT length (14 cpus, 7 workers): 18.77, 18.78, 18.79, 18.53, 18.77, 19.21, 18.79 ms. Throughput: 372.26 iter/sec. Timings for 4096K FFT length (14 cpus, 14 workers): 37.86, 38.03, 37.62, 37.28, 37.40, 37.76, 37.46, 37.71, 37.65, 37.35, 37.68, 38.25, 37.94, 38.33 ms. Throughput: 371.01 iter/sec. |
|
|
|
|
|
#646 | |
|
Serpentine Vermin Jar
Jul 2014
3,313 Posts |
Quote:
My theory is that with 14 workers, the memory contention becomes a big bottleneck, so the CPUs are actually idling more as it waits for memory. Running 1 worker with all 14 cores means the memory won't be the bottleneck anymore, and it's once again limited by the CPUs. A quick test is to look at the CPU graphs (one graph per core) when the workers are running. Doing this while benchmarking isn't as useful because you need to see performance over a longer period of time, like 10-15 seconds... you could adjust the benchmark settings to have it run longer... Anyway, when the cores are busy, if the physical cores aren't at 100% usage then it's waiting on something else (probably memory). You can also see the interesting effects of a multi-threaded worker with a sub-optimal threading scheme. For instance, if you take a somewhat small exponent in the 10M range and have a 14-core worker attack it with full force, what you'll see is the first core around 100% and then the other 13 cores will only be using 50-75%. That's because the smaller FFT sizes don't tend to distribute themselves as well among a lot of cores. For something like that (small FFTs) you'd want to limit it to a couple of cores at most, otherwise the program itself is too inefficient to cope with it. At the larger FFT sizes (2M and above seem decent enough) it's not much of a problem, and the larger the FFT, the more efficient it seems to get at distributing the load between cores. |
|
|
|
|
|
|
#647 |
|
May 2005
23×7×29 Posts |
Your CPU has TDP of 145W and it probably reaches that value with 12 cores utlizing Prime95 (you can verify that with some monitoring software). My CPU is 10c/20t and @ 2.5GHz I am hitting 105W with 9 cores running LLR / Prime95. Afterwards CPU clock occassionally falls to 2.4GHz (on single core) to stay below 105W TDP.
|
|
|
|
|
|
#648 | |
|
Serpentine Vermin Jar
Jul 2014
CF116 Posts |
Quote:
I say "almost" because there seems to be something common on many of the Proliants I've tested this on... CPU #1 will reach full turbo speeds, but CPU #2 is one turbo boost lower than the max. I don't know if it's the placement of the CPUs in relation to the bank of fans, or maybe the way CPU #2 is closer to where I have the hard drives in front (incoming air passes over the drives, through the fans, and then into the air handling baffles). Whatever the case, CPU #2 must be a little hotter so it steps down a bit. Not enough for me to get too concerned about, but I do keep the larger FFT sized tests on CPU #1 just to give them more oomph to get the work done. |
|
|
|
|
|
|
#649 |
|
Aug 2002
AE16 Posts |
It's interesting with these Xeon E5v3/i7 5xxx it is best to run 1 worker with multiple threads. It is clearly better than any other combinations. Not sure on the underlying reason but on the surface the parallelism and efficiency of 1 worker multiple threads is way better than 2 or more workers all fighting for cache and memory bandwidth.
There is some magic in there when these CPU is running just 1 worker. This is at least true for 4096K Last fiddled with by xtreme2k on 2016-02-03 at 05:12 |
|
|
|
|
|
#650 |
|
Jun 2003
5,087 Posts |
4096K FFT consumes 32MB of memory (plus change). This can run (almost) entirely out of the huge 35MB L3 cache of the Xeon. So despite the loss of efficiency due to multithreading, the 1 worker setup wins out.
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Perpetual "interesting video" thread... | Xyzzy | Lounge | 43 | 2021-07-17 00:00 |
| LLR benchmark thread | Oddball | Riesel Prime Search | 5 | 2010-08-02 00:11 |
| Perpetual I'm pi**ed off thread | rogue | Soap Box | 19 | 2009-10-28 19:17 |
| Perpetual autostereogram thread... | Xyzzy | Lounge | 10 | 2006-09-28 00:36 |
| Perpetual ECM factoring challenge thread... | Xyzzy | Factoring | 65 | 2005-09-05 08:16 |