mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   Hardware (https://www.mersenneforum.org/forumdisplay.php?f=9)
-   -   Perpetual benchmark thread... (https://www.mersenneforum.org/showthread.php?t=59)

Mark Rose 2016-08-27 00:59

i5-6600 with DDR-2133:

[Work thread Aug 24 10:24] Benchmarking multiple workers to measure the impact of memory bandwidth
[Work thread Aug 24 10:27] Timing 2048K FFT, 4 cpus, 1 worker. Average times: 2.57 ms. Total throughput: 388.96 iter/sec.
[Work thread Aug 24 10:27] Timing 2048K FFT, 4 cpus, 2 workers. Average times: 5.30, 5.30 ms. Total throughput: 377.27 iter/sec.
[Work thread Aug 24 10:27] Timing 2048K FFT, 4 cpus, 4 workers. Average times: 10.71, 10.71, 10.71, 10.71 ms. Total throughput: 373.41 iter/sec.
...
[Work thread Aug 24 10:29] Timing 4096K FFT, 4 cpus, 1 worker. Average times: 5.40 ms. Total throughput: 185.15 iter/sec.
[Work thread Aug 24 10:29] Timing 4096K FFT, 4 cpus, 2 workers. Average times: 10.88, 10.78 ms. Total throughput: 184.67 iter/sec.
[Work thread Aug 24 10:29] Timing 4096K FFT, 4 cpus, 4 workers. Average times: 21.67, 21.70, 21.65, 21.64 ms. Total throughput: 184.64 iter/sec.

Stock-clocked i7-4770k with DDR3-2400:

[Work thread Aug 24 10:30] Timing 2048K FFT, 4 cpus, 1 worker. Average times: 2.63 ms. Total throughput: 380.55 iter/sec.
[Work thread Aug 24 10:30] Timing 2048K FFT, 4 cpus, 2 workers. Average times: 5.90, 5.58 ms. Total throughput: 348.60 iter/sec.
[Work thread Aug 24 10:30] Timing 2048K FFT, 4 cpus, 4 workers. Average times: 10.49, 10.42, 10.54, 10.42 ms. Total throughput: 382.13 iter/sec.
[Work thread Aug 24 10:31] Timing 2560K FFT, 4 cpus, 1 worker. Average times: 3.56 ms. Total throughput: 280.91 iter/sec.
[Work thread Aug 24 10:31] Timing 2560K FFT, 4 cpus, 2 workers. Average times: 6.81, 6.80 ms. Total throughput: 294.08 iter/sec.
[Work thread Aug 24 10:31] Timing 2560K FFT, 4 cpus, 4 workers. Average times: 13.53, 13.56, 13.67, 13.58 ms. Total throughput: 294.46 iter/sec.
[Work thread Aug 24 10:31] Timing 3072K FFT, 4 cpus, 1 worker. Average times: 4.36 ms. Total throughput: 229.51 iter/sec.
[Work thread Aug 24 10:31] Timing 3072K FFT, 4 cpus, 2 workers. Average times: 8.07, 8.09 ms. Total throughput: 247.56 iter/sec.
[Work thread Aug 24 10:32] Timing 3072K FFT, 4 cpus, 4 workers. Average times: 16.12, 16.11, 16.25, 16.13 ms. Total throughput: 247.63 iter/sec.
[Work thread Aug 24 10:32] Timing 3584K FFT, 4 cpus, 1 worker. Average times: 4.81 ms. Total throughput: 207.94 iter/sec.
[Work thread Aug 24 10:32] Timing 3584K FFT, 4 cpus, 2 workers. Average times: 9.69, 9.66 ms. Total throughput: 206.67 iter/sec.
[Work thread Aug 24 10:32] Timing 3584K FFT, 4 cpus, 4 workers. Average times: 19.07, 18.99, 18.99, 19.09 ms. Total throughput: 210.14 iter/sec.
[Work thread Aug 24 10:32] Timing 4096K FFT, 4 cpus, 1 worker. Average times: 5.51 ms. Total throughput: 181.46 iter/sec.
[Work thread Aug 24 10:33] Timing 4096K FFT, 4 cpus, 2 workers. Average times: 10.85, 10.83 ms. Total throughput: 184.52 iter/sec.
[Work thread Aug 24 10:33] Timing 4096K FFT, 4 cpus, 4 workers. Average times: 21.96, 21.98, 21.76, 21.81 ms. Total throughput: 182.84 iter/sec.

And a 4770 with DDR3-1600:

[Work thread Aug 26 20:48] Timing 1024K FFT, 4 cpus, 1 worker. Average times: 1.31 ms. Total throughput: 760.95 iter/sec.
[Work thread Aug 26 20:48] Timing 1024K FFT, 4 cpus, 2 workers. Average times: 3.30, 3.70 ms. Total throughput: 573.21 iter/sec.
[Work thread Aug 26 20:48] Timing 1024K FFT, 4 cpus, 4 workers. Average times: 7.04, 7.16, 7.98, 7.27 ms. Total throughput: 544.37 iter/sec.
[Work thread Aug 26 20:48] Timing 1280K FFT, 4 cpus, 1 worker. Average times: 1.99 ms. Total throughput: 502.73 iter/sec.
[Work thread Aug 26 20:48] Timing 1280K FFT, 4 cpus, 2 workers. Average times: 4.38, 4.43 ms. Total throughput: 453.94 iter/sec.
[Work thread Aug 26 20:48] Timing 1280K FFT, 4 cpus, 4 workers. Average times: 9.18, 9.07, 9.10, 9.17 ms. Total throughput: 438.16 iter/sec.
[Work thread Aug 26 20:49] Timing 1536K FFT, 4 cpus, 1 worker. Average times: 2.43 ms. Total throughput: 411.77 iter/sec.
[Work thread Aug 26 20:49] Timing 1536K FFT, 4 cpus, 2 workers. Average times: 5.29, 5.44 ms. Total throughput: 372.94 iter/sec.
[Work thread Aug 26 20:49] Timing 1536K FFT, 4 cpus, 4 workers. Average times: 10.91, 10.94, 11.06, 11.04 ms. Total throughput: 363.99 iter/sec.
[Work thread Aug 26 20:49] Timing 1792K FFT, 4 cpus, 1 worker. Average times: 3.06 ms. Total throughput: 326.68 iter/sec.
[Work thread Aug 26 20:49] Timing 1792K FFT, 4 cpus, 2 workers. Average times: 6.55, 6.56 ms. Total throughput: 305.11 iter/sec.
[Work thread Aug 26 20:50] Timing 1792K FFT, 4 cpus, 4 workers. Average times: 13.18, 13.22, 13.06, 13.43 ms. Total throughput: 302.52 iter/sec.
[Work thread Aug 26 20:50] Timing 2048K FFT, 4 cpus, 1 worker. Average times: 3.49 ms. Total throughput: 286.67 iter/sec.
[Work thread Aug 26 20:50] Timing 2048K FFT, 4 cpus, 2 workers. Average times: 8.08, 7.51 ms. Total throughput: 256.88 iter/sec.
[Work thread Aug 26 20:50] Timing 2048K FFT, 4 cpus, 4 workers. Average times: 14.76, 15.04, 14.82, 15.09 ms. Total throughput: 267.96 iter/sec.
[Work thread Aug 26 20:50] Timing 2560K FFT, 4 cpus, 1 worker. Average times: 4.63 ms. Total throughput: 216.13 iter/sec.
[Work thread Aug 26 20:51] Timing 2560K FFT, 4 cpus, 2 workers. Average times: 9.50, 9.62 ms. Total throughput: 209.26 iter/sec.
[Work thread Aug 26 20:51] Timing 2560K FFT, 4 cpus, 4 workers. Average times: 18.46, 19.41, 19.10, 18.90 ms. Total throughput: 210.98 iter/sec.
[Work thread Aug 26 20:51] Timing 3072K FFT, 4 cpus, 1 worker. Average times: 5.66 ms. Total throughput: 176.75 iter/sec.
[Work thread Aug 26 20:51] Timing 3072K FFT, 4 cpus, 2 workers. Average times: 11.60, 11.86 ms. Total throughput: 170.48 iter/sec.
[Work thread Aug 26 20:51] Timing 3072K FFT, 4 cpus, 4 workers. Average times: 23.17, 22.32, 22.55, 23.19 ms. Total throughput: 175.43 iter/sec.
[Work thread Aug 26 20:52] Timing 3584K FFT, 4 cpus, 1 worker. Average times: 6.68 ms. Total throughput: 149.66 iter/sec.
[Work thread Aug 26 20:52] Timing 3584K FFT, 4 cpus, 2 workers. Average times: 13.62, 13.55 ms. Total throughput: 147.24 iter/sec.
[Work thread Aug 26 20:52] Timing 3584K FFT, 4 cpus, 4 workers. Average times: 26.65, 26.95, 27.47, 27.59 ms. Total throughput: 147.27 iter/sec.
[Work thread Aug 26 20:52] Timing 4096K FFT, 4 cpus, 1 worker. Average times: 8.05 ms. Total throughput: 124.17 iter/sec.
[Work thread Aug 26 20:52] Timing 4096K FFT, 4 cpus, 2 workers. Average times: 15.30, 15.29 ms. Total throughput: 130.74 iter/sec.
[Work thread Aug 26 20:53] Timing 4096K FFT, 4 cpus, 4 workers. Average times: 30.15, 30.57, 30.68, 30.75 ms. Total throughput: 131.00 iter/sec.

petrw1 2016-08-31 16:53

[QUOTE=Mark Rose;440799]This machine? [url]http://www.acer.com/ac/en/SG/content/model/DT.B1HSG.001[/url]

It takes DDR3[b]L[/b]-1600 at 1.35V.

The memory at both those links won't work. Try this:

[url]http://www.newegg.ca/Product/Product.aspx?Item=N82E16820156047[/url][/QUOTE]

I really appreciate the help (from all of you) but what in the first link tells me it can only handle DDR3-1600 at 1.35V?

Mark Rose 2016-08-31 17:25

[QUOTE=petrw1;441206]I really appreciate the help (from all of you) but what in the first link tells me it can only handle DDR3-1600 at 1.35V?[/QUOTE]

It's a specification of the CPU. [url]http://ark.intel.com/products/88185/Intel-Core-i5-6400-Processor-6M-Cache-up-to-3_30-GHz[/url]

When it's being used with DDR3, which is what the Acer page says the motherboard takes, it's limited to 1600 MHz and it must be low voltage DDR3, or DDR3L.

petrw1 2016-09-12 04:47

[QUOTE=Mark Rose;441215]It's a specification of the CPU. [url]http://ark.intel.com/products/88185/Intel-Core-i5-6400-Processor-6M-Cache-up-to-3_30-GHz[/url]

When it's being used with DDR3, which is what the Acer page says the motherboard takes, it's limited to 1600 MHz and it must be low voltage DDR3, or DDR3L.[/QUOTE]

So we ordered and installed 2X8G DDR3 - 1600 but CPU-Z says it is running at 800. Is it a simple MB-Settings thing or are we SOL.

The before and after benchmarks are virtually the same but 4 cores doing DC-LL is about 12% faster due to balanced Dual RAM.

henryzz 2016-09-12 13:16

[QUOTE=petrw1;442279]So we ordered and installed 2X8G DDR3 - 1600 but CPU-Z says it is running at 800. Is it a simple MB-Settings thing or are we SOL.

The before and after benchmarks are virtually the same but 4 cores doing DC-LL is about 12% faster due to balanced Dual RAM.[/QUOTE]

Cpu-z usually reports half the speed.

petrw1 2016-09-12 16:18

[QUOTE=petrw1;442279]So we ordered and installed 2X8G DDR3 - 1600 but CPU-Z says it is running at 800. Is it a simple MB-Settings thing or are we SOL.

The before and after benchmarks are virtually the same but 4 cores doing DC-LL is about 12% faster due to balanced Dual RAM.[/QUOTE]

Correction 25% faster.

17 to under 13 ms on a 37.5M DC for all 4 cores

Benchmark says it should be 13.5 for a 2048FFT with 4 cores....seems he is doing a little better
(7.68 for 1 core alone)

Antonio 2016-09-12 16:46

[QUOTE=henryzz;442309]Cpu-z usually reports half the speed.[/QUOTE]

Cpu-z gives the memory clock speed correctly. The factor of 2 difference comes from DDR memory transferring data on both the rising and the falling edge of the clock.

petrw1 2016-09-12 17:22

[QUOTE=Antonio;442324]Cpu-z gives the memory clock speed correctly. The factor of 2 difference comes from DDR memory transferring data on both the rising and the falling edge of the clock.[/QUOTE]

So to clarify if CPU-Z says 800 then for some reason my RAM is running at 800 and not 1600 as it is capable of?

If so is there anything I can do to get to that speed?
Or will that make no difference, that is, will the MB or some other component limit me to 800 anyway?

kladner 2016-09-12 17:27

[QUOTE=petrw1;442330]So to clarify if CPU-Z says 800 then for some reason my RAM is running at 800 and not 1600 as it is capable of?

If so is there anything I can do to get to that speed?
Or will that make no difference, that is, will the MB or some other component limit me to 800 anyway?[/QUOTE]
Your RAM is running at the correct speed. As explained, DDR RAM gets two operations per clock cycle. CPUZ reports the base clock, not what the RAM is doing.

James Heinrich 2016-09-13 03:40

[url]https://en.wikipedia.org/wiki/Double_data_rate[/url]

The RAM is running on an 800MHz (million cycles per second) clock, and transferring data at 1600MT/s (million transactions per second). Both are correct, even if that is confusing.

storm5510 2016-11-16 00:00

Intel(R) Core(TM) i5-3570 CPU @ 3.40GHz
CPU speed: 3557.21 MHz, 4 cores
CPU features: 3DNow!, SSE, SSE2, SSE4, AVX
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 6 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Prime95 64-bit version 28.10, RdtscTiming=1

Timing FFTs using 1 thread.
Best time for 1024K FFT length: 9.508 ms., avg: 9.768 ms.
Best time for 1280K FFT length: 12.303 ms., avg: 12.418 ms.
Best time for 1536K FFT length: 14.999 ms., avg: 15.142 ms.
Best time for 1792K FFT length: 18.287 ms., avg: 18.356 ms.
Best time for 2048K FFT length: 20.227 ms., avg: 20.361 ms.
Best time for 2560K FFT length: 26.262 ms., avg: 26.380 ms.
Best time for 3072K FFT length: 31.668 ms., avg: 31.762 ms.
Best time for 3584K FFT length: 38.144 ms., avg: 38.364 ms.
Best time for 4096K FFT length: 42.237 ms., avg: 42.404 ms.
Best time for 5120K FFT length: 54.871 ms., avg: 54.996 ms.
Best time for 6144K FFT length: 68.655 ms., avg: 68.826 ms.
Best time for 7168K FFT length: 82.420 ms., avg: 82.663 ms.
Best time for 8192K FFT length: 90.886 ms., avg: 91.456 ms.

Timing FFTs using 2 threads.
Best time for 1024K FFT length: 4.864 ms., avg: 4.918 ms.
Best time for 1280K FFT length: 6.314 ms., avg: 6.388 ms.
Best time for 1536K FFT length: 7.647 ms., avg: 7.741 ms.
Best time for 1792K FFT length: 9.385 ms., avg: 9.449 ms.
Best time for 2048K FFT length: 10.340 ms., avg: 10.423 ms.
Best time for 2560K FFT length: 13.370 ms., avg: 13.465 ms.
Best time for 3072K FFT length: 16.091 ms., avg: 16.292 ms.
Best time for 3584K FFT length: 19.393 ms., avg: 19.624 ms.
Best time for 4096K FFT length: 21.453 ms., avg: 21.588 ms.
Best time for 5120K FFT length: 27.850 ms., avg: 28.476 ms.
Best time for 6144K FFT length: 34.854 ms., avg: 35.100 ms.
Best time for 7168K FFT length: 41.837 ms., avg: 42.006 ms.
Best time for 8192K FFT length: 46.029 ms., avg: 46.188 ms.

Timing FFTs using 3 threads.
Best time for 1024K FFT length: 3.412 ms., avg: 3.462 ms.
Best time for 1280K FFT length: 4.457 ms., avg: 4.533 ms.
Best time for 1536K FFT length: 5.287 ms., avg: 5.401 ms.
Best time for 1792K FFT length: 6.556 ms., avg: 6.645 ms.
Best time for 2048K FFT length: 7.277 ms., avg: 7.350 ms.
Best time for 2560K FFT length: 9.316 ms., avg: 9.495 ms.
Best time for 3072K FFT length: 11.275 ms., avg: 11.354 ms.
Best time for 3584K FFT length: 13.431 ms., avg: 13.660 ms.
Best time for 4096K FFT length: 14.977 ms., avg: 15.127 ms.
Best time for 5120K FFT length: 19.226 ms., avg: 19.463 ms.
Best time for 6144K FFT length: 24.403 ms., avg: 24.689 ms.
Best time for 7168K FFT length: 28.934 ms., avg: 29.159 ms.
Best time for 8192K FFT length: 31.774 ms., avg: 32.199 ms.

Timing FFTs using 4 threads.
Best time for 1024K FFT length: 2.730 ms., avg: 2.804 ms.
Best time for 1280K FFT length: 3.577 ms., avg: 3.688 ms.
Best time for 1536K FFT length: 4.234 ms., avg: 4.320 ms.
Best time for 1792K FFT length: 5.218 ms., avg: 5.423 ms.
Best time for 2048K FFT length: 5.985 ms., avg: 6.203 ms.
Best time for 2560K FFT length: 7.661 ms., avg: 7.855 ms.
Best time for 3072K FFT length: 9.270 ms., avg: 9.394 ms.
Best time for 3584K FFT length: 11.039 ms., avg: 11.281 ms.
Best time for 4096K FFT length: 12.421 ms., avg: 12.651 ms.
Best time for 5120K FFT length: 15.811 ms., avg: 15.986 ms.
Best time for 6144K FFT length: 20.161 ms., avg: 20.647 ms.
Best time for 7168K FFT length: 23.814 ms., avg: 24.115 ms.
Best time for 8192K FFT length: 26.010 ms., avg: 26.269 ms.

Timings for 1024K FFT length (4 cpus, 4 workers): 11.17, 11.15, 11.15, 11.16 ms. Throughput: 358.47 iter/sec.
Timings for 1280K FFT length (4 cpus, 4 workers): 14.62, 14.55, 14.55, 14.57 ms. Throughput: 274.43 iter/sec.
Timings for 1536K FFT length (4 cpus, 4 workers): 16.92, 16.81, 16.74, 16.74 ms. Throughput: 238.06 iter/sec.
Timings for 1792K FFT length (4 cpus, 4 workers): 20.66, 20.37, 20.73, 20.36 ms. Throughput: 194.87 iter/sec.
Timings for 2048K FFT length (4 cpus, 4 workers): 26.05, 25.39, 26.14, 26.14 ms. Throughput: 154.29 iter/sec.
Timings for 2560K FFT length (4 cpus, 4 workers): 28.62, 28.13, 28.07, 28.23 ms. Throughput: 141.54 iter/sec.
Timings for 3072K FFT length (4 cpus, 4 workers): 37.11, 38.15, 36.84, 37.20 ms. Throughput: 107.18 iter/sec.
Timings for 3584K FFT length (4 cpus, 4 workers): 42.48, 42.02, 41.94, 42.10 ms. Throughput: 94.94 iter/sec.
Timings for 4096K FFT length (4 cpus, 4 workers): 47.66, 47.15, 46.92, 46.69 ms. Throughput: 84.92 iter/sec.
Timings for 5120K FFT length (4 cpus, 4 workers): 61.24, 60.11, 59.79, 60.09 ms. Throughput: 66.33 iter/sec.
Timings for 6144K FFT length (4 cpus, 4 workers): 75.05, 74.09, 73.57, 74.10 ms. Throughput: 53.91 iter/sec.
Timings for 7168K FFT length (4 cpus, 4 workers): 92.21, 90.88, 90.51, 91.44 ms. Throughput: 43.83 iter/sec.
Timings for 8192K FFT length (4 cpus, 4 workers): 106.61, 108.07, 111.84, 106.36 ms. Throughput: 36.98 iter/sec.


All times are UTC. The time now is 07:04.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.