mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
Thread Tools
Old 2017-04-13, 05:29   #751
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2×5×293 Posts
Default

The @ 4.0GHz is part of the model name when queried from the processor. It has nothing to do with the actual running processor frequency. It has confused me in the past as well.


Kieren, are you running 12 GB of RAM? Kind of an odd amount. Not having matched sticks is probably hampering performance. That being said, you're getting 32% more throughput with a 27% higher CPU clock and a 50% higher memory clock compared to my systems (for 4 cores, 1 worker, 4096K FFT). That extra memory bandwidth is helping.

Your benchmark also tells me I'm still memory constrained at 2133 with 4 cores at 3.3 GHz. I may try poking around the bios to see if there's a way to under a locked CPU besides disabling turbo.
Mark Rose is offline   Reply With Quote
Old 2017-04-13, 12:00   #752
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

236568 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
The @ 4.0GHz is part of the model name when queried from the processor. It has nothing to do with the actual running processor frequency. It has confused me in the past as well.


Kieren, are you running 12 GB of RAM? Kind of an odd amount. Not having matched sticks is probably hampering performance. That being said, you're getting 32% more throughput with a 27% higher CPU clock and a 50% higher memory clock compared to my systems (for 4 cores, 1 worker, 4096K FFT). That extra memory bandwidth is helping.

Your benchmark also tells me I'm still memory constrained at 2133 with 4 cores at 3.3 GHz. I may try poking around the bios to see if there's a way to under a locked CPU besides disabling turbo.
Does it indicate 12 GB somewhere? It should be 16GB, dual channel, dual rank. They are rated at 2666MHz, running at 3200.
Attached Thumbnails
Click image for larger version

Name:	16GB-dual rank.JPG
Views:	139
Size:	42.3 KB
ID:	15929  
kladner is offline   Reply With Quote
Old 2017-04-13, 13:23   #753
VictordeHolland
 
VictordeHolland's Avatar
 
"Victor de Hollander"
Aug 2011
the Netherlands

23×3×72 Posts
Default

Quote:
Originally Posted by db597 View Post
So from the benchmarks it looks like 8 Ryzen cores is still slower than 4 Skylake/Kabylake cores:

Ryzen @ 3.3GHz:
Code:
Timings for 4096K FFT length (8 cpus, 1 worker): 6.92 ms. Throughput: 144.58 iter/sec.
i7-6700K @ 4.2GHz:
Code:
Timings for 4096K FFT length (4 cpus, 1 worker):  4.07 ms.  Throughput: 245.91 iter/sec.
Ryzen is about as fast as my 5?6? year old SandyBridge

i5-2500k @4.0GHz DDR3-2133
Code:
Best time for 4096K FFT length: 6.839 ms., avg: 7.155 ms.
Timings for 4096K FFT length (4 cpus, 4 workers): 27.12, 26.85, 27.58, 27.00 ms.  Throughput: 147.41 iter/sec.
VictordeHolland is offline   Reply With Quote
Old 2017-04-13, 15:36   #754
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2·5·293 Posts
Default

Quote:
Originally Posted by kladner View Post
Does it indicate 12 GB somewhere? It should be 16GB, dual channel, dual rank. They are rated at 2666MHz, running at 3200.
Yeah, it does:

Code:
 Machine#0 (total=12649168KB, Backend=Windows, hwlocVersion=1.11.6, ProcessName=prime95.exe)
I doubt it will affect Prime95 though, now that you've confirmed the install RAM.
Mark Rose is offline   Reply With Quote
Old 2017-04-13, 18:08   #755
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

1101011000112 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
The @ 4.0GHz is part of the model name when queried from the processor. It has nothing to do with the actual running processor frequency.
I don't think it's the "@ 4.00 GHz" that was concerning, it was the "CPU speed: 4008.14 MHz".

Quote:
Originally Posted by Mark Rose View Post
are you running 12 GB of RAM? Kind of an odd amount. Not having matched sticks is probably hampering performance.
That depends on the system configuration. My i7-920 system has 12GB, but it's triple-channel so 12GB is a balanced configuration. Granted most systems are dual-channel (I'm odd and my two systems are 3-channel [i7-920] and 4-channel [i7-3930K] )

As for the RAM reported, I believe that's what's available to Prime95, not total system RAM. In my case I have 64GB installed and it logs as
Code:
Machine#0 (total=54609356KB)
which is 52GB. Although I'm not entirely sure how it pulled up that number, since I have 5 workers, specified at 11000MB each, maximum 4 high-memory workers, and an overall maximum of 44000MB. But in any case, it's clearly not the installed system RAM amount.
James Heinrich is offline   Reply With Quote
Old 2017-04-13, 21:08   #756
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

1015810 Posts
Default

Quote:
I don't think it's the "@ 4.00 GHz" that was concerning, it was the "CPU speed: 4008.14 MHz".
I see numbers like that when running stock. I think it must come from variations in the base clock.
kladner is offline   Reply With Quote
Old 2017-04-13, 21:16   #757
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23×149 Posts
Default

Quote:
Originally Posted by kladner View Post
I see numbers like that when running stock. I think it must come from variations in the base clock.
It's not that it wasn't exactly 4000.00MHz, but rather that kladner was expecting ~4.2GHz, not ~4.0GHz, hence my suggestion to monitor the frequency in realtime.
James Heinrich is offline   Reply With Quote
Old 2017-04-14, 00:41   #758
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
It's not that it wasn't exactly 4000.00MHz, but rather that kladner was expecting ~4.2GHz, not ~4.0GHz, hence my suggestion to monitor the frequency in realtime.
I did try watching CPU-Z when starting the benchmark. It takes thinning out other CPU users to get a baseline. As mentioned, in the "Sync all cores" option on the Asus board seems to make the frequency (multiplier) jump around a lot. When things were quiet, and the core clock was only occasionally hitting 4200 MHz, moving the mouse of clicking on something would make it peak.

The jump to 42x seems virtually simultaneous with clicking to start the benchmark, at least to human-scaled perceptions.
kladner is offline   Reply With Quote
Old 2017-04-17, 14:12   #759
FSund
 
Apr 2017

2 Posts
Default

Quote:
Originally Posted by db597 View Post
The 8192K FFT performance looks incredible on this version of Prime95, especially when all 8 cores are thrown at it. Would be good if someone can post results from a similarly priced Intel i7 7700K on Prime95 v29.1 Build 15 for comparison (I expect the i7 is a lot faster per core, but at the end of the day having double the cores may make it a rather close competition).
I have just gotten a 7700k, and I'm in the process of overclocking it now. Will try to report back with benchmarks when I'm done.

At the moment it's looking like I'll have to accept 4.7 GHz for Prime95 runs, any higher and my temperatures get too high. Or to be more specific, at 4.7 GHz have to increase the voltage to 1.280 V to do the the Prime95 torture tests without errors, and at those voltages I get peak temperatures of 85 C, which is a bit too high for my comfort.
FSund is offline   Reply With Quote
Old 2017-04-17, 15:50   #760
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

55628 Posts
Default

Do remember that Haswell and later bump the VCore +0.1 when doing AVX2/FMA3 (which Prime95 will use).
Mark Rose is offline   Reply With Quote
Old 2017-04-18, 06:53   #761
db597
 
db597's Avatar
 
Jan 2003

7·29 Posts
Default

Quote:
Originally Posted by VictordeHolland View Post
Ryzen is about as fast as my 5?6? year old SandyBridge
The current Core architecture has been around for 6-7 years and has lots of time for software optimisations to be done for it. Ryzen has been out for 1 month... I remember back in the v26.xx days, Prime95 was a lot slower before optimisation.

My Ryzen is mainly working on World Community Grid projects (generally not highly optimised code). There I see a 2.5x the points per day compared to my 2600K @ stock clockspeeds. At the same clockspeed, the IPC is around Haswell level and it has double the number of cores.
db597 is offline   Reply With Quote
Old 2017-04-23, 11:23   #762
FSund
 
Apr 2017

216 Posts
Default

Quote:
Originally Posted by db597 View Post
@LaurV... thanks for the comparison benchmark.

So for the case of both systems running on 8 physical cores, it's 7.136ms for the i7-6950X @ 3.0GHz vs 12.69ms for the Ryzen 1700 @ 3.3GHz. Looks like Intel wins big in terms of IPC.

Would still be interesting to see the results from a i7-7700K (half the cores, but higher IPC and higher clockspeed)... to compare at a similar cost level (a Ryzen 1700 system being still a bit cheaper than a comparable i7-7700K system).
Here are my results

Hardware:
CPU: i7-7700K - at 4.9 GHz with AVX Core Ratio Negative Offset of 2 (which reduces the multiplier to 47 when running Prime95, giving a clock speed of 4.7 GHz)
CPU Cooler: Noctua NH-D15
Motherboard: ASUS TUF Z270 Mark 1 - BIOS version 0906
PSU: EVGA SuperNOVA 550 G3
RAM: 2x Corsair Vengeance LPX DDR4 3200MHz 8GB - at XMP 3200 MHz
OS Drive: Corsair Force MP500 240GB M.2 PCIe SSD

Code:
[Sun Apr 23 11:02:28 2017]
Compare your results to other computers at http://www.mersenne.org/report_benchmarks
Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz
CPU speed: 4551.15 MHz, 4 hyperthreaded cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 8 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Prime95 64-bit version 29.1, RdtscTiming=1
Timings for 2048K FFT length (4 cpus, 1 worker):  2.19 ms.  Throughput: 457.47 iter/sec.
Timings for 2048K FFT length (4 cpus, 4 workers): 10.28,  9.28,  9.05,  9.78 ms.  Throughput: 417.80 iter/sec.
Timings for 2048K FFT length (4 cpus hyperthreaded, 1 worker):  2.50 ms.  Throughput: 400.18 iter/sec.
Timings for 2048K FFT length (4 cpus hyperthreaded, 4 workers): 10.57, 10.02, 13.69, 10.16 ms.  Throughput: 365.92 iter/sec.
Timings for 2560K FFT length (4 cpus, 1 worker):  2.82 ms.  Throughput: 354.13 iter/sec.
Timings for 2560K FFT length (4 cpus, 4 workers): 12.16, 11.62, 11.63, 13.58 ms.  Throughput: 327.87 iter/sec.
Timings for 2560K FFT length (4 cpus hyperthreaded, 1 worker):  3.18 ms.  Throughput: 314.12 iter/sec.
Timings for 2560K FFT length (4 cpus hyperthreaded, 4 workers): 13.08, 12.59, 16.81, 12.99 ms.  Throughput: 292.38 iter/sec.
Timings for 3072K FFT length (4 cpus, 1 worker):  3.45 ms.  Throughput: 289.75 iter/sec.
Timings for 3072K FFT length (4 cpus, 4 workers): 15.42, 14.62, 14.27, 15.11 ms.  Throughput: 269.51 iter/sec.
Timings for 3072K FFT length (4 cpus hyperthreaded, 1 worker):  3.96 ms.  Throughput: 252.73 iter/sec.
Timings for 3072K FFT length (4 cpus hyperthreaded, 4 workers): 16.08, 15.22, 20.47, 15.33 ms.  Throughput: 242.01 iter/sec.
Timings for 3584K FFT length (4 cpus, 1 worker):  4.11 ms.  Throughput: 243.02 iter/sec.
Timings for 3584K FFT length (4 cpus, 4 workers): 19.07, 17.39, 16.95, 17.41 ms.  Throughput: 226.40 iter/sec.
Timings for 3584K FFT length (4 cpus hyperthreaded, 1 worker):  4.73 ms.  Throughput: 211.63 iter/sec.
Timings for 3584K FFT length (4 cpus hyperthreaded, 4 workers): 18.80, 18.03, 24.42, 17.97 ms.  Throughput: 205.23 iter/sec.
Timings for 4096K FFT length (4 cpus, 1 worker):  4.79 ms.  Throughput: 208.79 iter/sec.
Timings for 4096K FFT length (4 cpus, 4 workers): 21.20, 19.72, 19.60, 20.85 ms.  Throughput: 196.85 iter/sec.
[Sun Apr 23 11:07:29 2017]
Timings for 4096K FFT length (4 cpus hyperthreaded, 1 worker):  5.46 ms.  Throughput: 183.31 iter/sec.
Timings for 4096K FFT length (4 cpus hyperthreaded, 4 workers): 21.19, 20.58, 27.73, 21.08 ms.  Throughput: 179.28 iter/sec.
Timings for 5120K FFT length (4 cpus, 1 worker):  6.09 ms.  Throughput: 164.13 iter/sec.
Timings for 5120K FFT length (4 cpus, 4 workers): 28.35, 24.43, 24.09, 24.33 ms.  Throughput: 158.81 iter/sec.
Timings for 5120K FFT length (4 cpus hyperthreaded, 1 worker):  6.97 ms.  Throughput: 143.50 iter/sec.
Timings for 5120K FFT length (4 cpus hyperthreaded, 4 workers): 27.05, 25.88, 34.61, 25.76 ms.  Throughput: 143.33 iter/sec.
Timings for 6144K FFT length (4 cpus, 1 worker):  7.94 ms.  Throughput: 125.99 iter/sec.
Timings for 6144K FFT length (4 cpus, 4 workers): 33.12, 30.44, 30.03, 34.13 ms.  Throughput: 125.65 iter/sec.
Timings for 6144K FFT length (4 cpus hyperthreaded, 1 worker):  8.88 ms.  Throughput: 112.61 iter/sec.
Timings for 6144K FFT length (4 cpus hyperthreaded, 4 workers): 33.69, 32.84, 46.51, 32.49 ms.  Throughput: 112.42 iter/sec.
Timings for 7168K FFT length (4 cpus, 1 worker):  9.31 ms.  Throughput: 107.45 iter/sec.
Timings for 7168K FFT length (4 cpus, 4 workers): 35.75, 35.32, 35.11, 45.61 ms.  Throughput: 106.69 iter/sec.
Timings for 7168K FFT length (4 cpus hyperthreaded, 1 worker): 10.54 ms.  Throughput: 94.88 iter/sec.
Timings for 7168K FFT length (4 cpus hyperthreaded, 4 workers): 39.98, 38.35, 54.51, 40.08 ms.  Throughput: 94.39 iter/sec.
Timings for 8192K FFT length (4 cpus, 1 worker): 10.75 ms.  Throughput: 93.03 iter/sec.
Timings for 8192K FFT length (4 cpus, 4 workers): 43.70, 41.72, 40.36, 46.95 ms.  Throughput: 92.93 iter/sec.
Timings for 8192K FFT length (4 cpus hyperthreaded, 1 worker): 12.24 ms.  Throughput: 81.67 iter/sec.
Timings for 8192K FFT length (4 cpus hyperthreaded, 4 workers): 47.77, 45.73, 62.07, 45.13 ms.  Throughput: 81.07 iter/sec.
FSund is offline   Reply With Quote
Old 2017-04-23, 13:26   #763
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

1011011100102 Posts
Default

If there ever were a case for more memory bandwidth...

It's nice to see a 4 core with over 200 iter/ms at 4096K FFT though!
Mark Rose is offline   Reply With Quote
Old 2017-04-23, 14:27   #764
retina
Undefined
 
retina's Avatar
 
"The unspeakable one"
Jun 2006
My evil lair

22·32·173 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
It's nice to see a 4 core with over 200 iter/ms at 4096K FFT though!
I think you have an extra 'm' in there. Maybe with the magical optical computers (using yellow of course) we will see 200 iter/ms. But as of today, I think not.
retina is online now   Reply With Quote
Old 2017-04-23, 15:42   #765
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2×5×293 Posts
Default

Quote:
Originally Posted by retina View Post
I think you have an extra 'm' in there. Maybe with the magical optical computers (using yellow of course) we will see 200 iter/ms. But as of today, I think not.
lol yes. Good eyes.
Mark Rose is offline   Reply With Quote
Old 2017-05-28, 19:21   #766
Dresdenboy
 
Dresdenboy's Avatar
 
Apr 2003
Berlin, Germany

192 Posts
Default

CPU: Ryzen 7 1700X stock (mostly running 3.5GHz on all cores with Prime95)
CPU Cooler: Noctua NH-D15 SE-AM4
Motherboard: ASRock Taichi X370
BIOS version: 2.34 (Beta with AGESA 1.0.0.6)
RAM: 2x16GB G.Skill TridentZ DDR4 3200MHz 14-14-14-36-1T dual rank
OS: Win10Pro x64

Code:
Compare your results to other computers at http://www.mersenne.org/report_benchmarks
AMD Ryzen 7 1700X Eight-Core Processor         
CPU speed: 3400.29 MHz, 8 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 512 KB, L3 cache size: 16 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 64
L2 TLBS: 1536
Prime95 64-bit version 29.1, RdtscTiming=1
Best time for 2048K FFT length: 13.522 ms., avg: 13.721 ms.
Best time for 2560K FFT length: 17.576 ms., avg: 18.112 ms.
Best time for 3072K FFT length: 21.111 ms., avg: 21.577 ms.
Best time for 3584K FFT length: 25.393 ms., avg: 26.027 ms.
Best time for 4096K FFT length: 28.571 ms., avg: 30.210 ms.
Best time for 5120K FFT length: 36.353 ms., avg: 36.910 ms.
Best time for 6144K FFT length: 43.307 ms., avg: 43.586 ms.
Best time for 7168K FFT length: 51.546 ms., avg: 51.943 ms.
Best time for 8192K FFT length: 58.803 ms., avg: 60.538 ms.
Timing FFTs using 2 threads on 1 core.
Best time for 2048K FFT length: 15.871 ms., avg: 16.169 ms.
Best time for 2560K FFT length: 20.416 ms., avg: 20.792 ms.
Best time for 3072K FFT length: 24.707 ms., avg: 25.089 ms.
Best time for 3584K FFT length: 30.205 ms., avg: 30.626 ms.
Best time for 4096K FFT length: 34.890 ms., avg: 35.156 ms.
Best time for 5120K FFT length: 42.726 ms., avg: 44.333 ms.
Best time for 6144K FFT length: 50.388 ms., avg: 51.474 ms.
Best time for 7168K FFT length: 59.980 ms., avg: 60.897 ms.
Best time for 8192K FFT length: 70.055 ms., avg: 70.686 ms.
Timing FFTs using 8 threads on 8 cores.
Best time for 2048K FFT length: 2.107 ms., avg: 2.420 ms.
Best time for 2560K FFT length: 3.006 ms., avg: 3.540 ms.
Best time for 3072K FFT length: 3.573 ms., avg: 4.106 ms.
Best time for 3584K FFT length: 4.235 ms., avg: 4.553 ms.
Best time for 4096K FFT length: 4.794 ms., avg: 5.213 ms.
Best time for 5120K FFT length: 5.536 ms., avg: 5.879 ms.
Best time for 6144K FFT length: 6.778 ms., avg: 6.975 ms.
Best time for 7168K FFT length: 7.995 ms., avg: 8.094 ms.
Best time for 8192K FFT length: 9.107 ms., avg: 9.352 ms.
Timing FFTs using 16 threads on 8 cores.
Best time for 2048K FFT length: 2.381 ms., avg: 2.501 ms.
Best time for 2560K FFT length: 3.175 ms., avg: 3.610 ms.
Best time for 3072K FFT length: 3.882 ms., avg: 4.355 ms.
Best time for 3584K FFT length: 4.646 ms., avg: 5.146 ms.
Best time for 4096K FFT length: 5.256 ms., avg: 5.451 ms.
Best time for 5120K FFT length: 6.213 ms., avg: 6.490 ms.
Best time for 6144K FFT length: 7.405 ms., avg: 7.650 ms.
Best time for 7168K FFT length: 8.841 ms., avg: 8.956 ms.
Best time for 8192K FFT length: 10.258 ms., avg: 10.349 ms.
Throughput results (w/o SMT):
Code:
Prime95 64-bit version 29.1, RdtscTiming=1
Timings for 2048K FFT length (1 cpu, 1 worker): 14.30 ms.  Throughput: 69.93 iter/sec.
Timings for 2048K FFT length (8 cpus, 1 worker):  2.14 ms.  Throughput: 467.81 iter/sec.
Timings for 2048K FFT length (8 cpus, 8 workers): 16.24, 16.03, 15.99, 16.03, 16.02, 15.95, 16.18, 16.03 ms.  Throughput: 498.14 iter/sec.
Timings for 2560K FFT length (1 cpu, 1 worker): 18.11 ms.  Throughput: 55.23 iter/sec.
Timings for 2560K FFT length (8 cpus, 1 worker):  3.03 ms.  Throughput: 330.57 iter/sec.
Timings for 2560K FFT length (8 cpus, 8 workers): 22.72, 22.41, 22.23, 22.18, 22.13, 22.05, 22.34, 22.21 ms.  Throughput: 359.04 iter/sec.
Timings for 3072K FFT length (1 cpu, 1 worker): 21.96 ms.  Throughput: 45.54 iter/sec.
Timings for 3072K FFT length (8 cpus, 1 worker):  3.63 ms.  Throughput: 275.83 iter/sec.
Timings for 3072K FFT length (8 cpus, 8 workers): 27.70, 27.15, 26.61, 26.67, 26.34, 26.48, 26.85, 26.81 ms.  Throughput: 298.28 iter/sec.
Timings for 3584K FFT length (1 cpu, 1 worker): 26.18 ms.  Throughput: 38.20 iter/sec.
Timings for 3584K FFT length (8 cpus, 1 worker):  4.28 ms.  Throughput: 233.54 iter/sec.
Timings for 3584K FFT length (8 cpus, 8 workers): 32.17, 31.36, 31.35, 31.24, 31.32, 31.04, 31.43, 31.45 ms.  Throughput: 254.64 iter/sec.
[Sun May 28 21:29:40 2017]
Timings for 4096K FFT length (1 cpu, 1 worker): 29.84 ms.  Throughput: 33.51 iter/sec.
Timings for 4096K FFT length (8 cpus, 1 worker):  4.87 ms.  Throughput: 205.25 iter/sec.
Timings for 4096K FFT length (8 cpus, 8 workers): 36.36, 35.91, 35.76, 35.53, 35.79, 35.88, 36.13, 36.07 ms.  Throughput: 222.68 iter/sec.
Timings for 5120K FFT length (1 cpu, 1 worker): 37.73 ms.  Throughput: 26.50 iter/sec.
Timings for 5120K FFT length (8 cpus, 1 worker):  5.53 ms.  Throughput: 180.79 iter/sec.
Timings for 5120K FFT length (8 cpus, 8 workers): 43.10, 42.08, 42.24, 41.99, 41.81, 41.76, 41.92, 42.09 ms.  Throughput: 189.93 iter/sec.
Timings for 6144K FFT length (1 cpu, 1 worker): 44.95 ms.  Throughput: 22.25 iter/sec.
Timings for 6144K FFT length (8 cpus, 1 worker):  6.86 ms.  Throughput: 145.67 iter/sec.
Timings for 6144K FFT length (8 cpus, 8 workers): 51.06, 50.97, 50.29, 50.15, 50.18, 50.24, 50.41, 50.41 ms.  Throughput: 158.54 iter/sec.
Timings for 7168K FFT length (1 cpu, 1 worker): 53.47 ms.  Throughput: 18.70 iter/sec.
Timings for 7168K FFT length (8 cpus, 1 worker):  8.10 ms.  Throughput: 123.49 iter/sec.
Timings for 7168K FFT length (8 cpus, 8 workers): 61.59, 60.51, 59.82, 60.19, 60.22, 59.97, 60.62, 60.58 ms.  Throughput: 132.37 iter/sec.
Timings for 8192K FFT length (1 cpu, 1 worker): 60.89 ms.  Throughput: 16.42 iter/sec.
Timings for 8192K FFT length (8 cpus, 1 worker):  9.13 ms.  Throughput: 109.52 iter/sec.
Timings for 8192K FFT length (8 cpus, 8 workers): 70.80, 69.62, 69.27, 70.24, 69.69, 68.67, 69.21, 69.33 ms.  Throughput: 114.95 iter/sec.

Last fiddled with by Dresdenboy on 2017-05-28 at 19:58
Dresdenboy is offline   Reply With Quote
Old 2017-05-28, 21:41   #767
Dresdenboy
 
Dresdenboy's Avatar
 
Apr 2003
Berlin, Germany

192 Posts
Default

With new data at hand, we could do some more comparisons.

Quote:
Originally Posted by db597 View Post
I posted the below results from my Ryzen 1700 (non-X) in the AMD Zen speculation thread earlier. Just thought I'd consolidate the results together with all the other benchmarks in this thread and also add a bit more detail on the setup.

CPU: AMD Ryzen 1700 (non-X)
Frequency: 3.32GHz @ 1.031V (stock rating 3GHz / Turbo 3.7GHz)
Heatsink: AMD Wraith Spire
Memory: Corsair 8GBx2 @ 2933GHz CAS16 (single rank)
Motherboard Asus X370-Pro
BIOS: 0604 (AGESA 1.0.0.4a)
Operating system: Windows 10 x64 Creators Update
Prime95 version: 29.1 Build 15

Code:
[snip]
Timings for 8192K FFT length (8 cpus, 8 workers): 99.83, 99.12, 96.13, 97.41, 96.20, 96.03, 96.76, 96.01 ms. Throughput: 82.33 iter/sec.
The 8192K FFT performance looks incredible on this version of Prime95, especially when all 8 cores are thrown at it. Would be good if someone can post results from a similarly priced Intel i7 7700K on Prime95 v29.1 Build 15 for comparison (I expect the i7 is a lot faster per core, but at the end of the day having double the cores may make it a rather close competition).
It seems, memory plays an important role. I got 30% lower times with the new beta BIOS, 3200-14-14-14-36 DR, and running at 3.5GHz (stock CPB):
Code:
Timings for 8192K FFT length (8 cpus, 8 workers): 70.80, 69.62, 69.27, 70.24, 69.69, 68.67, 69.21, 69.33 ms.  Throughput: 114.95 iter/sec.

Quote:
Originally Posted by LaurV View Post
Well, not exactly the same price range, but for a comparison term: i7-6950X @ 3.00GHz (yes, underclocked, having momentarily problems with cooling, April is Thai summer, the hottest period of the year, ~45°C outside), with single worker, working on 8 cores (from 10), on the required FFT size, Prime95 64-bit version 28.10:

<snip>
Timing FFTs using 8 threads on 8 physical CPUs.
<snip>
Best time for 8192K FFT length: 7.136 ms., avg: 7.291 ms.
<snip>
From my Ryzen 7 result:
Code:
Timing FFTs using 8 threads on 8 cores.
<snip>
Best time for 8192K FFT length: 9.107 ms., avg: 9.352 ms.
Normalized to 3GHz this would be 10.625 ms, or 67% the speed of your result. That's the penalty for half the AVX + cache throughput and mem channels.

Last fiddled with by Dresdenboy on 2017-05-28 at 21:44
Dresdenboy is offline   Reply With Quote
Old 2017-06-20, 22:26   #768
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009

13·151 Posts
Default

Below are the timings for my new build. The hardware specs are as follows:

Intel i7-7700, 3.6 GHZ, Turbo 4.2 GHz.
RAM: Kingston Hyper Fury DDR-4 2400 2*4GB
Main Board: Asus Prime B250M-A
CPU Cooler: Arctic i11 (Recommended by Mark Rose.)
Prime95 v29.1 Build 14


Quote:
[Tue Jun 20 18:01:25 2017]
Compare your results to other computers at http://www.mersenne.org/report_benchmarks
Intel(R) Core(TM) i7-7700 CPU @ 3.60GHz
CPU speed: 4078.95 MHz, 4 hyperthreaded cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 8 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Machine topology as determined by hwloc library:
Machine#0 (total=6990040KB, Backend=Windows, hwlocVersion=1.11.6, ProcessName=prime95.exe)
NUMANode#0 (local=6990040KB, total=6990040KB)
Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=158, CPUModel="Intel(R) Core(TM) i7-7700 CPU @ 3.60GHz", CPUStepping=9)
L3 (size=8192KB, linesize=64, ways=16, Inclusive=1)
L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
Core (cpuset: 0x00000003)
PU#0 (cpuset: 0x00000001)
PU#1 (cpuset: 0x00000002)
L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
Core (cpuset: 0x0000000c)
PU#2 (cpuset: 0x00000004)
PU#3 (cpuset: 0x00000008)
L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
Core (cpuset: 0x00000030)
PU#4 (cpuset: 0x00000010)
PU#5 (cpuset: 0x00000020)
L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
Core (cpuset: 0x000000c0)
PU#6 (cpuset: 0x00000040)
PU#7 (cpuset: 0x00000080)
Prime95 64-bit version 29.1, RdtscTiming=1
Timings for 2048K FFT length (4 cpus, 1 worker): 2.43 ms. Throughput: 412.20 iter/sec.
Timings for 2048K FFT length (4 cpus, 4 workers): 10.87, 10.83, 10.88, 10.84 ms. Throughput: 368.51 iter/sec.
Timings for 2048K FFT length (4 cpus hyperthreaded, 1 worker): 2.64 ms. Throughput: 379.40 iter/sec.
Timings for 2048K FFT length (4 cpus hyperthreaded, 4 workers): 11.82, 11.53, 11.55, 11.65 ms. Throughput: 343.76 iter/sec.
Timings for 2560K FFT length (4 cpus, 1 worker): 3.16 ms. Throughput: 316.92 iter/sec.
Timings for 2560K FFT length (4 cpus, 4 workers): 13.89, 13.85, 13.92, 13.88 ms. Throughput: 288.05 iter/sec.
Timings for 2560K FFT length (4 cpus hyperthreaded, 1 worker): 3.38 ms. Throughput: 295.71 iter/sec.
Timings for 2560K FFT length (4 cpus hyperthreaded, 4 workers): 14.69, 14.55, 14.54, 14.65 ms. Throughput: 273.83 iter/sec.
Timings for 3072K FFT length (4 cpus, 1 worker): 3.88 ms. Throughput: 257.76 iter/sec.
Timings for 3072K FFT length (4 cpus, 4 workers): 16.83, 16.78, 16.81, 16.90 ms. Throughput: 237.67 iter/sec.
Timings for 3072K FFT length (4 cpus hyperthreaded, 1 worker): 4.20 ms. Throughput: 238.23 iter/sec.
Timings for 3072K FFT length (4 cpus hyperthreaded, 4 workers): 17.86, 17.48, 17.81, 17.42 ms. Throughput: 226.76 iter/sec.
Timings for 3584K FFT length (4 cpus, 1 worker): 4.65 ms. Throughput: 214.84 iter/sec.
Timings for 3584K FFT length (4 cpus, 4 workers): 19.77, 19.75, 19.85, 19.80 ms. Throughput: 202.10 iter/sec.
Timings for 3584K FFT length (4 cpus hyperthreaded, 1 worker): 5.01 ms. Throughput: 199.58 iter/sec.
Timings for 3584K FFT length (4 cpus hyperthreaded, 4 workers): 20.65, 20.55, 20.56, 20.64 ms. Throughput: 194.19 iter/sec.
Timings for 4096K FFT length (4 cpus, 1 worker): 5.32 ms. Throughput: 188.08 iter/sec.
Timings for 4096K FFT length (4 cpus, 4 workers): 22.75, 22.28, 22.42, 22.61 ms. Throughput: 177.67 iter/sec.
[Tue Jun 20 18:06:28 2017]
Timings for 4096K FFT length (4 cpus hyperthreaded, 1 worker): 5.81 ms. Throughput: 172.10 iter/sec.
Timings for 4096K FFT length (4 cpus hyperthreaded, 4 workers): 23.71, 23.68, 23.61, 23.63 ms. Throughput: 169.10 iter/sec.
Timings for 5120K FFT length (4 cpus, 1 worker): 6.79 ms. Throughput: 147.20 iter/sec.
Timings for 5120K FFT length (4 cpus, 4 workers): 28.19, 27.94, 28.18, 28.03 ms. Throughput: 142.44 iter/sec.
Timings for 5120K FFT length (4 cpus hyperthreaded, 1 worker): 7.37 ms. Throughput: 135.77 iter/sec.
Timings for 5120K FFT length (4 cpus hyperthreaded, 4 workers): 29.78, 29.50, 29.67, 29.47 ms. Throughput: 135.12 iter/sec.
Timings for 6144K FFT length (4 cpus, 1 worker): 8.58 ms. Throughput: 116.61 iter/sec.
Timings for 6144K FFT length (4 cpus, 4 workers): 35.04, 35.72, 35.79, 35.68 ms. Throughput: 112.50 iter/sec.
Timings for 6144K FFT length (4 cpus hyperthreaded, 1 worker): 9.65 ms. Throughput: 103.64 iter/sec.
Timings for 6144K FFT length (4 cpus hyperthreaded, 4 workers): 38.22, 35.56, 40.69, 39.95 ms. Throughput: 103.89 iter/sec.
Timings for 7168K FFT length (4 cpus, 1 worker): 10.03 ms. Throughput: 99.73 iter/sec.
Timings for 7168K FFT length (4 cpus, 4 workers): 40.22, 39.93, 40.05, 40.20 ms. Throughput: 99.75 iter/sec.
Timings for 7168K FFT length (4 cpus hyperthreaded, 1 worker): 11.24 ms. Throughput: 88.94 iter/sec.
Timings for 7168K FFT length (4 cpus hyperthreaded, 4 workers): 45.00, 44.19, 44.71, 44.65 ms. Throughput: 89.62 iter/sec.
Timings for 8192K FFT length (4 cpus, 1 worker): 11.71 ms. Throughput: 85.42 iter/sec.
Timings for 8192K FFT length (4 cpus, 4 workers): 46.02, 45.88, 45.80, 45.96 ms. Throughput: 87.11 iter/sec.
Timings for 8192K FFT length (4 cpus hyperthreaded, 1 worker): 13.02 ms. Throughput: 76.83 iter/sec.
Timings for 8192K FFT length (4 cpus hyperthreaded, 4 workers): 52.32, 51.69, 52.79, 51.94 ms. Throughput: 76.65 iter/sec.
storm5510 is offline   Reply With Quote
Old 2017-06-21, 00:25   #769
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2×5×293 Posts
Default

Quote:
Originally Posted by storm5510 View Post
Below are the timings for my new build. The hardware specs are as follows:

Intel i7-7700, 3.6 GHZ, Turbo 4.2 GHz.
RAM: Kingston Hyper Fury DDR-4 2400 2*4GB
Main Board: Asus Prime B250M-A
CPU Cooler: Arctic i11 (Recommended by Mark Rose.)
Prime95 v29.1 Build 14
Timings look good!

If you want to save a little power, you can go into your BIOS and tweak the 4-core Turbo speed and lower it to 3.6 GHz. You should still get almost the same throughput. That way if you're running mprime the power usage and heat will be lower, but when running anything that doesn't use all four 4 cores it will get the full turbo speed.
Mark Rose is offline   Reply With Quote
Old 2017-06-21, 02:54   #770
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009

13×151 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
Timings look good!

If you want to save a little power, you can go into your BIOS and tweak the 4-core Turbo speed and lower it to 3.6 GHz. You should still get almost the same throughput. That way if you're running mprime the power usage and heat will be lower, but when running anything that doesn't use all four 4 cores it will get the full turbo speed.
I had a battle on my hands for a while. It was pulling 350 watts into the PSU. CPU cores running above 90°C. It was blowing hot air like a furnace. This was with Prime95 only!

I've made several trips back into the BIOS since. I loaded the default settings and everything started running much cooler. The power pull now is 142 watts and core temps are running in the low to mid 60's. Again, with Prime95 only, running a P-1. I don't know what settings the BIOS originally had. The first time I powered up, I just sat here and looked at it for a while. There is a lot in there.

The CPU is hovering between 3.96 and 4.02 GHz. However, and this is something I will probably need to put in the Prime95 area. It's only using four, of the eight, threads available, and will not allow me to use more than one worker. Curious!
storm5510 is offline   Reply With Quote
Old 2017-06-21, 04:31   #771
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2×5×293 Posts
Default

Quote:
Originally Posted by storm5510 View Post
The CPU is hovering between 3.96 and 4.02 GHz. However, and this is something I will probably need to put in the Prime95 area. It's only using four, of the eight, threads available, and will not allow me to use more than one worker. Curious!
That's working as expected. The CPU can't hit the top turbo speed when using all four cores. Also, Prime95 is so efficiently coded that using hyperthreads (which is basically two threads taking turns on the core) doesn't speed things up.

With regards to only one worker, I believe it will force that if you're working on very large exponents.
Mark Rose is offline   Reply With Quote
Old 2017-11-12, 13:41   #772
bayanne
 
bayanne's Avatar
 
"Tony Gott"
Aug 2002
Yell, Shetland, UK

14C16 Posts
Default

Intel(R) Core(TM) i7-4771 CPU @ 3.50GHz
CPU speed: 3491.94 MHz, 4 hyperthreaded cores
CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 8 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Attached Files
File Type: txt results.txt (13.4 KB, 246 views)
bayanne is offline   Reply With Quote
Old 2017-11-27, 03:34   #773
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default

Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
CPU speed: 4034.80 MHz, 4 hyperthreaded cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 8 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
[Actual CPU speed 4300MHz]
Attached Files
File Type: txt bench results.txt (23.4 KB, 319 views)
kladner is offline   Reply With Quote
Old 2017-12-31, 23:03   #774
obiwantoby
 
Dec 2017

1 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
Almost useful to me, except you only posted timing for 2 cores, not a the single-thread test I need for benchmarks.
Is there a standardization for prime.txt ? I'll send my machine through it.
obiwantoby is offline   Reply With Quote
Old 2018-01-07, 20:15   #775
charliedill
 
Jan 2018

2 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
Almost useful to me, except you only posted timing for 2 cores, not a the single-thread test I need for benchmarks.
Hello.

As a brand shiny new noob, should I run the benchmarks non-HT and single core for best and most useful results?
charliedill is offline   Reply With Quote
Old 2018-01-07, 22:04   #776
VictordeHolland
 
VictordeHolland's Avatar
 
"Victor de Hollander"
Aug 2011
the Netherlands

23×3×72 Posts
Default

Quote:
Originally Posted by charliedill View Post
Hello.

As a brand shiny new noob, should I run the benchmarks non-HT and single core for best and most useful results?
In Prime95 <options><benchmark> and let it run for a couple of minutes. Copy the output from results.txt to the link provided above.
VictordeHolland is offline   Reply With Quote
Old 2018-01-17, 00:42   #777
charliedill
 
Jan 2018

2 Posts
Default NUC benchmark

Intel(R) Core(TM) i3-4010U CPU @ 1.70GHz
CPU speed: 1696.09 MHz, 2 hyperthreaded cores
CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 3 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Attached Files
File Type: txt results.txt (20.2 KB, 197 views)
charliedill is offline   Reply With Quote
Old 2018-01-28, 11:27   #778
wfgarnett3
 
wfgarnett3's Avatar
 
"William Garnett III"
Oct 2002
Bensalem, PA

10101102 Posts
Default

Prime95 Version 29.4 build 5

Quote:
Intel(R) Core(TM) i3-4150 CPU @ 3.50GHz
CPU speed: 3491.96 MHz, 2 hyperthreaded cores
CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 3 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Machine topology as determined by hwloc library:
Machine#0 (total=4973676KB, Backend=Windows, hwlocVersion=1.11.6, ProcessName=prime95.exe)
NUMANode#0 (local=4973676KB, total=4973676KB)
Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=60, CPUModel="Intel(R) Core(TM) i3-4150 CPU @ 3.50GHz", CPUStepping=3)
L3 (size=3072KB, linesize=64, ways=12, Inclusive=1)
L2 (size=256KB, linesize=64, ways=8, Inclusive=0)
L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
Core (cpuset: 0x00000003)
PU#0 (cpuset: 0x00000001)
PU#1 (cpuset: 0x00000002)
L2 (size=256KB, linesize=64, ways=8, Inclusive=0)
L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
Core (cpuset: 0x0000000c)
PU#2 (cpuset: 0x00000004)
PU#3 (cpuset: 0x00000008)
Prime95 64-bit version 29.4, RdtscTiming=1
Best time for 1024K FFT length: 4.630 ms., avg: 5.160 ms.
Best time for 1120K FFT length: 5.052 ms., avg: 5.141 ms.
Best time for 1152K FFT length: 5.328 ms., avg: 6.172 ms.
Best time for 1200K FFT length: 5.723 ms., avg: 5.833 ms.
Best time for 1280K FFT length: 5.893 ms., avg: 6.026 ms.
Best time for 1344K FFT length: 6.330 ms., avg: 6.405 ms.
Best time for 1440K FFT length: 6.865 ms., avg: 6.954 ms.
Best time for 1536K FFT length: 7.072 ms., avg: 7.236 ms.
Best time for 1600K FFT length: 7.478 ms., avg: 8.150 ms.
Best time for 1680K FFT length: 8.138 ms., avg: 8.427 ms.
Best time for 1728K FFT length: 8.485 ms., avg: 8.654 ms.
Best time for 1792K FFT length: 8.477 ms., avg: 8.611 ms.
Best time for 1920K FFT length: 8.973 ms., avg: 9.156 ms.
Best time for 2016K FFT length: 9.718 ms., avg: 9.855 ms.
Best time for 2048K FFT length: 9.719 ms., avg: 9.858 ms.
Best time for 2304K FFT length: 10.768 ms., avg: 11.027 ms.
Best time for 2400K FFT length: 11.717 ms., avg: 11.840 ms.
Best time for 2560K FFT length: 12.101 ms., avg: 12.200 ms.
Best time for 2688K FFT length: 12.713 ms., avg: 13.114 ms.
Best time for 2880K FFT length: 14.093 ms., avg: 14.350 ms.
Best time for 3072K FFT length: 14.865 ms., avg: 15.132 ms.
Best time for 3200K FFT length: 16.011 ms., avg: 16.101 ms.
Best time for 3360K FFT length: 17.166 ms., avg: 17.375 ms.
Best time for 3456K FFT length: 17.543 ms., avg: 17.767 ms.
Best time for 3584K FFT length: 17.755 ms., avg: 18.060 ms.
Best time for 3840K FFT length: 18.882 ms., avg: 19.065 ms.
Best time for 4096K FFT length: 20.490 ms., avg: 20.721 ms.
Best time for 4480K FFT length: 22.489 ms., avg: 22.622 ms.
Best time for 4608K FFT length: 22.685 ms., avg: 22.799 ms.
Best time for 4800K FFT length: 24.021 ms., avg: 24.161 ms.
Best time for 5120K FFT length: 25.999 ms., avg: 26.188 ms.
Best time for 5376K FFT length: 26.821 ms., avg: 26.990 ms.
Best time for 5760K FFT length: 30.495 ms., avg: 30.619 ms.
Best time for 6144K FFT length: 31.718 ms., avg: 31.957 ms.
Best time for 6400K FFT length: 32.829 ms., avg: 35.509 ms.
Best time for 6720K FFT length: 37.190 ms., avg: 37.478 ms.
Best time for 6912K FFT length: 36.606 ms., avg: 38.893 ms.
Best time for 7168K FFT length: 40.236 ms., avg: 42.278 ms.
Best time for 7680K FFT length: 39.521 ms., avg: 42.656 ms.
Best time for 8064K FFT length: 44.332 ms., avg: 44.742 ms.
Best time for 8192K FFT length: 50.249 ms., avg: 50.680 ms.
Timing FFTs using 2 threads on 1 core.
Best time for 1024K FFT length: 4.671 ms., avg: 4.758 ms.
Best time for 1120K FFT length: 5.097 ms., avg: 5.166 ms.
Best time for 1152K FFT length: 5.251 ms., avg: 5.353 ms.
Best time for 1200K FFT length: 5.744 ms., avg: 5.883 ms.
Best time for 1280K FFT length: 5.888 ms., avg: 5.984 ms.
Best time for 1344K FFT length: 6.200 ms., avg: 6.584 ms.
Best time for 1440K FFT length: 6.827 ms., avg: 7.493 ms.
Best time for 1536K FFT length: 7.103 ms., avg: 7.171 ms.
Best time for 1600K FFT length: 7.417 ms., avg: 9.013 ms.
Best time for 1680K FFT length: 8.186 ms., avg: 8.903 ms.
Best time for 1728K FFT length: 8.350 ms., avg: 8.564 ms.
Best time for 1792K FFT length: 8.446 ms., avg: 9.772 ms.
Best time for 1920K FFT length: 9.204 ms., avg: 9.290 ms.
Best time for 2016K FFT length: 9.719 ms., avg: 9.907 ms.
Best time for 2048K FFT length: 9.797 ms., avg: 10.044 ms.
Best time for 2304K FFT length: 10.964 ms., avg: 11.210 ms.
Best time for 2400K FFT length: 11.552 ms., avg: 11.890 ms.
Best time for 2560K FFT length: 13.521 ms., avg: 14.232 ms.
Best time for 2688K FFT length: 13.153 ms., avg: 13.195 ms.
Best time for 2880K FFT length: 13.793 ms., avg: 14.136 ms.
Best time for 3072K FFT length: 14.923 ms., avg: 15.075 ms.
Best time for 3200K FFT length: 16.292 ms., avg: 17.322 ms.
Best time for 3360K FFT length: 17.391 ms., avg: 17.594 ms.
Best time for 3456K FFT length: 17.062 ms., avg: 17.491 ms.
Best time for 3584K FFT length: 18.191 ms., avg: 18.442 ms.
Best time for 3840K FFT length: 19.293 ms., avg: 19.608 ms.
Best time for 4096K FFT length: 21.194 ms., avg: 22.201 ms.
Best time for 4480K FFT length: 22.956 ms., avg: 23.277 ms.
Best time for 4608K FFT length: 24.249 ms., avg: 24.700 ms.
Best time for 4800K FFT length: 25.237 ms., avg: 25.791 ms.
Best time for 5120K FFT length: 30.279 ms., avg: 31.833 ms.
Best time for 5376K FFT length: 29.188 ms., avg: 29.600 ms.
Best time for 5760K FFT length: 35.958 ms., avg: 36.456 ms.
Best time for 6144K FFT length: 37.969 ms., avg: 38.287 ms.
Best time for 6400K FFT length: 34.875 ms., avg: 35.177 ms.
Best time for 6720K FFT length: 51.903 ms., avg: 52.427 ms.
Best time for 6912K FFT length: 43.171 ms., avg: 43.968 ms.
Best time for 7168K FFT length: 54.672 ms., avg: 55.560 ms.
Best time for 7680K FFT length: 46.683 ms., avg: 52.059 ms.
Best time for 8064K FFT length: 62.162 ms., avg: 64.955 ms.
Best time for 8192K FFT length: 64.755 ms., avg: 65.370 ms.
Timing FFTs using 2 threads on 2 cores.
Best time for 1024K FFT length: 2.581 ms., avg: 2.645 ms.
Best time for 1120K FFT length: 2.861 ms., avg: 2.952 ms.
Best time for 1152K FFT length: 2.923 ms., avg: 3.025 ms.
Best time for 1200K FFT length: 3.150 ms., avg: 3.421 ms.
Best time for 1280K FFT length: 3.276 ms., avg: 3.396 ms.
Best time for 1344K FFT length: 3.496 ms., avg: 3.618 ms.
Best time for 1440K FFT length: 3.830 ms., avg: 3.919 ms.
Best time for 1536K FFT length: 3.929 ms., avg: 4.063 ms.
Best time for 1600K FFT length: 4.217 ms., avg: 4.317 ms.
Best time for 1680K FFT length: 4.540 ms., avg: 4.778 ms.
Best time for 1728K FFT length: 4.769 ms., avg: 4.882 ms.
Best time for 1792K FFT length: 4.787 ms., avg: 4.861 ms.
Best time for 1920K FFT length: 6.049 ms., avg: 6.362 ms.
Best time for 2016K FFT length: 5.498 ms., avg: 6.821 ms.
Best time for 2048K FFT length: 5.516 ms., avg: 6.538 ms.
Best time for 2304K FFT length: 6.241 ms., avg: 7.533 ms.
Best time for 2400K FFT length: 6.424 ms., avg: 6.537 ms.
Best time for 2560K FFT length: 6.840 ms., avg: 6.922 ms.
Best time for 2688K FFT length: 7.463 ms., avg: 7.912 ms.
Best time for 2880K FFT length: 7.740 ms., avg: 7.819 ms.
Best time for 3072K FFT length: 8.316 ms., avg: 8.449 ms.
Best time for 3200K FFT length: 8.818 ms., avg: 8.983 ms.
Best time for 3360K FFT length: 9.679 ms., avg: 9.764 ms.
Best time for 3456K FFT length: 9.939 ms., avg: 10.233 ms.
Best time for 3584K FFT length: 10.106 ms., avg: 10.277 ms.
Best time for 3840K FFT length: 10.977 ms., avg: 11.092 ms.
Best time for 4096K FFT length: 11.635 ms., avg: 11.709 ms.
Best time for 4480K FFT length: 12.596 ms., avg: 12.747 ms.
Best time for 4608K FFT length: 12.894 ms., avg: 12.981 ms.
Best time for 4800K FFT length: 13.477 ms., avg: 13.567 ms.
Best time for 5120K FFT length: 14.498 ms., avg: 16.366 ms.
Best time for 5376K FFT length: 15.325 ms., avg: 15.578 ms.
Best time for 5760K FFT length: 16.820 ms., avg: 16.911 ms.
Best time for 6144K FFT length: 17.747 ms., avg: 18.339 ms.
Best time for 6400K FFT length: 18.410 ms., avg: 18.679 ms.
Best time for 6720K FFT length: 20.368 ms., avg: 20.793 ms.
Best time for 6912K FFT length: 20.689 ms., avg: 20.923 ms.
Best time for 7168K FFT length: 22.040 ms., avg: 22.209 ms.
Best time for 7680K FFT length: 22.246 ms., avg: 22.600 ms.
Best time for 8064K FFT length: 24.693 ms., avg: 24.931 ms.
Best time for 8192K FFT length: 27.080 ms., avg: 27.238 ms.
Timing FFTs using 4 threads on 2 cores.
Best time for 1024K FFT length: 2.669 ms., avg: 2.733 ms.
Best time for 1120K FFT length: 2.924 ms., avg: 2.964 ms.
Best time for 1152K FFT length: 2.976 ms., avg: 3.042 ms.
Best time for 1200K FFT length: 3.240 ms., avg: 3.293 ms.
Best time for 1280K FFT length: 3.386 ms., avg: 3.481 ms.
Best time for 1344K FFT length: 3.605 ms., avg: 3.684 ms.
Best time for 1440K FFT length: 4.498 ms., avg: 4.752 ms.
Best time for 1536K FFT length: 4.086 ms., avg: 4.173 ms.
Best time for 1600K FFT length: 4.206 ms., avg: 4.338 ms.
Best time for 1680K FFT length: 4.699 ms., avg: 4.839 ms.
Best time for 1728K FFT length: 4.843 ms., avg: 4.953 ms.
Best time for 1792K FFT length: 4.899 ms., avg: 4.968 ms.
Best time for 1920K FFT length: 5.282 ms., avg: 5.388 ms.
Best time for 2016K FFT length: 5.606 ms., avg: 5.863 ms.
Best time for 2048K FFT length: 5.660 ms., avg: 6.341 ms.
Best time for 2304K FFT length: 6.294 ms., avg: 6.418 ms.
Best time for 2400K FFT length: 6.656 ms., avg: 7.033 ms.
Best time for 2560K FFT length: 7.087 ms., avg: 7.654 ms.
Best time for 2688K FFT length: 7.613 ms., avg: 7.686 ms.
Best time for 2880K FFT length: 7.980 ms., avg: 8.165 ms.
Best time for 3072K FFT length: 8.586 ms., avg: 8.704 ms.
Best time for 3200K FFT length: 9.286 ms., avg: 9.371 ms.
Best time for 3360K FFT length: 10.396 ms., avg: 10.515 ms.
Best time for 3456K FFT length: 10.007 ms., avg: 10.302 ms.
Best time for 3584K FFT length: 10.214 ms., avg: 11.566 ms.
Best time for 3840K FFT length: 10.983 ms., avg: 11.225 ms.
Best time for 4096K FFT length: 12.676 ms., avg: 13.817 ms.
Best time for 4480K FFT length: 13.352 ms., avg: 13.562 ms.
Best time for 4608K FFT length: 13.421 ms., avg: 14.911 ms.
Best time for 4800K FFT length: 14.476 ms., avg: 14.614 ms.
Best time for 5120K FFT length: 17.831 ms., avg: 18.038 ms.
Best time for 5376K FFT length: 16.555 ms., avg: 16.804 ms.
Best time for 5760K FFT length: 19.178 ms., avg: 19.731 ms.
Best time for 6144K FFT length: 20.792 ms., avg: 21.024 ms.
Best time for 6400K FFT length: 19.889 ms., avg: 20.122 ms.
Best time for 6720K FFT length: 26.683 ms., avg: 27.220 ms.
Best time for 6912K FFT length: 23.420 ms., avg: 24.070 ms.
Best time for 7168K FFT length: 28.628 ms., avg: 28.723 ms.
Best time for 7680K FFT length: 25.325 ms., avg: 27.802 ms.
Best time for 8064K FFT length: 32.853 ms., avg: 33.457 ms.
Best time for 8192K FFT length: 32.941 ms., avg: 34.227 ms.
wfgarnett3 is offline   Reply With Quote
Old 2018-10-30, 03:26   #779
petrw1
1976 Toyota Corona years forever!
 
petrw1's Avatar
 
"Wayne"
Nov 2006
Saskatchewan, Canada

469510 Posts
Default I7-7820X WITH DDR4 3600

Code:
Intel(R) Core(TM) i7-7820X CPU @ 3.60GHz
CPU speed: 3600.01 MHz, 8 hyperthreaded cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA, AVX512F
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 11 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Machine topology as determined by hwloc library:
 Machine#0 (total=31049680KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe)
  NUMANode#0 (local=31049680KB, total=31049680KB)
    Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=85, CPUModel="Intel(R) Core(TM) i7-7820X CPU @ 3.60GHz", CPUStepping=4)
      L3 (size=11264KB, linesize=64, ways=11, Inclusive=0)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000003)
              PU#0 (cpuset: 0x00000001)
              PU#1 (cpuset: 0x00000002)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x0000000c)
              PU#2 (cpuset: 0x00000004)
              PU#3 (cpuset: 0x00000008)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000030)
              PU#4 (cpuset: 0x00000010)
              PU#5 (cpuset: 0x00000020)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x000000c0)
              PU#6 (cpuset: 0x00000040)
              PU#7 (cpuset: 0x00000080)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000300)
              PU#8 (cpuset: 0x00000100)
              PU#9 (cpuset: 0x00000200)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000c00)
              PU#10 (cpuset: 0x00000400)
              PU#11 (cpuset: 0x00000800)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00003000)
              PU#12 (cpuset: 0x00001000)
              PU#13 (cpuset: 0x00002000)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x0000c000)
              PU#14 (cpuset: 0x00004000)
              PU#15 (cpuset: 0x00008000)
Prime95 64-bit version 29.4, RdtscTiming=1
Best time for 2048K FFT length: 7.487 ms., avg: 7.596 ms.
Best time for 2304K FFT length: 8.205 ms., avg: 8.273 ms.
Best time for 2400K FFT length: 9.011 ms., avg: 9.126 ms.
Best time for 2560K FFT length: 9.634 ms., avg: 9.754 ms.
Best time for 2688K FFT length: 9.895 ms., avg: 10.002 ms.
Best time for 2880K FFT length: 10.918 ms., avg: 11.031 ms.
Best time for 3072K FFT length: 11.642 ms., avg: 11.785 ms.
Best time for 3200K FFT length: 11.866 ms., avg: 11.958 ms.
[Mon Oct 29 21:05:19 2018]
Best time for 3360K FFT length: 13.693 ms., avg: 13.791 ms.
Best time for 3456K FFT length: 13.325 ms., avg: 13.447 ms.
Best time for 3584K FFT length: 14.122 ms., avg: 14.208 ms.
Best time for 3840K FFT length: 14.411 ms., avg: 14.536 ms.
Best time for 4096K FFT length: 16.186 ms., avg: 16.229 ms.
Best time for 4480K FFT length: 17.272 ms., avg: 17.348 ms.
Best time for 4608K FFT length: 17.478 ms., avg: 17.547 ms.
Best time for 4800K FFT length: 18.971 ms., avg: 19.047 ms.
Best time for 5120K FFT length: 20.418 ms., avg: 20.522 ms.
Best time for 5376K FFT length: 20.466 ms., avg: 20.603 ms.
Best time for 5760K FFT length: 23.524 ms., avg: 23.645 ms.
Best time for 6144K FFT length: 24.153 ms., avg: 24.302 ms.
Best time for 6400K FFT length: 24.736 ms., avg: 24.881 ms.
Best time for 6720K FFT length: 28.553 ms., avg: 28.751 ms.
Best time for 6912K FFT length: 27.605 ms., avg: 27.737 ms.
Best time for 7168K FFT length: 29.507 ms., avg: 29.643 ms.
Best time for 7680K FFT length: 30.237 ms., avg: 30.350 ms.
Best time for 8064K FFT length: 33.558 ms., avg: 33.718 ms.
Best time for 8192K FFT length: 33.456 ms., avg: 33.561 ms.
Timing FFTs using 2 threads on 1 core.
Best time for 2048K FFT length: 6.593 ms., avg: 6.770 ms.
Best time for 2304K FFT length: 7.688 ms., avg: 7.909 ms.
Best time for 2400K FFT length: 8.208 ms., avg: 8.399 ms.
Best time for 2560K FFT length: 8.520 ms., avg: 8.727 ms.
Best time for 2688K FFT length: 9.155 ms., avg: 9.349 ms.
Best time for 2880K FFT length: 9.869 ms., avg: 10.082 ms.
Best time for 3072K FFT length: 10.319 ms., avg: 10.502 ms.
Best time for 3200K FFT length: 11.103 ms., avg: 11.482 ms.
Best time for 3360K FFT length: 12.664 ms., avg: 13.060 ms.
Best time for 3456K FFT length: 12.047 ms., avg: 12.439 ms.
Best time for 3584K FFT length: 12.263 ms., avg: 12.559 ms.
Best time for 3840K FFT length: 13.625 ms., avg: 13.955 ms.
Best time for 4096K FFT length: 14.252 ms., avg: 14.597 ms.
Best time for 4480K FFT length: 15.846 ms., avg: 16.023 ms.
Best time for 4608K FFT length: 16.406 ms., avg: 16.703 ms.
Best time for 4800K FFT length: 17.697 ms., avg: 17.931 ms.
Best time for 5120K FFT length: 18.146 ms., avg: 18.363 ms.
Best time for 5376K FFT length: 19.178 ms., avg: 19.587 ms.
Best time for 5760K FFT length: 23.242 ms., avg: 23.563 ms.
Best time for 6144K FFT length: 22.699 ms., avg: 22.787 ms.
Best time for 6400K FFT length: 23.764 ms., avg: 24.262 ms.
Best time for 6720K FFT length: 27.039 ms., avg: 27.709 ms.
Best time for 6912K FFT length: 26.864 ms., avg: 26.970 ms.
Best time for 7168K FFT length: 26.775 ms., avg: 27.100 ms.
Best time for 7680K FFT length: 28.262 ms., avg: 28.689 ms.
Best time for 8064K FFT length: 31.014 ms., avg: 31.358 ms.
Best time for 8192K FFT length: 31.270 ms., avg: 31.852 ms.
Timing FFTs using 2 threads on 2 cores.
Best time for 2048K FFT length: 4.012 ms., avg: 4.117 ms.
Best time for 2304K FFT length: 4.319 ms., avg: 4.392 ms.
Best time for 2400K FFT length: 4.703 ms., avg: 4.755 ms.
Best time for 2560K FFT length: 5.059 ms., avg: 5.125 ms.
Best time for 2688K FFT length: 5.197 ms., avg: 5.242 ms.
Best time for 2880K FFT length: 5.648 ms., avg: 5.693 ms.
Best time for 3072K FFT length: 6.135 ms., avg: 6.206 ms.
Best time for 3200K FFT length: 6.217 ms., avg: 6.285 ms.
Best time for 3360K FFT length: 7.119 ms., avg: 7.184 ms.
Best time for 3456K FFT length: 6.996 ms., avg: 7.078 ms.
Best time for 3584K FFT length: 7.349 ms., avg: 7.418 ms.
Best time for 3840K FFT length: 7.560 ms., avg: 7.622 ms.
Best time for 4096K FFT length: 8.395 ms., avg: 8.475 ms.
Best time for 4480K FFT length: 8.859 ms., avg: 8.938 ms.
Best time for 4608K FFT length: 9.113 ms., avg: 9.177 ms.
Best time for 4800K FFT length: 9.777 ms., avg: 9.856 ms.
Best time for 5120K FFT length: 10.532 ms., avg: 10.599 ms.
Best time for 5376K FFT length: 10.584 ms., avg: 10.652 ms.
Best time for 5760K FFT length: 12.046 ms., avg: 12.121 ms.
Best time for 6144K FFT length: 12.467 ms., avg: 12.536 ms.
Best time for 6400K FFT length: 12.762 ms., avg: 12.868 ms.
Best time for 6720K FFT length: 14.614 ms., avg: 14.683 ms.
Best time for 6912K FFT length: 14.280 ms., avg: 14.368 ms.
Best time for 7168K FFT length: 15.213 ms., avg: 15.306 ms.
Best time for 7680K FFT length: 15.545 ms., avg: 15.622 ms.
Best time for 8064K FFT length: 17.346 ms., avg: 17.426 ms.
Best time for 8192K FFT length: 17.234 ms., avg: 17.346 ms.
Timing FFTs using 4 threads on 4 cores.
Best time for 2048K FFT length: 2.117 ms., avg: 2.227 ms.
Best time for 2304K FFT length: 2.270 ms., avg: 2.383 ms.
Best time for 2400K FFT length: 2.482 ms., avg: 2.586 ms.
Best time for 2560K FFT length: 2.689 ms., avg: 2.793 ms.
Best time for 2688K FFT length: 2.701 ms., avg: 2.775 ms.
Best time for 2880K FFT length: 2.972 ms., avg: 3.010 ms.
Best time for 3072K FFT length: 3.213 ms., avg: 3.255 ms.
Best time for 3200K FFT length: 3.271 ms., avg: 3.367 ms.
Best time for 3360K FFT length: 3.845 ms., avg: 3.859 ms.
Best time for 3456K FFT length: 3.660 ms., avg: 3.753 ms.
Best time for 3584K FFT length: 3.841 ms., avg: 3.915 ms.
Best time for 3840K FFT length: 3.975 ms., avg: 4.008 ms.
Best time for 4096K FFT length: 4.453 ms., avg: 4.476 ms.
Best time for 4480K FFT length: 4.642 ms., avg: 4.662 ms.
Best time for 4608K FFT length: 4.855 ms., avg: 4.922 ms.
Best time for 4800K FFT length: 5.107 ms., avg: 5.168 ms.
Best time for 5120K FFT length: 5.464 ms., avg: 5.492 ms.
Best time for 5376K FFT length: 5.526 ms., avg: 5.547 ms.
Best time for 5760K FFT length: 6.286 ms., avg: 6.323 ms.
Best time for 6144K FFT length: 6.515 ms., avg: 6.550 ms.
Best time for 6400K FFT length: 6.683 ms., avg: 6.730 ms.
Best time for 6720K FFT length: 7.702 ms., avg: 7.726 ms.
Best time for 6912K FFT length: 7.420 ms., avg: 7.439 ms.
Best time for 7168K FFT length: 7.900 ms., avg: 7.925 ms.
Best time for 7680K FFT length: 8.087 ms., avg: 8.119 ms.
Best time for 8064K FFT length: 9.088 ms., avg: 9.120 ms.
Best time for 8192K FFT length: 9.067 ms., avg: 9.098 ms.
Timing FFTs using 8 threads on 8 cores.
Best time for 2048K FFT length: 1.149 ms., avg: 1.292 ms.
Best time for 2304K FFT length: 1.238 ms., avg: 1.394 ms.
Best time for 2400K FFT length: 1.350 ms., avg: 1.524 ms.
Best time for 2560K FFT length: 1.452 ms., avg: 1.623 ms.
Best time for 2688K FFT length: 1.489 ms., avg: 1.636 ms.
Best time for 2880K FFT length: 1.634 ms., avg: 1.814 ms.
Best time for 3072K FFT length: 1.763 ms., avg: 1.946 ms.
Best time for 3200K FFT length: 1.850 ms., avg: 2.022 ms.
Best time for 3360K FFT length: 2.051 ms., avg: 2.170 ms.
Best time for 3456K FFT length: 2.056 ms., avg: 2.230 ms.
Best time for 3584K FFT length: 2.125 ms., avg: 2.267 ms.
Best time for 3840K FFT length: 2.272 ms., avg: 2.424 ms.
Best time for 4096K FFT length: 2.473 ms., avg: 2.541 ms.
Best time for 4480K FFT length: 2.722 ms., avg: 2.828 ms.
Best time for 4608K FFT length: 2.778 ms., avg: 2.882 ms.
Best time for 4800K FFT length: 2.995 ms., avg: 3.128 ms.
Best time for 5120K FFT length: 3.127 ms., avg: 3.204 ms.
Best time for 5376K FFT length: 3.292 ms., avg: 3.336 ms.
Best time for 5760K FFT length: 3.654 ms., avg: 3.735 ms.
Best time for 6144K FFT length: 3.791 ms., avg: 3.869 ms.
Best time for 6400K FFT length: 4.033 ms., avg: 4.064 ms.
Best time for 6720K FFT length: 4.364 ms., avg: 4.391 ms.
Best time for 6912K FFT length: 4.462 ms., avg: 4.531 ms.
Best time for 7168K FFT length: 4.510 ms., avg: 4.611 ms.
Best time for 7680K FFT length: 4.857 ms., avg: 4.912 ms.
Best time for 8064K FFT length: 5.319 ms., avg: 5.362 ms.
Best time for 8192K FFT length: 5.253 ms., avg: 5.309 ms.
Timing FFTs using 16 threads on 8 cores.
Best time for 2048K FFT length: 1.034 ms., avg: 1.223 ms.
Best time for 2304K FFT length: 1.182 ms., avg: 1.373 ms.
Best time for 2400K FFT length: 1.285 ms., avg: 1.406 ms.
Best time for 2560K FFT length: 1.310 ms., avg: 1.504 ms.
Best time for 2688K FFT length: 1.427 ms., avg: 1.531 ms.
Best time for 2880K FFT length: 1.583 ms., avg: 1.745 ms.
Best time for 3072K FFT length: 1.625 ms., avg: 1.786 ms.
Best time for 3200K FFT length: 1.856 ms., avg: 1.959 ms.
Best time for 3360K FFT length: 1.933 ms., avg: 2.110 ms.
Best time for 3456K FFT length: 1.950 ms., avg: 2.027 ms.
Best time for 3584K FFT length: 1.971 ms., avg: 2.093 ms.
Best time for 3840K FFT length: 2.257 ms., avg: 2.318 ms.
Best time for 4096K FFT length: 2.329 ms., avg: 2.429 ms.
Best time for 4480K FFT length: 2.739 ms., avg: 2.818 ms.
Best time for 4608K FFT length: 2.761 ms., avg: 2.850 ms.
Best time for 4800K FFT length: 2.959 ms., avg: 2.995 ms.
Best time for 5120K FFT length: 3.054 ms., avg: 3.098 ms.
Best time for 5376K FFT length: 3.372 ms., avg: 3.460 ms.
Best time for 5760K FFT length: 3.940 ms., avg: 3.966 ms.
Best time for 6144K FFT length: 4.162 ms., avg: 4.213 ms.
Best time for 6400K FFT length: 4.169 ms., avg: 4.221 ms.
Best time for 6720K FFT length: 4.377 ms., avg: 4.408 ms.
Best time for 6912K FFT length: 4.798 ms., avg: 4.879 ms.
Best time for 7168K FFT length: 4.656 ms., avg: 4.705 ms.
Best time for 7680K FFT length: 4.985 ms., avg: 5.015 ms.
Best time for 8064K FFT length: 5.350 ms., avg: 5.374 ms.
Best time for 8192K FFT length: 5.615 ms., avg: 5.686 ms.
petrw1 is offline   Reply With Quote
Old 2018-12-07, 21:54   #780
simon389
 
Aug 2013

3×29 Posts
Default i7 9700k - 32GB dual rank 3600Mhz DDR4 - MSI MPG Z390I Gaming Edge AC Mini ITX mobs

Great numbers:

Code:
Intel(R) Core(TM) i7-9700K CPU @ 3.60GHz
CPU speed: 4618.26 MHz, 8 cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 12 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Machine topology as determined by hwloc library:
 Machine#0 (total=31361316KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe)
  NUMANode#0 (local=31361316KB, total=31361316KB)
    Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=158, CPUModel="Intel(R) Core(TM) i7-9700K CPU @ 3.60GHz", CPUStepping=12)
      L3 (size=12288KB, linesize=64, ways=12, Inclusive=1)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000001)
              PU#0 (cpuset: 0x00000001)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000002)
              PU#1 (cpuset: 0x00000002)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000004)
              PU#2 (cpuset: 0x00000004)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000008)
              PU#3 (cpuset: 0x00000008)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000010)
              PU#4 (cpuset: 0x00000010)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000020)
              PU#5 (cpuset: 0x00000020)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000040)
              PU#6 (cpuset: 0x00000040)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000080)
              PU#7 (cpuset: 0x00000080)
Prime95 64-bit version 29.4, RdtscTiming=1
Timings for 2048K FFT length (8 cores, 1 worker):  1.31 ms.  Throughput: 764.95 iter/sec.
Timings for 2048K FFT length (8 cores, 8 workers): 14.82, 14.72, 14.52, 14.49, 14.71, 14.60, 14.51, 14.28 ms.  Throughput: 548.75 iter/sec.
Timings for 2304K FFT length (8 cores, 1 worker):  1.60 ms.  Throughput: 625.21 iter/sec.
Timings for 2304K FFT length (8 cores, 8 workers): 16.49, 16.10, 16.49, 16.10, 16.49, 16.46, 16.53, 16.42 ms.  Throughput: 488.30 iter/sec.
Timings for 2400K FFT length (8 cores, 1 worker):  1.70 ms.  Throughput: 587.97 iter/sec.
Timings for 2400K FFT length (8 cores, 8 workers): 17.40, 17.26, 17.26, 17.22, 17.38, 17.26, 17.26, 17.22 ms.  Throughput: 462.85 iter/sec.
Timings for 2560K FFT length (8 cores, 1 worker):  1.86 ms.  Throughput: 536.39 iter/sec.
Timings for 2560K FFT length (8 cores, 8 workers): 18.25, 18.16, 18.12, 18.10, 18.23, 18.10, 18.16, 17.98 ms.  Throughput: 441.09 iter/sec.
Timings for 2688K FFT length (8 cores, 1 worker):  2.02 ms.  Throughput: 494.11 iter/sec.
Timings for 2688K FFT length (8 cores, 8 workers): 19.10, 18.99, 18.99, 19.01, 19.07, 19.05, 19.11, 18.88 ms.  Throughput: 420.53 iter/sec.
[Wed Dec 05 10:04:08 2018]
Timings for 2880K FFT length (8 cores, 1 worker):  2.23 ms.  Throughput: 448.18 iter/sec.
Timings for 2880K FFT length (8 cores, 8 workers): 20.88, 20.76, 20.73, 20.64, 20.93, 20.67, 20.85, 20.64 ms.  Throughput: 385.35 iter/sec.
Timings for 3072K FFT length (8 cores, 1 worker):  2.39 ms.  Throughput: 418.09 iter/sec.
Timings for 3072K FFT length (8 cores, 8 workers): 22.02, 21.93, 21.79, 21.89, 21.81, 21.87, 21.87, 21.64 ms.  Throughput: 366.11 iter/sec.
Timings for 3200K FFT length (8 cores, 1 worker):  2.61 ms.  Throughput: 383.37 iter/sec.
Timings for 3200K FFT length (8 cores, 8 workers): 23.59, 23.51, 23.44, 23.40, 23.51, 23.47, 23.44, 23.40 ms.  Throughput: 340.86 iter/sec.
Timings for 3360K FFT length (8 cores, 1 worker):  2.70 ms.  Throughput: 370.59 iter/sec.
Timings for 3360K FFT length (8 cores, 8 workers): 24.24, 23.94, 24.10, 23.87, 24.09, 23.87, 23.98, 23.98 ms.  Throughput: 333.24 iter/sec.
Timings for 3456K FFT length (8 cores, 1 worker):  2.81 ms.  Throughput: 355.90 iter/sec.
Timings for 3456K FFT length (8 cores, 8 workers): 24.80, 25.09, 24.88, 24.77, 24.96, 24.94, 24.81, 24.70 ms.  Throughput: 321.69 iter/sec.
Timings for 3584K FFT length (8 cores, 1 worker):  2.93 ms.  Throughput: 341.43 iter/sec.
Timings for 3584K FFT length (8 cores, 8 workers): 25.79, 25.65, 25.61, 25.61, 25.62, 25.66, 25.65, 25.41 ms.  Throughput: 312.22 iter/sec.
Timings for 3840K FFT length (8 cores, 1 worker):  3.19 ms.  Throughput: 313.04 iter/sec.
Timings for 3840K FFT length (8 cores, 8 workers): 28.05, 28.29, 27.96, 28.26, 28.04, 28.19, 27.91, 27.67 ms.  Throughput: 285.26 iter/sec.
Timings for 4096K FFT length (8 cores, 1 worker):  3.43 ms.  Throughput: 291.60 iter/sec.
Timings for 4096K FFT length (8 cores, 8 workers): 29.63, 29.70, 29.52, 29.50, 29.57, 29.46, 29.38, 29.02 ms.  Throughput: 271.45 iter/sec.
Timings for 4480K FFT length (8 cores, 1 worker):  3.92 ms.  Throughput: 255.31 iter/sec.
Timings for 4480K FFT length (8 cores, 8 workers): 33.30, 33.44, 33.48, 33.48, 33.03, 33.12, 33.02, 32.89 ms.  Throughput: 240.83 iter/sec.
Timings for 4608K FFT length (8 cores, 1 worker):  3.93 ms.  Throughput: 254.74 iter/sec.
[Wed Dec 05 10:09:11 2018]
Timings for 4608K FFT length (8 cores, 8 workers): 33.55, 34.11, 33.76, 33.55, 33.49, 33.39, 34.16, 32.82 ms.  Throughput: 238.12 iter/sec.
Timings for 4800K FFT length (8 cores, 1 worker):  4.24 ms.  Throughput: 235.64 iter/sec.
Timings for 4800K FFT length (8 cores, 8 workers): 36.05, 35.27, 35.09, 34.97, 34.99, 36.64, 35.96, 36.60 ms.  Throughput: 224.19 iter/sec.
Timings for 5120K FFT length (8 cores, 1 worker):  4.40 ms.  Throughput: 227.11 iter/sec.
Timings for 5120K FFT length (8 cores, 8 workers): 37.30, 37.07, 36.99, 37.37, 37.14, 37.03, 37.25, 37.36 ms.  Throughput: 215.12 iter/sec.
Timings for 5376K FFT length (8 cores, 1 worker):  4.76 ms.  Throughput: 210.21 iter/sec.
Timings for 5376K FFT length (8 cores, 8 workers): 39.38, 39.36, 40.05, 39.34, 39.48, 40.46, 39.40, 39.37 ms.  Throughput: 202.01 iter/sec.
Timings for 5760K FFT length (8 cores, 1 worker):  5.13 ms.  Throughput: 194.98 iter/sec.
Timings for 5760K FFT length (8 cores, 8 workers): 42.89, 42.68, 42.10, 42.46, 42.64, 42.15, 42.42, 42.86 ms.  Throughput: 188.14 iter/sec.
Timings for 6144K FFT length (8 cores, 1 worker):  5.45 ms.  Throughput: 183.45 iter/sec.
Timings for 6144K FFT length (8 cores, 8 workers): 45.87, 45.47, 45.26, 44.99, 45.58, 45.03, 45.07, 45.31 ms.  Throughput: 176.52 iter/sec.
Timings for 6400K FFT length (8 cores, 1 worker):  5.87 ms.  Throughput: 170.44 iter/sec.
Timings for 6400K FFT length (8 cores, 8 workers): 48.17, 49.77, 47.79, 47.47, 49.77, 48.04, 48.74, 48.39 ms.  Throughput: 164.94 iter/sec.
Timings for 6720K FFT length (8 cores, 1 worker):  5.89 ms.  Throughput: 169.71 iter/sec.
Timings for 6720K FFT length (8 cores, 8 workers): 49.38, 49.70, 48.96, 48.72, 49.43, 48.83, 48.99, 49.28 ms.  Throughput: 162.73 iter/sec.
Timings for 6912K FFT length (8 cores, 1 worker):  6.36 ms.  Throughput: 157.17 iter/sec.
Timings for 6912K FFT length (8 cores, 8 workers): 51.82, 51.95, 51.54, 51.18, 51.56, 51.36, 51.48, 51.45 ms.  Throughput: 155.21 iter/sec.
Timings for 7168K FFT length (8 cores, 1 worker):  6.29 ms.  Throughput: 158.98 iter/sec.
Timings for 7168K FFT length (8 cores, 8 workers): 52.49, 52.82, 52.38, 52.47, 52.64, 52.17, 52.21, 52.62 ms.  Throughput: 152.45 iter/sec.
[Wed Dec 05 10:14:24 2018]
Timings for 7680K FFT length (8 cores, 1 worker):  6.89 ms.  Throughput: 145.19 iter/sec.
Timings for 7680K FFT length (8 cores, 8 workers): 56.80, 57.22, 56.94, 56.40, 56.49, 56.42, 56.53, 56.73 ms.  Throughput: 141.12 iter/sec.
Timings for 8064K FFT length (8 cores, 1 worker):  7.30 ms.  Throughput: 137.06 iter/sec.
Timings for 8064K FFT length (8 cores, 8 workers): 59.67, 60.04, 59.01, 58.81, 59.64, 59.21, 59.41, 59.50 ms.  Throughput: 134.66 iter/sec.
Timings for 8192K FFT length (8 cores, 1 worker):  7.34 ms.  Throughput: 136.20 iter/sec.
Timings for 8192K FFT length (8 cores, 8 workers): 61.18, 60.44, 60.17, 60.01, 61.28, 60.67, 60.56, 60.50 ms.  Throughput: 132.01 iter/sec.
simon389 is offline   Reply With Quote
Old 2018-12-08, 17:00   #781
VictordeHolland
 
VictordeHolland's Avatar
 
"Victor de Hollander"
Aug 2011
the Netherlands

23×3×72 Posts
Default

CPU: Intel Core i5-6500 @3.3Ghz
Memory: 2x8GB DDR4-2133
Mobo: MSI H110M-ECO
Code:
Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz
CPU speed: 3279.11 MHz, 4 cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 6 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Machine topology as determined by hwloc library:
 Machine#0 (total=14968420KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe)
  NUMANode#0 (local=14968420KB, total=14968420KB)
    Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=94, CPUModel="Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz", CPUStepping=3)
      L3 (size=6144KB, linesize=64, ways=12, Inclusive=1)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000001)
              PU#0 (cpuset: 0x00000001)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000002)
              PU#1 (cpuset: 0x00000002)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000004)
              PU#2 (cpuset: 0x00000004)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000008)
              PU#3 (cpuset: 0x00000008)
Prime95 64-bit version 29.4, RdtscTiming=1

Timings for 2048K FFT length (4 cores, 1 worker):  2.87 ms.  Throughput: 348.96 iter/sec.
Timings for 2048K FFT length (4 cores, 4 workers): 12.41, 12.33, 12.38, 12.46 ms.  Throughput: 322.77 iter/sec.
Timings for 2304K FFT length (4 cores, 1 worker):  3.24 ms.  Throughput: 308.64 iter/sec.
Timings for 2304K FFT length (4 cores, 4 workers): 13.96, 13.88, 13.76, 13.83 ms.  Throughput: 288.67 iter/sec.
Timings for 2400K FFT length (4 cores, 1 worker):  3.45 ms.  Throughput: 289.71 iter/sec.
Timings for 2400K FFT length (4 cores, 4 workers): 14.77, 14.44, 14.49, 14.63 ms.  Throughput: 274.31 iter/sec.
Timings for 2560K FFT length (4 cores, 1 worker):  3.65 ms.  Throughput: 274.08 iter/sec.
Timings for 2560K FFT length (4 cores, 4 workers): 15.62, 15.71, 15.58, 15.69 ms.  Throughput: 255.62 iter/sec.
Timings for 2688K FFT length (4 cores, 1 worker):  3.88 ms.  Throughput: 257.87 iter/sec.
Timings for 2688K FFT length (4 cores, 4 workers): 16.46, 16.27, 16.41, 16.29 ms.  Throughput: 244.60 iter/sec.
Timings for 2880K FFT length (4 cores, 1 worker):  4.21 ms.  Throughput: 237.54 iter/sec.
Timings for 2880K FFT length (4 cores, 4 workers): 17.59, 17.73, 17.52, 17.49 ms.  Throughput: 227.48 iter/sec.
Timings for 3072K FFT length (4 cores, 1 worker):  4.37 ms.  Throughput: 228.73 iter/sec.
Timings for 3072K FFT length (4 cores, 4 workers): 18.96, 18.61, 18.94, 19.00 ms.  Throughput: 211.88 iter/sec.
Timings for 3200K FFT length (4 cores, 1 worker):  4.74 ms.  Throughput: 210.98 iter/sec.
Timings for 3200K FFT length (4 cores, 4 workers): 19.87, 19.79, 19.83, 19.75 ms.  Throughput: 201.91 iter/sec.
Timings for 3360K FFT length (4 cores, 1 worker):  5.12 ms.  Throughput: 195.13 iter/sec.
Timings for 3360K FFT length (4 cores, 4 workers): 20.87, 20.87, 20.83, 20.84 ms.  Throughput: 191.80 iter/sec.
Timings for 3456K FFT length (4 cores, 1 worker):  5.09 ms.  Throughput: 196.44 iter/sec.
Timings for 3456K FFT length (4 cores, 4 workers): 21.56, 21.37, 21.30, 21.46 ms.  Throughput: 186.72 iter/sec.
Timings for 3584K FFT length (4 cores, 1 worker):  5.28 ms.  Throughput: 189.55 iter/sec.
Timings for 3584K FFT length (4 cores, 4 workers): 22.57, 22.18, 22.41, 22.11 ms.  Throughput: 179.23 iter/sec.
Timings for 3840K FFT length (4 cores, 1 worker):  5.71 ms.  Throughput: 175.01 iter/sec.
Timings for 3840K FFT length (4 cores, 4 workers): 24.02, 23.87, 23.76, 24.08 ms.  Throughput: 167.15 iter/sec.
Timings for 4096K FFT length (4 cores, 1 worker):  6.06 ms.  Throughput: 164.89 iter/sec.
Timings for 4096K FFT length (4 cores, 4 workers): 25.63, 25.35, 25.72, 25.35 ms.  Throughput: 156.79 iter/sec.
Timings for 4480K FFT length (4 cores, 1 worker):  6.82 ms.  Throughput: 146.55 iter/sec.
Timings for 4480K FFT length (4 cores, 4 workers): 28.06, 28.09, 28.02, 28.39 ms.  Throughput: 142.14 iter/sec.
Timings for 4608K FFT length (4 cores, 1 worker):  6.89 ms.  Throughput: 145.13 iter/sec.
Timings for 4608K FFT length (4 cores, 4 workers): 28.61, 28.60, 28.88, 28.89 ms.  Throughput: 139.16 iter/sec.
Timings for 4800K FFT length (4 cores, 1 worker):  7.37 ms.  Throughput: 135.67 iter/sec.
Timings for 4800K FFT length (4 cores, 4 workers): 31.28, 30.41, 30.00, 29.62 ms.  Throughput: 131.95 iter/sec.
Timings for 5120K FFT length (4 cores, 1 worker):  7.76 ms.  Throughput: 128.86 iter/sec.
Timings for 5120K FFT length (4 cores, 4 workers): 32.27, 32.17, 31.72, 32.45 ms.  Throughput: 124.42 iter/sec.
Timings for 5376K FFT length (4 cores, 1 worker):  8.48 ms.  Throughput: 117.95 iter/sec.
Timings for 5376K FFT length (4 cores, 4 workers): 33.89, 33.84, 33.47, 33.89 ms.  Throughput: 118.44 iter/sec.
Timings for 5760K FFT length (4 cores, 1 worker):  9.07 ms.  Throughput: 110.29 iter/sec.
Timings for 5760K FFT length (4 cores, 4 workers): 37.29, 37.40, 35.05, 37.95 ms.  Throughput: 108.44 iter/sec.
Timings for 6144K FFT length (4 cores, 1 worker):  9.51 ms.  Throughput: 105.18 iter/sec.
Timings for 6144K FFT length (4 cores, 4 workers): 39.48, 40.29, 37.66, 39.16 ms.  Throughput: 102.24 iter/sec.
Timings for 6400K FFT length (4 cores, 1 worker): 10.04 ms.  Throughput: 99.64 iter/sec.
Timings for 6400K FFT length (4 cores, 4 workers): 40.67, 40.67, 41.47, 40.65 ms.  Throughput: 97.90 iter/sec.
Timings for 6720K FFT length (4 cores, 1 worker): 10.66 ms.  Throughput: 93.83 iter/sec.
Timings for 6720K FFT length (4 cores, 4 workers): 43.60, 43.34, 42.20, 42.71 ms.  Throughput: 93.12 iter/sec.
Timings for 6912K FFT length (4 cores, 1 worker): 11.01 ms.  Throughput: 90.84 iter/sec.
Timings for 6912K FFT length (4 cores, 4 workers): 45.53, 45.45, 43.88, 43.03 ms.  Throughput: 89.99 iter/sec.
Timings for 7168K FFT length (4 cores, 1 worker): 11.20 ms.  Throughput: 89.28 iter/sec.
Timings for 7168K FFT length (4 cores, 4 workers): 45.86, 45.78, 45.86, 45.78 ms.  Throughput: 87.30 iter/sec.
Timings for 7680K FFT length (4 cores, 1 worker): 11.98 ms.  Throughput: 83.44 iter/sec.
Timings for 7680K FFT length (4 cores, 4 workers): 50.12, 50.72, 49.10, 47.77 ms.  Throughput: 80.97 iter/sec.
Timings for 8064K FFT length (4 cores, 1 worker): 12.98 ms.  Throughput: 77.01 iter/sec.
Timings for 8064K FFT length (4 cores, 4 workers): 51.71, 53.23, 51.69, 52.00 ms.  Throughput: 76.70 iter/sec.
Timings for 8192K FFT length (4 cores, 1 worker): 13.11 ms.  Throughput: 76.26 iter/sec.
Timings for 8192K FFT length (4 cores, 4 workers): 53.35, 53.09, 53.06, 53.53 ms.  Throughput: 75.11 iter/sec.
Code:
Prime95 64-bit version 29.4, RdtscTiming=1
Timings for 2560K FFT length (1 core, 1 worker): 10.90 ms.  Throughput: 91.76 iter/sec.
Timings for 2560K FFT length (2 cores, 1 worker):  5.70 ms.  Throughput: 175.50 iter/sec.
Timings for 2560K FFT length (3 cores, 1 worker):  4.07 ms.  Throughput: 245.69 iter/sec.
 Timings for 2560K FFT length (4 cores, 1 worker):  3.63 ms.  Throughput: 275.22 iter/sec.
1st core: 91.76
2nd core: +83.74
3rd core: +70.19
4th core: +29.53
I'm probably going to disable turbo-boost as it looks like it is pretty much memory bottle-necked (going from 3 to 4 cores only adds 30 iters/sec.
VictordeHolland is offline   Reply With Quote
Old 2018-12-09, 04:34   #782
simon389
 
Aug 2013

5716 Posts
Default i9 9900k - 32GB dual rank 3600Mhz DDR4 - MSI MPG Z390I Gaming Edge AC Mini ITX mobo

Also flies on this mobo with this RAM. Getting 3.75ms for a 2^80,000,000-1 LL test, which is amazing considering it consumes 1/2 the power of my 7820X which iterates a similar test at 2.8ms per iteration.

Code:
Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz
CPU speed: 4726.39 MHz, 8 hyperthreaded cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 16 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Machine topology as determined by hwloc library:
 Machine#0 (total=31511272KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe)
  NUMANode#0 (local=31511272KB, total=31511272KB)
    Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=158, CPUModel="Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz", CPUStepping=12)
      L3 (size=16384KB, linesize=64, ways=16, Inclusive=1)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000003)
              PU#0 (cpuset: 0x00000001)
              PU#1 (cpuset: 0x00000002)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x0000000c)
              PU#2 (cpuset: 0x00000004)
              PU#3 (cpuset: 0x00000008)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000030)
              PU#4 (cpuset: 0x00000010)
              PU#5 (cpuset: 0x00000020)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x000000c0)
              PU#6 (cpuset: 0x00000040)
              PU#7 (cpuset: 0x00000080)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000300)
              PU#8 (cpuset: 0x00000100)
              PU#9 (cpuset: 0x00000200)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000c00)
              PU#10 (cpuset: 0x00000400)
              PU#11 (cpuset: 0x00000800)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00003000)
              PU#12 (cpuset: 0x00001000)
              PU#13 (cpuset: 0x00002000)
        L2 (size=256KB, linesize=64, ways=4, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x0000c000)
              PU#14 (cpuset: 0x00004000)
              PU#15 (cpuset: 0x00008000)
Prime95 64-bit version 29.4, RdtscTiming=1
Timings for 2048K FFT length (8 cores, 1 worker):  0.96 ms.  Throughput: 1036.60 iter/sec.
Timings for 2048K FFT length (8 cores, 8 workers): 14.73, 14.52, 14.76, 14.67, 14.67, 14.70, 14.65, 14.70 ms.  Throughput: 545.27 iter/sec.
Timings for 2304K FFT length (8 cores, 1 worker):  1.15 ms.  Throughput: 870.29 iter/sec.
Timings for 2304K FFT length (8 cores, 8 workers): 16.38, 16.63, 16.38, 16.49, 16.47, 16.42, 16.54, 16.77 ms.  Throughput: 484.55 iter/sec.
Timings for 2400K FFT length (8 cores, 1 worker):  1.28 ms.  Throughput: 784.01 iter/sec.
Timings for 2400K FFT length (8 cores, 8 workers): 17.36, 17.35, 17.20, 17.33, 17.23, 17.44, 17.36, 17.49 ms.  Throughput: 461.21 iter/sec.
Timings for 2560K FFT length (8 cores, 1 worker):  1.43 ms.  Throughput: 696.89 iter/sec.
Timings for 2560K FFT length (8 cores, 8 workers): 18.41, 18.56, 18.42, 18.34, 18.42, 18.56, 18.48, 18.50 ms.  Throughput: 433.34 iter/sec.
Timings for 2688K FFT length (8 cores, 1 worker):  1.67 ms.  Throughput: 597.61 iter/sec.
Timings for 2688K FFT length (8 cores, 8 workers): 19.34, 19.31, 19.33, 19.62, 19.35, 19.38, 19.36, 19.40 ms.  Throughput: 412.69 iter/sec.
Timings for 2880K FFT length (8 cores, 1 worker):  1.84 ms.  Throughput: 544.23 iter/sec.
Timings for 2880K FFT length (8 cores, 8 workers): 20.91, 20.98, 20.76, 20.69, 20.64, 21.02, 20.69, 21.04 ms.  Throughput: 383.87 iter/sec.
Timings for 3072K FFT length (8 cores, 1 worker):  2.05 ms.  Throughput: 487.94 iter/sec.
Timings for 3072K FFT length (8 cores, 8 workers): 22.40, 22.16, 22.14, 22.43, 22.16, 22.10, 22.01, 22.36 ms.  Throughput: 360.06 iter/sec.
Timings for 3200K FFT length (8 cores, 1 worker):  2.24 ms.  Throughput: 445.78 iter/sec.
[Sat Dec 08 20:08:28 2018]
Timings for 3200K FFT length (8 cores, 8 workers): 23.90, 23.59, 23.49, 23.54, 23.79, 23.57, 23.59, 23.66 ms.  Throughput: 338.39 iter/sec.
Timings for 3360K FFT length (8 cores, 1 worker):  2.46 ms.  Throughput: 406.74 iter/sec.
Timings for 3360K FFT length (8 cores, 8 workers): 24.61, 25.10, 24.56, 24.42, 24.27, 24.42, 24.51, 27.90 ms.  Throughput: 320.95 iter/sec.
Timings for 3456K FFT length (8 cores, 1 worker):  2.53 ms.  Throughput: 395.58 iter/sec.
Timings for 3456K FFT length (8 cores, 8 workers): 25.16, 25.16, 25.18, 25.13, 25.06, 25.25, 25.20, 25.48 ms.  Throughput: 317.41 iter/sec.
Timings for 3584K FFT length (8 cores, 1 worker):  2.71 ms.  Throughput: 368.74 iter/sec.
Timings for 3584K FFT length (8 cores, 8 workers): 25.79, 26.01, 26.02, 26.02, 26.02, 25.96, 26.03, 25.96 ms.  Throughput: 307.98 iter/sec.
Timings for 3840K FFT length (8 cores, 1 worker):  2.93 ms.  Throughput: 341.31 iter/sec.
Timings for 3840K FFT length (8 cores, 8 workers): 28.31, 28.36, 28.01, 28.34, 27.82, 28.33, 28.48, 28.44 ms.  Throughput: 283.09 iter/sec.
Timings for 4096K FFT length (8 cores, 1 worker):  3.17 ms.  Throughput: 315.36 iter/sec.
Timings for 4096K FFT length (8 cores, 8 workers): 29.30, 29.99, 30.19, 29.44, 29.62, 30.00, 29.71, 30.14 ms.  Throughput: 268.49 iter/sec.
Timings for 4480K FFT length (8 cores, 1 worker):  3.62 ms.  Throughput: 276.38 iter/sec.
Timings for 4480K FFT length (8 cores, 8 workers): 33.41, 33.05, 33.12, 32.98, 32.88, 33.52, 33.13, 33.25 ms.  Throughput: 241.21 iter/sec.
Timings for 4608K FFT length (8 cores, 1 worker):  3.68 ms.  Throughput: 272.09 iter/sec.
Timings for 4608K FFT length (8 cores, 8 workers): 33.85, 33.93, 33.77, 34.00, 33.78, 33.92, 33.60, 33.76 ms.  Throughput: 236.51 iter/sec.
Timings for 4800K FFT length (8 cores, 1 worker):  4.03 ms.  Throughput: 248.35 iter/sec.
Timings for 4800K FFT length (8 cores, 8 workers): 35.77, 35.43, 35.40, 35.56, 35.17, 35.15, 35.28, 35.19 ms.  Throughput: 226.20 iter/sec.
Timings for 5120K FFT length (8 cores, 1 worker):  4.23 ms.  Throughput: 236.34 iter/sec.
Timings for 5120K FFT length (8 cores, 8 workers): 37.17, 37.35, 37.13, 36.61, 37.15, 37.09, 37.23, 37.17 ms.  Throughput: 215.57 iter/sec.
[Sat Dec 08 20:13:32 2018]
Timings for 5376K FFT length (8 cores, 1 worker):  4.52 ms.  Throughput: 221.09 iter/sec.
Timings for 5376K FFT length (8 cores, 8 workers): 38.98, 39.02, 39.32, 39.12, 39.03, 40.72, 39.60, 39.37 ms.  Throughput: 203.11 iter/sec.
Timings for 5760K FFT length (8 cores, 1 worker):  4.80 ms.  Throughput: 208.24 iter/sec.
Timings for 5760K FFT length (8 cores, 8 workers): 43.67, 42.70, 41.27, 41.90, 41.93, 41.31, 42.79, 42.66 ms.  Throughput: 189.29 iter/sec.
Timings for 6144K FFT length (8 cores, 1 worker):  5.15 ms.  Throughput: 194.36 iter/sec.
Timings for 6144K FFT length (8 cores, 8 workers): 45.19, 45.31, 44.79, 45.39, 44.42, 45.32, 45.00, 44.56 ms.  Throughput: 177.80 iter/sec.
Timings for 6400K FFT length (8 cores, 1 worker):  5.63 ms.  Throughput: 177.48 iter/sec.
Timings for 6400K FFT length (8 cores, 8 workers): 47.85, 48.36, 48.03, 47.93, 47.79, 48.01, 48.31, 48.07 ms.  Throughput: 166.52 iter/sec.
Timings for 6720K FFT length (8 cores, 1 worker):  5.65 ms.  Throughput: 177.03 iter/sec.
Timings for 6720K FFT length (8 cores, 8 workers): 49.33, 48.69, 49.38, 48.53, 48.40, 48.27, 48.60, 48.84 ms.  Throughput: 164.09 iter/sec.
Timings for 6912K FFT length (8 cores, 1 worker):  6.03 ms.  Throughput: 165.73 iter/sec.
Timings for 6912K FFT length (8 cores, 8 workers): 53.00, 50.55, 50.83, 50.25, 50.37, 52.23, 50.99, 51.21 ms.  Throughput: 156.36 iter/sec.
Timings for 7168K FFT length (8 cores, 1 worker):  6.07 ms.  Throughput: 164.80 iter/sec.
Timings for 7168K FFT length (8 cores, 8 workers): 52.21, 51.85, 51.73, 52.16, 52.03, 51.91, 51.57, 52.26 ms.  Throughput: 153.96 iter/sec.
Timings for 7680K FFT length (8 cores, 1 worker):  6.68 ms.  Throughput: 149.59 iter/sec.
Timings for 7680K FFT length (8 cores, 8 workers): 56.78, 56.50, 56.31, 55.16, 56.39, 55.90, 56.19, 56.42 ms.  Throughput: 142.34 iter/sec.
Timings for 8064K FFT length (8 cores, 1 worker):  7.04 ms.  Throughput: 142.08 iter/sec.
Timings for 8064K FFT length (8 cores, 8 workers): 58.85, 58.56, 58.47, 59.16, 58.93, 59.04, 58.63, 58.60 ms.  Throughput: 136.11 iter/sec.
Timings for 8192K FFT length (8 cores, 1 worker):  7.06 ms.  Throughput: 141.62 iter/sec.
[Sat Dec 08 20:18:46 2018]
Timings for 8192K FFT length (8 cores, 8 workers): 59.48, 59.73, 59.74, 60.57, 59.29, 59.66, 59.70, 61.10 ms.  Throughput: 133.55 iter/sec.
simon389 is offline   Reply With Quote
Old 2018-12-10, 18:01   #783
simon389
 
Aug 2013

1278 Posts
Default i7 9800x with DDR4 3600 19-20-20-40 and EVGA x399 Micro ATX

Seems like it can complete a 2^88,000,000-1 test in about 2.6 days

Code:
Intel(R) Core(TM) i7-9800X CPU @ 3.80GHz
CPU speed: 3792.01 MHz, 8 hyperthreaded cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA, AVX512F
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 16896 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Machine topology as determined by hwloc library:
 Machine#0 (total=64533332KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe)
  NUMANode#0 (local=64533332KB, total=64533332KB)
    Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=85, CPUModel="Intel(R) Core(TM) i7-9800X CPU @ 3.80GHz", CPUStepping=4)
      L3 (size=16896KB, linesize=64, ways=11, Inclusive=0)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000003)
              PU#0 (cpuset: 0x00000001)
              PU#1 (cpuset: 0x00000002)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x0000000c)
              PU#2 (cpuset: 0x00000004)
              PU#3 (cpuset: 0x00000008)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000030)
              PU#4 (cpuset: 0x00000010)
              PU#5 (cpuset: 0x00000020)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x000000c0)
              PU#6 (cpuset: 0x00000040)
              PU#7 (cpuset: 0x00000080)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000300)
              PU#8 (cpuset: 0x00000100)
              PU#9 (cpuset: 0x00000200)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000c00)
              PU#10 (cpuset: 0x00000400)
              PU#11 (cpuset: 0x00000800)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00003000)
              PU#12 (cpuset: 0x00001000)
              PU#13 (cpuset: 0x00002000)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x0000c000)
              PU#14 (cpuset: 0x00004000)
              PU#15 (cpuset: 0x00008000)
Prime95 64-bit version 29.4, RdtscTiming=1
Timing FFTs using 8 threads on 8 cores.
Best time for 2048K FFT length: 1.033 ms., avg: 1.042 ms.
Best time for 2304K FFT length: 1.094 ms., avg: 1.109 ms.
Best time for 2400K FFT length: 1.197 ms., avg: 1.209 ms.
Best time for 2560K FFT length: 1.290 ms., avg: 1.355 ms.
Best time for 2688K FFT length: 1.305 ms., avg: 1.320 ms.
Best time for 2880K FFT length: 1.432 ms., avg: 1.449 ms.
Best time for 3072K FFT length: 1.533 ms., avg: 1.542 ms.
Best time for 3200K FFT length: 1.605 ms., avg: 1.617 ms.
Best time for 3360K FFT length: 1.809 ms., avg: 1.826 ms.
Best time for 3456K FFT length: 1.791 ms., avg: 1.805 ms.
Best time for 3584K FFT length: 1.826 ms., avg: 1.844 ms.
Best time for 3840K FFT length: 1.931 ms., avg: 1.946 ms.
Best time for 4096K FFT length: 2.093 ms., avg: 2.119 ms.
Best time for 4480K FFT length: 2.267 ms., avg: 2.292 ms.
Best time for 4608K FFT length: 2.345 ms., avg: 2.361 ms.
Best time for 4800K FFT length: 2.529 ms., avg: 2.545 ms.
Best time for 5120K FFT length: 2.655 ms., avg: 2.678 ms.
Best time for 5376K FFT length: 2.746 ms., avg: 2.770 ms.
Best time for 5760K FFT length: 3.097 ms., avg: 3.118 ms.
Best time for 6144K FFT length: 3.222 ms., avg: 3.250 ms.
Best time for 6400K FFT length: 3.422 ms., avg: 3.442 ms.
Best time for 6720K FFT length: 3.744 ms., avg: 3.776 ms.
Best time for 6912K FFT length: 3.792 ms., avg: 3.825 ms.
Best time for 7168K FFT length: 3.872 ms., avg: 3.892 ms.
Best time for 7680K FFT length: 4.122 ms., avg: 4.139 ms.
Best time for 8064K FFT length: 4.537 ms., avg: 4.569 ms.
Best time for 8192K FFT length: 4.514 ms., avg: 4.544 ms.
Compare your results to other computers at http://www.mersenne.org/report_benchmarks
Intel(R) Core(TM) i7-9800X CPU @ 3.80GHz
CPU speed: 3792.00 MHz, 8 hyperthreaded cores
CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA, AVX512F
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 16896 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64
Machine topology as determined by hwloc library:
 Machine#0 (total=64533332KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe)
  NUMANode#0 (local=64533332KB, total=64533332KB)
    Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=85, CPUModel="Intel(R) Core(TM) i7-9800X CPU @ 3.80GHz", CPUStepping=4)
      L3 (size=16896KB, linesize=64, ways=11, Inclusive=0)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000003)
              PU#0 (cpuset: 0x00000001)
              PU#1 (cpuset: 0x00000002)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x0000000c)
              PU#2 (cpuset: 0x00000004)
              PU#3 (cpuset: 0x00000008)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000030)
              PU#4 (cpuset: 0x00000010)
              PU#5 (cpuset: 0x00000020)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x000000c0)
              PU#6 (cpuset: 0x00000040)
              PU#7 (cpuset: 0x00000080)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000300)
              PU#8 (cpuset: 0x00000100)
              PU#9 (cpuset: 0x00000200)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00000c00)
              PU#10 (cpuset: 0x00000400)
              PU#11 (cpuset: 0x00000800)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x00003000)
              PU#12 (cpuset: 0x00001000)
              PU#13 (cpuset: 0x00002000)
        L2 (size=1024KB, linesize=64, ways=16, Inclusive=0)
          L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
            Core (cpuset: 0x0000c000)
              PU#14 (cpuset: 0x00004000)
              PU#15 (cpuset: 0x00008000)
Prime95 64-bit version 29.4, RdtscTiming=1
Timings for 2048K FFT length (8 cores, 1 worker):  1.04 ms.  Throughput: 958.23 iter/sec.
Timings for 2304K FFT length (8 cores, 1 worker):  1.11 ms.  Throughput: 899.33 iter/sec.
Timings for 2400K FFT length (8 cores, 1 worker):  1.22 ms.  Throughput: 816.48 iter/sec.
Timings for 2560K FFT length (8 cores, 1 worker):  1.32 ms.  Throughput: 760.26 iter/sec.
Timings for 2688K FFT length (8 cores, 1 worker):  1.33 ms.  Throughput: 751.84 iter/sec.
Timings for 2880K FFT length (8 cores, 1 worker):  1.46 ms.  Throughput: 683.17 iter/sec.
Timings for 3072K FFT length (8 cores, 1 worker):  1.54 ms.  Throughput: 649.60 iter/sec.
Timings for 3200K FFT length (8 cores, 1 worker):  1.62 ms.  Throughput: 617.88 iter/sec.
Timings for 3360K FFT length (8 cores, 1 worker):  1.80 ms.  Throughput: 554.46 iter/sec.
Timings for 3456K FFT length (8 cores, 1 worker):  1.79 ms.  Throughput: 557.51 iter/sec.
Timings for 3584K FFT length (8 cores, 1 worker):  1.85 ms.  Throughput: 539.60 iter/sec.
Timings for 3840K FFT length (8 cores, 1 worker):  1.95 ms.  Throughput: 511.78 iter/sec.
Timings for 4096K FFT length (8 cores, 1 worker):  2.11 ms.  Throughput: 475.04 iter/sec.
Timings for 4480K FFT length (8 cores, 1 worker):  2.30 ms.  Throughput: 434.97 iter/sec.
[Mon Dec 10 09:32:49 2018]
Timings for 4608K FFT length (8 cores, 1 worker):  2.36 ms.  Throughput: 423.04 iter/sec.
Timings for 4800K FFT length (8 cores, 1 worker):  2.56 ms.  Throughput: 389.99 iter/sec.
Timings for 5120K FFT length (8 cores, 1 worker):  2.72 ms.  Throughput: 367.83 iter/sec.
Timings for 5376K FFT length (8 cores, 1 worker):  2.77 ms.  Throughput: 360.57 iter/sec.
Timings for 5760K FFT length (8 cores, 1 worker):  3.11 ms.  Throughput: 321.75 iter/sec.
Timings for 6144K FFT length (8 cores, 1 worker):  3.25 ms.  Throughput: 308.12 iter/sec.
Timings for 6400K FFT length (8 cores, 1 worker):  3.45 ms.  Throughput: 290.16 iter/sec.
Timings for 6720K FFT length (8 cores, 1 worker):  3.73 ms.  Throughput: 268.19 iter/sec.
Timings for 6912K FFT length (8 cores, 1 worker):  3.81 ms.  Throughput: 262.29 iter/sec.
Timings for 7168K FFT length (8 cores, 1 worker):  3.93 ms.  Throughput: 254.74 iter/sec.
Timings for 7680K FFT length (8 cores, 1 worker):  4.15 ms.  Throughput: 241.06 iter/sec.
Timings for 8064K FFT length (8 cores, 1 worker):  4.57 ms.  Throughput: 218.77 iter/sec.
Timings for 8192K FFT length (8 cores, 1 worker):  4.57 ms.  Throughput: 218.81 iter/sec.
simon389 is offline   Reply With Quote
Old 2019-07-03, 02:57   #784
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

26×5×17 Posts
Default gpu tf bottom of the totem pole or near it

Quadro NVS295, 1.74Ghz-D/day. This is on a 1x/16x PCIE extender to make use of the pcie 1x slot in the motherboard. Requires registry edit to lengthen TdrDelay.
Code:
mfaktc v0.21 (64bit built)

Compiletime options
  THREADS_PER_BLOCK         256
  SIEVE_SIZE_LIMIT          32kiB
  SIEVE_SIZE                193154bits
  SIEVE_SPLIT               250
  MORE_CLASSES              enabled

Runtime options
  SievePrimes               25000
  SievePrimesAdjust         1
  SievePrimesMin            5000
  SievePrimesMax            100000
  NumStreams                3
  CPUStreams                3
  GridSize                  3
  GPU Sieving               enabled
  GPUSievePrimes            82486
  GPUSieveSize              64Mi bits
  GPUSieveProcessSize       16Ki bits
  Checkpoints               enabled
  CheckpointDelay           900s
  WorkFileAddDelay          3600s
  Stages                    enabled
  StopAfterFactor           bitlevel
  PrintMode                 full
  V5UserID                  kriesel
  ComputerID                eaglet-nvs295
  AllowSleep                no
  TimeStampInResults        yes

CUDA version info
  binary compiled for CUDA  6.50
  CUDA runtime version      6.50
  CUDA driver version       6.50

CUDA device info
  name                      Quadro NVS 295
  compute capability        1.1
  max threads per block     512
  max shared memory per MP  16384 byte
  number of multiprocessors 1
  CUDA cores per MP         8
  CUDA cores - total        8
  clock rate (CUDA cores)   1300MHz
  memory clock rate:        695MHz
  memory bus width:         64 bit

Automatic parameters
  threads per grid          1048576
  GPUSievePrimes (adjusted) 82486
  GPUsieve minimum exponent 1055144

running a simple selftest...
Selftest statistics
  number of tests           107
  successfull tests         107

selftest PASSED!

got assignment: exp=119998999 bit_min=72 bit_max=73 (7.97 GHz-days)
Starting trial factoring M119998999 from 2^72 to 2^73 (7.97 GHz-days)
 k_min =  19676691147960
 k_max =  39353382296711
Using GPU kernel "barrett76_mul32_gs"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Jul 02 21:05 |    0   0.1% | 411.97   4d13h |      1.74    82485    n.a.%
Jul 02 21:11 |    5   0.2% | 411.85   4d13h |      1.74    82485    n.a.%
Jul 02 21:18 |    9   0.3% | 411.49   4d13h |      1.74    82485    n.a.%
Jul 02 21:25 |   12   0.4% | 411.39   4d13h |      1.74    82485    n.a.%

Last fiddled with by kriesel on 2019-07-03 at 02:59
kriesel is offline   Reply With Quote
Old 2019-09-08, 15:18   #785
scan80269
 
"Sam"
Jun 2019
California, USA

3×11 Posts
Default i9-9900T, 32GB DDR4-3600 2R, ASRock Z390 Phantom Gaming-ITX/ac mobo

Good energy efficiency at this throughput with the CPU consuming 35W.
scan80269 is offline   Reply With Quote
Old 2019-12-02, 18:57   #786
Meikel
 
Nov 2008

32 Posts
Default AMD Ryzen 9 3900X 12-Core, 4216.62 MHz, 32GB DDR4-3200, X570 AORUS

Seems like AMD has a quite OK CPU for mprime now...


Throughput-Test:
Code:
AMD Ryzen 9 3900X 12-Core Processor            
CPU speed: 4239.72 MHz, 12 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 12x32 KB, L2 cache size: 12x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Machine topology as determined by hwloc library:
 Machine#0 (total=29788576KB, Backend=Windows, hwlocVersion=2.0.4, ProcessName=prime95.exe)
  Package (total=29788576KB, CPUVendor=AuthenticAMD, CPUFamilyNumber=23, CPUModelNumber=113, CPUModel="AMD Ryzen 9 3900X 12-Core Processor            ", CPUStepping=0)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000003)
            PU#0 (cpuset: 0x00000001)
            PU#1 (cpuset: 0x00000002)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x0000000c)
            PU#2 (cpuset: 0x00000004)
            PU#3 (cpuset: 0x00000008)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000030)
            PU#4 (cpuset: 0x00000010)
            PU#5 (cpuset: 0x00000020)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x000000c0)
            PU#6 (cpuset: 0x00000040)
            PU#7 (cpuset: 0x00000080)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000300)
            PU#8 (cpuset: 0x00000100)
            PU#9 (cpuset: 0x00000200)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000c00)
            PU#10 (cpuset: 0x00000400)
            PU#11 (cpuset: 0x00000800)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00003000)
            PU#12 (cpuset: 0x00001000)
            PU#13 (cpuset: 0x00002000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x0000c000)
            PU#14 (cpuset: 0x00004000)
            PU#15 (cpuset: 0x00008000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00030000)
            PU#16 (cpuset: 0x00010000)
            PU#17 (cpuset: 0x00020000)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x000c0000)
            PU#18 (cpuset: 0x00040000)
            PU#19 (cpuset: 0x00080000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00300000)
            PU#20 (cpuset: 0x00100000)
            PU#21 (cpuset: 0x00200000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00c00000)
            PU#22 (cpuset: 0x00400000)
            PU#23 (cpuset: 0x00800000)
Prime95 64-bit version 29.8, RdtscTiming=1
Timings for 2048K FFT length (12 cores, 1 worker):  1.12 ms.  Throughput: 893.57 iter/sec.
Timings for 2048K FFT length (12 cores, 4 workers):  2.75,  2.72,  2.74,  2.82 ms.  Throughput: 1450.51 iter/sec.
Timings for 2048K FFT length (12 cores, 12 workers): 21.92, 21.87, 22.31, 22.07, 22.08, 22.06, 21.93, 21.93, 21.94, 22.68, 21.97, 21.73 ms.  Throughput: 544.49 iter/sec.
Timings for 2240K FFT length (12 cores, 1 worker):  1.04 ms.  Throughput: 957.66 iter/sec.
Timings for 2240K FFT length (12 cores, 4 workers):  4.32,  4.25,  4.37,  4.41 ms.  Throughput: 922.99 iter/sec.
Timings for 2240K FFT length (12 cores, 12 workers): 24.77, 24.56, 24.65, 24.49, 24.73, 24.55, 24.51, 24.48, 24.45, 25.31, 24.73, 24.47 ms.  Throughput: 487.01 iter/sec.
Timings for 2304K FFT length (12 cores, 1 worker):  1.06 ms.  Throughput: 942.24 iter/sec.
[Mon Dec 02 19:13:17 2019]
Timings for 2304K FFT length (12 cores, 4 workers):  4.82,  4.70,  4.73,  4.90 ms.  Throughput: 835.64 iter/sec.
Timings for 2304K FFT length (12 cores, 12 workers): 25.45, 25.22, 25.44, 25.41, 25.43, 25.48, 25.15, 25.17, 25.18, 26.10, 25.52, 25.16 ms.  Throughput: 472.62 iter/sec.
Timings for 2400K FFT length (12 cores, 1 worker):  1.13 ms.  Throughput: 881.23 iter/sec.
Timings for 2400K FFT length (12 cores, 4 workers):  5.70,  5.61,  5.69,  5.82 ms.  Throughput: 700.99 iter/sec.
Timings for 2400K FFT length (12 cores, 12 workers): 26.79, 26.72, 26.79, 26.84, 26.80, 26.78, 26.42, 26.39, 26.42, 27.35, 26.67, 26.64 ms.  Throughput: 449.18 iter/sec.
Timings for 2560K FFT length (12 cores, 1 worker):  1.20 ms.  Throughput: 835.60 iter/sec.
Timings for 2560K FFT length (12 cores, 4 workers):  6.19,  5.98,  6.16,  6.19 ms.  Throughput: 652.72 iter/sec.
Timings for 2560K FFT length (12 cores, 12 workers): 28.10, 27.90, 28.05, 28.04, 28.20, 28.10, 27.92, 27.91, 27.91, 28.74, 27.85, 27.80 ms.  Throughput: 427.97 iter/sec.
Timings for 2688K FFT length (12 cores, 1 worker):  1.22 ms.  Throughput: 817.77 iter/sec.
Timings for 2688K FFT length (12 cores, 4 workers):  7.51,  7.05,  7.13,  7.35 ms.  Throughput: 551.39 iter/sec.
[Mon Dec 02 19:18:23 2019]
Timings for 2688K FFT length (12 cores, 12 workers): 29.97, 29.81, 30.32, 29.98, 30.07, 29.95, 29.71, 29.68, 29.68, 30.92, 29.87, 29.76 ms.  Throughput: 400.36 iter/sec.
Timings for 2800K FFT length (12 cores, 1 worker):  1.31 ms.  Throughput: 763.36 iter/sec.
Timings for 2800K FFT length (12 cores, 4 workers):  8.00,  7.96,  7.97,  8.08 ms.  Throughput: 499.88 iter/sec.
Timings for 2800K FFT length (12 cores, 12 workers): 31.66, 31.52, 31.62, 31.62, 31.66, 31.67, 31.44, 31.31, 31.40, 32.27, 31.56, 31.35 ms.  Throughput: 379.87 iter/sec.
Timings for 2880K FFT length (12 cores, 1 worker):  1.36 ms.  Throughput: 733.59 iter/sec.
Timings for 2880K FFT length (12 cores, 4 workers):  8.44,  8.24,  8.17,  8.47 ms.  Throughput: 480.35 iter/sec.
Timings for 2880K FFT length (12 cores, 12 workers): 32.23, 32.10, 32.36, 32.28, 32.48, 32.38, 31.98, 32.20, 31.97, 33.02, 32.38, 31.98 ms.  Throughput: 371.77 iter/sec.
Timings for 3072K FFT length (12 cores, 1 worker):  1.49 ms.  Throughput: 669.26 iter/sec.
Timings for 3072K FFT length (12 cores, 4 workers):  9.20,  9.09,  9.10,  9.22 ms.  Throughput: 437.06 iter/sec.
Timings for 3072K FFT length (12 cores, 12 workers): 34.09, 33.95, 34.37, 34.06, 34.13, 34.11, 33.83, 33.65, 33.81, 35.26, 34.33, 33.92 ms.  Throughput: 351.68 iter/sec.
[Mon Dec 02 19:23:30 2019]
Timings for 3200K FFT length (12 cores, 1 worker):  1.49 ms.  Throughput: 672.05 iter/sec.
Timings for 3200K FFT length (12 cores, 4 workers):  9.60,  9.39,  9.53,  9.61 ms.  Throughput: 419.61 iter/sec.
Timings for 3200K FFT length (12 cores, 12 workers): 35.56, 35.38, 35.53, 35.49, 35.58, 35.56, 35.42, 35.39, 35.31, 35.96, 35.62, 35.03 ms.  Throughput: 338.18 iter/sec.
Timings for 3360K FFT length (12 cores, 1 worker):  1.49 ms.  Throughput: 669.62 iter/sec.
Timings for 3360K FFT length (12 cores, 4 workers): 10.70, 10.57, 10.56, 10.64 ms.  Throughput: 376.76 iter/sec.
Timings for 3360K FFT length (12 cores, 12 workers): 37.94, 37.78, 38.10, 37.91, 38.13, 37.94, 37.65, 37.58, 37.63, 38.72, 38.25, 37.66 ms.  Throughput: 316.31 iter/sec.
Timings for 3584K FFT length (12 cores, 1 worker):  1.69 ms.  Throughput: 591.59 iter/sec.
Timings for 3584K FFT length (12 cores, 4 workers): 11.35, 11.43, 11.30, 11.46 ms.  Throughput: 351.35 iter/sec.
Timings for 3584K FFT length (12 cores, 12 workers): 40.37, 39.68, 39.90, 40.04, 39.86, 39.87, 39.73, 39.53, 39.73, 40.92, 39.82, 39.79 ms.  Throughput: 300.50 iter/sec.
Timings for 3840K FFT length (12 cores, 1 worker):  1.74 ms.  Throughput: 576.01 iter/sec.
[Mon Dec 02 19:28:38 2019]
Timings for 3840K FFT length (12 cores, 4 workers): 12.57, 12.50, 12.41, 12.56 ms.  Throughput: 319.71 iter/sec.
Timings for 3840K FFT length (12 cores, 12 workers): 43.17, 42.85, 43.46, 43.19, 43.60, 43.30, 43.12, 43.13, 43.09, 44.42, 43.16, 42.47 ms.  Throughput: 277.51 iter/sec.
Timings for 4096K FFT length (12 cores, 1 worker):  1.93 ms.  Throughput: 519.20 iter/sec.
Timings for 4096K FFT length (12 cores, 4 workers): 13.82, 13.72, 13.63, 13.89 ms.  Throughput: 290.64 iter/sec.
Timings for 4096K FFT length (12 cores, 12 workers): 45.88, 45.30, 45.93, 45.72, 45.81, 45.81, 45.46, 45.42, 45.52, 46.72, 45.88, 45.34 ms.  Throughput: 262.41 iter/sec.
Timings for 4480K FFT length (12 cores, 1 worker):  2.06 ms.  Throughput: 485.48 iter/sec.
Timings for 4480K FFT length (12 cores, 4 workers): 15.46, 15.40, 15.36, 15.69 ms.  Throughput: 258.44 iter/sec.
Timings for 4480K FFT length (12 cores, 12 workers): 51.14, 50.93, 51.53, 51.09, 51.28, 51.14, 50.73, 50.61, 50.73, 52.62, 51.07, 50.62 ms.  Throughput: 234.75 iter/sec.
Timings for 4608K FFT length (12 cores, 1 worker):  2.08 ms.  Throughput: 480.39 iter/sec.
Timings for 4608K FFT length (12 cores, 4 workers): 15.81, 15.79, 15.66, 15.97 ms.  Throughput: 253.09 iter/sec.
[Mon Dec 02 19:33:48 2019]
Timings for 4608K FFT length (12 cores, 12 workers): 52.06, 51.76, 52.24, 52.10, 52.30, 52.30, 52.14, 51.92, 51.98, 53.46, 51.77, 51.47 ms.  Throughput: 230.23 iter/sec.
Timings for 4800K FFT length (12 cores, 1 worker):  2.24 ms.  Throughput: 446.97 iter/sec.
Timings for 4800K FFT length (12 cores, 4 workers): 16.47, 16.41, 16.45, 16.45 ms.  Throughput: 243.21 iter/sec.
Timings for 4800K FFT length (12 cores, 12 workers): 54.42, 53.86, 54.01, 54.13, 54.65, 54.19, 54.08, 54.11, 54.04, 55.34, 53.77, 53.52 ms.  Throughput: 221.51 iter/sec.
Timings for 5120K FFT length (12 cores, 1 worker):  2.28 ms.  Throughput: 439.02 iter/sec.
Timings for 5120K FFT length (12 cores, 4 workers): 17.74, 17.72, 17.74, 17.74 ms.  Throughput: 225.57 iter/sec.
Timings for 5120K FFT length (12 cores, 12 workers): 57.75, 57.34, 57.90, 57.71, 58.00, 57.86, 57.53, 57.61, 57.55, 58.87, 57.57, 56.88 ms.  Throughput: 207.93 iter/sec.
Timings for 5376K FFT length (12 cores, 1 worker):  2.43 ms.  Throughput: 412.23 iter/sec.
Timings for 5376K FFT length (12 cores, 4 workers): 18.74, 18.72, 18.73, 18.73 ms.  Throughput: 213.55 iter/sec.
Timings for 5376K FFT length (12 cores, 12 workers): 60.16, 59.70, 60.45, 60.12, 60.29, 60.44, 60.11, 60.27, 60.21, 61.21, 60.17, 59.42 ms.  Throughput: 199.30 iter/sec.
[Mon Dec 02 19:38:59 2019]
Timings for 5600K FFT length (12 cores, 1 worker):  2.55 ms.  Throughput: 391.78 iter/sec.
Timings for 5600K FFT length (12 cores, 4 workers): 19.84, 19.85, 19.85, 19.88 ms.  Throughput: 201.44 iter/sec.
Timings for 5600K FFT length (12 cores, 12 workers): 63.53, 62.96, 63.67, 63.29, 63.39, 63.18, 62.95, 62.71, 62.97, 64.61, 63.14, 62.97 ms.  Throughput: 189.64 iter/sec.
Timings for 5760K FFT length (12 cores, 1 worker):  2.69 ms.  Throughput: 371.66 iter/sec.
Timings for 5760K FFT length (12 cores, 4 workers): 21.13, 20.93, 20.91, 21.07 ms.  Throughput: 190.37 iter/sec.
Timings for 5760K FFT length (12 cores, 12 workers): 65.81, 65.36, 66.20, 65.83, 65.75, 65.72, 65.32, 65.54, 64.88, 67.88, 65.54, 65.59 ms.  Throughput: 182.43 iter/sec.
Timings for 6144K FFT length (12 cores, 1 worker):  2.87 ms.  Throughput: 348.96 iter/sec.
Timings for 6144K FFT length (12 cores, 4 workers): 21.77, 21.76, 21.76, 21.76 ms.  Throughput: 183.78 iter/sec.
Timings for 6144K FFT length (12 cores, 12 workers): 69.00, 68.32, 68.96, 69.09, 69.06, 69.15, 68.57, 68.86, 69.11, 70.14, 68.17, 67.99 ms.  Throughput: 174.26 iter/sec.
Timings for 6400K FFT length (12 cores, 1 worker):  2.98 ms.  Throughput: 336.13 iter/sec.
[Mon Dec 02 19:44:12 2019]
Timings for 6400K FFT length (12 cores, 4 workers): 23.01, 23.02, 22.97, 22.91 ms.  Throughput: 174.09 iter/sec.
Timings for 6400K FFT length (12 cores, 12 workers): 72.24, 71.64, 72.34, 72.18, 72.03, 72.29, 71.89, 71.82, 72.03, 72.80, 72.07, 71.39 ms.  Throughput: 166.53 iter/sec.
Timings for 6720K FFT length (12 cores, 1 worker):  3.34 ms.  Throughput: 299.21 iter/sec.
Timings for 6720K FFT length (12 cores, 4 workers): 24.48, 24.63, 24.43, 24.44 ms.  Throughput: 163.30 iter/sec.
Timings for 6720K FFT length (12 cores, 12 workers): 75.94, 75.65, 76.91, 76.22, 76.27, 76.24, 75.89, 75.45, 75.92, 77.83, 76.30, 75.74 ms.  Throughput: 157.50 iter/sec.
Timings for 7168K FFT length (12 cores, 1 worker):  3.80 ms.  Throughput: 262.89 iter/sec.
Timings for 7168K FFT length (12 cores, 4 workers): 25.84, 25.86, 25.86, 25.84 ms.  Throughput: 154.74 iter/sec.
Timings for 7168K FFT length (12 cores, 12 workers): 80.85, 79.94, 80.52, 80.73, 80.37, 80.54, 80.18, 80.20, 80.30, 82.08, 80.22, 79.41 ms.  Throughput: 149.18 iter/sec.
Timings for 7680K FFT length (12 cores, 1 worker):  4.80 ms.  Throughput: 208.34 iter/sec.
Timings for 7680K FFT length (12 cores, 4 workers): 28.67, 28.69, 28.65, 28.67 ms.  Throughput: 139.52 iter/sec.
[Mon Dec 02 19:49:28 2019]
Timings for 7680K FFT length (12 cores, 12 workers): 88.35, 87.48, 88.29, 88.06, 88.40, 88.15, 87.51, 87.06, 87.58, 90.22, 87.87, 87.14 ms.  Throughput: 136.36 iter/sec.
Timings for 8000K FFT length (12 cores, 1 worker):  4.77 ms.  Throughput: 209.53 iter/sec.
Timings for 8000K FFT length (12 cores, 4 workers): 29.32, 29.42, 29.49, 29.26 ms.  Throughput: 136.18 iter/sec.
Timings for 8000K FFT length (12 cores, 12 workers): 90.34, 89.92, 91.53, 90.77, 91.25, 90.92, 90.71, 90.60, 90.98, 92.36, 90.38, 89.06 ms.  Throughput: 132.26 iter/sec.
Timings for 8064K FFT length (12 cores, 1 worker):  4.82 ms.  Throughput: 207.44 iter/sec.
Timings for 8064K FFT length (12 cores, 4 workers): 29.42, 29.50, 29.47, 29.48 ms.  Throughput: 135.73 iter/sec.
Timings for 8064K FFT length (12 cores, 12 workers): 90.63, 89.95, 91.74, 91.02, 91.00, 91.26, 90.67, 90.72, 90.87, 91.64, 89.91, 89.87 ms.  Throughput: 132.21 iter/sec.
Timings for 8192K FFT length (12 cores, 1 worker):  5.23 ms.  Throughput: 191.24 iter/sec.
Timings for 8192K FFT length (12 cores, 4 workers): 30.05, 30.39, 30.18, 30.38 ms.  Throughput: 132.25 iter/sec.
Timings for 8192K FFT length (12 cores, 12 workers): 92.85, 91.51, 92.40, 92.79, 92.76, 92.82, 92.59, 92.26, 92.58, 93.38, 91.43, 91.16 ms.  Throughput: 129.91 iter/sec.
FFT-Timings:
Code:
AMD Ryzen 9 3900X 12-Core Processor            
CPU speed: 4217.00 MHz, 12 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 12x32 KB, L2 cache size: 12x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Machine topology as determined by hwloc library:
 Machine#0 (total=29788576KB, Backend=Windows, hwlocVersion=2.0.4, ProcessName=prime95.exe)
  Package (total=29788576KB, CPUVendor=AuthenticAMD, CPUFamilyNumber=23, CPUModelNumber=113, CPUModel="AMD Ryzen 9 3900X 12-Core Processor            ", CPUStepping=0)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000003)
            PU#0 (cpuset: 0x00000001)
            PU#1 (cpuset: 0x00000002)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x0000000c)
            PU#2 (cpuset: 0x00000004)
            PU#3 (cpuset: 0x00000008)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000030)
            PU#4 (cpuset: 0x00000010)
            PU#5 (cpuset: 0x00000020)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x000000c0)
            PU#6 (cpuset: 0x00000040)
            PU#7 (cpuset: 0x00000080)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000300)
            PU#8 (cpuset: 0x00000100)
            PU#9 (cpuset: 0x00000200)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000c00)
            PU#10 (cpuset: 0x00000400)
            PU#11 (cpuset: 0x00000800)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00003000)
            PU#12 (cpuset: 0x00001000)
            PU#13 (cpuset: 0x00002000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x0000c000)
            PU#14 (cpuset: 0x00004000)
            PU#15 (cpuset: 0x00008000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00030000)
            PU#16 (cpuset: 0x00010000)
            PU#17 (cpuset: 0x00020000)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x000c0000)
            PU#18 (cpuset: 0x00040000)
            PU#19 (cpuset: 0x00080000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00300000)
            PU#20 (cpuset: 0x00100000)
            PU#21 (cpuset: 0x00200000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00c00000)
            PU#22 (cpuset: 0x00400000)
            PU#23 (cpuset: 0x00800000)
Prime95 64-bit version 29.8, RdtscTiming=1
Timing FFTs using 12 threads on 12 cores.
Best time for 2048K FFT length: 1.262 ms., avg: 1.309 ms.
Best time for 2240K FFT length: 1.025 ms., avg: 1.042 ms.
Best time for 2304K FFT length: 1.107 ms., avg: 1.122 ms.
Best time for 2400K FFT length: 1.171 ms., avg: 1.180 ms.
Best time for 2560K FFT length: 1.163 ms., avg: 1.178 ms.
Best time for 2688K FFT length: 1.225 ms., avg: 1.253 ms.
Best time for 2800K FFT length: 1.323 ms., avg: 1.340 ms.
Best time for 2880K FFT length: 1.416 ms., avg: 1.443 ms.
Best time for 3072K FFT length: 1.483 ms., avg: 1.524 ms.
Best time for 3200K FFT length: 1.463 ms., avg: 1.478 ms.
Best time for 3360K FFT length: 1.461 ms., avg: 1.478 ms.
Best time for 3584K FFT length: 1.665 ms., avg: 1.680 ms.
Best time for 3840K FFT length: 1.706 ms., avg: 1.743 ms.
Best time for 4096K FFT length: 1.887 ms., avg: 1.927 ms.
Best time for 4480K FFT length: 2.074 ms., avg: 2.108 ms.
Best time for 4608K FFT length: 2.061 ms., avg: 2.087 ms.
Best time for 4800K FFT length: 2.136 ms., avg: 2.161 ms.
Best time for 5120K FFT length: 2.215 ms., avg: 2.251 ms.
Best time for 5376K FFT length: 2.411 ms., avg: 2.447 ms.
Best time for 5600K FFT length: 2.566 ms., avg: 2.604 ms.
Best time for 5760K FFT length: 2.620 ms., avg: 2.741 ms.
Best time for 6144K FFT length: 2.788 ms., avg: 2.853 ms.
Best time for 6400K FFT length: 3.047 ms., avg: 3.214 ms.
Best time for 6720K FFT length: 3.212 ms., avg: 3.355 ms.
Best time for 7168K FFT length: 3.660 ms., avg: 3.755 ms.
Best time for 7680K FFT length: 4.364 ms., avg: 4.562 ms.
Best time for 8000K FFT length: 4.483 ms., avg: 4.617 ms.
Best time for 8064K FFT length: 4.731 ms., avg: 4.860 ms.
Best time for 8192K FFT length: 4.974 ms., avg: 5.100 ms.
And finally the trial factoring benchmark:
Code:
AMD Ryzen 9 3900X 12-Core Processor            
CPU speed: 4216.22 MHz, 12 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 12x32 KB, L2 cache size: 12x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Machine topology as determined by hwloc library:
 Machine#0 (total=29788576KB, Backend=Windows, hwlocVersion=2.0.4, ProcessName=prime95.exe)
  Package (total=29788576KB, CPUVendor=AuthenticAMD, CPUFamilyNumber=23, CPUModelNumber=113, CPUModel="AMD Ryzen 9 3900X 12-Core Processor            ", CPUStepping=0)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000003)
            PU#0 (cpuset: 0x00000001)
            PU#1 (cpuset: 0x00000002)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x0000000c)
            PU#2 (cpuset: 0x00000004)
            PU#3 (cpuset: 0x00000008)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000030)
            PU#4 (cpuset: 0x00000010)
            PU#5 (cpuset: 0x00000020)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x000000c0)
            PU#6 (cpuset: 0x00000040)
            PU#7 (cpuset: 0x00000080)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000300)
            PU#8 (cpuset: 0x00000100)
            PU#9 (cpuset: 0x00000200)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00000c00)
            PU#10 (cpuset: 0x00000400)
            PU#11 (cpuset: 0x00000800)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00003000)
            PU#12 (cpuset: 0x00001000)
            PU#13 (cpuset: 0x00002000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x0000c000)
            PU#14 (cpuset: 0x00004000)
            PU#15 (cpuset: 0x00008000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00030000)
            PU#16 (cpuset: 0x00010000)
            PU#17 (cpuset: 0x00020000)
    L3 (size=16384KB, linesize=64, ways=16, Inclusive=0)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x000c0000)
            PU#18 (cpuset: 0x00040000)
            PU#19 (cpuset: 0x00080000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00300000)
            PU#20 (cpuset: 0x00100000)
            PU#21 (cpuset: 0x00200000)
      L2 (size=512KB, linesize=64, ways=8, Inclusive=1)
        L1d (size=32KB, linesize=64, ways=8, Inclusive=0)
          Core (cpuset: 0x00c00000)
            PU#22 (cpuset: 0x00400000)
            PU#23 (cpuset: 0x00800000)
Prime95 64-bit version 29.8, RdtscTiming=1
Best time for 61 bit trial factors: 0.686 ms.
Best time for 62 bit trial factors: 0.711 ms.
Best time for 63 bit trial factors: 0.705 ms.
Best time for 64 bit trial factors: 0.708 ms.
Best time for 65 bit trial factors: 0.706 ms.
Best time for 66 bit trial factors: 0.698 ms.
Best time for 67 bit trial factors: 0.695 ms.
Best time for 75 bit trial factors: 0.694 ms.
Best time for 76 bit trial factors: 0.686 ms.
Best time for 77 bit trial factors: 0.693 ms.
Meikel is offline   Reply With Quote
Old 2020-01-29, 19:52   #787
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default

Hi,

Quote:
Originally Posted by Meikel View Post
Seems like AMD has a quite OK CPU for mprime now...
you might want to include benchmarks for 2 workers and it is even better for certain ranges (FFT data fits twice into L3 cache(s) but not 4 times). Current LL Doublecheck fall into this range!

Stock Ryzen 9 3900X with dual DDR4-3200 (dual rank):
Code:
Prime95 64-bit version 29.8, RdtscTiming=1
Timings for 2880K FFT length (12 cores, 1 worker):  1.33 ms.  Throughput: 750.80 iter/sec.
Timings for 2880K FFT length (12 cores, 2 workers):  2.10,  2.10 ms.  Throughput: 954.10 iter/sec.
Oliver
TheJudger is offline   Reply With Quote
Old 2020-01-31, 21:20   #788
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

100010101112 Posts
Default

Hi, some fun with my Ryzen 9 3900X, I think most impressive is part 3!

BIOS defaults (142 W PPT), dual channel DDR4-2400 (dual rank):
2048K, 5760K, 6144K and 6400K flawed by some background processes (e.g. Windows Update?)
Code:
Prime95 64-bit version 29.8, RdtscTiming=1
Timings for 2048K FFT length (12 cores, 1 worker):  1.73 ms.  Throughput: 579.64 iter/sec.
Timings for 2048K FFT length (12 cores, 2 workers):  1.77,  1.77 ms.  Throughput: 1127.98 iter/sec.
Timings for 2240K FFT length (12 cores, 1 worker):  1.38 ms.  Throughput: 724.20 iter/sec.
Timings for 2240K FFT length (12 cores, 2 workers):  1.88,  1.89 ms.  Throughput: 1061.54 iter/sec.
Timings for 2304K FFT length (12 cores, 1 worker):  1.45 ms.  Throughput: 689.66 iter/sec.
Timings for 2304K FFT length (12 cores, 2 workers):  1.95,  1.95 ms.  Throughput: 1024.97 iter/sec.
Timings for 2400K FFT length (12 cores, 1 worker):  1.51 ms.  Throughput: 663.27 iter/sec.
Timings for 2400K FFT length (12 cores, 2 workers):  2.12,  2.06 ms.  Throughput: 957.64 iter/sec.
Timings for 2560K FFT length (12 cores, 1 worker):  1.53 ms.  Throughput: 654.30 iter/sec.
Timings for 2560K FFT length (12 cores, 2 workers):  2.12,  2.13 ms.  Throughput: 940.81 iter/sec.
Timings for 2688K FFT length (12 cores, 1 worker):  1.65 ms.  Throughput: 605.34 iter/sec.
Timings for 2688K FFT length (12 cores, 2 workers):  2.31,  2.27 ms.  Throughput: 873.05 iter/sec.
Timings for 2800K FFT length (12 cores, 1 worker):  2.36 ms.  Throughput: 423.44 iter/sec.
Timings for 2800K FFT length (12 cores, 2 workers):  2.38,  2.38 ms.  Throughput: 839.83 iter/sec.
Timings for 2880K FFT length (12 cores, 1 worker):  1.76 ms.  Throughput: 567.54 iter/sec.
Timings for 2880K FFT length (12 cores, 2 workers):  2.50,  2.53 ms.  Throughput: 795.85 iter/sec.
Timings for 3072K FFT length (12 cores, 1 worker):  2.24 ms.  Throughput: 446.77 iter/sec.
Timings for 3072K FFT length (12 cores, 2 workers):  3.37,  3.22 ms.  Throughput: 607.02 iter/sec.
Timings for 3200K FFT length (12 cores, 1 worker):  2.09 ms.  Throughput: 478.34 iter/sec.
Timings for 3200K FFT length (12 cores, 2 workers):  3.43,  3.38 ms.  Throughput: 586.73 iter/sec.
Timings for 3360K FFT length (12 cores, 1 worker):  2.21 ms.  Throughput: 452.61 iter/sec.
Timings for 3360K FFT length (12 cores, 2 workers):  3.52,  3.74 ms.  Throughput: 551.65 iter/sec.
Timings for 3584K FFT length (12 cores, 1 worker):  2.35 ms.  Throughput: 425.02 iter/sec.
Timings for 3584K FFT length (12 cores, 2 workers):  3.96,  4.31 ms.  Throughput: 484.73 iter/sec.
Timings for 3840K FFT length (12 cores, 1 worker):  2.39 ms.  Throughput: 418.29 iter/sec.
Timings for 3840K FFT length (12 cores, 2 workers):  4.65,  5.26 ms.  Throughput: 405.50 iter/sec.
Timings for 4096K FFT length (12 cores, 1 worker):  2.69 ms.  Throughput: 372.10 iter/sec.
Timings for 4096K FFT length (12 cores, 2 workers):  5.38,  5.87 ms.  Throughput: 356.12 iter/sec.
Timings for 4480K FFT length (12 cores, 1 worker):  3.06 ms.  Throughput: 326.65 iter/sec.
Timings for 4480K FFT length (12 cores, 2 workers):  7.02,  7.73 ms.  Throughput: 271.71 iter/sec.
Timings for 4608K FFT length (12 cores, 1 worker):  2.83 ms.  Throughput: 353.26 iter/sec.
Timings for 4608K FFT length (12 cores, 2 workers):  7.43,  7.13 ms.  Throughput: 274.79 iter/sec.
Timings for 4800K FFT length (12 cores, 1 worker):  2.87 ms.  Throughput: 347.90 iter/sec.
Timings for 4800K FFT length (12 cores, 2 workers):  7.08,  7.92 ms.  Throughput: 267.43 iter/sec.
Timings for 5120K FFT length (12 cores, 1 worker):  3.00 ms.  Throughput: 332.95 iter/sec.
Timings for 5120K FFT length (12 cores, 2 workers):  8.08,  9.03 ms.  Throughput: 234.51 iter/sec.
Timings for 5376K FFT length (12 cores, 1 worker):  3.00 ms.  Throughput: 333.79 iter/sec.
Timings for 5376K FFT length (12 cores, 2 workers):  8.87,  9.06 ms.  Throughput: 223.13 iter/sec.
Timings for 5600K FFT length (12 cores, 1 worker):  3.36 ms.  Throughput: 297.95 iter/sec.
Timings for 5600K FFT length (12 cores, 2 workers):  9.97,  9.96 ms.  Throughput: 200.63 iter/sec.
Timings for 5760K FFT length (12 cores, 1 worker):  3.49 ms.  Throughput: 286.59 iter/sec.
Timings for 5760K FFT length (12 cores, 2 workers): 11.58, 10.94 ms.  Throughput: 177.71 iter/sec.
Timings for 6144K FFT length (12 cores, 1 worker):  3.56 ms.  Throughput: 280.75 iter/sec.
Timings for 6144K FFT length (12 cores, 2 workers): 20.44, 20.29 ms.  Throughput: 98.21 iter/sec.
Timings for 6400K FFT length (12 cores, 1 worker): 15.05 ms.  Throughput: 66.43 iter/sec.
Timings for 6400K FFT length (12 cores, 2 workers): 45.33, 19.30 ms.  Throughput: 73.89 iter/sec.
Timings for 6720K FFT length (12 cores, 1 worker):  3.92 ms.  Throughput: 254.78 iter/sec.
Timings for 6720K FFT length (12 cores, 2 workers): 13.35, 13.36 ms.  Throughput: 149.78 iter/sec.
Timings for 7168K FFT length (12 cores, 1 worker):  4.50 ms.  Throughput: 222.41 iter/sec.
Timings for 7168K FFT length (12 cores, 2 workers): 14.06, 14.19 ms.  Throughput: 141.58 iter/sec.
Timings for 7680K FFT length (12 cores, 1 worker):  5.37 ms.  Throughput: 186.05 iter/sec.
Timings for 7680K FFT length (12 cores, 2 workers): 16.17, 16.17 ms.  Throughput: 123.71 iter/sec.
Timings for 8000K FFT length (12 cores, 1 worker):  5.22 ms.  Throughput: 191.54 iter/sec.
Timings for 8000K FFT length (12 cores, 2 workers): 16.20, 16.32 ms.  Throughput: 122.99 iter/sec.
Timings for 8064K FFT length (12 cores, 1 worker):  5.49 ms.  Throughput: 182.26 iter/sec.
Timings for 8064K FFT length (12 cores, 2 workers): 16.50, 16.69 ms.  Throughput: 120.53 iter/sec.
Timings for 8192K FFT length (12 cores, 1 worker):  5.89 ms.  Throughput: 169.83 iter/sec.
Timings for 8192K FFT length (12 cores, 2 workers): 16.95, 17.09 ms.  Throughput: 117.51 iter/sec.
BIOS defaults (142 W PPT), dual channel DDR4-3200 (dual rank):
Code:
Prime95 64-bit version 29.8, RdtscTiming=1
Timings for 2048K FFT length (12 cores, 1 worker):  1.07 ms.  Throughput: 934.93 iter/sec.
Timings for 2048K FFT length (12 cores, 2 workers):  1.53,  1.53 ms.  Throughput: 1307.78 iter/sec.
Timings for 2240K FFT length (12 cores, 1 worker):  1.03 ms.  Throughput: 968.92 iter/sec.
Timings for 2240K FFT length (12 cores, 2 workers):  1.66,  1.71 ms.  Throughput: 1187.84 iter/sec.
Timings for 2304K FFT length (12 cores, 1 worker):  1.08 ms.  Throughput: 925.87 iter/sec.
Timings for 2304K FFT length (12 cores, 2 workers):  1.63,  1.67 ms.  Throughput: 1212.94 iter/sec.
Timings for 2400K FFT length (12 cores, 1 worker):  1.18 ms.  Throughput: 845.48 iter/sec.
Timings for 2400K FFT length (12 cores, 2 workers):  1.80,  1.77 ms.  Throughput: 1121.96 iter/sec.
Timings for 2560K FFT length (12 cores, 1 worker):  1.18 ms.  Throughput: 846.87 iter/sec.
Timings for 2560K FFT length (12 cores, 2 workers):  2.01,  1.96 ms.  Throughput: 1007.45 iter/sec.
Timings for 2688K FFT length (12 cores, 1 worker):  1.22 ms.  Throughput: 817.39 iter/sec.
Timings for 2688K FFT length (12 cores, 2 workers):  1.97,  1.96 ms.  Throughput: 1015.97 iter/sec.
Timings for 2800K FFT length (12 cores, 1 worker):  1.31 ms.  Throughput: 764.73 iter/sec.
Timings for 2800K FFT length (12 cores, 2 workers):  2.07,  2.03 ms.  Throughput: 975.56 iter/sec.
Timings for 2880K FFT length (12 cores, 1 worker):  1.34 ms.  Throughput: 747.34 iter/sec.
Timings for 2880K FFT length (12 cores, 2 workers):  2.08,  2.12 ms.  Throughput: 952.38 iter/sec.
Timings for 3072K FFT length (12 cores, 1 worker):  1.44 ms.  Throughput: 692.45 iter/sec.
Timings for 3072K FFT length (12 cores, 2 workers):  2.25,  2.27 ms.  Throughput: 884.15 iter/sec.
Timings for 3200K FFT length (12 cores, 1 worker):  1.50 ms.  Throughput: 668.50 iter/sec.
Timings for 3200K FFT length (12 cores, 2 workers):  2.40,  2.40 ms.  Throughput: 833.02 iter/sec.
Timings for 3360K FFT length (12 cores, 1 worker):  1.50 ms.  Throughput: 665.21 iter/sec.
Timings for 3360K FFT length (12 cores, 2 workers):  2.76,  2.75 ms.  Throughput: 725.20 iter/sec.
Timings for 3584K FFT length (12 cores, 1 worker):  1.68 ms.  Throughput: 595.78 iter/sec.
Timings for 3584K FFT length (12 cores, 2 workers):  2.97,  2.98 ms.  Throughput: 671.48 iter/sec.
Timings for 3840K FFT length (12 cores, 1 worker):  1.71 ms.  Throughput: 584.73 iter/sec.
Timings for 3840K FFT length (12 cores, 2 workers):  3.35,  3.36 ms.  Throughput: 596.38 iter/sec.
Timings for 4096K FFT length (12 cores, 1 worker):  1.88 ms.  Throughput: 532.71 iter/sec.
Timings for 4096K FFT length (12 cores, 2 workers):  4.07,  4.06 ms.  Throughput: 492.05 iter/sec.
Timings for 4480K FFT length (12 cores, 1 worker):  2.09 ms.  Throughput: 478.71 iter/sec.
Timings for 4480K FFT length (12 cores, 2 workers):  5.39,  5.32 ms.  Throughput: 373.51 iter/sec.
Timings for 4608K FFT length (12 cores, 1 worker):  2.05 ms.  Throughput: 488.26 iter/sec.
Timings for 4608K FFT length (12 cores, 2 workers):  5.24,  5.23 ms.  Throughput: 382.13 iter/sec.
Timings for 4800K FFT length (12 cores, 1 worker):  2.13 ms.  Throughput: 470.50 iter/sec.
Timings for 4800K FFT length (12 cores, 2 workers):  5.76,  5.76 ms.  Throughput: 347.27 iter/sec.
Timings for 5120K FFT length (12 cores, 1 worker):  2.21 ms.  Throughput: 452.76 iter/sec.
Timings for 5120K FFT length (12 cores, 2 workers):  6.52,  6.53 ms.  Throughput: 306.55 iter/sec.
Timings for 5376K FFT length (12 cores, 1 worker):  2.39 ms.  Throughput: 418.74 iter/sec.
Timings for 5376K FFT length (12 cores, 2 workers):  7.23,  7.37 ms.  Throughput: 273.98 iter/sec.
Timings for 5600K FFT length (12 cores, 1 worker):  2.54 ms.  Throughput: 393.36 iter/sec.
Timings for 5600K FFT length (12 cores, 2 workers):  8.02,  8.02 ms.  Throughput: 249.24 iter/sec.
Timings for 5760K FFT length (12 cores, 1 worker):  2.63 ms.  Throughput: 380.26 iter/sec.
Timings for 5760K FFT length (12 cores, 2 workers):  8.79,  8.64 ms.  Throughput: 229.51 iter/sec.
Timings for 6144K FFT length (12 cores, 1 worker):  2.78 ms.  Throughput: 359.64 iter/sec.
Timings for 6144K FFT length (12 cores, 2 workers):  9.16,  9.13 ms.  Throughput: 218.77 iter/sec.
Timings for 6400K FFT length (12 cores, 1 worker):  2.84 ms.  Throughput: 352.44 iter/sec.
Timings for 6400K FFT length (12 cores, 2 workers):  9.85,  9.85 ms.  Throughput: 203.12 iter/sec.
Timings for 6720K FFT length (12 cores, 1 worker):  3.23 ms.  Throughput: 309.43 iter/sec.
Timings for 6720K FFT length (12 cores, 2 workers): 10.81, 10.64 ms.  Throughput: 186.49 iter/sec.
Timings for 7168K FFT length (12 cores, 1 worker):  3.65 ms.  Throughput: 274.28 iter/sec.
Timings for 7168K FFT length (12 cores, 2 workers): 11.48, 11.48 ms.  Throughput: 174.26 iter/sec.
Timings for 7680K FFT length (12 cores, 1 worker):  4.40 ms.  Throughput: 227.38 iter/sec.
Timings for 7680K FFT length (12 cores, 2 workers): 13.00, 13.02 ms.  Throughput: 153.75 iter/sec.
Timings for 8000K FFT length (12 cores, 1 worker):  4.46 ms.  Throughput: 224.21 iter/sec.
Timings for 8000K FFT length (12 cores, 2 workers): 13.24, 13.24 ms.  Throughput: 151.08 iter/sec.
Timings for 8064K FFT length (12 cores, 1 worker):  4.65 ms.  Throughput: 214.92 iter/sec.
Timings for 8064K FFT length (12 cores, 2 workers): 13.42, 13.42 ms.  Throughput: 149.06 iter/sec.
Timings for 8192K FFT length (12 cores, 1 worker):  4.85 ms.  Throughput: 205.99 iter/sec.
Timings for 8192K FFT length (12 cores, 2 workers): 13.79, 13.78 ms.  Throughput: 145.07 iter/sec.
BIOS defaults (100 W PPT set with Ryzen Master), dual channel DDR4-3200 (dual rank):
Code:
Prime95 64-bit version 29.8, RdtscTiming=1
Timings for 2048K FFT length (12 cores, 1 worker):  1.32 ms.  Throughput: 759.28 iter/sec.
Timings for 2048K FFT length (12 cores, 2 workers):  1.60,  1.59 ms.  Throughput: 1254.81 iter/sec.
Timings for 2240K FFT length (12 cores, 1 worker):  1.07 ms.  Throughput: 935.16 iter/sec.
Timings for 2240K FFT length (12 cores, 2 workers):  1.75,  1.76 ms.  Throughput: 1139.24 iter/sec.
Timings for 2304K FFT length (12 cores, 1 worker):  1.09 ms.  Throughput: 917.06 iter/sec.
Timings for 2304K FFT length (12 cores, 2 workers):  1.71,  1.73 ms.  Throughput: 1162.16 iter/sec.
Timings for 2400K FFT length (12 cores, 1 worker):  1.15 ms.  Throughput: 870.42 iter/sec.
Timings for 2400K FFT length (12 cores, 2 workers):  1.84,  1.85 ms.  Throughput: 1086.44 iter/sec.
Timings for 2560K FFT length (12 cores, 1 worker):  1.23 ms.  Throughput: 814.20 iter/sec.
Timings for 2560K FFT length (12 cores, 2 workers):  2.01,  2.01 ms.  Throughput: 994.44 iter/sec.
Timings for 2688K FFT length (12 cores, 1 worker):  1.24 ms.  Throughput: 809.06 iter/sec.
Timings for 2688K FFT length (12 cores, 2 workers):  2.05,  2.05 ms.  Throughput: 975.72 iter/sec.
Timings for 2800K FFT length (12 cores, 1 worker):  1.87 ms.  Throughput: 534.27 iter/sec.
Timings for 2800K FFT length (12 cores, 2 workers):  2.16,  2.16 ms.  Throughput: 925.65 iter/sec.
Timings for 2880K FFT length (12 cores, 1 worker):  1.37 ms.  Throughput: 731.17 iter/sec.
Timings for 2880K FFT length (12 cores, 2 workers):  2.20,  2.19 ms.  Throughput: 911.47 iter/sec.
Timings for 3072K FFT length (12 cores, 1 worker):  1.49 ms.  Throughput: 670.10 iter/sec.
Timings for 3072K FFT length (12 cores, 2 workers):  2.35,  2.27 ms.  Throughput: 865.36 iter/sec.
Timings for 3200K FFT length (12 cores, 1 worker):  1.52 ms.  Throughput: 658.08 iter/sec.
Timings for 3200K FFT length (12 cores, 2 workers):  2.61,  2.51 ms.  Throughput: 781.95 iter/sec.
Timings for 3360K FFT length (12 cores, 1 worker):  1.53 ms.  Throughput: 652.05 iter/sec.
Timings for 3360K FFT length (12 cores, 2 workers):  2.67,  2.69 ms.  Throughput: 747.01 iter/sec.
Timings for 3584K FFT length (12 cores, 1 worker):  1.70 ms.  Throughput: 587.06 iter/sec.
Timings for 3584K FFT length (12 cores, 2 workers):  2.99,  3.02 ms.  Throughput: 665.09 iter/sec.
Timings for 3840K FFT length (12 cores, 1 worker):  1.75 ms.  Throughput: 569.94 iter/sec.
Timings for 3840K FFT length (12 cores, 2 workers):  3.37,  3.38 ms.  Throughput: 592.54 iter/sec.
Timings for 4096K FFT length (12 cores, 1 worker):  1.92 ms.  Throughput: 520.63 iter/sec.
Timings for 4096K FFT length (12 cores, 2 workers):  4.13,  4.05 ms.  Throughput: 489.30 iter/sec.
Timings for 4480K FFT length (12 cores, 1 worker):  2.10 ms.  Throughput: 477.04 iter/sec.
Timings for 4480K FFT length (12 cores, 2 workers):  5.47,  5.32 ms.  Throughput: 370.75 iter/sec.
Timings for 4608K FFT length (12 cores, 1 worker):  2.09 ms.  Throughput: 478.53 iter/sec.
Timings for 4608K FFT length (12 cores, 2 workers):  5.31,  5.38 ms.  Throughput: 374.50 iter/sec.
Timings for 4800K FFT length (12 cores, 1 worker):  2.15 ms.  Throughput: 464.74 iter/sec.
Timings for 4800K FFT length (12 cores, 2 workers):  5.80,  5.76 ms.  Throughput: 346.07 iter/sec.
Timings for 5120K FFT length (12 cores, 1 worker):  2.25 ms.  Throughput: 445.05 iter/sec.
Timings for 5120K FFT length (12 cores, 2 workers):  6.67,  6.62 ms.  Throughput: 301.14 iter/sec.
Timings for 5376K FFT length (12 cores, 1 worker):  2.45 ms.  Throughput: 407.43 iter/sec.
Timings for 5376K FFT length (12 cores, 2 workers):  7.29,  7.31 ms.  Throughput: 274.12 iter/sec.
Timings for 5600K FFT length (12 cores, 1 worker):  2.56 ms.  Throughput: 391.23 iter/sec.
Timings for 5600K FFT length (12 cores, 2 workers):  8.10,  8.11 ms.  Throughput: 246.73 iter/sec.
Timings for 5760K FFT length (12 cores, 1 worker):  2.69 ms.  Throughput: 371.30 iter/sec.
Timings for 5760K FFT length (12 cores, 2 workers):  8.75,  8.84 ms.  Throughput: 227.40 iter/sec.
Timings for 6144K FFT length (12 cores, 1 worker):  2.83 ms.  Throughput: 353.38 iter/sec.
Timings for 6144K FFT length (12 cores, 2 workers):  9.36,  9.20 ms.  Throughput: 215.52 iter/sec.
Timings for 6400K FFT length (12 cores, 1 worker):  2.90 ms.  Throughput: 344.26 iter/sec.
Timings for 6400K FFT length (12 cores, 2 workers):  9.84,  9.85 ms.  Throughput: 203.16 iter/sec.
Timings for 6720K FFT length (12 cores, 1 worker):  3.34 ms.  Throughput: 299.58 iter/sec.
Timings for 6720K FFT length (12 cores, 2 workers): 10.83, 10.85 ms.  Throughput: 184.52 iter/sec.
Timings for 7168K FFT length (12 cores, 1 worker):  3.71 ms.  Throughput: 269.23 iter/sec.
Timings for 7168K FFT length (12 cores, 2 workers): 11.47, 11.57 ms.  Throughput: 173.60 iter/sec.
Timings for 7680K FFT length (12 cores, 1 worker):  4.42 ms.  Throughput: 226.09 iter/sec.
Timings for 7680K FFT length (12 cores, 2 workers): 13.11, 12.99 ms.  Throughput: 153.28 iter/sec.
Timings for 8000K FFT length (12 cores, 1 worker):  4.52 ms.  Throughput: 221.24 iter/sec.
Timings for 8000K FFT length (12 cores, 2 workers): 13.17, 13.15 ms.  Throughput: 152.00 iter/sec.
Timings for 8064K FFT length (12 cores, 1 worker):  4.64 ms.  Throughput: 215.55 iter/sec.
Timings for 8064K FFT length (12 cores, 2 workers): 13.43, 13.51 ms.  Throughput: 148.48 iter/sec.
Timings for 8192K FFT length (12 cores, 1 worker):  4.96 ms.  Throughput: 201.73 iter/sec.
Timings for 8192K FFT length (12 cores, 2 workers): 13.93, 13.93 ms.  Throughput: 143.56 iter/sec.
TheJudger is offline   Reply With Quote
Old 2020-02-01, 15:21   #789
JCoveiro
 
"Jorge Coveiro"
Nov 2006
Moura, Portugal

1A16 Posts
Default AMD Ryzen9 3950X 16-Core, 64GB DDR4-3200 Corsair RGB Pro, X570 ASUS Crosshair VIII Hero (Wi-Fi)

Here are some AMD 3950x benchmarks:

Stock + RamCache III + Memory Clock: 1600 + Fabric Clock: 1600
(Throughput + FFT + Trial) Benchmarks:
Code:
#########################################################
Throughput Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4239.86 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Timings for 2048K FFT length (16 cores, 1 worker):  1.52 ms.  Throughput: 659.51 iter/sec.
Timings for 2048K FFT length (16 cores, 2 workers):  1.34,  1.34 ms.  Throughput: 1491.01 iter/sec.
Timings for 2240K FFT length (16 cores, 1 worker):  1.04 ms.  Throughput: 963.01 iter/sec.
Timings for 2240K FFT length (16 cores, 2 workers):  1.42,  1.42 ms.  Throughput: 1405.76 iter/sec.
Timings for 2304K FFT length (16 cores, 1 worker):  1.07 ms.  Throughput: 934.74 iter/sec.
Timings for 2304K FFT length (16 cores, 2 workers):  1.42,  1.42 ms.  Throughput: 1409.12 iter/sec.
Timings for 2400K FFT length (16 cores, 1 worker):  1.17 ms.  Throughput: 851.33 iter/sec.
Timings for 2400K FFT length (16 cores, 2 workers):  1.58,  1.54 ms.  Throughput: 1282.51 iter/sec.
Timings for 2560K FFT length (16 cores, 1 worker):  1.17 ms.  Throughput: 857.44 iter/sec.
Timings for 2560K FFT length (16 cores, 2 workers):  1.61,  1.63 ms.  Throughput: 1233.47 iter/sec.
Timings for 2688K FFT length (16 cores, 1 worker):  1.21 ms.  Throughput: 828.15 iter/sec.
Timings for 2688K FFT length (16 cores, 2 workers):  1.69,  1.70 ms.  Throughput: 1177.94 iter/sec.
Timings for 2800K FFT length (16 cores, 1 worker):  1.94 ms.  Throughput: 515.93 iter/sec.
Timings for 2800K FFT length (16 cores, 2 workers):  1.79,  1.81 ms.  Throughput: 1108.87 iter/sec.
Timings for 2880K FFT length (16 cores, 1 worker):  1.34 ms.  Throughput: 749.01 iter/sec.
Timings for 2880K FFT length (16 cores, 2 workers):  1.80,  1.86 ms.  Throughput: 1091.24 iter/sec.
Timings for 3072K FFT length (16 cores, 1 worker):  1.45 ms.  Throughput: 689.52 iter/sec.
Timings for 3072K FFT length (16 cores, 2 workers):  2.17,  2.17 ms.  Throughput: 921.70 iter/sec.
Timings for 3200K FFT length (16 cores, 1 worker):  1.41 ms.  Throughput: 708.09 iter/sec.
Timings for 3200K FFT length (16 cores, 2 workers):  2.18,  2.18 ms.  Throughput: 918.07 iter/sec.
Timings for 3360K FFT length (16 cores, 1 worker):  1.53 ms.  Throughput: 655.59 iter/sec.
Timings for 3360K FFT length (16 cores, 2 workers):  2.96,  2.93 ms.  Throughput: 679.02 iter/sec.
Timings for 3584K FFT length (16 cores, 1 worker):  1.65 ms.  Throughput: 604.66 iter/sec.
Timings for 3584K FFT length (16 cores, 2 workers):  2.99,  3.03 ms.  Throughput: 664.45 iter/sec.
Timings for 3840K FFT length (16 cores, 1 worker):  1.63 ms.  Throughput: 614.97 iter/sec.
Timings for 3840K FFT length (16 cores, 2 workers):  3.58,  3.39 ms.  Throughput: 574.03 iter/sec.
Timings for 4096K FFT length (16 cores, 1 worker):  1.89 ms.  Throughput: 528.07 iter/sec.
Timings for 4096K FFT length (16 cores, 2 workers):  4.27,  4.27 ms.  Throughput: 468.29 iter/sec.
Timings for 4480K FFT length (16 cores, 1 worker):  2.08 ms.  Throughput: 480.12 iter/sec.
Timings for 4480K FFT length (16 cores, 2 workers):  5.33,  5.32 ms.  Throughput: 375.61 iter/sec.
Timings for 4608K FFT length (16 cores, 1 worker):  1.94 ms.  Throughput: 514.56 iter/sec.
Timings for 4608K FFT length (16 cores, 2 workers):  5.42,  5.57 ms.  Throughput: 363.87 iter/sec.
Timings for 4800K FFT length (16 cores, 1 worker):  2.01 ms.  Throughput: 498.65 iter/sec.
Timings for 4800K FFT length (16 cores, 2 workers):  5.68,  5.71 ms.  Throughput: 351.21 iter/sec.
Timings for 5120K FFT length (16 cores, 1 worker):  2.12 ms.  Throughput: 470.89 iter/sec.
Timings for 5120K FFT length (16 cores, 2 workers):  6.44,  6.51 ms.  Throughput: 309.04 iter/sec.
Timings for 5376K FFT length (16 cores, 1 worker):  2.32 ms.  Throughput: 430.39 iter/sec.
Timings for 5376K FFT length (16 cores, 2 workers):  7.05,  7.11 ms.  Throughput: 282.55 iter/sec.
Timings for 5600K FFT length (16 cores, 1 worker):  2.39 ms.  Throughput: 418.58 iter/sec.
Timings for 5600K FFT length (16 cores, 2 workers):  7.77,  7.75 ms.  Throughput: 257.70 iter/sec.
Timings for 5760K FFT length (16 cores, 1 worker):  2.59 ms.  Throughput: 386.79 iter/sec.
Timings for 5760K FFT length (16 cores, 2 workers):  8.52,  8.53 ms.  Throughput: 234.56 iter/sec.
Timings for 6144K FFT length (16 cores, 1 worker):  2.66 ms.  Throughput: 375.77 iter/sec.
Timings for 6144K FFT length (16 cores, 2 workers):  8.89,  8.88 ms.  Throughput: 225.04 iter/sec.
Timings for 6400K FFT length (16 cores, 1 worker):  2.80 ms.  Throughput: 356.80 iter/sec.
Timings for 6400K FFT length (16 cores, 2 workers):  9.41,  9.47 ms.  Throughput: 211.91 iter/sec.
Timings for 6720K FFT length (16 cores, 1 worker):  3.43 ms.  Throughput: 291.55 iter/sec.
Timings for 6720K FFT length (16 cores, 2 workers): 10.74, 10.65 ms.  Throughput: 187.03 iter/sec.
Timings for 7168K FFT length (16 cores, 1 worker):  3.57 ms.  Throughput: 280.15 iter/sec.
Timings for 7168K FFT length (16 cores, 2 workers): 11.22, 11.26 ms.  Throughput: 177.94 iter/sec.
Timings for 7680K FFT length (16 cores, 1 worker):  4.46 ms.  Throughput: 224.28 iter/sec.
Timings for 7680K FFT length (16 cores, 2 workers): 12.94, 12.94 ms.  Throughput: 154.59 iter/sec.
Timings for 8000K FFT length (16 cores, 1 worker):  4.35 ms.  Throughput: 229.82 iter/sec.
Timings for 8000K FFT length (16 cores, 2 workers): 13.07, 13.03 ms.  Throughput: 153.31 iter/sec.
Timings for 8064K FFT length (16 cores, 1 worker):  4.55 ms.  Throughput: 219.93 iter/sec.
Timings for 8064K FFT length (16 cores, 2 workers): 13.25, 13.20 ms.  Throughput: 151.25 iter/sec.
Timings for 8192K FFT length (16 cores, 1 worker):  4.81 ms.  Throughput: 208.04 iter/sec.
Timings for 8192K FFT length (16 cores, 2 workers): 13.61, 13.53 ms.  Throughput: 147.36 iter/sec.

#########################################################
FFT Timings Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4239.35 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Timing FFTs using 16 threads on 16 cores.
Best time for 2048K FFT length: 1.470 ms., avg: 1.505 ms.
Best time for 2240K FFT length: 1.043 ms., avg: 1.071 ms.
Best time for 2304K FFT length: 1.025 ms., avg: 1.068 ms.
Best time for 2400K FFT length: 1.107 ms., avg: 1.139 ms.
Best time for 2560K FFT length: 1.103 ms., avg: 1.139 ms.
Best time for 2688K FFT length: 1.178 ms., avg: 1.212 ms.
Best time for 2800K FFT length: 1.437 ms., avg: 1.490 ms.
Best time for 2880K FFT length: 1.369 ms., avg: 1.439 ms.
Best time for 3072K FFT length: 1.418 ms., avg: 1.441 ms.
Best time for 3200K FFT length: 1.373 ms., avg: 1.412 ms.
Best time for 3360K FFT length: 1.459 ms., avg: 1.500 ms.
Best time for 3584K FFT length: 1.627 ms., avg: 1.647 ms.
Best time for 3840K FFT length: 1.590 ms., avg: 1.622 ms.
Best time for 4096K FFT length: 1.825 ms., avg: 1.851 ms.
Best time for 4480K FFT length: 2.056 ms., avg: 2.081 ms.
Best time for 4608K FFT length: 1.916 ms., avg: 1.978 ms.
Best time for 4800K FFT length: 1.980 ms., avg: 2.014 ms.
Best time for 5120K FFT length: 2.061 ms., avg: 2.091 ms.
Best time for 5376K FFT length: 2.248 ms., avg: 2.301 ms.
Best time for 5600K FFT length: 2.348 ms., avg: 2.434 ms.
Best time for 5760K FFT length: 2.507 ms., avg: 2.669 ms.
Best time for 6144K FFT length: 2.605 ms., avg: 2.646 ms.
Best time for 6400K FFT length: 2.746 ms., avg: 2.862 ms.
Best time for 6720K FFT length: 3.049 ms., avg: 3.141 ms.
Best time for 7168K FFT length: 3.429 ms., avg: 3.610 ms.
Best time for 7680K FFT length: 4.324 ms., avg: 4.512 ms.
Best time for 8000K FFT length: 4.246 ms., avg: 4.360 ms.
Best time for 8064K FFT length: 4.450 ms., avg: 4.538 ms.
Best time for 8192K FFT length: 4.640 ms., avg: 4.816 ms.

#########################################################
Trial Factoring Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4239.79 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Best time for 61 bit trial factors: 0.697 ms.
Best time for 62 bit trial factors: 0.720 ms.
Best time for 63 bit trial factors: 0.718 ms.
Best time for 64 bit trial factors: 0.729 ms.
Best time for 65 bit trial factors: 0.721 ms.
Best time for 66 bit trial factors: 0.680 ms.
Best time for 67 bit trial factors: 0.662 ms.
Best time for 75 bit trial factors: 0.942 ms.
Best time for 76 bit trial factors: 0.715 ms.
Best time for 77 bit trial factors: 0.702 ms.
PBO Overclock + RamCache III + RamDisk + Memory Overclock: 1800 + Fabric Overclock: 1800
(Throughput + FFT + Trial) Benchmarks:
Code:
#########################################################
Throughput Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4240.84 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Timings for 2048K FFT length (16 cores, 1 worker):  1.38 ms.  Throughput: 723.23 iter/sec.
Timings for 2048K FFT length (16 cores, 2 workers):  1.27,  1.26 ms.  Throughput: 1579.14 iter/sec.
Timings for 2240K FFT length (16 cores, 1 worker):  0.91 ms.  Throughput: 1104.29 iter/sec.
Timings for 2240K FFT length (16 cores, 2 workers):  1.33,  1.34 ms.  Throughput: 1500.80 iter/sec.
Timings for 2304K FFT length (16 cores, 1 worker):  1.19 ms.  Throughput: 837.94 iter/sec.
Timings for 2304K FFT length (16 cores, 2 workers):  1.33,  1.38 ms.  Throughput: 1477.86 iter/sec.
Timings for 2400K FFT length (16 cores, 1 worker):  1.40 ms.  Throughput: 713.60 iter/sec.
Timings for 2400K FFT length (16 cores, 2 workers):  1.42,  1.43 ms.  Throughput: 1403.26 iter/sec.
Timings for 2560K FFT length (16 cores, 1 worker):  1.01 ms.  Throughput: 990.78 iter/sec.
Timings for 2560K FFT length (16 cores, 2 workers):  1.54,  1.53 ms.  Throughput: 1302.31 iter/sec.
Timings for 2688K FFT length (16 cores, 1 worker):  1.07 ms.  Throughput: 938.33 iter/sec.
Timings for 2688K FFT length (16 cores, 2 workers):  2.10,  2.16 ms.  Throughput: 938.32 iter/sec.
Timings for 2800K FFT length (16 cores, 1 worker):  1.42 ms.  Throughput: 702.78 iter/sec.
Timings for 2800K FFT length (16 cores, 2 workers):  1.67,  1.68 ms.  Throughput: 1193.15 iter/sec.
Timings for 2880K FFT length (16 cores, 1 worker):  1.16 ms.  Throughput: 861.84 iter/sec.
Timings for 2880K FFT length (16 cores, 2 workers):  1.67,  1.68 ms.  Throughput: 1196.04 iter/sec.
Timings for 3072K FFT length (16 cores, 1 worker):  1.28 ms.  Throughput: 782.42 iter/sec.
Timings for 3072K FFT length (16 cores, 2 workers):  1.81,  1.83 ms.  Throughput: 1101.26 iter/sec.
Timings for 3200K FFT length (16 cores, 1 worker):  1.26 ms.  Throughput: 794.87 iter/sec.
Timings for 3200K FFT length (16 cores, 2 workers):  3.09,  3.12 ms.  Throughput: 644.20 iter/sec.
Timings for 3360K FFT length (16 cores, 1 worker):  1.29 ms.  Throughput: 775.47 iter/sec.
Timings for 3360K FFT length (16 cores, 2 workers):  2.33,  2.38 ms.  Throughput: 850.57 iter/sec.
Timings for 3584K FFT length (16 cores, 1 worker):  1.47 ms.  Throughput: 682.57 iter/sec.
Timings for 3584K FFT length (16 cores, 2 workers):  2.74,  2.71 ms.  Throughput: 733.36 iter/sec.
Timings for 3840K FFT length (16 cores, 1 worker):  1.46 ms.  Throughput: 686.72 iter/sec.
Timings for 3840K FFT length (16 cores, 2 workers):  2.86,  2.99 ms.  Throughput: 684.17 iter/sec.
Timings for 4096K FFT length (16 cores, 1 worker):  1.67 ms.  Throughput: 599.08 iter/sec.
Timings for 4096K FFT length (16 cores, 2 workers):  3.60,  3.67 ms.  Throughput: 550.61 iter/sec.
Timings for 4480K FFT length (16 cores, 1 worker):  1.83 ms.  Throughput: 546.01 iter/sec.
Timings for 4480K FFT length (16 cores, 2 workers):  4.77,  4.87 ms.  Throughput: 415.00 iter/sec.
Timings for 4608K FFT length (16 cores, 1 worker):  1.78 ms.  Throughput: 561.78 iter/sec.
Timings for 4608K FFT length (16 cores, 2 workers):  4.55,  4.65 ms.  Throughput: 434.67 iter/sec.
Timings for 4800K FFT length (16 cores, 1 worker):  1.81 ms.  Throughput: 551.30 iter/sec.
Timings for 4800K FFT length (16 cores, 2 workers):  5.08,  5.17 ms.  Throughput: 390.10 iter/sec.
Timings for 5120K FFT length (16 cores, 1 worker):  1.89 ms.  Throughput: 529.66 iter/sec.
Timings for 5120K FFT length (16 cores, 2 workers):  5.82,  5.93 ms.  Throughput: 340.44 iter/sec.
Timings for 5376K FFT length (16 cores, 1 worker):  2.07 ms.  Throughput: 483.40 iter/sec.
Timings for 5376K FFT length (16 cores, 2 workers):  6.74,  6.66 ms.  Throughput: 298.58 iter/sec.
Timings for 5600K FFT length (16 cores, 1 worker):  2.16 ms.  Throughput: 462.27 iter/sec.
Timings for 5600K FFT length (16 cores, 2 workers):  7.26,  7.33 ms.  Throughput: 274.15 iter/sec.
Timings for 5760K FFT length (16 cores, 1 worker):  2.32 ms.  Throughput: 430.87 iter/sec.
Timings for 5760K FFT length (16 cores, 2 workers):  7.86,  7.98 ms.  Throughput: 252.51 iter/sec.
Timings for 6144K FFT length (16 cores, 1 worker):  2.39 ms.  Throughput: 418.61 iter/sec.
Timings for 6144K FFT length (16 cores, 2 workers):  7.76,  7.89 ms.  Throughput: 255.60 iter/sec.
Timings for 6400K FFT length (16 cores, 1 worker):  2.53 ms.  Throughput: 395.61 iter/sec.
Timings for 6400K FFT length (16 cores, 2 workers):  8.54,  8.67 ms.  Throughput: 232.40 iter/sec.
Timings for 6720K FFT length (16 cores, 1 worker):  2.88 ms.  Throughput: 347.72 iter/sec.
Timings for 6720K FFT length (16 cores, 2 workers):  9.67,  9.70 ms.  Throughput: 206.46 iter/sec.
Timings for 7168K FFT length (16 cores, 1 worker):  3.18 ms.  Throughput: 314.60 iter/sec.
Timings for 7168K FFT length (16 cores, 2 workers): 10.22, 10.22 ms.  Throughput: 195.70 iter/sec.
Timings for 7680K FFT length (16 cores, 1 worker):  4.04 ms.  Throughput: 247.63 iter/sec.
Timings for 7680K FFT length (16 cores, 2 workers): 11.46, 11.81 ms.  Throughput: 171.99 iter/sec.
Timings for 8000K FFT length (16 cores, 1 worker):  4.01 ms.  Throughput: 249.18 iter/sec.
Timings for 8000K FFT length (16 cores, 2 workers): 11.65, 11.84 ms.  Throughput: 170.29 iter/sec.
Timings for 8064K FFT length (16 cores, 1 worker):  4.13 ms.  Throughput: 242.19 iter/sec.
Timings for 8064K FFT length (16 cores, 2 workers): 11.89, 11.88 ms.  Throughput: 168.27 iter/sec.
Timings for 8192K FFT length (16 cores, 1 worker):  4.40 ms.  Throughput: 227.32 iter/sec.
Timings for 8192K FFT length (16 cores, 2 workers): 12.20, 12.21 ms.  Throughput: 163.84 iter/sec.

#########################################################
FFT Timings Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4212.31 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Timing FFTs using 16 threads on 16 cores.
Best time for 2048K FFT length: 1.327 ms., avg: 1.372 ms.
Best time for 2240K FFT length: 0.893 ms., avg: 0.922 ms.
Best time for 2304K FFT length: 1.140 ms., avg: 1.172 ms.
Best time for 2400K FFT length: 1.001 ms., avg: 1.019 ms.
Best time for 2560K FFT length: 1.024 ms., avg: 1.046 ms.
Best time for 2688K FFT length: 1.040 ms., avg: 1.055 ms.
Best time for 2800K FFT length: 1.452 ms., avg: 1.493 ms.
Best time for 2880K FFT length: 1.126 ms., avg: 1.173 ms.
Best time for 3072K FFT length: 1.274 ms., avg: 1.291 ms.
Best time for 3200K FFT length: 1.224 ms., avg: 1.251 ms.
Best time for 3360K FFT length: 1.319 ms., avg: 1.334 ms.
Best time for 3584K FFT length: 1.460 ms., avg: 1.477 ms.
Best time for 3840K FFT length: 1.459 ms., avg: 1.474 ms.
Best time for 4096K FFT length: 1.619 ms., avg: 1.643 ms.
Best time for 4480K FFT length: 1.792 ms., avg: 1.831 ms.
Best time for 4608K FFT length: 1.718 ms., avg: 1.736 ms.
Best time for 4800K FFT length: 1.793 ms., avg: 1.817 ms.
Best time for 5120K FFT length: 1.860 ms., avg: 1.882 ms.
Best time for 5376K FFT length: 2.026 ms., avg: 2.051 ms.
Best time for 5600K FFT length: 2.120 ms., avg: 2.189 ms.
Best time for 5760K FFT length: 2.244 ms., avg: 2.288 ms.
Best time for 6144K FFT length: 2.310 ms., avg: 2.368 ms.
Best time for 6400K FFT length: 2.463 ms., avg: 2.527 ms.
Best time for 6720K FFT length: 2.790 ms., avg: 2.867 ms.
Best time for 7168K FFT length: 3.072 ms., avg: 3.203 ms.
Best time for 7680K FFT length: 3.927 ms., avg: 4.069 ms.
Best time for 8000K FFT length: 3.864 ms., avg: 3.961 ms.
Best time for 8064K FFT length: 4.018 ms., avg: 4.177 ms.
Best time for 8192K FFT length: 4.306 ms., avg: 4.464 ms.

#########################################################
Trial Factoring Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4212.78 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Best time for 61 bit trial factors: 0.718 ms.
Best time for 62 bit trial factors: 0.730 ms.
Best time for 63 bit trial factors: 0.670 ms.
Best time for 64 bit trial factors: 0.721 ms.
Best time for 65 bit trial factors: 0.674 ms.
Best time for 66 bit trial factors: 0.708 ms.
Best time for 67 bit trial factors: 0.715 ms.
Best time for 75 bit trial factors: 0.711 ms.
Best time for 76 bit trial factors: 0.703 ms.
Best time for 77 bit trial factors: 0.699 ms.
JCoveiro is offline   Reply With Quote
Old 2020-02-02, 17:38   #790
JCoveiro
 
"Jorge Coveiro"
Nov 2006
Moura, Portugal

2×13 Posts
Default AMD Ryzen9 3950X 16-Core - UPDATE!

I also decided to test with "manual overclocking".

Manual Overclocking - 4.2GHz@1.25v + Memory Overclock: 1800 + Fabric Overclock: 1800 + RamCache III

Here are the benchmarks:

Code:
#################################################
Throughput Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4190.69 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Timings for 2048K FFT length (16 cores, 1 worker):  1.03 ms.  Throughput: 974.93 iter/sec.
Timings for 2048K FFT length (16 cores, 2 workers):  1.24,  1.21 ms.  Throughput: 1637.97 iter/sec.
Timings for 2240K FFT length (16 cores, 1 worker):  0.95 ms.  Throughput: 1047.98 iter/sec.
Timings for 2240K FFT length (16 cores, 2 workers):  1.30,  1.30 ms.  Throughput: 1539.05 iter/sec.
Timings for 2304K FFT length (16 cores, 1 worker):  0.95 ms.  Throughput: 1056.92 iter/sec.
Timings for 2304K FFT length (16 cores, 2 workers):  1.31,  1.33 ms.  Throughput: 1519.39 iter/sec.
Timings for 2400K FFT length (16 cores, 1 worker):  1.47 ms.  Throughput: 681.38 iter/sec.
Timings for 2400K FFT length (16 cores, 2 workers):  1.41,  1.43 ms.  Throughput: 1408.85 iter/sec.
Timings for 2560K FFT length (16 cores, 1 worker):  1.00 ms.  Throughput: 1002.11 iter/sec.
Timings for 2560K FFT length (16 cores, 2 workers):  1.46,  1.48 ms.  Throughput: 1363.25 iter/sec.
Timings for 2688K FFT length (16 cores, 1 worker):  1.05 ms.  Throughput: 948.17 iter/sec.
Timings for 2688K FFT length (16 cores, 2 workers):  1.52,  1.54 ms.  Throughput: 1308.20 iter/sec.
Timings for 2800K FFT length (16 cores, 1 worker):  1.42 ms.  Throughput: 704.31 iter/sec.
Timings for 2800K FFT length (16 cores, 2 workers):  1.66,  1.67 ms.  Throughput: 1203.02 iter/sec.
Timings for 2880K FFT length (16 cores, 1 worker):  1.46 ms.  Throughput: 686.80 iter/sec.
Timings for 2880K FFT length (16 cores, 2 workers):  1.64,  1.66 ms.  Throughput: 1215.08 iter/sec.
Timings for 3072K FFT length (16 cores, 1 worker):  1.29 ms.  Throughput: 776.18 iter/sec.
Timings for 3072K FFT length (16 cores, 2 workers):  1.80,  1.81 ms.  Throughput: 1106.82 iter/sec.
Timings for 3200K FFT length (16 cores, 1 worker):  1.24 ms.  Throughput: 809.10 iter/sec.
Timings for 3200K FFT length (16 cores, 2 workers):  2.29,  2.40 ms.  Throughput: 852.48 iter/sec.
Timings for 3360K FFT length (16 cores, 1 worker):  1.28 ms.  Throughput: 783.01 iter/sec.
Timings for 3360K FFT length (16 cores, 2 workers):  2.08,  2.10 ms.  Throughput: 957.96 iter/sec.
Timings for 3584K FFT length (16 cores, 1 worker):  1.47 ms.  Throughput: 680.75 iter/sec.
Timings for 3584K FFT length (16 cores, 2 workers):  2.58,  2.65 ms.  Throughput: 764.92 iter/sec.
Timings for 3840K FFT length (16 cores, 1 worker):  1.43 ms.  Throughput: 698.16 iter/sec.
Timings for 3840K FFT length (16 cores, 2 workers):  2.99,  2.98 ms.  Throughput: 669.79 iter/sec.
Timings for 4096K FFT length (16 cores, 1 worker):  1.67 ms.  Throughput: 598.28 iter/sec.
Timings for 4096K FFT length (16 cores, 2 workers):  3.58,  3.73 ms.  Throughput: 547.76 iter/sec.
Timings for 4480K FFT length (16 cores, 1 worker):  1.84 ms.  Throughput: 542.99 iter/sec.
Timings for 4480K FFT length (16 cores, 2 workers):  5.13,  4.99 ms.  Throughput: 395.23 iter/sec.
Timings for 4608K FFT length (16 cores, 1 worker):  1.77 ms.  Throughput: 566.01 iter/sec.
Timings for 4608K FFT length (16 cores, 2 workers):  5.80,  5.69 ms.  Throughput: 348.11 iter/sec.
Timings for 4800K FFT length (16 cores, 1 worker):  1.78 ms.  Throughput: 562.41 iter/sec.
Timings for 4800K FFT length (16 cores, 2 workers):  5.60,  5.56 ms.  Throughput: 358.40 iter/sec.
Timings for 5120K FFT length (16 cores, 1 worker):  1.90 ms.  Throughput: 525.25 iter/sec.
Timings for 5120K FFT length (16 cores, 2 workers):  5.87,  5.89 ms.  Throughput: 340.13 iter/sec.
Timings for 5376K FFT length (16 cores, 1 worker):  2.04 ms.  Throughput: 491.12 iter/sec.
Timings for 5376K FFT length (16 cores, 2 workers):  6.46,  6.46 ms.  Throughput: 309.57 iter/sec.
Timings for 5600K FFT length (16 cores, 1 worker):  2.15 ms.  Throughput: 465.32 iter/sec.
Timings for 5600K FFT length (16 cores, 2 workers):  7.29,  7.29 ms.  Throughput: 274.39 iter/sec.
Timings for 5760K FFT length (16 cores, 1 worker):  2.30 ms.  Throughput: 434.51 iter/sec.
Timings for 5760K FFT length (16 cores, 2 workers):  7.95,  7.95 ms.  Throughput: 251.48 iter/sec.
Timings for 6144K FFT length (16 cores, 1 worker):  2.37 ms.  Throughput: 421.43 iter/sec.
Timings for 6144K FFT length (16 cores, 2 workers):  8.16,  8.16 ms.  Throughput: 245.05 iter/sec.
Timings for 6400K FFT length (16 cores, 1 worker):  2.61 ms.  Throughput: 383.66 iter/sec.
Timings for 6400K FFT length (16 cores, 2 workers):  8.45,  8.76 ms.  Throughput: 232.49 iter/sec.
Timings for 6720K FFT length (16 cores, 1 worker):  2.90 ms.  Throughput: 345.24 iter/sec.
Timings for 6720K FFT length (16 cores, 2 workers):  9.65,  9.61 ms.  Throughput: 207.74 iter/sec.
Timings for 7168K FFT length (16 cores, 1 worker):  3.18 ms.  Throughput: 314.10 iter/sec.
Timings for 7168K FFT length (16 cores, 2 workers): 10.23, 10.15 ms.  Throughput: 196.29 iter/sec.
Timings for 7680K FFT length (16 cores, 1 worker):  4.09 ms.  Throughput: 244.50 iter/sec.
Timings for 7680K FFT length (16 cores, 2 workers): 11.65, 11.61 ms.  Throughput: 171.92 iter/sec.
Timings for 8000K FFT length (16 cores, 1 worker):  4.02 ms.  Throughput: 248.51 iter/sec.
Timings for 8000K FFT length (16 cores, 2 workers): 11.67, 11.68 ms.  Throughput: 171.31 iter/sec.
Timings for 8064K FFT length (16 cores, 1 worker):  4.13 ms.  Throughput: 241.98 iter/sec.
Timings for 8064K FFT length (16 cores, 2 workers): 11.97, 11.89 ms.  Throughput: 167.65 iter/sec.
Timings for 8192K FFT length (16 cores, 1 worker):  4.38 ms.  Throughput: 228.46 iter/sec.
Timings for 8192K FFT length (16 cores, 2 workers): 11.98, 12.11 ms.  Throughput: 166.05 iter/sec.

####################################################
FFT Timings Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4185.26 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Timing FFTs using 16 threads on 16 cores.
Best time for 2048K FFT length: 1.013 ms., avg: 1.044 ms.
Best time for 2240K FFT length: 0.877 ms., avg: 0.897 ms.
Best time for 2304K FFT length: 1.168 ms., avg: 1.208 ms.
Best time for 2400K FFT length: 1.017 ms., avg: 1.031 ms.
Best time for 2560K FFT length: 1.013 ms., avg: 1.047 ms.
Best time for 2688K FFT length: 1.035 ms., avg: 1.054 ms.
Best time for 2800K FFT length: 1.945 ms., avg: 1.988 ms.
Best time for 2880K FFT length: 1.134 ms., avg: 1.160 ms.
Best time for 3072K FFT length: 1.287 ms., avg: 1.298 ms.
Best time for 3200K FFT length: 1.248 ms., avg: 1.271 ms.
Best time for 3360K FFT length: 1.276 ms., avg: 1.300 ms.
Best time for 3584K FFT length: 1.466 ms., avg: 1.480 ms.
Best time for 3840K FFT length: 1.412 ms., avg: 1.443 ms.
Best time for 4096K FFT length: 1.653 ms., avg: 1.682 ms.
Best time for 4480K FFT length: 1.840 ms., avg: 1.877 ms.
Best time for 4608K FFT length: 1.704 ms., avg: 1.724 ms.
Best time for 4800K FFT length: 1.764 ms., avg: 1.779 ms.
Best time for 5120K FFT length: 1.867 ms., avg: 1.884 ms.
Best time for 5376K FFT length: 2.021 ms., avg: 2.042 ms.
Best time for 5600K FFT length: 2.115 ms., avg: 2.138 ms.
Best time for 5760K FFT length: 2.271 ms., avg: 2.311 ms.
Best time for 6144K FFT length: 2.359 ms., avg: 2.405 ms.
Best time for 6400K FFT length: 2.460 ms., avg: 2.518 ms.
Best time for 6720K FFT length: 2.804 ms., avg: 2.887 ms.
Best time for 7168K FFT length: 3.096 ms., avg: 3.152 ms.
Best time for 7680K FFT length: 3.958 ms., avg: 4.032 ms.
Best time for 8000K FFT length: 3.967 ms., avg: 4.045 ms.
Best time for 8064K FFT length: 4.016 ms., avg: 4.134 ms.
Best time for 8192K FFT length: 4.271 ms., avg: 4.396 ms.

####################################################
Trial Factoring Benchmark:

AMD Ryzen 9 3950X 16-Core Processor            
CPU speed: 4185.88 MHz, 16 hyperthreaded cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB
L1 cache line size: 64 bytes, L2 cache line size: 64 bytes
Prime95 64-bit version 29.8, RdtscTiming=1
Best time for 61 bit trial factors: 0.710 ms.
Best time for 62 bit trial factors: 0.714 ms.
Best time for 63 bit trial factors: 0.710 ms.
Best time for 64 bit trial factors: 0.713 ms.
Best time for 65 bit trial factors: 0.710 ms.
Best time for 66 bit trial factors: 0.694 ms.
Best time for 67 bit trial factors: 0.692 ms.
Best time for 75 bit trial factors: 0.688 ms.
Best time for 76 bit trial factors: 0.689 ms.
Best time for 77 bit trial factors: 0.690 ms.

Last fiddled with by JCoveiro on 2020-02-02 at 17:46
JCoveiro is offline   Reply With Quote
Old 2020-03-25, 04:24   #791
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default

Here are some interesting comparisons. I ran benchmarks for 5760K FFT, on a 6700K CPU at 4400, 4200, and 4000 MHz. 32 GiB dual rank DDR4-3000 RAM. By narrow margins 4200 MHz came out best, but all three sets top out within a few it/sec of each other.

4400 was the first run. After that, I dropped the 4-core-1-worker test because 2 workers were coming out better.
Attached Files
File Type: txt CPU clock bench.txt (9.1 KB, 108 views)

Last fiddled with by kladner on 2020-03-25 at 04:27
kladner is offline   Reply With Quote
Old 2020-03-25, 14:29   #792
Viliam Furik
 
"Viliam Furík"
Jul 2018
Martin, Slovakia

54 Posts
Default

Quote:
Originally Posted by kladner View Post
Here are some interesting comparisons. I ran benchmarks for 5760K FFT, on a 6700K CPU at 4400, 4200, and 4000 MHz. 32 GiB dual rank DDR4-3000 RAM. By narrow margins 4200 MHz came out best, but all three sets top out within a few it/sec of each other.

4400 was the first run. After that, I dropped the 4-core-1-worker test because 2 workers were coming out better.
Could you do it on range of FFTs (e.g. 2048 to 8192), and without testing all implementations?
Viliam Furik is offline   Reply With Quote
Old 2020-03-25, 16:59   #793
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default

Quote:
Originally Posted by Viliam Furik View Post
Could you do it on range of FFTs (e.g. 2048 to 8192), and without testing all implementations?
It will take a little while, but under lockdown I have time on my hands.
EDIT: Here ya go.
Attached Files
File Type: txt CPU clock bench 02.txt (9.6 KB, 193 views)

Last fiddled with by kladner on 2020-03-25 at 17:49
kladner is offline   Reply With Quote
Old 2020-03-25, 21:18   #794
Viliam Furik
 
"Viliam Furík"
Jul 2018
Martin, Slovakia

27116 Posts
Default

Quote:
Originally Posted by kladner View Post
It will take a little while, but under lockdown I have time on my hands.
EDIT: Here ya go.
Thank you. I've looked at the results, and it seems like there is little advantage from RAM speed with those speeds and that CPU. And for some reason, the 4000 turns out fastest on some FFT lengths, but it is not that much so it may be a measurement error (some background tasks).

Is it possible that when RAM is faster than CPU clock, it will not be used the same as with faster CPU? I'm thinking about this exact thing, because I have 3200 MHz RAM now, and I want to know whether I should buy 4000 MHz or 4400 MHz RAM because my CPU is manually overclocked to 4,1 GHz (Ryzen 9 3900X).
Viliam Furik is offline   Reply With Quote
Old 2020-03-25, 22:02   #795
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

165678 Posts
Default

Quote:
Originally Posted by Viliam Furik View Post
Thank you. I've looked at the results, and it seems like there is little advantage from RAM speed with those speeds and that CPU.
Note the massive L3 cache on that chip. It is not surprising that RAM speed is not a factor.
Prime95 is offline   Reply With Quote
Old 2020-03-26, 01:51   #796
axn
 
axn's Avatar
 
Jun 2003

13DF16 Posts
Default

Quote:
Originally Posted by Prime95 View Post
Note the massive L3 cache on that chip. It is not surprising that RAM speed is not a factor.
L3 is 8MB on a 6700K, which is mediocre.
axn is offline   Reply With Quote
Old 2020-03-26, 02:25   #797
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

2·2,437 Posts
Default

I think he was referring to the Ryzen?
Edit: 70MB L3, if my search result is to be believed.

Last fiddled with by VBCurtis on 2020-03-26 at 02:26
VBCurtis is offline   Reply With Quote
Old 2020-03-26, 03:04   #798
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

19×397 Posts
Default

Quote:
Originally Posted by axn View Post
L3 is 8MB on a 6700K, which is mediocre.
Sorry, confused the benchmark with JCoveiro's Ryzen.
Prime95 is offline   Reply With Quote
Old 2020-03-26, 03:40   #799
axn
 
axn's Avatar
 
Jun 2003

10011110111112 Posts
Default

Quote:
Originally Posted by Prime95 View Post
Sorry, confused the benchmark with JCoveiro's Ryzen.
Ok, makes sense. That still leaves the question of why mem speed is not affecting performance. I suppose, with just 4 cores, all of the tested RAM speeds are sufficient to feed the CPU.

Quote:
Originally Posted by VBCurtis View Post
I think he was referring to the Ryzen?
Edit: 70MB L3, if my search result is to be believed.
Ryzen L3 are all power-of-2, but since it is a victim cache, AMD specs list L2+L3 as a single "cache" number. So if you see 70MB for a processor, that is 64MB L3 + 6MB (12*512KB) L2.
axn is offline   Reply With Quote
Old 2020-03-26, 16:15   #800
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default

I have to note that the RAM speed was the same in all these runs. Only the CPU clock was different. I took the very similar results as an indication that this system is memory-bound.
EDIT: Also, these tests were not run under strict lab-like conditions. I did shut down obvious cycle-stealing apps like performance monitors and browsers. In line with normal operation on the machine I deliberately left mfaktc running on the GPU as part of the environment. Allowance has to be made for margin-of-error.

Last fiddled with by kladner on 2020-03-26 at 16:21
kladner is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Perpetual "interesting video" thread... Xyzzy Lounge 43 2021-07-17 00:00
LLR benchmark thread Oddball Riesel Prime Search 5 2010-08-02 00:11
Perpetual I'm pi**ed off thread rogue Soap Box 19 2009-10-28 19:17
Perpetual autostereogram thread... Xyzzy Lounge 10 2006-09-28 00:36
Perpetual ECM factoring challenge thread... Xyzzy Factoring 65 2005-09-05 08:16

All times are UTC. The time now is 21:51.


Fri Aug 6 21:51:21 UTC 2021 up 14 days, 16:20, 1 user, load averages: 2.68, 2.54, 2.51

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.