![]() |
![]() |
#760 |
"/X\(‘-‘)/X\"
Jan 2013
29×101 Posts |
![]()
Do remember that Haswell and later bump the VCore +0.1 when doing AVX2/FMA3 (which Prime95 will use).
|
![]() |
![]() |
![]() |
#761 |
Jan 2003
7×29 Posts |
![]()
The current Core architecture has been around for 6-7 years and has lots of time for software optimisations to be done for it. Ryzen has been out for 1 month... I remember back in the v26.xx days, Prime95 was a lot slower before optimisation.
My Ryzen is mainly working on World Community Grid projects (generally not highly optimised code). There I see a 2.5x the points per day compared to my 2600K @ stock clockspeeds. At the same clockspeed, the IPC is around Haswell level and it has double the number of cores. |
![]() |
![]() |
![]() |
#762 | |
Apr 2017
2 Posts |
![]() Quote:
Hardware: CPU: i7-7700K - at 4.9 GHz with AVX Core Ratio Negative Offset of 2 (which reduces the multiplier to 47 when running Prime95, giving a clock speed of 4.7 GHz) CPU Cooler: Noctua NH-D15 Motherboard: ASUS TUF Z270 Mark 1 - BIOS version 0906 PSU: EVGA SuperNOVA 550 G3 RAM: 2x Corsair Vengeance LPX DDR4 3200MHz 8GB - at XMP 3200 MHz OS Drive: Corsair Force MP500 240GB M.2 PCIe SSD Code:
[Sun Apr 23 11:02:28 2017] Compare your results to other computers at http://www.mersenne.org/report_benchmarks Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz CPU speed: 4551.15 MHz, 4 hyperthreaded cores CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 8 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Prime95 64-bit version 29.1, RdtscTiming=1 Timings for 2048K FFT length (4 cpus, 1 worker): 2.19 ms. Throughput: 457.47 iter/sec. Timings for 2048K FFT length (4 cpus, 4 workers): 10.28, 9.28, 9.05, 9.78 ms. Throughput: 417.80 iter/sec. Timings for 2048K FFT length (4 cpus hyperthreaded, 1 worker): 2.50 ms. Throughput: 400.18 iter/sec. Timings for 2048K FFT length (4 cpus hyperthreaded, 4 workers): 10.57, 10.02, 13.69, 10.16 ms. Throughput: 365.92 iter/sec. Timings for 2560K FFT length (4 cpus, 1 worker): 2.82 ms. Throughput: 354.13 iter/sec. Timings for 2560K FFT length (4 cpus, 4 workers): 12.16, 11.62, 11.63, 13.58 ms. Throughput: 327.87 iter/sec. Timings for 2560K FFT length (4 cpus hyperthreaded, 1 worker): 3.18 ms. Throughput: 314.12 iter/sec. Timings for 2560K FFT length (4 cpus hyperthreaded, 4 workers): 13.08, 12.59, 16.81, 12.99 ms. Throughput: 292.38 iter/sec. Timings for 3072K FFT length (4 cpus, 1 worker): 3.45 ms. Throughput: 289.75 iter/sec. Timings for 3072K FFT length (4 cpus, 4 workers): 15.42, 14.62, 14.27, 15.11 ms. Throughput: 269.51 iter/sec. Timings for 3072K FFT length (4 cpus hyperthreaded, 1 worker): 3.96 ms. Throughput: 252.73 iter/sec. Timings for 3072K FFT length (4 cpus hyperthreaded, 4 workers): 16.08, 15.22, 20.47, 15.33 ms. Throughput: 242.01 iter/sec. Timings for 3584K FFT length (4 cpus, 1 worker): 4.11 ms. Throughput: 243.02 iter/sec. Timings for 3584K FFT length (4 cpus, 4 workers): 19.07, 17.39, 16.95, 17.41 ms. Throughput: 226.40 iter/sec. Timings for 3584K FFT length (4 cpus hyperthreaded, 1 worker): 4.73 ms. Throughput: 211.63 iter/sec. Timings for 3584K FFT length (4 cpus hyperthreaded, 4 workers): 18.80, 18.03, 24.42, 17.97 ms. Throughput: 205.23 iter/sec. Timings for 4096K FFT length (4 cpus, 1 worker): 4.79 ms. Throughput: 208.79 iter/sec. Timings for 4096K FFT length (4 cpus, 4 workers): 21.20, 19.72, 19.60, 20.85 ms. Throughput: 196.85 iter/sec. [Sun Apr 23 11:07:29 2017] Timings for 4096K FFT length (4 cpus hyperthreaded, 1 worker): 5.46 ms. Throughput: 183.31 iter/sec. Timings for 4096K FFT length (4 cpus hyperthreaded, 4 workers): 21.19, 20.58, 27.73, 21.08 ms. Throughput: 179.28 iter/sec. Timings for 5120K FFT length (4 cpus, 1 worker): 6.09 ms. Throughput: 164.13 iter/sec. Timings for 5120K FFT length (4 cpus, 4 workers): 28.35, 24.43, 24.09, 24.33 ms. Throughput: 158.81 iter/sec. Timings for 5120K FFT length (4 cpus hyperthreaded, 1 worker): 6.97 ms. Throughput: 143.50 iter/sec. Timings for 5120K FFT length (4 cpus hyperthreaded, 4 workers): 27.05, 25.88, 34.61, 25.76 ms. Throughput: 143.33 iter/sec. Timings for 6144K FFT length (4 cpus, 1 worker): 7.94 ms. Throughput: 125.99 iter/sec. Timings for 6144K FFT length (4 cpus, 4 workers): 33.12, 30.44, 30.03, 34.13 ms. Throughput: 125.65 iter/sec. Timings for 6144K FFT length (4 cpus hyperthreaded, 1 worker): 8.88 ms. Throughput: 112.61 iter/sec. Timings for 6144K FFT length (4 cpus hyperthreaded, 4 workers): 33.69, 32.84, 46.51, 32.49 ms. Throughput: 112.42 iter/sec. Timings for 7168K FFT length (4 cpus, 1 worker): 9.31 ms. Throughput: 107.45 iter/sec. Timings for 7168K FFT length (4 cpus, 4 workers): 35.75, 35.32, 35.11, 45.61 ms. Throughput: 106.69 iter/sec. Timings for 7168K FFT length (4 cpus hyperthreaded, 1 worker): 10.54 ms. Throughput: 94.88 iter/sec. Timings for 7168K FFT length (4 cpus hyperthreaded, 4 workers): 39.98, 38.35, 54.51, 40.08 ms. Throughput: 94.39 iter/sec. Timings for 8192K FFT length (4 cpus, 1 worker): 10.75 ms. Throughput: 93.03 iter/sec. Timings for 8192K FFT length (4 cpus, 4 workers): 43.70, 41.72, 40.36, 46.95 ms. Throughput: 92.93 iter/sec. Timings for 8192K FFT length (4 cpus hyperthreaded, 1 worker): 12.24 ms. Throughput: 81.67 iter/sec. Timings for 8192K FFT length (4 cpus hyperthreaded, 4 workers): 47.77, 45.73, 62.07, 45.13 ms. Throughput: 81.07 iter/sec. |
|
![]() |
![]() |
![]() |
#763 |
"/X\(‘-‘)/X\"
Jan 2013
292910 Posts |
![]()
If there ever were a case for more memory bandwidth...
It's nice to see a 4 core with over 200 iter/ms at 4096K FFT though! |
![]() |
![]() |
![]() |
#764 |
Undefined
"The unspeakable one"
Jun 2006
My evil lair
2×3×1,019 Posts |
![]() |
![]() |
![]() |
![]() |
#765 |
"/X\(‘-‘)/X\"
Jan 2013
29·101 Posts |
![]() |
![]() |
![]() |
![]() |
#766 |
Apr 2003
Berlin, Germany
192 Posts |
![]()
CPU: Ryzen 7 1700X stock (mostly running 3.5GHz on all cores with Prime95)
CPU Cooler: Noctua NH-D15 SE-AM4 Motherboard: ASRock Taichi X370 BIOS version: 2.34 (Beta with AGESA 1.0.0.6) RAM: 2x16GB G.Skill TridentZ DDR4 3200MHz 14-14-14-36-1T dual rank OS: Win10Pro x64 Code:
Compare your results to other computers at http://www.mersenne.org/report_benchmarks AMD Ryzen 7 1700X Eight-Core Processor CPU speed: 3400.29 MHz, 8 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 32 KB L2 cache size: 512 KB, L3 cache size: 16 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes L1 TLBS: 64 L2 TLBS: 1536 Prime95 64-bit version 29.1, RdtscTiming=1 Best time for 2048K FFT length: 13.522 ms., avg: 13.721 ms. Best time for 2560K FFT length: 17.576 ms., avg: 18.112 ms. Best time for 3072K FFT length: 21.111 ms., avg: 21.577 ms. Best time for 3584K FFT length: 25.393 ms., avg: 26.027 ms. Best time for 4096K FFT length: 28.571 ms., avg: 30.210 ms. Best time for 5120K FFT length: 36.353 ms., avg: 36.910 ms. Best time for 6144K FFT length: 43.307 ms., avg: 43.586 ms. Best time for 7168K FFT length: 51.546 ms., avg: 51.943 ms. Best time for 8192K FFT length: 58.803 ms., avg: 60.538 ms. Timing FFTs using 2 threads on 1 core. Best time for 2048K FFT length: 15.871 ms., avg: 16.169 ms. Best time for 2560K FFT length: 20.416 ms., avg: 20.792 ms. Best time for 3072K FFT length: 24.707 ms., avg: 25.089 ms. Best time for 3584K FFT length: 30.205 ms., avg: 30.626 ms. Best time for 4096K FFT length: 34.890 ms., avg: 35.156 ms. Best time for 5120K FFT length: 42.726 ms., avg: 44.333 ms. Best time for 6144K FFT length: 50.388 ms., avg: 51.474 ms. Best time for 7168K FFT length: 59.980 ms., avg: 60.897 ms. Best time for 8192K FFT length: 70.055 ms., avg: 70.686 ms. Timing FFTs using 8 threads on 8 cores. Best time for 2048K FFT length: 2.107 ms., avg: 2.420 ms. Best time for 2560K FFT length: 3.006 ms., avg: 3.540 ms. Best time for 3072K FFT length: 3.573 ms., avg: 4.106 ms. Best time for 3584K FFT length: 4.235 ms., avg: 4.553 ms. Best time for 4096K FFT length: 4.794 ms., avg: 5.213 ms. Best time for 5120K FFT length: 5.536 ms., avg: 5.879 ms. Best time for 6144K FFT length: 6.778 ms., avg: 6.975 ms. Best time for 7168K FFT length: 7.995 ms., avg: 8.094 ms. Best time for 8192K FFT length: 9.107 ms., avg: 9.352 ms. Timing FFTs using 16 threads on 8 cores. Best time for 2048K FFT length: 2.381 ms., avg: 2.501 ms. Best time for 2560K FFT length: 3.175 ms., avg: 3.610 ms. Best time for 3072K FFT length: 3.882 ms., avg: 4.355 ms. Best time for 3584K FFT length: 4.646 ms., avg: 5.146 ms. Best time for 4096K FFT length: 5.256 ms., avg: 5.451 ms. Best time for 5120K FFT length: 6.213 ms., avg: 6.490 ms. Best time for 6144K FFT length: 7.405 ms., avg: 7.650 ms. Best time for 7168K FFT length: 8.841 ms., avg: 8.956 ms. Best time for 8192K FFT length: 10.258 ms., avg: 10.349 ms. Code:
Prime95 64-bit version 29.1, RdtscTiming=1 Timings for 2048K FFT length (1 cpu, 1 worker): 14.30 ms. Throughput: 69.93 iter/sec. Timings for 2048K FFT length (8 cpus, 1 worker): 2.14 ms. Throughput: 467.81 iter/sec. Timings for 2048K FFT length (8 cpus, 8 workers): 16.24, 16.03, 15.99, 16.03, 16.02, 15.95, 16.18, 16.03 ms. Throughput: 498.14 iter/sec. Timings for 2560K FFT length (1 cpu, 1 worker): 18.11 ms. Throughput: 55.23 iter/sec. Timings for 2560K FFT length (8 cpus, 1 worker): 3.03 ms. Throughput: 330.57 iter/sec. Timings for 2560K FFT length (8 cpus, 8 workers): 22.72, 22.41, 22.23, 22.18, 22.13, 22.05, 22.34, 22.21 ms. Throughput: 359.04 iter/sec. Timings for 3072K FFT length (1 cpu, 1 worker): 21.96 ms. Throughput: 45.54 iter/sec. Timings for 3072K FFT length (8 cpus, 1 worker): 3.63 ms. Throughput: 275.83 iter/sec. Timings for 3072K FFT length (8 cpus, 8 workers): 27.70, 27.15, 26.61, 26.67, 26.34, 26.48, 26.85, 26.81 ms. Throughput: 298.28 iter/sec. Timings for 3584K FFT length (1 cpu, 1 worker): 26.18 ms. Throughput: 38.20 iter/sec. Timings for 3584K FFT length (8 cpus, 1 worker): 4.28 ms. Throughput: 233.54 iter/sec. Timings for 3584K FFT length (8 cpus, 8 workers): 32.17, 31.36, 31.35, 31.24, 31.32, 31.04, 31.43, 31.45 ms. Throughput: 254.64 iter/sec. [Sun May 28 21:29:40 2017] Timings for 4096K FFT length (1 cpu, 1 worker): 29.84 ms. Throughput: 33.51 iter/sec. Timings for 4096K FFT length (8 cpus, 1 worker): 4.87 ms. Throughput: 205.25 iter/sec. Timings for 4096K FFT length (8 cpus, 8 workers): 36.36, 35.91, 35.76, 35.53, 35.79, 35.88, 36.13, 36.07 ms. Throughput: 222.68 iter/sec. Timings for 5120K FFT length (1 cpu, 1 worker): 37.73 ms. Throughput: 26.50 iter/sec. Timings for 5120K FFT length (8 cpus, 1 worker): 5.53 ms. Throughput: 180.79 iter/sec. Timings for 5120K FFT length (8 cpus, 8 workers): 43.10, 42.08, 42.24, 41.99, 41.81, 41.76, 41.92, 42.09 ms. Throughput: 189.93 iter/sec. Timings for 6144K FFT length (1 cpu, 1 worker): 44.95 ms. Throughput: 22.25 iter/sec. Timings for 6144K FFT length (8 cpus, 1 worker): 6.86 ms. Throughput: 145.67 iter/sec. Timings for 6144K FFT length (8 cpus, 8 workers): 51.06, 50.97, 50.29, 50.15, 50.18, 50.24, 50.41, 50.41 ms. Throughput: 158.54 iter/sec. Timings for 7168K FFT length (1 cpu, 1 worker): 53.47 ms. Throughput: 18.70 iter/sec. Timings for 7168K FFT length (8 cpus, 1 worker): 8.10 ms. Throughput: 123.49 iter/sec. Timings for 7168K FFT length (8 cpus, 8 workers): 61.59, 60.51, 59.82, 60.19, 60.22, 59.97, 60.62, 60.58 ms. Throughput: 132.37 iter/sec. Timings for 8192K FFT length (1 cpu, 1 worker): 60.89 ms. Throughput: 16.42 iter/sec. Timings for 8192K FFT length (8 cpus, 1 worker): 9.13 ms. Throughput: 109.52 iter/sec. Timings for 8192K FFT length (8 cpus, 8 workers): 70.80, 69.62, 69.27, 70.24, 69.69, 68.67, 69.21, 69.33 ms. Throughput: 114.95 iter/sec. Last fiddled with by Dresdenboy on 2017-05-28 at 19:58 |
![]() |
![]() |
![]() |
#767 | ||
Apr 2003
Berlin, Germany
1011010012 Posts |
![]()
With new data at hand, we could do some more comparisons.
Quote:
Code:
Timings for 8192K FFT length (8 cpus, 8 workers): 70.80, 69.62, 69.27, 70.24, 69.69, 68.67, 69.21, 69.33 ms. Throughput: 114.95 iter/sec. Quote:
Code:
Timing FFTs using 8 threads on 8 cores. <snip> Best time for 8192K FFT length: 9.107 ms., avg: 9.352 ms. Last fiddled with by Dresdenboy on 2017-05-28 at 21:44 |
||
![]() |
![]() |
![]() |
#768 | |
Random Account
Aug 2009
19·101 Posts |
![]()
Below are the timings for my new build. The hardware specs are as follows:
Intel i7-7700, 3.6 GHZ, Turbo 4.2 GHz. RAM: Kingston Hyper Fury DDR-4 2400 2*4GB Main Board: Asus Prime B250M-A CPU Cooler: Arctic i11 (Recommended by Mark Rose.) Prime95 v29.1 Build 14 Quote:
|
|
![]() |
![]() |
![]() |
#769 | |
"/X\(‘-‘)/X\"
Jan 2013
29·101 Posts |
![]() Quote:
If you want to save a little power, you can go into your BIOS and tweak the 4-core Turbo speed and lower it to 3.6 GHz. You should still get almost the same throughput. That way if you're running mprime the power usage and heat will be lower, but when running anything that doesn't use all four 4 cores it will get the full turbo speed. |
|
![]() |
![]() |
![]() |
#770 | |
Random Account
Aug 2009
191910 Posts |
![]() Quote:
I've made several trips back into the BIOS since. I loaded the default settings and everything started running much cooler. The power pull now is 142 watts and core temps are running in the low to mid 60's. Again, with Prime95 only, running a P-1. I don't know what settings the BIOS originally had. The first time I powered up, I just sat here and looked at it for a while. There is a lot in there. The CPU is hovering between 3.96 and 4.02 GHz. However, and this is something I will probably need to put in the Prime95 area. It's only using four, of the eight, threads available, and will not allow me to use more than one worker. Curious! |
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Perpetual "interesting video" thread... | Xyzzy | Lounge | 39 | 2021-03-12 14:19 |
LLR benchmark thread | Oddball | Riesel Prime Search | 5 | 2010-08-02 00:11 |
Perpetual I'm pi**ed off thread | rogue | Soap Box | 19 | 2009-10-28 19:17 |
Perpetual autostereogram thread... | Xyzzy | Lounge | 10 | 2006-09-28 00:36 |
Perpetual ECM factoring challenge thread... | Xyzzy | Factoring | 65 | 2005-09-05 08:16 |