![]() |
![]() |
#782 |
Aug 2013
23×11 Posts |
![]()
Also flies on this mobo with this RAM. Getting 3.75ms for a 2^80,000,000-1 LL test, which is amazing considering it consumes 1/2 the power of my 7820X which iterates a similar test at 2.8ms per iteration.
Code:
Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz CPU speed: 4726.39 MHz, 8 hyperthreaded cores CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 16 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Machine topology as determined by hwloc library: Machine#0 (total=31511272KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe) NUMANode#0 (local=31511272KB, total=31511272KB) Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=158, CPUModel="Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz", CPUStepping=12) L3 (size=16384KB, linesize=64, ways=16, Inclusive=1) L2 (size=256KB, linesize=64, ways=4, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000003) PU#0 (cpuset: 0x00000001) PU#1 (cpuset: 0x00000002) L2 (size=256KB, linesize=64, ways=4, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000000c) PU#2 (cpuset: 0x00000004) PU#3 (cpuset: 0x00000008) L2 (size=256KB, linesize=64, ways=4, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000030) PU#4 (cpuset: 0x00000010) PU#5 (cpuset: 0x00000020) L2 (size=256KB, linesize=64, ways=4, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000000c0) PU#6 (cpuset: 0x00000040) PU#7 (cpuset: 0x00000080) L2 (size=256KB, linesize=64, ways=4, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000300) PU#8 (cpuset: 0x00000100) PU#9 (cpuset: 0x00000200) L2 (size=256KB, linesize=64, ways=4, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000c00) PU#10 (cpuset: 0x00000400) PU#11 (cpuset: 0x00000800) L2 (size=256KB, linesize=64, ways=4, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00003000) PU#12 (cpuset: 0x00001000) PU#13 (cpuset: 0x00002000) L2 (size=256KB, linesize=64, ways=4, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000c000) PU#14 (cpuset: 0x00004000) PU#15 (cpuset: 0x00008000) Prime95 64-bit version 29.4, RdtscTiming=1 Timings for 2048K FFT length (8 cores, 1 worker): 0.96 ms. Throughput: 1036.60 iter/sec. Timings for 2048K FFT length (8 cores, 8 workers): 14.73, 14.52, 14.76, 14.67, 14.67, 14.70, 14.65, 14.70 ms. Throughput: 545.27 iter/sec. Timings for 2304K FFT length (8 cores, 1 worker): 1.15 ms. Throughput: 870.29 iter/sec. Timings for 2304K FFT length (8 cores, 8 workers): 16.38, 16.63, 16.38, 16.49, 16.47, 16.42, 16.54, 16.77 ms. Throughput: 484.55 iter/sec. Timings for 2400K FFT length (8 cores, 1 worker): 1.28 ms. Throughput: 784.01 iter/sec. Timings for 2400K FFT length (8 cores, 8 workers): 17.36, 17.35, 17.20, 17.33, 17.23, 17.44, 17.36, 17.49 ms. Throughput: 461.21 iter/sec. Timings for 2560K FFT length (8 cores, 1 worker): 1.43 ms. Throughput: 696.89 iter/sec. Timings for 2560K FFT length (8 cores, 8 workers): 18.41, 18.56, 18.42, 18.34, 18.42, 18.56, 18.48, 18.50 ms. Throughput: 433.34 iter/sec. Timings for 2688K FFT length (8 cores, 1 worker): 1.67 ms. Throughput: 597.61 iter/sec. Timings for 2688K FFT length (8 cores, 8 workers): 19.34, 19.31, 19.33, 19.62, 19.35, 19.38, 19.36, 19.40 ms. Throughput: 412.69 iter/sec. Timings for 2880K FFT length (8 cores, 1 worker): 1.84 ms. Throughput: 544.23 iter/sec. Timings for 2880K FFT length (8 cores, 8 workers): 20.91, 20.98, 20.76, 20.69, 20.64, 21.02, 20.69, 21.04 ms. Throughput: 383.87 iter/sec. Timings for 3072K FFT length (8 cores, 1 worker): 2.05 ms. Throughput: 487.94 iter/sec. Timings for 3072K FFT length (8 cores, 8 workers): 22.40, 22.16, 22.14, 22.43, 22.16, 22.10, 22.01, 22.36 ms. Throughput: 360.06 iter/sec. Timings for 3200K FFT length (8 cores, 1 worker): 2.24 ms. Throughput: 445.78 iter/sec. [Sat Dec 08 20:08:28 2018] Timings for 3200K FFT length (8 cores, 8 workers): 23.90, 23.59, 23.49, 23.54, 23.79, 23.57, 23.59, 23.66 ms. Throughput: 338.39 iter/sec. Timings for 3360K FFT length (8 cores, 1 worker): 2.46 ms. Throughput: 406.74 iter/sec. Timings for 3360K FFT length (8 cores, 8 workers): 24.61, 25.10, 24.56, 24.42, 24.27, 24.42, 24.51, 27.90 ms. Throughput: 320.95 iter/sec. Timings for 3456K FFT length (8 cores, 1 worker): 2.53 ms. Throughput: 395.58 iter/sec. Timings for 3456K FFT length (8 cores, 8 workers): 25.16, 25.16, 25.18, 25.13, 25.06, 25.25, 25.20, 25.48 ms. Throughput: 317.41 iter/sec. Timings for 3584K FFT length (8 cores, 1 worker): 2.71 ms. Throughput: 368.74 iter/sec. Timings for 3584K FFT length (8 cores, 8 workers): 25.79, 26.01, 26.02, 26.02, 26.02, 25.96, 26.03, 25.96 ms. Throughput: 307.98 iter/sec. Timings for 3840K FFT length (8 cores, 1 worker): 2.93 ms. Throughput: 341.31 iter/sec. Timings for 3840K FFT length (8 cores, 8 workers): 28.31, 28.36, 28.01, 28.34, 27.82, 28.33, 28.48, 28.44 ms. Throughput: 283.09 iter/sec. Timings for 4096K FFT length (8 cores, 1 worker): 3.17 ms. Throughput: 315.36 iter/sec. Timings for 4096K FFT length (8 cores, 8 workers): 29.30, 29.99, 30.19, 29.44, 29.62, 30.00, 29.71, 30.14 ms. Throughput: 268.49 iter/sec. Timings for 4480K FFT length (8 cores, 1 worker): 3.62 ms. Throughput: 276.38 iter/sec. Timings for 4480K FFT length (8 cores, 8 workers): 33.41, 33.05, 33.12, 32.98, 32.88, 33.52, 33.13, 33.25 ms. Throughput: 241.21 iter/sec. Timings for 4608K FFT length (8 cores, 1 worker): 3.68 ms. Throughput: 272.09 iter/sec. Timings for 4608K FFT length (8 cores, 8 workers): 33.85, 33.93, 33.77, 34.00, 33.78, 33.92, 33.60, 33.76 ms. Throughput: 236.51 iter/sec. Timings for 4800K FFT length (8 cores, 1 worker): 4.03 ms. Throughput: 248.35 iter/sec. Timings for 4800K FFT length (8 cores, 8 workers): 35.77, 35.43, 35.40, 35.56, 35.17, 35.15, 35.28, 35.19 ms. Throughput: 226.20 iter/sec. Timings for 5120K FFT length (8 cores, 1 worker): 4.23 ms. Throughput: 236.34 iter/sec. Timings for 5120K FFT length (8 cores, 8 workers): 37.17, 37.35, 37.13, 36.61, 37.15, 37.09, 37.23, 37.17 ms. Throughput: 215.57 iter/sec. [Sat Dec 08 20:13:32 2018] Timings for 5376K FFT length (8 cores, 1 worker): 4.52 ms. Throughput: 221.09 iter/sec. Timings for 5376K FFT length (8 cores, 8 workers): 38.98, 39.02, 39.32, 39.12, 39.03, 40.72, 39.60, 39.37 ms. Throughput: 203.11 iter/sec. Timings for 5760K FFT length (8 cores, 1 worker): 4.80 ms. Throughput: 208.24 iter/sec. Timings for 5760K FFT length (8 cores, 8 workers): 43.67, 42.70, 41.27, 41.90, 41.93, 41.31, 42.79, 42.66 ms. Throughput: 189.29 iter/sec. Timings for 6144K FFT length (8 cores, 1 worker): 5.15 ms. Throughput: 194.36 iter/sec. Timings for 6144K FFT length (8 cores, 8 workers): 45.19, 45.31, 44.79, 45.39, 44.42, 45.32, 45.00, 44.56 ms. Throughput: 177.80 iter/sec. Timings for 6400K FFT length (8 cores, 1 worker): 5.63 ms. Throughput: 177.48 iter/sec. Timings for 6400K FFT length (8 cores, 8 workers): 47.85, 48.36, 48.03, 47.93, 47.79, 48.01, 48.31, 48.07 ms. Throughput: 166.52 iter/sec. Timings for 6720K FFT length (8 cores, 1 worker): 5.65 ms. Throughput: 177.03 iter/sec. Timings for 6720K FFT length (8 cores, 8 workers): 49.33, 48.69, 49.38, 48.53, 48.40, 48.27, 48.60, 48.84 ms. Throughput: 164.09 iter/sec. Timings for 6912K FFT length (8 cores, 1 worker): 6.03 ms. Throughput: 165.73 iter/sec. Timings for 6912K FFT length (8 cores, 8 workers): 53.00, 50.55, 50.83, 50.25, 50.37, 52.23, 50.99, 51.21 ms. Throughput: 156.36 iter/sec. Timings for 7168K FFT length (8 cores, 1 worker): 6.07 ms. Throughput: 164.80 iter/sec. Timings for 7168K FFT length (8 cores, 8 workers): 52.21, 51.85, 51.73, 52.16, 52.03, 51.91, 51.57, 52.26 ms. Throughput: 153.96 iter/sec. Timings for 7680K FFT length (8 cores, 1 worker): 6.68 ms. Throughput: 149.59 iter/sec. Timings for 7680K FFT length (8 cores, 8 workers): 56.78, 56.50, 56.31, 55.16, 56.39, 55.90, 56.19, 56.42 ms. Throughput: 142.34 iter/sec. Timings for 8064K FFT length (8 cores, 1 worker): 7.04 ms. Throughput: 142.08 iter/sec. Timings for 8064K FFT length (8 cores, 8 workers): 58.85, 58.56, 58.47, 59.16, 58.93, 59.04, 58.63, 58.60 ms. Throughput: 136.11 iter/sec. Timings for 8192K FFT length (8 cores, 1 worker): 7.06 ms. Throughput: 141.62 iter/sec. [Sat Dec 08 20:18:46 2018] Timings for 8192K FFT length (8 cores, 8 workers): 59.48, 59.73, 59.74, 60.57, 59.29, 59.66, 59.70, 61.10 ms. Throughput: 133.55 iter/sec. |
![]() |
![]() |
![]() |
#783 |
Aug 2013
1308 Posts |
![]()
Seems like it can complete a 2^88,000,000-1 test in about 2.6 days
Code:
Intel(R) Core(TM) i7-9800X CPU @ 3.80GHz CPU speed: 3792.01 MHz, 8 hyperthreaded cores CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA, AVX512F L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 16896 KB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Machine topology as determined by hwloc library: Machine#0 (total=64533332KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe) NUMANode#0 (local=64533332KB, total=64533332KB) Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=85, CPUModel="Intel(R) Core(TM) i7-9800X CPU @ 3.80GHz", CPUStepping=4) L3 (size=16896KB, linesize=64, ways=11, Inclusive=0) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000003) PU#0 (cpuset: 0x00000001) PU#1 (cpuset: 0x00000002) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000000c) PU#2 (cpuset: 0x00000004) PU#3 (cpuset: 0x00000008) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000030) PU#4 (cpuset: 0x00000010) PU#5 (cpuset: 0x00000020) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000000c0) PU#6 (cpuset: 0x00000040) PU#7 (cpuset: 0x00000080) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000300) PU#8 (cpuset: 0x00000100) PU#9 (cpuset: 0x00000200) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000c00) PU#10 (cpuset: 0x00000400) PU#11 (cpuset: 0x00000800) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00003000) PU#12 (cpuset: 0x00001000) PU#13 (cpuset: 0x00002000) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000c000) PU#14 (cpuset: 0x00004000) PU#15 (cpuset: 0x00008000) Prime95 64-bit version 29.4, RdtscTiming=1 Timing FFTs using 8 threads on 8 cores. Best time for 2048K FFT length: 1.033 ms., avg: 1.042 ms. Best time for 2304K FFT length: 1.094 ms., avg: 1.109 ms. Best time for 2400K FFT length: 1.197 ms., avg: 1.209 ms. Best time for 2560K FFT length: 1.290 ms., avg: 1.355 ms. Best time for 2688K FFT length: 1.305 ms., avg: 1.320 ms. Best time for 2880K FFT length: 1.432 ms., avg: 1.449 ms. Best time for 3072K FFT length: 1.533 ms., avg: 1.542 ms. Best time for 3200K FFT length: 1.605 ms., avg: 1.617 ms. Best time for 3360K FFT length: 1.809 ms., avg: 1.826 ms. Best time for 3456K FFT length: 1.791 ms., avg: 1.805 ms. Best time for 3584K FFT length: 1.826 ms., avg: 1.844 ms. Best time for 3840K FFT length: 1.931 ms., avg: 1.946 ms. Best time for 4096K FFT length: 2.093 ms., avg: 2.119 ms. Best time for 4480K FFT length: 2.267 ms., avg: 2.292 ms. Best time for 4608K FFT length: 2.345 ms., avg: 2.361 ms. Best time for 4800K FFT length: 2.529 ms., avg: 2.545 ms. Best time for 5120K FFT length: 2.655 ms., avg: 2.678 ms. Best time for 5376K FFT length: 2.746 ms., avg: 2.770 ms. Best time for 5760K FFT length: 3.097 ms., avg: 3.118 ms. Best time for 6144K FFT length: 3.222 ms., avg: 3.250 ms. Best time for 6400K FFT length: 3.422 ms., avg: 3.442 ms. Best time for 6720K FFT length: 3.744 ms., avg: 3.776 ms. Best time for 6912K FFT length: 3.792 ms., avg: 3.825 ms. Best time for 7168K FFT length: 3.872 ms., avg: 3.892 ms. Best time for 7680K FFT length: 4.122 ms., avg: 4.139 ms. Best time for 8064K FFT length: 4.537 ms., avg: 4.569 ms. Best time for 8192K FFT length: 4.514 ms., avg: 4.544 ms. Compare your results to other computers at http://www.mersenne.org/report_benchmarks Intel(R) Core(TM) i7-9800X CPU @ 3.80GHz CPU speed: 3792.00 MHz, 8 hyperthreaded cores CPU features: Prefetchw, SSE, SSE2, SSE4, AVX, AVX2, FMA, AVX512F L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 16896 KB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Machine topology as determined by hwloc library: Machine#0 (total=64533332KB, Backend=Windows, hwlocVersion=1.11.9, ProcessName=prime95.exe) NUMANode#0 (local=64533332KB, total=64533332KB) Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=85, CPUModel="Intel(R) Core(TM) i7-9800X CPU @ 3.80GHz", CPUStepping=4) L3 (size=16896KB, linesize=64, ways=11, Inclusive=0) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000003) PU#0 (cpuset: 0x00000001) PU#1 (cpuset: 0x00000002) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000000c) PU#2 (cpuset: 0x00000004) PU#3 (cpuset: 0x00000008) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000030) PU#4 (cpuset: 0x00000010) PU#5 (cpuset: 0x00000020) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000000c0) PU#6 (cpuset: 0x00000040) PU#7 (cpuset: 0x00000080) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000300) PU#8 (cpuset: 0x00000100) PU#9 (cpuset: 0x00000200) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000c00) PU#10 (cpuset: 0x00000400) PU#11 (cpuset: 0x00000800) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00003000) PU#12 (cpuset: 0x00001000) PU#13 (cpuset: 0x00002000) L2 (size=1024KB, linesize=64, ways=16, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000c000) PU#14 (cpuset: 0x00004000) PU#15 (cpuset: 0x00008000) Prime95 64-bit version 29.4, RdtscTiming=1 Timings for 2048K FFT length (8 cores, 1 worker): 1.04 ms. Throughput: 958.23 iter/sec. Timings for 2304K FFT length (8 cores, 1 worker): 1.11 ms. Throughput: 899.33 iter/sec. Timings for 2400K FFT length (8 cores, 1 worker): 1.22 ms. Throughput: 816.48 iter/sec. Timings for 2560K FFT length (8 cores, 1 worker): 1.32 ms. Throughput: 760.26 iter/sec. Timings for 2688K FFT length (8 cores, 1 worker): 1.33 ms. Throughput: 751.84 iter/sec. Timings for 2880K FFT length (8 cores, 1 worker): 1.46 ms. Throughput: 683.17 iter/sec. Timings for 3072K FFT length (8 cores, 1 worker): 1.54 ms. Throughput: 649.60 iter/sec. Timings for 3200K FFT length (8 cores, 1 worker): 1.62 ms. Throughput: 617.88 iter/sec. Timings for 3360K FFT length (8 cores, 1 worker): 1.80 ms. Throughput: 554.46 iter/sec. Timings for 3456K FFT length (8 cores, 1 worker): 1.79 ms. Throughput: 557.51 iter/sec. Timings for 3584K FFT length (8 cores, 1 worker): 1.85 ms. Throughput: 539.60 iter/sec. Timings for 3840K FFT length (8 cores, 1 worker): 1.95 ms. Throughput: 511.78 iter/sec. Timings for 4096K FFT length (8 cores, 1 worker): 2.11 ms. Throughput: 475.04 iter/sec. Timings for 4480K FFT length (8 cores, 1 worker): 2.30 ms. Throughput: 434.97 iter/sec. [Mon Dec 10 09:32:49 2018] Timings for 4608K FFT length (8 cores, 1 worker): 2.36 ms. Throughput: 423.04 iter/sec. Timings for 4800K FFT length (8 cores, 1 worker): 2.56 ms. Throughput: 389.99 iter/sec. Timings for 5120K FFT length (8 cores, 1 worker): 2.72 ms. Throughput: 367.83 iter/sec. Timings for 5376K FFT length (8 cores, 1 worker): 2.77 ms. Throughput: 360.57 iter/sec. Timings for 5760K FFT length (8 cores, 1 worker): 3.11 ms. Throughput: 321.75 iter/sec. Timings for 6144K FFT length (8 cores, 1 worker): 3.25 ms. Throughput: 308.12 iter/sec. Timings for 6400K FFT length (8 cores, 1 worker): 3.45 ms. Throughput: 290.16 iter/sec. Timings for 6720K FFT length (8 cores, 1 worker): 3.73 ms. Throughput: 268.19 iter/sec. Timings for 6912K FFT length (8 cores, 1 worker): 3.81 ms. Throughput: 262.29 iter/sec. Timings for 7168K FFT length (8 cores, 1 worker): 3.93 ms. Throughput: 254.74 iter/sec. Timings for 7680K FFT length (8 cores, 1 worker): 4.15 ms. Throughput: 241.06 iter/sec. Timings for 8064K FFT length (8 cores, 1 worker): 4.57 ms. Throughput: 218.77 iter/sec. Timings for 8192K FFT length (8 cores, 1 worker): 4.57 ms. Throughput: 218.81 iter/sec. |
![]() |
![]() |
![]() |
#784 |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
163138 Posts |
![]()
Quadro NVS295, 1.74Ghz-D/day. This is on a 1x/16x PCIE extender to make use of the pcie 1x slot in the motherboard. Requires registry edit to lengthen TdrDelay.
Code:
mfaktc v0.21 (64bit built) Compiletime options THREADS_PER_BLOCK 256 SIEVE_SIZE_LIMIT 32kiB SIEVE_SIZE 193154bits SIEVE_SPLIT 250 MORE_CLASSES enabled Runtime options SievePrimes 25000 SievePrimesAdjust 1 SievePrimesMin 5000 SievePrimesMax 100000 NumStreams 3 CPUStreams 3 GridSize 3 GPU Sieving enabled GPUSievePrimes 82486 GPUSieveSize 64Mi bits GPUSieveProcessSize 16Ki bits Checkpoints enabled CheckpointDelay 900s WorkFileAddDelay 3600s Stages enabled StopAfterFactor bitlevel PrintMode full V5UserID kriesel ComputerID eaglet-nvs295 AllowSleep no TimeStampInResults yes CUDA version info binary compiled for CUDA 6.50 CUDA runtime version 6.50 CUDA driver version 6.50 CUDA device info name Quadro NVS 295 compute capability 1.1 max threads per block 512 max shared memory per MP 16384 byte number of multiprocessors 1 CUDA cores per MP 8 CUDA cores - total 8 clock rate (CUDA cores) 1300MHz memory clock rate: 695MHz memory bus width: 64 bit Automatic parameters threads per grid 1048576 GPUSievePrimes (adjusted) 82486 GPUsieve minimum exponent 1055144 running a simple selftest... Selftest statistics number of tests 107 successfull tests 107 selftest PASSED! got assignment: exp=119998999 bit_min=72 bit_max=73 (7.97 GHz-days) Starting trial factoring M119998999 from 2^72 to 2^73 (7.97 GHz-days) k_min = 19676691147960 k_max = 39353382296711 Using GPU kernel "barrett76_mul32_gs" Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Jul 02 21:05 | 0 0.1% | 411.97 4d13h | 1.74 82485 n.a.% Jul 02 21:11 | 5 0.2% | 411.85 4d13h | 1.74 82485 n.a.% Jul 02 21:18 | 9 0.3% | 411.49 4d13h | 1.74 82485 n.a.% Jul 02 21:25 | 12 0.4% | 411.39 4d13h | 1.74 82485 n.a.% Last fiddled with by kriesel on 2019-07-03 at 02:59 |
![]() |
![]() |
![]() |
#785 |
"Sam"
Jun 2019
California, USA
468 Posts |
![]()
Good energy efficiency at this throughput with the CPU consuming 35W.
|
![]() |
![]() |
![]() |
#786 |
Nov 2008
916 Posts |
![]()
Seems like AMD has a quite OK CPU for mprime now...
Throughput-Test: Code:
AMD Ryzen 9 3900X 12-Core Processor CPU speed: 4239.72 MHz, 12 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 12x32 KB, L2 cache size: 12x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Machine topology as determined by hwloc library: Machine#0 (total=29788576KB, Backend=Windows, hwlocVersion=2.0.4, ProcessName=prime95.exe) Package (total=29788576KB, CPUVendor=AuthenticAMD, CPUFamilyNumber=23, CPUModelNumber=113, CPUModel="AMD Ryzen 9 3900X 12-Core Processor ", CPUStepping=0) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000003) PU#0 (cpuset: 0x00000001) PU#1 (cpuset: 0x00000002) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000000c) PU#2 (cpuset: 0x00000004) PU#3 (cpuset: 0x00000008) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000030) PU#4 (cpuset: 0x00000010) PU#5 (cpuset: 0x00000020) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000000c0) PU#6 (cpuset: 0x00000040) PU#7 (cpuset: 0x00000080) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000300) PU#8 (cpuset: 0x00000100) PU#9 (cpuset: 0x00000200) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000c00) PU#10 (cpuset: 0x00000400) PU#11 (cpuset: 0x00000800) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00003000) PU#12 (cpuset: 0x00001000) PU#13 (cpuset: 0x00002000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000c000) PU#14 (cpuset: 0x00004000) PU#15 (cpuset: 0x00008000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00030000) PU#16 (cpuset: 0x00010000) PU#17 (cpuset: 0x00020000) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000c0000) PU#18 (cpuset: 0x00040000) PU#19 (cpuset: 0x00080000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00300000) PU#20 (cpuset: 0x00100000) PU#21 (cpuset: 0x00200000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00c00000) PU#22 (cpuset: 0x00400000) PU#23 (cpuset: 0x00800000) Prime95 64-bit version 29.8, RdtscTiming=1 Timings for 2048K FFT length (12 cores, 1 worker): 1.12 ms. Throughput: 893.57 iter/sec. Timings for 2048K FFT length (12 cores, 4 workers): 2.75, 2.72, 2.74, 2.82 ms. Throughput: 1450.51 iter/sec. Timings for 2048K FFT length (12 cores, 12 workers): 21.92, 21.87, 22.31, 22.07, 22.08, 22.06, 21.93, 21.93, 21.94, 22.68, 21.97, 21.73 ms. Throughput: 544.49 iter/sec. Timings for 2240K FFT length (12 cores, 1 worker): 1.04 ms. Throughput: 957.66 iter/sec. Timings for 2240K FFT length (12 cores, 4 workers): 4.32, 4.25, 4.37, 4.41 ms. Throughput: 922.99 iter/sec. Timings for 2240K FFT length (12 cores, 12 workers): 24.77, 24.56, 24.65, 24.49, 24.73, 24.55, 24.51, 24.48, 24.45, 25.31, 24.73, 24.47 ms. Throughput: 487.01 iter/sec. Timings for 2304K FFT length (12 cores, 1 worker): 1.06 ms. Throughput: 942.24 iter/sec. [Mon Dec 02 19:13:17 2019] Timings for 2304K FFT length (12 cores, 4 workers): 4.82, 4.70, 4.73, 4.90 ms. Throughput: 835.64 iter/sec. Timings for 2304K FFT length (12 cores, 12 workers): 25.45, 25.22, 25.44, 25.41, 25.43, 25.48, 25.15, 25.17, 25.18, 26.10, 25.52, 25.16 ms. Throughput: 472.62 iter/sec. Timings for 2400K FFT length (12 cores, 1 worker): 1.13 ms. Throughput: 881.23 iter/sec. Timings for 2400K FFT length (12 cores, 4 workers): 5.70, 5.61, 5.69, 5.82 ms. Throughput: 700.99 iter/sec. Timings for 2400K FFT length (12 cores, 12 workers): 26.79, 26.72, 26.79, 26.84, 26.80, 26.78, 26.42, 26.39, 26.42, 27.35, 26.67, 26.64 ms. Throughput: 449.18 iter/sec. Timings for 2560K FFT length (12 cores, 1 worker): 1.20 ms. Throughput: 835.60 iter/sec. Timings for 2560K FFT length (12 cores, 4 workers): 6.19, 5.98, 6.16, 6.19 ms. Throughput: 652.72 iter/sec. Timings for 2560K FFT length (12 cores, 12 workers): 28.10, 27.90, 28.05, 28.04, 28.20, 28.10, 27.92, 27.91, 27.91, 28.74, 27.85, 27.80 ms. Throughput: 427.97 iter/sec. Timings for 2688K FFT length (12 cores, 1 worker): 1.22 ms. Throughput: 817.77 iter/sec. Timings for 2688K FFT length (12 cores, 4 workers): 7.51, 7.05, 7.13, 7.35 ms. Throughput: 551.39 iter/sec. [Mon Dec 02 19:18:23 2019] Timings for 2688K FFT length (12 cores, 12 workers): 29.97, 29.81, 30.32, 29.98, 30.07, 29.95, 29.71, 29.68, 29.68, 30.92, 29.87, 29.76 ms. Throughput: 400.36 iter/sec. Timings for 2800K FFT length (12 cores, 1 worker): 1.31 ms. Throughput: 763.36 iter/sec. Timings for 2800K FFT length (12 cores, 4 workers): 8.00, 7.96, 7.97, 8.08 ms. Throughput: 499.88 iter/sec. Timings for 2800K FFT length (12 cores, 12 workers): 31.66, 31.52, 31.62, 31.62, 31.66, 31.67, 31.44, 31.31, 31.40, 32.27, 31.56, 31.35 ms. Throughput: 379.87 iter/sec. Timings for 2880K FFT length (12 cores, 1 worker): 1.36 ms. Throughput: 733.59 iter/sec. Timings for 2880K FFT length (12 cores, 4 workers): 8.44, 8.24, 8.17, 8.47 ms. Throughput: 480.35 iter/sec. Timings for 2880K FFT length (12 cores, 12 workers): 32.23, 32.10, 32.36, 32.28, 32.48, 32.38, 31.98, 32.20, 31.97, 33.02, 32.38, 31.98 ms. Throughput: 371.77 iter/sec. Timings for 3072K FFT length (12 cores, 1 worker): 1.49 ms. Throughput: 669.26 iter/sec. Timings for 3072K FFT length (12 cores, 4 workers): 9.20, 9.09, 9.10, 9.22 ms. Throughput: 437.06 iter/sec. Timings for 3072K FFT length (12 cores, 12 workers): 34.09, 33.95, 34.37, 34.06, 34.13, 34.11, 33.83, 33.65, 33.81, 35.26, 34.33, 33.92 ms. Throughput: 351.68 iter/sec. [Mon Dec 02 19:23:30 2019] Timings for 3200K FFT length (12 cores, 1 worker): 1.49 ms. Throughput: 672.05 iter/sec. Timings for 3200K FFT length (12 cores, 4 workers): 9.60, 9.39, 9.53, 9.61 ms. Throughput: 419.61 iter/sec. Timings for 3200K FFT length (12 cores, 12 workers): 35.56, 35.38, 35.53, 35.49, 35.58, 35.56, 35.42, 35.39, 35.31, 35.96, 35.62, 35.03 ms. Throughput: 338.18 iter/sec. Timings for 3360K FFT length (12 cores, 1 worker): 1.49 ms. Throughput: 669.62 iter/sec. Timings for 3360K FFT length (12 cores, 4 workers): 10.70, 10.57, 10.56, 10.64 ms. Throughput: 376.76 iter/sec. Timings for 3360K FFT length (12 cores, 12 workers): 37.94, 37.78, 38.10, 37.91, 38.13, 37.94, 37.65, 37.58, 37.63, 38.72, 38.25, 37.66 ms. Throughput: 316.31 iter/sec. Timings for 3584K FFT length (12 cores, 1 worker): 1.69 ms. Throughput: 591.59 iter/sec. Timings for 3584K FFT length (12 cores, 4 workers): 11.35, 11.43, 11.30, 11.46 ms. Throughput: 351.35 iter/sec. Timings for 3584K FFT length (12 cores, 12 workers): 40.37, 39.68, 39.90, 40.04, 39.86, 39.87, 39.73, 39.53, 39.73, 40.92, 39.82, 39.79 ms. Throughput: 300.50 iter/sec. Timings for 3840K FFT length (12 cores, 1 worker): 1.74 ms. Throughput: 576.01 iter/sec. [Mon Dec 02 19:28:38 2019] Timings for 3840K FFT length (12 cores, 4 workers): 12.57, 12.50, 12.41, 12.56 ms. Throughput: 319.71 iter/sec. Timings for 3840K FFT length (12 cores, 12 workers): 43.17, 42.85, 43.46, 43.19, 43.60, 43.30, 43.12, 43.13, 43.09, 44.42, 43.16, 42.47 ms. Throughput: 277.51 iter/sec. Timings for 4096K FFT length (12 cores, 1 worker): 1.93 ms. Throughput: 519.20 iter/sec. Timings for 4096K FFT length (12 cores, 4 workers): 13.82, 13.72, 13.63, 13.89 ms. Throughput: 290.64 iter/sec. Timings for 4096K FFT length (12 cores, 12 workers): 45.88, 45.30, 45.93, 45.72, 45.81, 45.81, 45.46, 45.42, 45.52, 46.72, 45.88, 45.34 ms. Throughput: 262.41 iter/sec. Timings for 4480K FFT length (12 cores, 1 worker): 2.06 ms. Throughput: 485.48 iter/sec. Timings for 4480K FFT length (12 cores, 4 workers): 15.46, 15.40, 15.36, 15.69 ms. Throughput: 258.44 iter/sec. Timings for 4480K FFT length (12 cores, 12 workers): 51.14, 50.93, 51.53, 51.09, 51.28, 51.14, 50.73, 50.61, 50.73, 52.62, 51.07, 50.62 ms. Throughput: 234.75 iter/sec. Timings for 4608K FFT length (12 cores, 1 worker): 2.08 ms. Throughput: 480.39 iter/sec. Timings for 4608K FFT length (12 cores, 4 workers): 15.81, 15.79, 15.66, 15.97 ms. Throughput: 253.09 iter/sec. [Mon Dec 02 19:33:48 2019] Timings for 4608K FFT length (12 cores, 12 workers): 52.06, 51.76, 52.24, 52.10, 52.30, 52.30, 52.14, 51.92, 51.98, 53.46, 51.77, 51.47 ms. Throughput: 230.23 iter/sec. Timings for 4800K FFT length (12 cores, 1 worker): 2.24 ms. Throughput: 446.97 iter/sec. Timings for 4800K FFT length (12 cores, 4 workers): 16.47, 16.41, 16.45, 16.45 ms. Throughput: 243.21 iter/sec. Timings for 4800K FFT length (12 cores, 12 workers): 54.42, 53.86, 54.01, 54.13, 54.65, 54.19, 54.08, 54.11, 54.04, 55.34, 53.77, 53.52 ms. Throughput: 221.51 iter/sec. Timings for 5120K FFT length (12 cores, 1 worker): 2.28 ms. Throughput: 439.02 iter/sec. Timings for 5120K FFT length (12 cores, 4 workers): 17.74, 17.72, 17.74, 17.74 ms. Throughput: 225.57 iter/sec. Timings for 5120K FFT length (12 cores, 12 workers): 57.75, 57.34, 57.90, 57.71, 58.00, 57.86, 57.53, 57.61, 57.55, 58.87, 57.57, 56.88 ms. Throughput: 207.93 iter/sec. Timings for 5376K FFT length (12 cores, 1 worker): 2.43 ms. Throughput: 412.23 iter/sec. Timings for 5376K FFT length (12 cores, 4 workers): 18.74, 18.72, 18.73, 18.73 ms. Throughput: 213.55 iter/sec. Timings for 5376K FFT length (12 cores, 12 workers): 60.16, 59.70, 60.45, 60.12, 60.29, 60.44, 60.11, 60.27, 60.21, 61.21, 60.17, 59.42 ms. Throughput: 199.30 iter/sec. [Mon Dec 02 19:38:59 2019] Timings for 5600K FFT length (12 cores, 1 worker): 2.55 ms. Throughput: 391.78 iter/sec. Timings for 5600K FFT length (12 cores, 4 workers): 19.84, 19.85, 19.85, 19.88 ms. Throughput: 201.44 iter/sec. Timings for 5600K FFT length (12 cores, 12 workers): 63.53, 62.96, 63.67, 63.29, 63.39, 63.18, 62.95, 62.71, 62.97, 64.61, 63.14, 62.97 ms. Throughput: 189.64 iter/sec. Timings for 5760K FFT length (12 cores, 1 worker): 2.69 ms. Throughput: 371.66 iter/sec. Timings for 5760K FFT length (12 cores, 4 workers): 21.13, 20.93, 20.91, 21.07 ms. Throughput: 190.37 iter/sec. Timings for 5760K FFT length (12 cores, 12 workers): 65.81, 65.36, 66.20, 65.83, 65.75, 65.72, 65.32, 65.54, 64.88, 67.88, 65.54, 65.59 ms. Throughput: 182.43 iter/sec. Timings for 6144K FFT length (12 cores, 1 worker): 2.87 ms. Throughput: 348.96 iter/sec. Timings for 6144K FFT length (12 cores, 4 workers): 21.77, 21.76, 21.76, 21.76 ms. Throughput: 183.78 iter/sec. Timings for 6144K FFT length (12 cores, 12 workers): 69.00, 68.32, 68.96, 69.09, 69.06, 69.15, 68.57, 68.86, 69.11, 70.14, 68.17, 67.99 ms. Throughput: 174.26 iter/sec. Timings for 6400K FFT length (12 cores, 1 worker): 2.98 ms. Throughput: 336.13 iter/sec. [Mon Dec 02 19:44:12 2019] Timings for 6400K FFT length (12 cores, 4 workers): 23.01, 23.02, 22.97, 22.91 ms. Throughput: 174.09 iter/sec. Timings for 6400K FFT length (12 cores, 12 workers): 72.24, 71.64, 72.34, 72.18, 72.03, 72.29, 71.89, 71.82, 72.03, 72.80, 72.07, 71.39 ms. Throughput: 166.53 iter/sec. Timings for 6720K FFT length (12 cores, 1 worker): 3.34 ms. Throughput: 299.21 iter/sec. Timings for 6720K FFT length (12 cores, 4 workers): 24.48, 24.63, 24.43, 24.44 ms. Throughput: 163.30 iter/sec. Timings for 6720K FFT length (12 cores, 12 workers): 75.94, 75.65, 76.91, 76.22, 76.27, 76.24, 75.89, 75.45, 75.92, 77.83, 76.30, 75.74 ms. Throughput: 157.50 iter/sec. Timings for 7168K FFT length (12 cores, 1 worker): 3.80 ms. Throughput: 262.89 iter/sec. Timings for 7168K FFT length (12 cores, 4 workers): 25.84, 25.86, 25.86, 25.84 ms. Throughput: 154.74 iter/sec. Timings for 7168K FFT length (12 cores, 12 workers): 80.85, 79.94, 80.52, 80.73, 80.37, 80.54, 80.18, 80.20, 80.30, 82.08, 80.22, 79.41 ms. Throughput: 149.18 iter/sec. Timings for 7680K FFT length (12 cores, 1 worker): 4.80 ms. Throughput: 208.34 iter/sec. Timings for 7680K FFT length (12 cores, 4 workers): 28.67, 28.69, 28.65, 28.67 ms. Throughput: 139.52 iter/sec. [Mon Dec 02 19:49:28 2019] Timings for 7680K FFT length (12 cores, 12 workers): 88.35, 87.48, 88.29, 88.06, 88.40, 88.15, 87.51, 87.06, 87.58, 90.22, 87.87, 87.14 ms. Throughput: 136.36 iter/sec. Timings for 8000K FFT length (12 cores, 1 worker): 4.77 ms. Throughput: 209.53 iter/sec. Timings for 8000K FFT length (12 cores, 4 workers): 29.32, 29.42, 29.49, 29.26 ms. Throughput: 136.18 iter/sec. Timings for 8000K FFT length (12 cores, 12 workers): 90.34, 89.92, 91.53, 90.77, 91.25, 90.92, 90.71, 90.60, 90.98, 92.36, 90.38, 89.06 ms. Throughput: 132.26 iter/sec. Timings for 8064K FFT length (12 cores, 1 worker): 4.82 ms. Throughput: 207.44 iter/sec. Timings for 8064K FFT length (12 cores, 4 workers): 29.42, 29.50, 29.47, 29.48 ms. Throughput: 135.73 iter/sec. Timings for 8064K FFT length (12 cores, 12 workers): 90.63, 89.95, 91.74, 91.02, 91.00, 91.26, 90.67, 90.72, 90.87, 91.64, 89.91, 89.87 ms. Throughput: 132.21 iter/sec. Timings for 8192K FFT length (12 cores, 1 worker): 5.23 ms. Throughput: 191.24 iter/sec. Timings for 8192K FFT length (12 cores, 4 workers): 30.05, 30.39, 30.18, 30.38 ms. Throughput: 132.25 iter/sec. Timings for 8192K FFT length (12 cores, 12 workers): 92.85, 91.51, 92.40, 92.79, 92.76, 92.82, 92.59, 92.26, 92.58, 93.38, 91.43, 91.16 ms. Throughput: 129.91 iter/sec. Code:
AMD Ryzen 9 3900X 12-Core Processor CPU speed: 4217.00 MHz, 12 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 12x32 KB, L2 cache size: 12x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Machine topology as determined by hwloc library: Machine#0 (total=29788576KB, Backend=Windows, hwlocVersion=2.0.4, ProcessName=prime95.exe) Package (total=29788576KB, CPUVendor=AuthenticAMD, CPUFamilyNumber=23, CPUModelNumber=113, CPUModel="AMD Ryzen 9 3900X 12-Core Processor ", CPUStepping=0) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000003) PU#0 (cpuset: 0x00000001) PU#1 (cpuset: 0x00000002) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000000c) PU#2 (cpuset: 0x00000004) PU#3 (cpuset: 0x00000008) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000030) PU#4 (cpuset: 0x00000010) PU#5 (cpuset: 0x00000020) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000000c0) PU#6 (cpuset: 0x00000040) PU#7 (cpuset: 0x00000080) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000300) PU#8 (cpuset: 0x00000100) PU#9 (cpuset: 0x00000200) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000c00) PU#10 (cpuset: 0x00000400) PU#11 (cpuset: 0x00000800) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00003000) PU#12 (cpuset: 0x00001000) PU#13 (cpuset: 0x00002000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000c000) PU#14 (cpuset: 0x00004000) PU#15 (cpuset: 0x00008000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00030000) PU#16 (cpuset: 0x00010000) PU#17 (cpuset: 0x00020000) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000c0000) PU#18 (cpuset: 0x00040000) PU#19 (cpuset: 0x00080000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00300000) PU#20 (cpuset: 0x00100000) PU#21 (cpuset: 0x00200000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00c00000) PU#22 (cpuset: 0x00400000) PU#23 (cpuset: 0x00800000) Prime95 64-bit version 29.8, RdtscTiming=1 Timing FFTs using 12 threads on 12 cores. Best time for 2048K FFT length: 1.262 ms., avg: 1.309 ms. Best time for 2240K FFT length: 1.025 ms., avg: 1.042 ms. Best time for 2304K FFT length: 1.107 ms., avg: 1.122 ms. Best time for 2400K FFT length: 1.171 ms., avg: 1.180 ms. Best time for 2560K FFT length: 1.163 ms., avg: 1.178 ms. Best time for 2688K FFT length: 1.225 ms., avg: 1.253 ms. Best time for 2800K FFT length: 1.323 ms., avg: 1.340 ms. Best time for 2880K FFT length: 1.416 ms., avg: 1.443 ms. Best time for 3072K FFT length: 1.483 ms., avg: 1.524 ms. Best time for 3200K FFT length: 1.463 ms., avg: 1.478 ms. Best time for 3360K FFT length: 1.461 ms., avg: 1.478 ms. Best time for 3584K FFT length: 1.665 ms., avg: 1.680 ms. Best time for 3840K FFT length: 1.706 ms., avg: 1.743 ms. Best time for 4096K FFT length: 1.887 ms., avg: 1.927 ms. Best time for 4480K FFT length: 2.074 ms., avg: 2.108 ms. Best time for 4608K FFT length: 2.061 ms., avg: 2.087 ms. Best time for 4800K FFT length: 2.136 ms., avg: 2.161 ms. Best time for 5120K FFT length: 2.215 ms., avg: 2.251 ms. Best time for 5376K FFT length: 2.411 ms., avg: 2.447 ms. Best time for 5600K FFT length: 2.566 ms., avg: 2.604 ms. Best time for 5760K FFT length: 2.620 ms., avg: 2.741 ms. Best time for 6144K FFT length: 2.788 ms., avg: 2.853 ms. Best time for 6400K FFT length: 3.047 ms., avg: 3.214 ms. Best time for 6720K FFT length: 3.212 ms., avg: 3.355 ms. Best time for 7168K FFT length: 3.660 ms., avg: 3.755 ms. Best time for 7680K FFT length: 4.364 ms., avg: 4.562 ms. Best time for 8000K FFT length: 4.483 ms., avg: 4.617 ms. Best time for 8064K FFT length: 4.731 ms., avg: 4.860 ms. Best time for 8192K FFT length: 4.974 ms., avg: 5.100 ms. Code:
AMD Ryzen 9 3900X 12-Core Processor CPU speed: 4216.22 MHz, 12 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 12x32 KB, L2 cache size: 12x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Machine topology as determined by hwloc library: Machine#0 (total=29788576KB, Backend=Windows, hwlocVersion=2.0.4, ProcessName=prime95.exe) Package (total=29788576KB, CPUVendor=AuthenticAMD, CPUFamilyNumber=23, CPUModelNumber=113, CPUModel="AMD Ryzen 9 3900X 12-Core Processor ", CPUStepping=0) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000003) PU#0 (cpuset: 0x00000001) PU#1 (cpuset: 0x00000002) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000000c) PU#2 (cpuset: 0x00000004) PU#3 (cpuset: 0x00000008) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000030) PU#4 (cpuset: 0x00000010) PU#5 (cpuset: 0x00000020) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000000c0) PU#6 (cpuset: 0x00000040) PU#7 (cpuset: 0x00000080) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000300) PU#8 (cpuset: 0x00000100) PU#9 (cpuset: 0x00000200) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000c00) PU#10 (cpuset: 0x00000400) PU#11 (cpuset: 0x00000800) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00003000) PU#12 (cpuset: 0x00001000) PU#13 (cpuset: 0x00002000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x0000c000) PU#14 (cpuset: 0x00004000) PU#15 (cpuset: 0x00008000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00030000) PU#16 (cpuset: 0x00010000) PU#17 (cpuset: 0x00020000) L3 (size=16384KB, linesize=64, ways=16, Inclusive=0) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x000c0000) PU#18 (cpuset: 0x00040000) PU#19 (cpuset: 0x00080000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00300000) PU#20 (cpuset: 0x00100000) PU#21 (cpuset: 0x00200000) L2 (size=512KB, linesize=64, ways=8, Inclusive=1) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00c00000) PU#22 (cpuset: 0x00400000) PU#23 (cpuset: 0x00800000) Prime95 64-bit version 29.8, RdtscTiming=1 Best time for 61 bit trial factors: 0.686 ms. Best time for 62 bit trial factors: 0.711 ms. Best time for 63 bit trial factors: 0.705 ms. Best time for 64 bit trial factors: 0.708 ms. Best time for 65 bit trial factors: 0.706 ms. Best time for 66 bit trial factors: 0.698 ms. Best time for 67 bit trial factors: 0.695 ms. Best time for 75 bit trial factors: 0.694 ms. Best time for 76 bit trial factors: 0.686 ms. Best time for 77 bit trial factors: 0.693 ms. |
![]() |
![]() |
![]() |
#787 |
"Oliver"
Mar 2005
Germany
21328 Posts |
![]()
Hi,
you might want to include benchmarks for 2 workers and it is even better for certain ranges (FFT data fits twice into L3 cache(s) but not 4 times). Current LL Doublecheck fall into this range! Stock Ryzen 9 3900X with dual DDR4-3200 (dual rank): Code:
Prime95 64-bit version 29.8, RdtscTiming=1 Timings for 2880K FFT length (12 cores, 1 worker): 1.33 ms. Throughput: 750.80 iter/sec. Timings for 2880K FFT length (12 cores, 2 workers): 2.10, 2.10 ms. Throughput: 954.10 iter/sec. |
![]() |
![]() |
![]() |
#788 |
"Oliver"
Mar 2005
Germany
2×557 Posts |
![]()
Hi, some fun with my Ryzen 9 3900X, I think most impressive is part 3!
BIOS defaults (142 W PPT), dual channel DDR4-2400 (dual rank): 2048K, 5760K, 6144K and 6400K flawed by some background processes (e.g. Windows Update?) Code:
Prime95 64-bit version 29.8, RdtscTiming=1 Timings for 2048K FFT length (12 cores, 1 worker): 1.73 ms. Throughput: 579.64 iter/sec. Timings for 2048K FFT length (12 cores, 2 workers): 1.77, 1.77 ms. Throughput: 1127.98 iter/sec. Timings for 2240K FFT length (12 cores, 1 worker): 1.38 ms. Throughput: 724.20 iter/sec. Timings for 2240K FFT length (12 cores, 2 workers): 1.88, 1.89 ms. Throughput: 1061.54 iter/sec. Timings for 2304K FFT length (12 cores, 1 worker): 1.45 ms. Throughput: 689.66 iter/sec. Timings for 2304K FFT length (12 cores, 2 workers): 1.95, 1.95 ms. Throughput: 1024.97 iter/sec. Timings for 2400K FFT length (12 cores, 1 worker): 1.51 ms. Throughput: 663.27 iter/sec. Timings for 2400K FFT length (12 cores, 2 workers): 2.12, 2.06 ms. Throughput: 957.64 iter/sec. Timings for 2560K FFT length (12 cores, 1 worker): 1.53 ms. Throughput: 654.30 iter/sec. Timings for 2560K FFT length (12 cores, 2 workers): 2.12, 2.13 ms. Throughput: 940.81 iter/sec. Timings for 2688K FFT length (12 cores, 1 worker): 1.65 ms. Throughput: 605.34 iter/sec. Timings for 2688K FFT length (12 cores, 2 workers): 2.31, 2.27 ms. Throughput: 873.05 iter/sec. Timings for 2800K FFT length (12 cores, 1 worker): 2.36 ms. Throughput: 423.44 iter/sec. Timings for 2800K FFT length (12 cores, 2 workers): 2.38, 2.38 ms. Throughput: 839.83 iter/sec. Timings for 2880K FFT length (12 cores, 1 worker): 1.76 ms. Throughput: 567.54 iter/sec. Timings for 2880K FFT length (12 cores, 2 workers): 2.50, 2.53 ms. Throughput: 795.85 iter/sec. Timings for 3072K FFT length (12 cores, 1 worker): 2.24 ms. Throughput: 446.77 iter/sec. Timings for 3072K FFT length (12 cores, 2 workers): 3.37, 3.22 ms. Throughput: 607.02 iter/sec. Timings for 3200K FFT length (12 cores, 1 worker): 2.09 ms. Throughput: 478.34 iter/sec. Timings for 3200K FFT length (12 cores, 2 workers): 3.43, 3.38 ms. Throughput: 586.73 iter/sec. Timings for 3360K FFT length (12 cores, 1 worker): 2.21 ms. Throughput: 452.61 iter/sec. Timings for 3360K FFT length (12 cores, 2 workers): 3.52, 3.74 ms. Throughput: 551.65 iter/sec. Timings for 3584K FFT length (12 cores, 1 worker): 2.35 ms. Throughput: 425.02 iter/sec. Timings for 3584K FFT length (12 cores, 2 workers): 3.96, 4.31 ms. Throughput: 484.73 iter/sec. Timings for 3840K FFT length (12 cores, 1 worker): 2.39 ms. Throughput: 418.29 iter/sec. Timings for 3840K FFT length (12 cores, 2 workers): 4.65, 5.26 ms. Throughput: 405.50 iter/sec. Timings for 4096K FFT length (12 cores, 1 worker): 2.69 ms. Throughput: 372.10 iter/sec. Timings for 4096K FFT length (12 cores, 2 workers): 5.38, 5.87 ms. Throughput: 356.12 iter/sec. Timings for 4480K FFT length (12 cores, 1 worker): 3.06 ms. Throughput: 326.65 iter/sec. Timings for 4480K FFT length (12 cores, 2 workers): 7.02, 7.73 ms. Throughput: 271.71 iter/sec. Timings for 4608K FFT length (12 cores, 1 worker): 2.83 ms. Throughput: 353.26 iter/sec. Timings for 4608K FFT length (12 cores, 2 workers): 7.43, 7.13 ms. Throughput: 274.79 iter/sec. Timings for 4800K FFT length (12 cores, 1 worker): 2.87 ms. Throughput: 347.90 iter/sec. Timings for 4800K FFT length (12 cores, 2 workers): 7.08, 7.92 ms. Throughput: 267.43 iter/sec. Timings for 5120K FFT length (12 cores, 1 worker): 3.00 ms. Throughput: 332.95 iter/sec. Timings for 5120K FFT length (12 cores, 2 workers): 8.08, 9.03 ms. Throughput: 234.51 iter/sec. Timings for 5376K FFT length (12 cores, 1 worker): 3.00 ms. Throughput: 333.79 iter/sec. Timings for 5376K FFT length (12 cores, 2 workers): 8.87, 9.06 ms. Throughput: 223.13 iter/sec. Timings for 5600K FFT length (12 cores, 1 worker): 3.36 ms. Throughput: 297.95 iter/sec. Timings for 5600K FFT length (12 cores, 2 workers): 9.97, 9.96 ms. Throughput: 200.63 iter/sec. Timings for 5760K FFT length (12 cores, 1 worker): 3.49 ms. Throughput: 286.59 iter/sec. Timings for 5760K FFT length (12 cores, 2 workers): 11.58, 10.94 ms. Throughput: 177.71 iter/sec. Timings for 6144K FFT length (12 cores, 1 worker): 3.56 ms. Throughput: 280.75 iter/sec. Timings for 6144K FFT length (12 cores, 2 workers): 20.44, 20.29 ms. Throughput: 98.21 iter/sec. Timings for 6400K FFT length (12 cores, 1 worker): 15.05 ms. Throughput: 66.43 iter/sec. Timings for 6400K FFT length (12 cores, 2 workers): 45.33, 19.30 ms. Throughput: 73.89 iter/sec. Timings for 6720K FFT length (12 cores, 1 worker): 3.92 ms. Throughput: 254.78 iter/sec. Timings for 6720K FFT length (12 cores, 2 workers): 13.35, 13.36 ms. Throughput: 149.78 iter/sec. Timings for 7168K FFT length (12 cores, 1 worker): 4.50 ms. Throughput: 222.41 iter/sec. Timings for 7168K FFT length (12 cores, 2 workers): 14.06, 14.19 ms. Throughput: 141.58 iter/sec. Timings for 7680K FFT length (12 cores, 1 worker): 5.37 ms. Throughput: 186.05 iter/sec. Timings for 7680K FFT length (12 cores, 2 workers): 16.17, 16.17 ms. Throughput: 123.71 iter/sec. Timings for 8000K FFT length (12 cores, 1 worker): 5.22 ms. Throughput: 191.54 iter/sec. Timings for 8000K FFT length (12 cores, 2 workers): 16.20, 16.32 ms. Throughput: 122.99 iter/sec. Timings for 8064K FFT length (12 cores, 1 worker): 5.49 ms. Throughput: 182.26 iter/sec. Timings for 8064K FFT length (12 cores, 2 workers): 16.50, 16.69 ms. Throughput: 120.53 iter/sec. Timings for 8192K FFT length (12 cores, 1 worker): 5.89 ms. Throughput: 169.83 iter/sec. Timings for 8192K FFT length (12 cores, 2 workers): 16.95, 17.09 ms. Throughput: 117.51 iter/sec. Code:
Prime95 64-bit version 29.8, RdtscTiming=1 Timings for 2048K FFT length (12 cores, 1 worker): 1.07 ms. Throughput: 934.93 iter/sec. Timings for 2048K FFT length (12 cores, 2 workers): 1.53, 1.53 ms. Throughput: 1307.78 iter/sec. Timings for 2240K FFT length (12 cores, 1 worker): 1.03 ms. Throughput: 968.92 iter/sec. Timings for 2240K FFT length (12 cores, 2 workers): 1.66, 1.71 ms. Throughput: 1187.84 iter/sec. Timings for 2304K FFT length (12 cores, 1 worker): 1.08 ms. Throughput: 925.87 iter/sec. Timings for 2304K FFT length (12 cores, 2 workers): 1.63, 1.67 ms. Throughput: 1212.94 iter/sec. Timings for 2400K FFT length (12 cores, 1 worker): 1.18 ms. Throughput: 845.48 iter/sec. Timings for 2400K FFT length (12 cores, 2 workers): 1.80, 1.77 ms. Throughput: 1121.96 iter/sec. Timings for 2560K FFT length (12 cores, 1 worker): 1.18 ms. Throughput: 846.87 iter/sec. Timings for 2560K FFT length (12 cores, 2 workers): 2.01, 1.96 ms. Throughput: 1007.45 iter/sec. Timings for 2688K FFT length (12 cores, 1 worker): 1.22 ms. Throughput: 817.39 iter/sec. Timings for 2688K FFT length (12 cores, 2 workers): 1.97, 1.96 ms. Throughput: 1015.97 iter/sec. Timings for 2800K FFT length (12 cores, 1 worker): 1.31 ms. Throughput: 764.73 iter/sec. Timings for 2800K FFT length (12 cores, 2 workers): 2.07, 2.03 ms. Throughput: 975.56 iter/sec. Timings for 2880K FFT length (12 cores, 1 worker): 1.34 ms. Throughput: 747.34 iter/sec. Timings for 2880K FFT length (12 cores, 2 workers): 2.08, 2.12 ms. Throughput: 952.38 iter/sec. Timings for 3072K FFT length (12 cores, 1 worker): 1.44 ms. Throughput: 692.45 iter/sec. Timings for 3072K FFT length (12 cores, 2 workers): 2.25, 2.27 ms. Throughput: 884.15 iter/sec. Timings for 3200K FFT length (12 cores, 1 worker): 1.50 ms. Throughput: 668.50 iter/sec. Timings for 3200K FFT length (12 cores, 2 workers): 2.40, 2.40 ms. Throughput: 833.02 iter/sec. Timings for 3360K FFT length (12 cores, 1 worker): 1.50 ms. Throughput: 665.21 iter/sec. Timings for 3360K FFT length (12 cores, 2 workers): 2.76, 2.75 ms. Throughput: 725.20 iter/sec. Timings for 3584K FFT length (12 cores, 1 worker): 1.68 ms. Throughput: 595.78 iter/sec. Timings for 3584K FFT length (12 cores, 2 workers): 2.97, 2.98 ms. Throughput: 671.48 iter/sec. Timings for 3840K FFT length (12 cores, 1 worker): 1.71 ms. Throughput: 584.73 iter/sec. Timings for 3840K FFT length (12 cores, 2 workers): 3.35, 3.36 ms. Throughput: 596.38 iter/sec. Timings for 4096K FFT length (12 cores, 1 worker): 1.88 ms. Throughput: 532.71 iter/sec. Timings for 4096K FFT length (12 cores, 2 workers): 4.07, 4.06 ms. Throughput: 492.05 iter/sec. Timings for 4480K FFT length (12 cores, 1 worker): 2.09 ms. Throughput: 478.71 iter/sec. Timings for 4480K FFT length (12 cores, 2 workers): 5.39, 5.32 ms. Throughput: 373.51 iter/sec. Timings for 4608K FFT length (12 cores, 1 worker): 2.05 ms. Throughput: 488.26 iter/sec. Timings for 4608K FFT length (12 cores, 2 workers): 5.24, 5.23 ms. Throughput: 382.13 iter/sec. Timings for 4800K FFT length (12 cores, 1 worker): 2.13 ms. Throughput: 470.50 iter/sec. Timings for 4800K FFT length (12 cores, 2 workers): 5.76, 5.76 ms. Throughput: 347.27 iter/sec. Timings for 5120K FFT length (12 cores, 1 worker): 2.21 ms. Throughput: 452.76 iter/sec. Timings for 5120K FFT length (12 cores, 2 workers): 6.52, 6.53 ms. Throughput: 306.55 iter/sec. Timings for 5376K FFT length (12 cores, 1 worker): 2.39 ms. Throughput: 418.74 iter/sec. Timings for 5376K FFT length (12 cores, 2 workers): 7.23, 7.37 ms. Throughput: 273.98 iter/sec. Timings for 5600K FFT length (12 cores, 1 worker): 2.54 ms. Throughput: 393.36 iter/sec. Timings for 5600K FFT length (12 cores, 2 workers): 8.02, 8.02 ms. Throughput: 249.24 iter/sec. Timings for 5760K FFT length (12 cores, 1 worker): 2.63 ms. Throughput: 380.26 iter/sec. Timings for 5760K FFT length (12 cores, 2 workers): 8.79, 8.64 ms. Throughput: 229.51 iter/sec. Timings for 6144K FFT length (12 cores, 1 worker): 2.78 ms. Throughput: 359.64 iter/sec. Timings for 6144K FFT length (12 cores, 2 workers): 9.16, 9.13 ms. Throughput: 218.77 iter/sec. Timings for 6400K FFT length (12 cores, 1 worker): 2.84 ms. Throughput: 352.44 iter/sec. Timings for 6400K FFT length (12 cores, 2 workers): 9.85, 9.85 ms. Throughput: 203.12 iter/sec. Timings for 6720K FFT length (12 cores, 1 worker): 3.23 ms. Throughput: 309.43 iter/sec. Timings for 6720K FFT length (12 cores, 2 workers): 10.81, 10.64 ms. Throughput: 186.49 iter/sec. Timings for 7168K FFT length (12 cores, 1 worker): 3.65 ms. Throughput: 274.28 iter/sec. Timings for 7168K FFT length (12 cores, 2 workers): 11.48, 11.48 ms. Throughput: 174.26 iter/sec. Timings for 7680K FFT length (12 cores, 1 worker): 4.40 ms. Throughput: 227.38 iter/sec. Timings for 7680K FFT length (12 cores, 2 workers): 13.00, 13.02 ms. Throughput: 153.75 iter/sec. Timings for 8000K FFT length (12 cores, 1 worker): 4.46 ms. Throughput: 224.21 iter/sec. Timings for 8000K FFT length (12 cores, 2 workers): 13.24, 13.24 ms. Throughput: 151.08 iter/sec. Timings for 8064K FFT length (12 cores, 1 worker): 4.65 ms. Throughput: 214.92 iter/sec. Timings for 8064K FFT length (12 cores, 2 workers): 13.42, 13.42 ms. Throughput: 149.06 iter/sec. Timings for 8192K FFT length (12 cores, 1 worker): 4.85 ms. Throughput: 205.99 iter/sec. Timings for 8192K FFT length (12 cores, 2 workers): 13.79, 13.78 ms. Throughput: 145.07 iter/sec. Code:
Prime95 64-bit version 29.8, RdtscTiming=1 Timings for 2048K FFT length (12 cores, 1 worker): 1.32 ms. Throughput: 759.28 iter/sec. Timings for 2048K FFT length (12 cores, 2 workers): 1.60, 1.59 ms. Throughput: 1254.81 iter/sec. Timings for 2240K FFT length (12 cores, 1 worker): 1.07 ms. Throughput: 935.16 iter/sec. Timings for 2240K FFT length (12 cores, 2 workers): 1.75, 1.76 ms. Throughput: 1139.24 iter/sec. Timings for 2304K FFT length (12 cores, 1 worker): 1.09 ms. Throughput: 917.06 iter/sec. Timings for 2304K FFT length (12 cores, 2 workers): 1.71, 1.73 ms. Throughput: 1162.16 iter/sec. Timings for 2400K FFT length (12 cores, 1 worker): 1.15 ms. Throughput: 870.42 iter/sec. Timings for 2400K FFT length (12 cores, 2 workers): 1.84, 1.85 ms. Throughput: 1086.44 iter/sec. Timings for 2560K FFT length (12 cores, 1 worker): 1.23 ms. Throughput: 814.20 iter/sec. Timings for 2560K FFT length (12 cores, 2 workers): 2.01, 2.01 ms. Throughput: 994.44 iter/sec. Timings for 2688K FFT length (12 cores, 1 worker): 1.24 ms. Throughput: 809.06 iter/sec. Timings for 2688K FFT length (12 cores, 2 workers): 2.05, 2.05 ms. Throughput: 975.72 iter/sec. Timings for 2800K FFT length (12 cores, 1 worker): 1.87 ms. Throughput: 534.27 iter/sec. Timings for 2800K FFT length (12 cores, 2 workers): 2.16, 2.16 ms. Throughput: 925.65 iter/sec. Timings for 2880K FFT length (12 cores, 1 worker): 1.37 ms. Throughput: 731.17 iter/sec. Timings for 2880K FFT length (12 cores, 2 workers): 2.20, 2.19 ms. Throughput: 911.47 iter/sec. Timings for 3072K FFT length (12 cores, 1 worker): 1.49 ms. Throughput: 670.10 iter/sec. Timings for 3072K FFT length (12 cores, 2 workers): 2.35, 2.27 ms. Throughput: 865.36 iter/sec. Timings for 3200K FFT length (12 cores, 1 worker): 1.52 ms. Throughput: 658.08 iter/sec. Timings for 3200K FFT length (12 cores, 2 workers): 2.61, 2.51 ms. Throughput: 781.95 iter/sec. Timings for 3360K FFT length (12 cores, 1 worker): 1.53 ms. Throughput: 652.05 iter/sec. Timings for 3360K FFT length (12 cores, 2 workers): 2.67, 2.69 ms. Throughput: 747.01 iter/sec. Timings for 3584K FFT length (12 cores, 1 worker): 1.70 ms. Throughput: 587.06 iter/sec. Timings for 3584K FFT length (12 cores, 2 workers): 2.99, 3.02 ms. Throughput: 665.09 iter/sec. Timings for 3840K FFT length (12 cores, 1 worker): 1.75 ms. Throughput: 569.94 iter/sec. Timings for 3840K FFT length (12 cores, 2 workers): 3.37, 3.38 ms. Throughput: 592.54 iter/sec. Timings for 4096K FFT length (12 cores, 1 worker): 1.92 ms. Throughput: 520.63 iter/sec. Timings for 4096K FFT length (12 cores, 2 workers): 4.13, 4.05 ms. Throughput: 489.30 iter/sec. Timings for 4480K FFT length (12 cores, 1 worker): 2.10 ms. Throughput: 477.04 iter/sec. Timings for 4480K FFT length (12 cores, 2 workers): 5.47, 5.32 ms. Throughput: 370.75 iter/sec. Timings for 4608K FFT length (12 cores, 1 worker): 2.09 ms. Throughput: 478.53 iter/sec. Timings for 4608K FFT length (12 cores, 2 workers): 5.31, 5.38 ms. Throughput: 374.50 iter/sec. Timings for 4800K FFT length (12 cores, 1 worker): 2.15 ms. Throughput: 464.74 iter/sec. Timings for 4800K FFT length (12 cores, 2 workers): 5.80, 5.76 ms. Throughput: 346.07 iter/sec. Timings for 5120K FFT length (12 cores, 1 worker): 2.25 ms. Throughput: 445.05 iter/sec. Timings for 5120K FFT length (12 cores, 2 workers): 6.67, 6.62 ms. Throughput: 301.14 iter/sec. Timings for 5376K FFT length (12 cores, 1 worker): 2.45 ms. Throughput: 407.43 iter/sec. Timings for 5376K FFT length (12 cores, 2 workers): 7.29, 7.31 ms. Throughput: 274.12 iter/sec. Timings for 5600K FFT length (12 cores, 1 worker): 2.56 ms. Throughput: 391.23 iter/sec. Timings for 5600K FFT length (12 cores, 2 workers): 8.10, 8.11 ms. Throughput: 246.73 iter/sec. Timings for 5760K FFT length (12 cores, 1 worker): 2.69 ms. Throughput: 371.30 iter/sec. Timings for 5760K FFT length (12 cores, 2 workers): 8.75, 8.84 ms. Throughput: 227.40 iter/sec. Timings for 6144K FFT length (12 cores, 1 worker): 2.83 ms. Throughput: 353.38 iter/sec. Timings for 6144K FFT length (12 cores, 2 workers): 9.36, 9.20 ms. Throughput: 215.52 iter/sec. Timings for 6400K FFT length (12 cores, 1 worker): 2.90 ms. Throughput: 344.26 iter/sec. Timings for 6400K FFT length (12 cores, 2 workers): 9.84, 9.85 ms. Throughput: 203.16 iter/sec. Timings for 6720K FFT length (12 cores, 1 worker): 3.34 ms. Throughput: 299.58 iter/sec. Timings for 6720K FFT length (12 cores, 2 workers): 10.83, 10.85 ms. Throughput: 184.52 iter/sec. Timings for 7168K FFT length (12 cores, 1 worker): 3.71 ms. Throughput: 269.23 iter/sec. Timings for 7168K FFT length (12 cores, 2 workers): 11.47, 11.57 ms. Throughput: 173.60 iter/sec. Timings for 7680K FFT length (12 cores, 1 worker): 4.42 ms. Throughput: 226.09 iter/sec. Timings for 7680K FFT length (12 cores, 2 workers): 13.11, 12.99 ms. Throughput: 153.28 iter/sec. Timings for 8000K FFT length (12 cores, 1 worker): 4.52 ms. Throughput: 221.24 iter/sec. Timings for 8000K FFT length (12 cores, 2 workers): 13.17, 13.15 ms. Throughput: 152.00 iter/sec. Timings for 8064K FFT length (12 cores, 1 worker): 4.64 ms. Throughput: 215.55 iter/sec. Timings for 8064K FFT length (12 cores, 2 workers): 13.43, 13.51 ms. Throughput: 148.48 iter/sec. Timings for 8192K FFT length (12 cores, 1 worker): 4.96 ms. Throughput: 201.73 iter/sec. Timings for 8192K FFT length (12 cores, 2 workers): 13.93, 13.93 ms. Throughput: 143.56 iter/sec. |
![]() |
![]() |
![]() |
#789 |
"Jorge Coveiro"
Nov 2006
Moura, Portugal
4810 Posts |
![]()
Here are some AMD 3950x benchmarks:
Stock + RamCache III + Memory Clock: 1600 + Fabric Clock: 1600 (Throughput + FFT + Trial) Benchmarks: Code:
######################################################### Throughput Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4239.86 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Timings for 2048K FFT length (16 cores, 1 worker): 1.52 ms. Throughput: 659.51 iter/sec. Timings for 2048K FFT length (16 cores, 2 workers): 1.34, 1.34 ms. Throughput: 1491.01 iter/sec. Timings for 2240K FFT length (16 cores, 1 worker): 1.04 ms. Throughput: 963.01 iter/sec. Timings for 2240K FFT length (16 cores, 2 workers): 1.42, 1.42 ms. Throughput: 1405.76 iter/sec. Timings for 2304K FFT length (16 cores, 1 worker): 1.07 ms. Throughput: 934.74 iter/sec. Timings for 2304K FFT length (16 cores, 2 workers): 1.42, 1.42 ms. Throughput: 1409.12 iter/sec. Timings for 2400K FFT length (16 cores, 1 worker): 1.17 ms. Throughput: 851.33 iter/sec. Timings for 2400K FFT length (16 cores, 2 workers): 1.58, 1.54 ms. Throughput: 1282.51 iter/sec. Timings for 2560K FFT length (16 cores, 1 worker): 1.17 ms. Throughput: 857.44 iter/sec. Timings for 2560K FFT length (16 cores, 2 workers): 1.61, 1.63 ms. Throughput: 1233.47 iter/sec. Timings for 2688K FFT length (16 cores, 1 worker): 1.21 ms. Throughput: 828.15 iter/sec. Timings for 2688K FFT length (16 cores, 2 workers): 1.69, 1.70 ms. Throughput: 1177.94 iter/sec. Timings for 2800K FFT length (16 cores, 1 worker): 1.94 ms. Throughput: 515.93 iter/sec. Timings for 2800K FFT length (16 cores, 2 workers): 1.79, 1.81 ms. Throughput: 1108.87 iter/sec. Timings for 2880K FFT length (16 cores, 1 worker): 1.34 ms. Throughput: 749.01 iter/sec. Timings for 2880K FFT length (16 cores, 2 workers): 1.80, 1.86 ms. Throughput: 1091.24 iter/sec. Timings for 3072K FFT length (16 cores, 1 worker): 1.45 ms. Throughput: 689.52 iter/sec. Timings for 3072K FFT length (16 cores, 2 workers): 2.17, 2.17 ms. Throughput: 921.70 iter/sec. Timings for 3200K FFT length (16 cores, 1 worker): 1.41 ms. Throughput: 708.09 iter/sec. Timings for 3200K FFT length (16 cores, 2 workers): 2.18, 2.18 ms. Throughput: 918.07 iter/sec. Timings for 3360K FFT length (16 cores, 1 worker): 1.53 ms. Throughput: 655.59 iter/sec. Timings for 3360K FFT length (16 cores, 2 workers): 2.96, 2.93 ms. Throughput: 679.02 iter/sec. Timings for 3584K FFT length (16 cores, 1 worker): 1.65 ms. Throughput: 604.66 iter/sec. Timings for 3584K FFT length (16 cores, 2 workers): 2.99, 3.03 ms. Throughput: 664.45 iter/sec. Timings for 3840K FFT length (16 cores, 1 worker): 1.63 ms. Throughput: 614.97 iter/sec. Timings for 3840K FFT length (16 cores, 2 workers): 3.58, 3.39 ms. Throughput: 574.03 iter/sec. Timings for 4096K FFT length (16 cores, 1 worker): 1.89 ms. Throughput: 528.07 iter/sec. Timings for 4096K FFT length (16 cores, 2 workers): 4.27, 4.27 ms. Throughput: 468.29 iter/sec. Timings for 4480K FFT length (16 cores, 1 worker): 2.08 ms. Throughput: 480.12 iter/sec. Timings for 4480K FFT length (16 cores, 2 workers): 5.33, 5.32 ms. Throughput: 375.61 iter/sec. Timings for 4608K FFT length (16 cores, 1 worker): 1.94 ms. Throughput: 514.56 iter/sec. Timings for 4608K FFT length (16 cores, 2 workers): 5.42, 5.57 ms. Throughput: 363.87 iter/sec. Timings for 4800K FFT length (16 cores, 1 worker): 2.01 ms. Throughput: 498.65 iter/sec. Timings for 4800K FFT length (16 cores, 2 workers): 5.68, 5.71 ms. Throughput: 351.21 iter/sec. Timings for 5120K FFT length (16 cores, 1 worker): 2.12 ms. Throughput: 470.89 iter/sec. Timings for 5120K FFT length (16 cores, 2 workers): 6.44, 6.51 ms. Throughput: 309.04 iter/sec. Timings for 5376K FFT length (16 cores, 1 worker): 2.32 ms. Throughput: 430.39 iter/sec. Timings for 5376K FFT length (16 cores, 2 workers): 7.05, 7.11 ms. Throughput: 282.55 iter/sec. Timings for 5600K FFT length (16 cores, 1 worker): 2.39 ms. Throughput: 418.58 iter/sec. Timings for 5600K FFT length (16 cores, 2 workers): 7.77, 7.75 ms. Throughput: 257.70 iter/sec. Timings for 5760K FFT length (16 cores, 1 worker): 2.59 ms. Throughput: 386.79 iter/sec. Timings for 5760K FFT length (16 cores, 2 workers): 8.52, 8.53 ms. Throughput: 234.56 iter/sec. Timings for 6144K FFT length (16 cores, 1 worker): 2.66 ms. Throughput: 375.77 iter/sec. Timings for 6144K FFT length (16 cores, 2 workers): 8.89, 8.88 ms. Throughput: 225.04 iter/sec. Timings for 6400K FFT length (16 cores, 1 worker): 2.80 ms. Throughput: 356.80 iter/sec. Timings for 6400K FFT length (16 cores, 2 workers): 9.41, 9.47 ms. Throughput: 211.91 iter/sec. Timings for 6720K FFT length (16 cores, 1 worker): 3.43 ms. Throughput: 291.55 iter/sec. Timings for 6720K FFT length (16 cores, 2 workers): 10.74, 10.65 ms. Throughput: 187.03 iter/sec. Timings for 7168K FFT length (16 cores, 1 worker): 3.57 ms. Throughput: 280.15 iter/sec. Timings for 7168K FFT length (16 cores, 2 workers): 11.22, 11.26 ms. Throughput: 177.94 iter/sec. Timings for 7680K FFT length (16 cores, 1 worker): 4.46 ms. Throughput: 224.28 iter/sec. Timings for 7680K FFT length (16 cores, 2 workers): 12.94, 12.94 ms. Throughput: 154.59 iter/sec. Timings for 8000K FFT length (16 cores, 1 worker): 4.35 ms. Throughput: 229.82 iter/sec. Timings for 8000K FFT length (16 cores, 2 workers): 13.07, 13.03 ms. Throughput: 153.31 iter/sec. Timings for 8064K FFT length (16 cores, 1 worker): 4.55 ms. Throughput: 219.93 iter/sec. Timings for 8064K FFT length (16 cores, 2 workers): 13.25, 13.20 ms. Throughput: 151.25 iter/sec. Timings for 8192K FFT length (16 cores, 1 worker): 4.81 ms. Throughput: 208.04 iter/sec. Timings for 8192K FFT length (16 cores, 2 workers): 13.61, 13.53 ms. Throughput: 147.36 iter/sec. ######################################################### FFT Timings Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4239.35 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Timing FFTs using 16 threads on 16 cores. Best time for 2048K FFT length: 1.470 ms., avg: 1.505 ms. Best time for 2240K FFT length: 1.043 ms., avg: 1.071 ms. Best time for 2304K FFT length: 1.025 ms., avg: 1.068 ms. Best time for 2400K FFT length: 1.107 ms., avg: 1.139 ms. Best time for 2560K FFT length: 1.103 ms., avg: 1.139 ms. Best time for 2688K FFT length: 1.178 ms., avg: 1.212 ms. Best time for 2800K FFT length: 1.437 ms., avg: 1.490 ms. Best time for 2880K FFT length: 1.369 ms., avg: 1.439 ms. Best time for 3072K FFT length: 1.418 ms., avg: 1.441 ms. Best time for 3200K FFT length: 1.373 ms., avg: 1.412 ms. Best time for 3360K FFT length: 1.459 ms., avg: 1.500 ms. Best time for 3584K FFT length: 1.627 ms., avg: 1.647 ms. Best time for 3840K FFT length: 1.590 ms., avg: 1.622 ms. Best time for 4096K FFT length: 1.825 ms., avg: 1.851 ms. Best time for 4480K FFT length: 2.056 ms., avg: 2.081 ms. Best time for 4608K FFT length: 1.916 ms., avg: 1.978 ms. Best time for 4800K FFT length: 1.980 ms., avg: 2.014 ms. Best time for 5120K FFT length: 2.061 ms., avg: 2.091 ms. Best time for 5376K FFT length: 2.248 ms., avg: 2.301 ms. Best time for 5600K FFT length: 2.348 ms., avg: 2.434 ms. Best time for 5760K FFT length: 2.507 ms., avg: 2.669 ms. Best time for 6144K FFT length: 2.605 ms., avg: 2.646 ms. Best time for 6400K FFT length: 2.746 ms., avg: 2.862 ms. Best time for 6720K FFT length: 3.049 ms., avg: 3.141 ms. Best time for 7168K FFT length: 3.429 ms., avg: 3.610 ms. Best time for 7680K FFT length: 4.324 ms., avg: 4.512 ms. Best time for 8000K FFT length: 4.246 ms., avg: 4.360 ms. Best time for 8064K FFT length: 4.450 ms., avg: 4.538 ms. Best time for 8192K FFT length: 4.640 ms., avg: 4.816 ms. ######################################################### Trial Factoring Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4239.79 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Best time for 61 bit trial factors: 0.697 ms. Best time for 62 bit trial factors: 0.720 ms. Best time for 63 bit trial factors: 0.718 ms. Best time for 64 bit trial factors: 0.729 ms. Best time for 65 bit trial factors: 0.721 ms. Best time for 66 bit trial factors: 0.680 ms. Best time for 67 bit trial factors: 0.662 ms. Best time for 75 bit trial factors: 0.942 ms. Best time for 76 bit trial factors: 0.715 ms. Best time for 77 bit trial factors: 0.702 ms. (Throughput + FFT + Trial) Benchmarks: Code:
######################################################### Throughput Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4240.84 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Timings for 2048K FFT length (16 cores, 1 worker): 1.38 ms. Throughput: 723.23 iter/sec. Timings for 2048K FFT length (16 cores, 2 workers): 1.27, 1.26 ms. Throughput: 1579.14 iter/sec. Timings for 2240K FFT length (16 cores, 1 worker): 0.91 ms. Throughput: 1104.29 iter/sec. Timings for 2240K FFT length (16 cores, 2 workers): 1.33, 1.34 ms. Throughput: 1500.80 iter/sec. Timings for 2304K FFT length (16 cores, 1 worker): 1.19 ms. Throughput: 837.94 iter/sec. Timings for 2304K FFT length (16 cores, 2 workers): 1.33, 1.38 ms. Throughput: 1477.86 iter/sec. Timings for 2400K FFT length (16 cores, 1 worker): 1.40 ms. Throughput: 713.60 iter/sec. Timings for 2400K FFT length (16 cores, 2 workers): 1.42, 1.43 ms. Throughput: 1403.26 iter/sec. Timings for 2560K FFT length (16 cores, 1 worker): 1.01 ms. Throughput: 990.78 iter/sec. Timings for 2560K FFT length (16 cores, 2 workers): 1.54, 1.53 ms. Throughput: 1302.31 iter/sec. Timings for 2688K FFT length (16 cores, 1 worker): 1.07 ms. Throughput: 938.33 iter/sec. Timings for 2688K FFT length (16 cores, 2 workers): 2.10, 2.16 ms. Throughput: 938.32 iter/sec. Timings for 2800K FFT length (16 cores, 1 worker): 1.42 ms. Throughput: 702.78 iter/sec. Timings for 2800K FFT length (16 cores, 2 workers): 1.67, 1.68 ms. Throughput: 1193.15 iter/sec. Timings for 2880K FFT length (16 cores, 1 worker): 1.16 ms. Throughput: 861.84 iter/sec. Timings for 2880K FFT length (16 cores, 2 workers): 1.67, 1.68 ms. Throughput: 1196.04 iter/sec. Timings for 3072K FFT length (16 cores, 1 worker): 1.28 ms. Throughput: 782.42 iter/sec. Timings for 3072K FFT length (16 cores, 2 workers): 1.81, 1.83 ms. Throughput: 1101.26 iter/sec. Timings for 3200K FFT length (16 cores, 1 worker): 1.26 ms. Throughput: 794.87 iter/sec. Timings for 3200K FFT length (16 cores, 2 workers): 3.09, 3.12 ms. Throughput: 644.20 iter/sec. Timings for 3360K FFT length (16 cores, 1 worker): 1.29 ms. Throughput: 775.47 iter/sec. Timings for 3360K FFT length (16 cores, 2 workers): 2.33, 2.38 ms. Throughput: 850.57 iter/sec. Timings for 3584K FFT length (16 cores, 1 worker): 1.47 ms. Throughput: 682.57 iter/sec. Timings for 3584K FFT length (16 cores, 2 workers): 2.74, 2.71 ms. Throughput: 733.36 iter/sec. Timings for 3840K FFT length (16 cores, 1 worker): 1.46 ms. Throughput: 686.72 iter/sec. Timings for 3840K FFT length (16 cores, 2 workers): 2.86, 2.99 ms. Throughput: 684.17 iter/sec. Timings for 4096K FFT length (16 cores, 1 worker): 1.67 ms. Throughput: 599.08 iter/sec. Timings for 4096K FFT length (16 cores, 2 workers): 3.60, 3.67 ms. Throughput: 550.61 iter/sec. Timings for 4480K FFT length (16 cores, 1 worker): 1.83 ms. Throughput: 546.01 iter/sec. Timings for 4480K FFT length (16 cores, 2 workers): 4.77, 4.87 ms. Throughput: 415.00 iter/sec. Timings for 4608K FFT length (16 cores, 1 worker): 1.78 ms. Throughput: 561.78 iter/sec. Timings for 4608K FFT length (16 cores, 2 workers): 4.55, 4.65 ms. Throughput: 434.67 iter/sec. Timings for 4800K FFT length (16 cores, 1 worker): 1.81 ms. Throughput: 551.30 iter/sec. Timings for 4800K FFT length (16 cores, 2 workers): 5.08, 5.17 ms. Throughput: 390.10 iter/sec. Timings for 5120K FFT length (16 cores, 1 worker): 1.89 ms. Throughput: 529.66 iter/sec. Timings for 5120K FFT length (16 cores, 2 workers): 5.82, 5.93 ms. Throughput: 340.44 iter/sec. Timings for 5376K FFT length (16 cores, 1 worker): 2.07 ms. Throughput: 483.40 iter/sec. Timings for 5376K FFT length (16 cores, 2 workers): 6.74, 6.66 ms. Throughput: 298.58 iter/sec. Timings for 5600K FFT length (16 cores, 1 worker): 2.16 ms. Throughput: 462.27 iter/sec. Timings for 5600K FFT length (16 cores, 2 workers): 7.26, 7.33 ms. Throughput: 274.15 iter/sec. Timings for 5760K FFT length (16 cores, 1 worker): 2.32 ms. Throughput: 430.87 iter/sec. Timings for 5760K FFT length (16 cores, 2 workers): 7.86, 7.98 ms. Throughput: 252.51 iter/sec. Timings for 6144K FFT length (16 cores, 1 worker): 2.39 ms. Throughput: 418.61 iter/sec. Timings for 6144K FFT length (16 cores, 2 workers): 7.76, 7.89 ms. Throughput: 255.60 iter/sec. Timings for 6400K FFT length (16 cores, 1 worker): 2.53 ms. Throughput: 395.61 iter/sec. Timings for 6400K FFT length (16 cores, 2 workers): 8.54, 8.67 ms. Throughput: 232.40 iter/sec. Timings for 6720K FFT length (16 cores, 1 worker): 2.88 ms. Throughput: 347.72 iter/sec. Timings for 6720K FFT length (16 cores, 2 workers): 9.67, 9.70 ms. Throughput: 206.46 iter/sec. Timings for 7168K FFT length (16 cores, 1 worker): 3.18 ms. Throughput: 314.60 iter/sec. Timings for 7168K FFT length (16 cores, 2 workers): 10.22, 10.22 ms. Throughput: 195.70 iter/sec. Timings for 7680K FFT length (16 cores, 1 worker): 4.04 ms. Throughput: 247.63 iter/sec. Timings for 7680K FFT length (16 cores, 2 workers): 11.46, 11.81 ms. Throughput: 171.99 iter/sec. Timings for 8000K FFT length (16 cores, 1 worker): 4.01 ms. Throughput: 249.18 iter/sec. Timings for 8000K FFT length (16 cores, 2 workers): 11.65, 11.84 ms. Throughput: 170.29 iter/sec. Timings for 8064K FFT length (16 cores, 1 worker): 4.13 ms. Throughput: 242.19 iter/sec. Timings for 8064K FFT length (16 cores, 2 workers): 11.89, 11.88 ms. Throughput: 168.27 iter/sec. Timings for 8192K FFT length (16 cores, 1 worker): 4.40 ms. Throughput: 227.32 iter/sec. Timings for 8192K FFT length (16 cores, 2 workers): 12.20, 12.21 ms. Throughput: 163.84 iter/sec. ######################################################### FFT Timings Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4212.31 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Timing FFTs using 16 threads on 16 cores. Best time for 2048K FFT length: 1.327 ms., avg: 1.372 ms. Best time for 2240K FFT length: 0.893 ms., avg: 0.922 ms. Best time for 2304K FFT length: 1.140 ms., avg: 1.172 ms. Best time for 2400K FFT length: 1.001 ms., avg: 1.019 ms. Best time for 2560K FFT length: 1.024 ms., avg: 1.046 ms. Best time for 2688K FFT length: 1.040 ms., avg: 1.055 ms. Best time for 2800K FFT length: 1.452 ms., avg: 1.493 ms. Best time for 2880K FFT length: 1.126 ms., avg: 1.173 ms. Best time for 3072K FFT length: 1.274 ms., avg: 1.291 ms. Best time for 3200K FFT length: 1.224 ms., avg: 1.251 ms. Best time for 3360K FFT length: 1.319 ms., avg: 1.334 ms. Best time for 3584K FFT length: 1.460 ms., avg: 1.477 ms. Best time for 3840K FFT length: 1.459 ms., avg: 1.474 ms. Best time for 4096K FFT length: 1.619 ms., avg: 1.643 ms. Best time for 4480K FFT length: 1.792 ms., avg: 1.831 ms. Best time for 4608K FFT length: 1.718 ms., avg: 1.736 ms. Best time for 4800K FFT length: 1.793 ms., avg: 1.817 ms. Best time for 5120K FFT length: 1.860 ms., avg: 1.882 ms. Best time for 5376K FFT length: 2.026 ms., avg: 2.051 ms. Best time for 5600K FFT length: 2.120 ms., avg: 2.189 ms. Best time for 5760K FFT length: 2.244 ms., avg: 2.288 ms. Best time for 6144K FFT length: 2.310 ms., avg: 2.368 ms. Best time for 6400K FFT length: 2.463 ms., avg: 2.527 ms. Best time for 6720K FFT length: 2.790 ms., avg: 2.867 ms. Best time for 7168K FFT length: 3.072 ms., avg: 3.203 ms. Best time for 7680K FFT length: 3.927 ms., avg: 4.069 ms. Best time for 8000K FFT length: 3.864 ms., avg: 3.961 ms. Best time for 8064K FFT length: 4.018 ms., avg: 4.177 ms. Best time for 8192K FFT length: 4.306 ms., avg: 4.464 ms. ######################################################### Trial Factoring Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4212.78 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Best time for 61 bit trial factors: 0.718 ms. Best time for 62 bit trial factors: 0.730 ms. Best time for 63 bit trial factors: 0.670 ms. Best time for 64 bit trial factors: 0.721 ms. Best time for 65 bit trial factors: 0.674 ms. Best time for 66 bit trial factors: 0.708 ms. Best time for 67 bit trial factors: 0.715 ms. Best time for 75 bit trial factors: 0.711 ms. Best time for 76 bit trial factors: 0.703 ms. Best time for 77 bit trial factors: 0.699 ms. |
![]() |
![]() |
![]() |
#790 |
"Jorge Coveiro"
Nov 2006
Moura, Portugal
24·3 Posts |
![]()
I also decided to test with "manual overclocking".
Manual Overclocking - 4.2GHz@1.25v + Memory Overclock: 1800 + Fabric Overclock: 1800 + RamCache III Here are the benchmarks: Code:
################################################# Throughput Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4190.69 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Timings for 2048K FFT length (16 cores, 1 worker): 1.03 ms. Throughput: 974.93 iter/sec. Timings for 2048K FFT length (16 cores, 2 workers): 1.24, 1.21 ms. Throughput: 1637.97 iter/sec. Timings for 2240K FFT length (16 cores, 1 worker): 0.95 ms. Throughput: 1047.98 iter/sec. Timings for 2240K FFT length (16 cores, 2 workers): 1.30, 1.30 ms. Throughput: 1539.05 iter/sec. Timings for 2304K FFT length (16 cores, 1 worker): 0.95 ms. Throughput: 1056.92 iter/sec. Timings for 2304K FFT length (16 cores, 2 workers): 1.31, 1.33 ms. Throughput: 1519.39 iter/sec. Timings for 2400K FFT length (16 cores, 1 worker): 1.47 ms. Throughput: 681.38 iter/sec. Timings for 2400K FFT length (16 cores, 2 workers): 1.41, 1.43 ms. Throughput: 1408.85 iter/sec. Timings for 2560K FFT length (16 cores, 1 worker): 1.00 ms. Throughput: 1002.11 iter/sec. Timings for 2560K FFT length (16 cores, 2 workers): 1.46, 1.48 ms. Throughput: 1363.25 iter/sec. Timings for 2688K FFT length (16 cores, 1 worker): 1.05 ms. Throughput: 948.17 iter/sec. Timings for 2688K FFT length (16 cores, 2 workers): 1.52, 1.54 ms. Throughput: 1308.20 iter/sec. Timings for 2800K FFT length (16 cores, 1 worker): 1.42 ms. Throughput: 704.31 iter/sec. Timings for 2800K FFT length (16 cores, 2 workers): 1.66, 1.67 ms. Throughput: 1203.02 iter/sec. Timings for 2880K FFT length (16 cores, 1 worker): 1.46 ms. Throughput: 686.80 iter/sec. Timings for 2880K FFT length (16 cores, 2 workers): 1.64, 1.66 ms. Throughput: 1215.08 iter/sec. Timings for 3072K FFT length (16 cores, 1 worker): 1.29 ms. Throughput: 776.18 iter/sec. Timings for 3072K FFT length (16 cores, 2 workers): 1.80, 1.81 ms. Throughput: 1106.82 iter/sec. Timings for 3200K FFT length (16 cores, 1 worker): 1.24 ms. Throughput: 809.10 iter/sec. Timings for 3200K FFT length (16 cores, 2 workers): 2.29, 2.40 ms. Throughput: 852.48 iter/sec. Timings for 3360K FFT length (16 cores, 1 worker): 1.28 ms. Throughput: 783.01 iter/sec. Timings for 3360K FFT length (16 cores, 2 workers): 2.08, 2.10 ms. Throughput: 957.96 iter/sec. Timings for 3584K FFT length (16 cores, 1 worker): 1.47 ms. Throughput: 680.75 iter/sec. Timings for 3584K FFT length (16 cores, 2 workers): 2.58, 2.65 ms. Throughput: 764.92 iter/sec. Timings for 3840K FFT length (16 cores, 1 worker): 1.43 ms. Throughput: 698.16 iter/sec. Timings for 3840K FFT length (16 cores, 2 workers): 2.99, 2.98 ms. Throughput: 669.79 iter/sec. Timings for 4096K FFT length (16 cores, 1 worker): 1.67 ms. Throughput: 598.28 iter/sec. Timings for 4096K FFT length (16 cores, 2 workers): 3.58, 3.73 ms. Throughput: 547.76 iter/sec. Timings for 4480K FFT length (16 cores, 1 worker): 1.84 ms. Throughput: 542.99 iter/sec. Timings for 4480K FFT length (16 cores, 2 workers): 5.13, 4.99 ms. Throughput: 395.23 iter/sec. Timings for 4608K FFT length (16 cores, 1 worker): 1.77 ms. Throughput: 566.01 iter/sec. Timings for 4608K FFT length (16 cores, 2 workers): 5.80, 5.69 ms. Throughput: 348.11 iter/sec. Timings for 4800K FFT length (16 cores, 1 worker): 1.78 ms. Throughput: 562.41 iter/sec. Timings for 4800K FFT length (16 cores, 2 workers): 5.60, 5.56 ms. Throughput: 358.40 iter/sec. Timings for 5120K FFT length (16 cores, 1 worker): 1.90 ms. Throughput: 525.25 iter/sec. Timings for 5120K FFT length (16 cores, 2 workers): 5.87, 5.89 ms. Throughput: 340.13 iter/sec. Timings for 5376K FFT length (16 cores, 1 worker): 2.04 ms. Throughput: 491.12 iter/sec. Timings for 5376K FFT length (16 cores, 2 workers): 6.46, 6.46 ms. Throughput: 309.57 iter/sec. Timings for 5600K FFT length (16 cores, 1 worker): 2.15 ms. Throughput: 465.32 iter/sec. Timings for 5600K FFT length (16 cores, 2 workers): 7.29, 7.29 ms. Throughput: 274.39 iter/sec. Timings for 5760K FFT length (16 cores, 1 worker): 2.30 ms. Throughput: 434.51 iter/sec. Timings for 5760K FFT length (16 cores, 2 workers): 7.95, 7.95 ms. Throughput: 251.48 iter/sec. Timings for 6144K FFT length (16 cores, 1 worker): 2.37 ms. Throughput: 421.43 iter/sec. Timings for 6144K FFT length (16 cores, 2 workers): 8.16, 8.16 ms. Throughput: 245.05 iter/sec. Timings for 6400K FFT length (16 cores, 1 worker): 2.61 ms. Throughput: 383.66 iter/sec. Timings for 6400K FFT length (16 cores, 2 workers): 8.45, 8.76 ms. Throughput: 232.49 iter/sec. Timings for 6720K FFT length (16 cores, 1 worker): 2.90 ms. Throughput: 345.24 iter/sec. Timings for 6720K FFT length (16 cores, 2 workers): 9.65, 9.61 ms. Throughput: 207.74 iter/sec. Timings for 7168K FFT length (16 cores, 1 worker): 3.18 ms. Throughput: 314.10 iter/sec. Timings for 7168K FFT length (16 cores, 2 workers): 10.23, 10.15 ms. Throughput: 196.29 iter/sec. Timings for 7680K FFT length (16 cores, 1 worker): 4.09 ms. Throughput: 244.50 iter/sec. Timings for 7680K FFT length (16 cores, 2 workers): 11.65, 11.61 ms. Throughput: 171.92 iter/sec. Timings for 8000K FFT length (16 cores, 1 worker): 4.02 ms. Throughput: 248.51 iter/sec. Timings for 8000K FFT length (16 cores, 2 workers): 11.67, 11.68 ms. Throughput: 171.31 iter/sec. Timings for 8064K FFT length (16 cores, 1 worker): 4.13 ms. Throughput: 241.98 iter/sec. Timings for 8064K FFT length (16 cores, 2 workers): 11.97, 11.89 ms. Throughput: 167.65 iter/sec. Timings for 8192K FFT length (16 cores, 1 worker): 4.38 ms. Throughput: 228.46 iter/sec. Timings for 8192K FFT length (16 cores, 2 workers): 11.98, 12.11 ms. Throughput: 166.05 iter/sec. #################################################### FFT Timings Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4185.26 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Timing FFTs using 16 threads on 16 cores. Best time for 2048K FFT length: 1.013 ms., avg: 1.044 ms. Best time for 2240K FFT length: 0.877 ms., avg: 0.897 ms. Best time for 2304K FFT length: 1.168 ms., avg: 1.208 ms. Best time for 2400K FFT length: 1.017 ms., avg: 1.031 ms. Best time for 2560K FFT length: 1.013 ms., avg: 1.047 ms. Best time for 2688K FFT length: 1.035 ms., avg: 1.054 ms. Best time for 2800K FFT length: 1.945 ms., avg: 1.988 ms. Best time for 2880K FFT length: 1.134 ms., avg: 1.160 ms. Best time for 3072K FFT length: 1.287 ms., avg: 1.298 ms. Best time for 3200K FFT length: 1.248 ms., avg: 1.271 ms. Best time for 3360K FFT length: 1.276 ms., avg: 1.300 ms. Best time for 3584K FFT length: 1.466 ms., avg: 1.480 ms. Best time for 3840K FFT length: 1.412 ms., avg: 1.443 ms. Best time for 4096K FFT length: 1.653 ms., avg: 1.682 ms. Best time for 4480K FFT length: 1.840 ms., avg: 1.877 ms. Best time for 4608K FFT length: 1.704 ms., avg: 1.724 ms. Best time for 4800K FFT length: 1.764 ms., avg: 1.779 ms. Best time for 5120K FFT length: 1.867 ms., avg: 1.884 ms. Best time for 5376K FFT length: 2.021 ms., avg: 2.042 ms. Best time for 5600K FFT length: 2.115 ms., avg: 2.138 ms. Best time for 5760K FFT length: 2.271 ms., avg: 2.311 ms. Best time for 6144K FFT length: 2.359 ms., avg: 2.405 ms. Best time for 6400K FFT length: 2.460 ms., avg: 2.518 ms. Best time for 6720K FFT length: 2.804 ms., avg: 2.887 ms. Best time for 7168K FFT length: 3.096 ms., avg: 3.152 ms. Best time for 7680K FFT length: 3.958 ms., avg: 4.032 ms. Best time for 8000K FFT length: 3.967 ms., avg: 4.045 ms. Best time for 8064K FFT length: 4.016 ms., avg: 4.134 ms. Best time for 8192K FFT length: 4.271 ms., avg: 4.396 ms. #################################################### Trial Factoring Benchmark: AMD Ryzen 9 3950X 16-Core Processor CPU speed: 4185.88 MHz, 16 hyperthreaded cores CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 16x32 KB, L2 cache size: 16x512 KB, L3 cache size: 4x16 MB L1 cache line size: 64 bytes, L2 cache line size: 64 bytes Prime95 64-bit version 29.8, RdtscTiming=1 Best time for 61 bit trial factors: 0.710 ms. Best time for 62 bit trial factors: 0.714 ms. Best time for 63 bit trial factors: 0.710 ms. Best time for 64 bit trial factors: 0.713 ms. Best time for 65 bit trial factors: 0.710 ms. Best time for 66 bit trial factors: 0.694 ms. Best time for 67 bit trial factors: 0.692 ms. Best time for 75 bit trial factors: 0.688 ms. Best time for 76 bit trial factors: 0.689 ms. Best time for 77 bit trial factors: 0.690 ms. Last fiddled with by JCoveiro on 2020-02-02 at 17:46 |
![]() |
![]() |
![]() |
#791 |
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
![]()
Here are some interesting comparisons. I ran benchmarks for 5760K FFT, on a 6700K CPU at 4400, 4200, and 4000 MHz. 32 GiB dual rank DDR4-3000 RAM. By narrow margins 4200 MHz came out best, but all three sets top out within a few it/sec of each other.
4400 was the first run. After that, I dropped the 4-core-1-worker test because 2 workers were coming out better. Last fiddled with by kladner on 2020-03-25 at 04:27 |
![]() |
![]() |
![]() |
#792 | |
"Viliam Furík"
Jul 2018
Martin, Slovakia
31916 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Perpetual "interesting video" thread... | Xyzzy | Lounge | 51 | 2022-10-06 11:28 |
LLR benchmark thread | Oddball | Riesel Prime Search | 5 | 2010-08-02 00:11 |
Perpetual I'm pi**ed off thread | rogue | Soap Box | 19 | 2009-10-28 19:17 |
Perpetual autostereogram thread... | Xyzzy | Lounge | 10 | 2006-09-28 00:36 |
Perpetual ECM factoring challenge thread... | Xyzzy | Factoring | 65 | 2005-09-05 08:16 |