![]() |
![]() |
#199 |
Random Account
Aug 2009
Not U. + S.A.
42048 Posts |
![]() |
![]() |
![]() |
![]() |
#200 | |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
32·733 Posts |
![]() Quote:
Check for other applications using lots of cycles. Firefox can be very CPU and memory intensive. That can really distort both prime95 benchmark results and what Task Manager CPU monitoring show. Best benchmarking results will be obtained when all other processes practical are idle or absent. Last fiddled with by kriesel on 2022-05-01 at 00:08 |
|
![]() |
![]() |
![]() |
#201 |
"James Heinrich"
May 2004
ex-Northern Ontario
1110100110102 Posts |
![]() |
![]() |
![]() |
![]() |
#202 |
Nov 2012
Japan
1000112 Posts |
![]()
I run some old versions.
From the taskmanger or the results of benchmarks, I can distinguish the issue. p95v306b4.win64 OK (single core) Timings for 2048K FFT length (1 core, 1 worker): 5.01 ms. Throughput: 199.74 iter/sec. p95v307b1.win64 OK (single core), but fail to complete p95v307b2.win64 NG (all cores) p95v307b3.win64 NG (all cores) p95v307b4.win64 fail to run, immediately stop p95v307b5.win64 NG (all cores) p95v307b7.win64 NG (all cores) p95v307b8.win64 NG (all cores) p95v307b9.win64 NG (all cores) p95v308b13.win64 NG (all cores) Timings for 2048K FFT length (1 core, 1 worker): 0.57 ms. Throughput: 1741.37 iter/sec. |
![]() |
![]() |
![]() |
#203 |
Nov 2012
Japan
5·7 Posts |
![]()
It seems that AlderLake has the issue.
The result of 1 core shows too fast, even if AlderLake is running over 5GHz. 12900k: https://mersenneforum.org/showpost.p...8&postcount=64 Timings for 2048K FFT length (8 cores, 1 worker): 0.62 ms. Throughput: 1602.22 iter/sec. https://mersenneforum.org/showpost.p...4&postcount=65 FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=128, Pass2=16384, clm=4 (1 core, 1 worker): 0.62 ms. Throughput: 1624.29 iter/sec. 12700K: https://mersenneforum.org/showpost.p...7&postcount=69 Timings for 2048K FFT length (1 core, 1 worker): 4.57 ms. Throughput: 218.94 iter/sec. Timings for 2048K FFT length (8 cores, 1 worker): 0.64 ms. Throughput: 1564.59 iter/sec. It appears that this is normal. |
![]() |
![]() |
![]() |
#204 | |
"University student"
May 2021
Beijing, China
32×29 Posts |
![]() Quote:
FFT uses complex numbers, if we compute a complex number directly instead of computing the real and imaginary part respectively, we could get an over 2x speedup. Last fiddled with by Zhangrc on 2022-05-02 at 10:32 |
|
![]() |
![]() |
![]() |
#205 |
Nov 2012
Japan
5×7 Posts |
![]()
"Benchmark all-complex FFTs" option is not much faster as normal in my machine with AVX512.
Timings for 2048K FFT length (1 core, 1 worker): 0.65 ms. Throughput: 1535.94 iter/sec. Timings for 2048K all-complex FFT length (1 core, 1 worker): 0.65 ms. Throughput: 1546.01 iter/sec. And both have the issue.(running using all cores) |
![]() |
![]() |
![]() |
#206 |
"James Heinrich"
May 2004
ex-Northern Ontario
2·3·7·89 Posts |
![]() |
![]() |
![]() |
![]() |
#207 |
Nov 2012
Japan
3510 Posts |
![]()
No.
Prime95.exe uses all cores, even if I enter 1 in "Number of CPU cores to benchmark" textbox. Timings for 2048K FFT length (1 core, 1 worker): 0.61 ms. Throughput: 1627.02 iter/sec. Timings for 2100K FFT length (1 core, 1 worker): 0.75 ms. Throughput: 1338.92 iter/sec. Timings for 2160K FFT length (1 core, 1 worker): 0.80 ms. Throughput: 1243.79 iter/sec. Throughput for 1 core is expected to around 100-200 iter/sec for such FFT length |
![]() |
![]() |
![]() |
#208 |
Nov 2012
Japan
5·7 Posts |
![]()
"FFT timings benchmark" does not have the issue.
I can see the multi-core scaling. Timing FFTs using 1 core: Best time for 2048K FFT length: 4.987 ms., avg: 5.001 ms. Timing FFTs using 2 cores: Best time for 2048K FFT length: 2.606 ms., avg: 2.891 ms. Timing FFTs using 3 cores: Best time for 2048K FFT length: 1.768 ms., avg: 2.066 ms. Timing FFTs using 4 cores: Best time for 2048K FFT length: 1.328 ms., avg: 1.853 ms. |
![]() |
![]() |
![]() |
#209 |
Nov 2012
Japan
5·7 Posts |
![]()
I try reduceing the active cores in BIOS setup.
Even if I activate only 4 cores (disabling 14 cores) and Hyperthread is off , Prime95.exe still uses all cores. Intel(R) Core(TM) i9-7980XE CPU @ 2.60GHz CPU speed: 3286.47 MHz, 4 cores Timings for 2048K FFT length (1 core, 1 worker): 1.42 ms. Throughput: 703.46 iter/sec. |
![]() |
![]() |