mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   Hardware (https://www.mersenneforum.org/forumdisplay.php?f=9)
-   -   PFGW benchmarking (https://www.mersenneforum.org/showthread.php?t=24801)

carpetpool 2019-09-30 00:29

PFGW benchmarking
 
I've been using PFGW 4.0.0 to find new random PRPs (and primes) of large sizes. Of the two machines I use to test numbers, one is an Intel Core i5-3230M CPU (Windows 10) and the other is an AMD FX-4300 CPU (Ubuntu).

Just now I was running one of my 160K digit PRP canditiates when I got times:

(757.9948s+3.4911s) with i5-3230M machine
(1079.1039s+10.0399s) with AMD FX-4300 CPU

What is strange about this is the i5 machine has a clock speed of 2.60 GHz and the AMD machine has a clock speed of 3.80 GHz. Not to mention the i5 only has 2 physical cores compared to the AMD with 4 physical cores.

So my question would be why the running time would be shorter for the Windows 10 machine compared to the Linux Machine? I think it has something to do with the CPU brand performance but I'm not sure.

Secondly, I'm assuming PFGW is NOT multithreaded. Weather it is or not, would there be time difference on two machines with the same operating system, processor type, and clock speed, but one machine has significantly more cores than the other (For example, a 3.5 Ghz @ 4 cores v.s. 3.5 Ghz @ 8 cores)?

Please let me know if you have any information that might be helpful. Thanks.

kuratkull 2019-09-30 04:34

PFGW isn't multithreaded and Intel has AVX special instructions. At least that is what seems to decide a large chunk of the performance for me.

Happy5214 2019-09-30 06:19

According to Wikipedia, AMD FX-4300 doesn't have AVX2 support, but it does support AVX and FMA3.

mackerel 2019-09-30 08:02

Per-core/per-clock the Intel CPUs of that era is a LOT faster than AMD CPUs. Having instruction support isn't enough, there has to be adequate hardware implemented to back it up, which is lacking in the AMD CPU. I'm not too familiar with that era, is that the ones where there was one FPU shared between two cores?

At 160k FFT it would probably have poor scaling even if you were to run multi-threaded, compared to running one task per core. I don't recall if PFGW now has support for multithead, but it was talked about in the past. Also give LLR a try (which I know has multi-thread support via -t option). Sometimes one or the other might be faster.

If you start running multiple tasks on a CPU, be aware you might hit bottlenecks in ram bandwidth, and also check thermals especially on mobile devices if that's what the Intel CPU is in.

kuratkull 2019-09-30 20:06

My mistake. It seems at least the newest(v4) PFGW supports threads with the -T switch.


All times are UTC. The time now is 00:44.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.