mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2017-04-28, 02:21   #1
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

11100001110012 Posts
Default 29.2 benchmark help

Background: Prime95 selects pre-determined FFT implementations based on benchmarks from various machines I've owned over the years. The problem is that the pre-determined selection may not be best for your machine -- it may have faster memory, more or fewer cores, etc.

Here's the plan. I've added code that writes benchmark timings to gwnum.txt as well as code to read that data in. Next up is writing code that selects the best FFT implementation based on the data. I hope this results in a tiny performance boost on machines that the default FFT implementation is non-optimal.

If I create an executable with every possible FFT implementation, I'll end up with a very large executable. So, I need help selecting which FFT implementations to include and which ones can safely be excluded.

To start the process, I'd like to gather some data from 64-bit linux machines with FMA3 support using this executable: https://www.dropbox.com/s/9n3cfkumuqykbbp/mprime?dl=0

Download the executable, add the line "AllBench=1" to prime.txt. Do a throughput benchmark with settings 16 to 32768 sizes, "every FFT size", "all CPU cores", "one worker" and "maximum workers", 8 seconds should be sufficient.

This will be time consuming
Your machine must support FMA3
Unusual machines are what I'm looking for. Especially Ryzen, OC'd machines, fast mem, 2 cores, 6 cores, fancy Xeons

When done email results.txt to me. Delete the executable - it is not for production work.

Thanks!
Prime95 is offline   Reply With Quote
Old 2017-04-28, 04:25   #2
Jayder
 
Jayder's Avatar
 
Dec 2012

2·139 Posts
Default

If I ran this on a Windows machine in an Ubuntu VM, would that affect anything, or would that be okay?
Jayder is offline   Reply With Quote
Old 2017-04-28, 08:35   #3
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2,917 Posts
Default

Quote:
Originally Posted by Jayder View Post
If I ran this on a Windows machine in an Ubuntu VM, would that affect anything, or would that be okay?
It would certainly affect things.

You could boot the Ubuntu Desktop install USB to Desktop and run it from there?
Mark Rose is online now   Reply With Quote
Old 2017-04-28, 09:20   #4
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2,917 Posts
Default

Would you like us to benchmark hyperthreads?
Mark Rose is online now   Reply With Quote
Old 2017-04-28, 11:50   #5
mackerel
 
mackerel's Avatar
 
Feb 2016
UK

5·79 Posts
Default

I'd love to help but the linux requirement adds enough friction that I may be selective and test limited systems. If it ran on Windows I could easily throw it on everything.

I could do any of following if others aren't doing it, and are of specific interest.
Ryzen 1700 stock and OC
OC'd machines - Broadwell (with 128MB L4 cache) and Skylake quad cores
Fast ram machines: Skylake quad cores
Fancy Xeons: 14 core Haswell
2 cores: i3 Haswell, Skylake
mackerel is offline   Reply With Quote
Old 2017-04-28, 12:28   #6
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2,917 Posts
Default

I'm running it on:

i3-2120 @ 3.3
i3-4710 @ 3.7 with single rank DDR3-1600
i3-4710 @ 3.7 with dual rank DDR3-1600
i7-4770k @ 3.9 (all core, so overclocked) with dual rank DDR3-1600
i5-6600 @ 3.3 (underclocked) with dual rank DDR4-2133
Mark Rose is online now   Reply With Quote
Old 2017-04-28, 13:05   #7
mackerel
 
mackerel's Avatar
 
Feb 2016
UK

5×79 Posts
Default

Mark, does i3-2120 support required FMA3? Also the other i3 is 4170 not 4710?
mackerel is offline   Reply With Quote
Old 2017-04-28, 13:52   #8
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

52×172 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
Would you like us to benchmark hyperthreads?
no
Prime95 is offline   Reply With Quote
Old 2017-04-28, 13:54   #9
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

11100001110012 Posts
Default

I'll create a Windows build tonight
Prime95 is offline   Reply With Quote
Old 2017-04-28, 14:13   #10
VictordeHolland
 
VictordeHolland's Avatar
 
"Victor de Hollander"
Aug 2011
the Netherlands

23·3·72 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
I'm running it on:

i3-2120 @ 3.3
i3-4710 @ 3.7 with single rank DDR3-1600
i3-4710 @ 3.7 with dual rank DDR3-1600
i3-2120 is SandyBridge based, so has AVX, but doesn't have AVX2/FMA
i3-4170 is Haswell, which does have AVX, AVX2, FMA (it mostly Pentium and Celeron parts that have AVX disabled for market segmentation)
VictordeHolland is offline   Reply With Quote
Old 2017-04-28, 14:16   #11
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2,917 Posts
Default

Quote:
Originally Posted by mackerel View Post
Mark, does i3-2120 support required FMA3? Also the other i3 is 4170 not 4710?
Hmm.. I did this in the middle of the night, and you're right, i3-2120 doesn't support FMA3. I'll put it back to normal work.

And the two other i3's are 4170.
Mark Rose is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
29.2 benchmark help #2 (Ryzen only) Prime95 Software 10 2017-05-08 13:24
Benchmark Variances Fred Software 5 2016-04-01 18:15
LLR benchmark thread Oddball Riesel Prime Search 5 2010-08-02 00:11
Does anyone have i7 920? for Benchmark? cipher Twin Prime Search 2 2009-04-14 20:16
Benchmark Weirdness R.D. Silverman Hardware 2 2007-07-25 12:16

All times are UTC. The time now is 14:52.

Fri Dec 4 14:52:03 UTC 2020 up 1 day, 11:03, 0 users, load averages: 2.09, 2.58, 2.51

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.