mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Msieve

Reply
 
Thread Tools
Old 2020-08-18, 13:15   #23
bsquared
 
bsquared's Avatar
 
"Ben"
Feb 2007

22·3·293 Posts
Default

Quote:
Originally Posted by frmky View Post
I know this is late, but if you still have this data set up, try
mpirun -np 2 msieve -nc2 1,2 -v -t 20
After 1% elasped, the ETA is:

Code:
-np 2 1x2 -t 20: 3 hrs 9 min
-np 4 1x4 -t 10: 2 hrs 48 min
-np 5 1x5 -t 8: 3 hrs 49 min
-np 8 1x8 -t 5: 2 hrs 50 min
The 1x5 time is not surprising as one of the processes is split across sockets. Of the others that split evenly, more processes with fewer threads each appear to be a bit better.
bsquared is offline   Reply With Quote
Old 2020-08-30, 09:03   #24
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

2·23·179 Posts
Default

Here are binaries for 64-bit Linux with various "VBITS" flags set.

Attached Files
File Type: gz msieve-1.54-VBITS64.x86_64.gz (324.4 KB, 253 views)
File Type: gz msieve-1.54-VBITS128.x86_64.gz (326.1 KB, 245 views)
File Type: gz msieve-1.54-VBITS256.x86_64.gz (334.2 KB, 253 views)
Xyzzy is offline   Reply With Quote
Old 2020-10-30, 10:53   #25
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

202A16 Posts
Default

CPU = i5-10600K
RAM = 2×8GB DDR4-3200
CMD = ./msieve -v -nc -t 6
LA = 21988s


Attached Files
File Type: gz msieve.log.gz (2.8 KB, 178 views)
Xyzzy is offline   Reply With Quote
Old 2020-10-30, 11:08   #26
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

2×23×179 Posts
Default

Given that the 1920X and 3950X are pretty serious CPUs, does the result for the i5 seem abnormally fast?

CPU = 1920X
RAM = 4×16GB DDR4-2666
CMD = ./msieve -v -nc -t 24
LA = 7h 58m 53s

CPU = 3950X
RAM = 2×8GB DDR4-3666
CMD = ./msieve -v -nc -t 16
LA = 7h 33m 00s

CPU = i5-10600K
RAM = 2×8GB DDR4-3200
CMD = ./msieve -v -nc -t 6
LA = 6h 06m 28s
Xyzzy is offline   Reply With Quote
Old 2020-10-30, 12:38   #27
Gimarel
 
Apr 2010

2·83 Posts
Default

Quote:
Originally Posted by Xyzzy View Post
CPU = 3950X
RAM = 2×8GB DDR4-3666
CMD = ./msieve -v -nc -t 16
LA = 7h 33m 00s
My timings for a AMD Ryzen 9 3950X, 2x32GB DDR4-3600:

-nc1: ~0h 43m 18s
-nc2: ~0h 5m 15s until the multithreaded LA starts


Timings for the multithreaded part:

-nc2: estimated 3h 24m msieve compiled with gcc-9.3
-nc2: estimated 3h 25m msieve compiled with gcc-10.0
-nc2: estimated 3h 22m msieve compiled with clang-9
-nc2: estimated 3h 24m msieve compiled with clang-10

Fastest total without -nc3: ~4h 21m

All runs with VBITS=256 and 32 threads. All other versions were slower.
I tried the objects for each compiler twice, to ensure that the clang-9 one is indeed the fastest.

Last fiddled with by Gimarel on 2020-10-30 at 12:39
Gimarel is offline   Reply With Quote
Old 2020-11-17, 14:47   #28
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

100000001010102 Posts
Default

CPU = 5600X
RAM = 2×16GB DDR4-3200
CMD = ./msieve -v -nc -t 12
LA = 14805s


Attached Files
File Type: gz msieve.log.gz (2.7 KB, 155 views)
Xyzzy is offline   Reply With Quote
Old 2020-11-17, 14:56   #29
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

2×23×179 Posts
Default

CPU = 1920X
RAM = 4×16GB DDR4-2666
CMD = ./msieve -v -nc -t 24
LA = 7h 58m 53s


CPU = 3950X
RAM = 2×8GB DDR4-3666
CMD = ./msieve -v -nc -t 16
LA = 7h 33m 00s


CPU = 5600X
RAM = 2×16GB DDR4-3200
CMD = ./msieve -v -nc -t 12
LA = 4h 6m 45s


We have used the same binary and the same setup/method for every benchmark we have posted.

This 5600X result just doesn't seem right unless we had the 1920X and 3950X set up wrong or something.

Xyzzy is offline   Reply With Quote
Old 2020-11-17, 15:19   #30
axn
 
axn's Avatar
 
Jun 2003

2×3×7×112 Posts
Default

Quote:
Originally Posted by Xyzzy View Post
CPU = 3950X
RAM = 2×8GB DDR4-3666
CMD = ./msieve -v -nc -t 16
LA = 7h 33m 00s


CPU = 5600X
RAM = 2×16GB DDR4-3200
CMD = ./msieve -v -nc -t 12
LA = 4h 6m 45s


We have used the same binary and the same setup/method for every benchmark we have posted.

This 5600X result just doesn't seem right unless we had the 1920X and 3950X set up wrong or something.

Is it possible that the 8GB modules in 3950 are single rank vs 16GB in 5600X are dual ranks? Try swapping the RAM between the systems
axn is offline   Reply With Quote
Old 2020-11-17, 17:13   #31
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

2·23·179 Posts
Default

Quote:
Originally Posted by axn View Post
Is it possible that the 8GB modules in 3950 are single rank vs 16GB in 5600X are dual ranks? Try swapping the RAM between the systems
They were single rank sticks.

We don't have the 3950X anymore so we can't retest it.

Xyzzy is offline   Reply With Quote
Old 2021-04-13, 22:36   #32
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

100000001010102 Posts
Default

CPU = 10980XE (165W)
RAM = 8×32GB DDR4-3200
CMD = ./msieve -v -nc -t 18
LA = 16343s


CPU = 10980XE (165W)
RAM = 8×32GB DDR4-3200
CMD = ./msieve -v -nc -t 36
LA = 14709s


Xyzzy is offline   Reply With Quote
Old 2021-05-09, 08:57   #33
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

32·5·47 Posts
Default

Each node has a Fujitsu A64FX 64-bit ARM processor with 48 cores and 32 GB HBM memory divided into 4 NUMA regions.

VBITS = 128
1 node 3h 30m
2 nodes 1h 58m
4 nodes 1h 10m
8 nodes 0h 41m

VBITS makes a big difference for this processor
1 node
VBITS = 64 4h 5m
VBITS = 128 3h 30m
VBITS = 256 5h 40m

Two notes about compiling: The cache size must be set in the source since msieve doesn't detect it for ARM processors and the default is quite small. And removing the manual loop unrolling in the files in common/lanczos/cpu/ gives a small but consistent 1.5-2% improvement on this processor.

Last fiddled with by frmky on 2021-05-09 at 09:26
frmky is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
PFGW benchmarking carpetpool Hardware 4 2019-09-30 20:06
Looking for benchmarking help with a Phenom or PhenomII X6 mrolle Software 25 2012-03-14 14:15
GMP 5.0.1 vs GMP 4.1.4 benchmarking unconnected GMP-ECM 5 2011-04-03 16:16
Benchmarking dual-CPU machines garo Software 2 2010-09-27 20:33
Benchmarking challenge! Xyzzy Software 17 2003-08-26 15:43

All times are UTC. The time now is 04:42.


Sun Aug 1 04:42:09 UTC 2021 up 8 days, 23:11, 0 users, load averages: 0.70, 1.03, 1.34

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.