![]() |
![]() |
#12 |
"Carlos Pinho"
Oct 2011
Milton Keynes, UK
2·2,423 Posts |
![]()
HT helps a lot on LA, at least for me.
|
![]() |
![]() |
![]() |
#13 | |
"Curtis"
Feb 2005
Riverside, CA
22×1,151 Posts |
![]() Quote:
VBITS=128 on otherwise idle machine. ETA after 1% of job: 6-threaded 14hr 34 min 12-threads 8 hr 26 min 18-threads 9 hr 15 min 24-threads 8 hr 27 min These times look rather slow; I just installed the extra 32GB memory today, so perhaps filling all 8 slots slows memory access a bunch. Some time I'll remove the original 16GB and see if 4 sticks is faster than 8. |
|
![]() |
![]() |
![]() |
#14 | |
"Mike"
Aug 2002
22·5·397 Posts |
![]() Quote:
Note that we only count the LA phase in our calculations. ![]() |
|
![]() |
![]() |
![]() |
#15 | |
"Mike"
Aug 2002
22×5×397 Posts |
![]() Quote:
12 = 8h04m50s 20 = 8h32m31s 24 = 7h59m33s ![]() |
|
![]() |
![]() |
![]() |
#16 |
"Mike"
Aug 2002
22×5×397 Posts |
![]()
CPU = i7-8565U
RAM = 2×16GB DDR4-2400 CMD = ./msieve -v -nc -t 8 LA = 47884s ![]() |
![]() |
![]() |
![]() |
#17 |
"Mike"
Aug 2002
22·5·397 Posts |
![]()
CPU = i7-8565U
RAM = 2×16GB DDR4-2400 CMD = ./msieve -v -nc -t 4 LA = 51662s ![]() |
![]() |
![]() |
![]() |
#18 |
"Mike"
Aug 2002
22×5×397 Posts |
![]()
CPU = 3950X
RAM = 2×8GB DDR4-3666 CMD = ./msieve -v -nc -t 16 LA = 27180s ![]() |
![]() |
![]() |
![]() |
#19 |
"Beschorner Kurt"
Jul 2016
Germany
22·5 Posts |
![]()
In my experience, the throughput depends on an additionally running program (e.g. gmp-ecm)
machine: i7-7820X - 8 cores + HT matrix: 49M * 49M memory: 64 GB msieve .... -t16 solo ~ 55% (power according task manager) msieve .... -t16 and gmp-ecm (prior: low) ~ 78% -"- With msieve + mprime/Prime95 the effectiveness is a litle lower Kurt |
![]() |
![]() |
![]() |
#20 | |
Jul 2003
So Cal
2·13·79 Posts |
![]() Quote:
mpirun -np 2 msieve -nc2 1,2 -v -t 20 |
|
![]() |
![]() |
![]() |
#21 |
Jul 2003
So Cal
40068 Posts |
![]()
Here's a bench using compute nodes with one Xeon E5-2650 v4 Broadwell cpu with 12-cores, 24 threads.
1 node 7h 40m 2 nodes 2h 45m 4 nodes 1h 35m 8 nodes 1h 10m Not sure why the time for one node is so high compared to the others? Perhaps something fitting into the cache with the smaller matrices on each node? |
![]() |
![]() |
![]() |
#22 | |
"Curtis"
Feb 2005
Riverside, CA
22×1,151 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
PFGW benchmarking | carpetpool | Hardware | 4 | 2019-09-30 20:06 |
Looking for benchmarking help with a Phenom or PhenomII X6 | mrolle | Software | 25 | 2012-03-14 14:15 |
GMP 5.0.1 vs GMP 4.1.4 benchmarking | unconnected | GMP-ECM | 5 | 2011-04-03 16:16 |
Benchmarking dual-CPU machines | garo | Software | 2 | 2010-09-27 20:33 |
Benchmarking challenge! | Xyzzy | Software | 17 | 2003-08-26 15:43 |