![]() |
|
|
#1 |
|
23×36 Posts |
|
|
|
|
#2 |
|
Just call me Henry
"David"
Sep 2007
Cambridge (GMT/BST)
2·33·109 Posts |
How does the performance compare to mlucas?
|
|
|
|
|
|
#3 | |
|
Banned
"Luigi"
Aug 2002
Team Italia
481810 Posts |
Quote:
BTW, Ernst Mayer ported his multithreading asm-aware code to Arm as well. It works like a charm on Raspberries and Ernst is working on implementing prp testing as well... Last fiddled with by ET_ on 2018-07-17 at 11:55 |
|
|
|
|
|
|
#4 |
|
Jan 2008
France
22616 Posts |
|
|
|
|
|
|
#5 |
|
Sep 2003
5·11·47 Posts |
Look at the dates, the files are mostly from 5 years ago, a few from 3 years ago.
Seems to be abandonware, unfortunately. |
|
|
|
|
|
#6 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
2×32×7×43 Posts |
Quote:
CUDALucas 2.05.1 Feb 2015 CUDAPm1 0.20 Nov 2013 mfaktc 0.21 Oct 2014 mfakto 0.14 Nov 2014 gpulucas 0.9.4 Feb 2012 and almost clLucas at Jan 2016. Some of these are in heavy current use, because they adequately fulfill the function. Iprime unfortunately seems not to have save files implemented, per the road map in the read.md. Last fiddled with by kriesel on 2018-07-17 at 23:28 |
|
|
|
|
|
|
#7 |
|
∂2ω=0
Sep 2002
República de California
2×32×647 Posts |
It actually would be useful to compare timings between Nick's non-SIMD integer code vs Mlucas on ARMv8, if someone has bandwidth to test them both on the same system, I would be appreciative.
|
|
|
|
|
|
#8 |
|
I moo ablest echo power!
May 2013
13×137 Posts |
|
|
|
|
|
|
#9 | |
|
∂2ω=0
Sep 2002
República de California
2×32×647 Posts |
Quote:
1. If the integer checker supports multithreading, run Mlucas self-tests on all 4 cores via './Mlucas -s m -iters 100 -cpu 0:3', afterward have a look at the mlucas.cfg file and run the integer code at the nearest FFT lengths to those in whatever short-length timing mode it supports; 2. If the integer checker does not support multithreading, do as in [1] but run Mlucas self-tests on just 2 core via './Mlucas -s m -iters 100'. Since the 2 codes will likely permit appreciably different max-exponents at any given transform length, the timing comparisons will need to be interepreted in that light, i.e. in "timing for comparable exponent" fashion. Thanks! Last fiddled with by ewmayer on 2018-07-25 at 20:54 |
|
|
|
|
|
|
#10 |
|
I moo ablest echo power!
May 2013
13·137 Posts |
Looks like the iprime code is single-threaded, so I ran mlucas single-thread as well. I only did one test of iprime, and here's why:
iprime: Testing 2**20000047-1 with fft size 2**20 for 100 iterations Residue 0xDD61B3E031F1E0BA That took 4m23.108015142s for 100 iterations which is 2.631080151s per iteration mlucas: 1024 msec/iter = 159.98 (used the exponent here for the iprime test) So mlucas is ~10x faster than iprime for the same FFT size and exponent. And iprime crashed trying to run p = 49005071. Last fiddled with by wombatman on 2018-07-27 at 04:06 |
|
|
|
|
|
#11 |
|
∂2ω=0
Sep 2002
República de California
101101011111102 Posts |
Thanks, Wombatman! A quick determination of "it's hopeless" is arguably as useful as the "this looks promising" ones.
I got only a modest 1.5x per-cycle boost from SIMD-assembly-versus-not on ARMv8 (I suspect because, unlike x86, the ARM is designed to share as many underlying functional units between SIMD and non-SIMD instructions as possible, i.e. both SIMD and basic C-code access the same number of hardware resources, but careful SIMD coding makes better use of same), so the non-SIMD using nature of the integer code would seem to serve as only a small mitigation. |
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| (M48) NEW MERSENNE PRIME! LARGEST PRIME NUMBER DISCOVERED! | dabaichi | News | 571 | 2020-10-26 11:02 |
| How does one prove that a mersenne prime found with CUDALucas is really prime? | ICWiener | Software | 38 | 2018-06-09 13:59 |
| Prime numbers Grid, to test an odd integer on 44 | Zarck | Math | 5 | 2012-03-06 14:43 |
| The 40th known Mersenne prime, 220996011-1 is not PRIME! | illman-q | Miscellaneous Math | 33 | 2004-09-19 05:02 |
| Primes for a mersenne integer DWT FNT | gbvalor | Math | 1 | 2003-09-08 16:05 |