![]() |
![]() |
#12 |
∂2ω=0
Sep 2002
República de California
2DEB16 Posts |
![]()
Patch Alert: Some recent code changes to clean up the messaging and file-writing left a few dangling fclose() calls in the two *_mod_square.c source files, potentially leading to a null-pointer fclose crash following emission of a roundoff error warning. Fixed. Also a few help.txt file changes to improve coherence of the how-to-kill text.
The md5 value in the OP has been updated to match this upload (1686232 bytes, md5 = dc5487e984196a32b47a8066ec9a6803). |
![]() |
![]() |
![]() |
#13 | |
"Teal Dulcet"
Jun 2018
22·19 Posts |
![]() Quote:
It will now automatically add lines to the bench.txt file for future reference in the same format as those added by Prime95/MPrime to the respective results.bench.txt file when running a throughput benchmark. This is in addition to the benchmark summery table I added to that file in my previous update of the script (see post above). Here is an example of these lines on a 4 core ARM system: Code:
Timings for 2048K FFT length (4 cores, 1 threads, 4 workers): 40.41, 41.10, 40.81, 40.32 ms. Throughput: 98.381 iter/sec. Timings for 2304K FFT length (4 cores, 1 threads, 4 workers): 48.30, 48.96, 48.34, 49.33 ms. Throughput: 82.083 iter/sec. Timings for 2560K FFT length (4 cores, 1 threads, 4 workers): 52.98, 53.24, 53.09, 53.06 ms. Throughput: 75.340 iter/sec. Timings for 2816K FFT length (4 cores, 1 threads, 4 workers): 58.45, 58.60, 58.59, 58.59 ms. Throughput: 68.306 iter/sec. Timings for 3072K FFT length (4 cores, 1 threads, 4 workers): 64.35, 64.80, 64.45, 64.69 ms. Throughput: 61.947 iter/sec. Timings for 3328K FFT length (4 cores, 1 threads, 4 workers): 69.29, 69.48, 69.34, 69.57 ms. Throughput: 57.621 iter/sec. Timings for 3584K FFT length (4 cores, 1 threads, 4 workers): 75.27, 75.47, 75.24, 75.52 ms. Throughput: 53.067 iter/sec. Timings for 3840K FFT length (4 cores, 1 threads, 4 workers): 81.52, 81.82, 81.66, 81.73 ms. Throughput: 48.970 iter/sec. Timings for 4096K FFT length (4 cores, 1 threads, 4 workers): 78.06, 78.70, 78.06, 78.87 ms. Throughput: 51.007 iter/sec. Timings for 4608K FFT length (4 cores, 1 threads, 4 workers): 97.03, 97.62, 97.09, 97.78 ms. Throughput: 41.076 iter/sec. Timings for 5120K FFT length (4 cores, 1 threads, 4 workers): 107.23, 107.67, 107.00, 107.60 ms. Throughput: 37.253 iter/sec. Timings for 5632K FFT length (4 cores, 1 threads, 4 workers): 118.54, 119.28, 118.63, 119.31 ms. Throughput: 33.630 iter/sec. Timings for 6144K FFT length (4 cores, 1 threads, 4 workers): 129.88, 130.56, 128.54, 129.97 ms. Throughput: 30.832 iter/sec. Timings for 6656K FFT length (4 cores, 1 threads, 4 workers): 141.49, 142.41, 141.57, 142.44 ms. Throughput: 28.174 iter/sec. Timings for 7168K FFT length (4 cores, 1 threads, 4 workers): 153.81, 155.76, 155.40, 155.35 ms. Throughput: 25.794 iter/sec. Timings for 7680K FFT length (4 cores, 1 threads, 4 workers): 167.57, 169.11, 168.69, 168.74 ms. Throughput: 23.736 iter/sec. |
|
![]() |
![]() |
![]() |
#14 |
∂2ω=0
Sep 2002
República de California
5·2,351 Posts |
![]()
Patch Alert:Due to a user report of bad behavior of regular (non '-9') kill with his multithreaded run, signal handling has been changed to immediate-exit without savefile write. Suggest users 'killall -9 Mlucas' any ongoing jobs at their earliest convenience and switch to the updated code; using that, regular 'kill' should work. Also, workfile assignments are now echoed to the per-exponent log (.stat) file, not just to stderr (e.g. to the nohup.out file).
The md5 value in the OP has been updated to match this upload (1688188 bytes, md5 = 970c4dde58417bd7f6be0e4af4b59b4e). |
![]() |
![]() |
![]() |
#15 | |
"Teal Dulcet"
Jun 2018
7610 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#16 | |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
112·61 Posts |
![]() Quote:
I think the limit situation may improve somewhat with the next release. Last fiddled with by kriesel on 2022-02-05 at 22:37 |
|
![]() |
![]() |
![]() |
#17 | |
∂2ω=0
Sep 2002
República de California
1175510 Posts |
![]() Quote:
I've been running p-1 on F33, exponent ~8.6 billion, for around 6 months now, max ROEs are a little under 0.10 for that. |
|
![]() |
![]() |
![]() |
#18 | |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
112×61 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#19 | ||
∂2ω=0
Sep 2002
República de California
1175510 Posts |
![]()
Ken PMed me with some questions and examples of issues he hit for M(p) with p > 2^32 - for the benefit of any other users wanting to play with such stuff, copy of my reply to him:
Quote:
|
||
![]() |
![]() |
![]() |
#20 | |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
738110 Posts |
![]() Quote:
Yeah, you're pretty safe there, since at an estimated 88.5 years to 232 iterations on 256M fft length on one of my "faster" test systems, we'd potentially need to leave instructions for our heirs, and theirs! (Other than your source code that is.) Last fiddled with by kriesel on 2022-02-09 at 10:30 |
|
![]() |
![]() |
![]() |
#21 |
Mar 2019
3×109 Posts |
![]()
I am trying to get the pm1 standalone binary to build on Ubuntu. Getting the following errors:
Code:
$ clang -c -DPM1_STANDALONE -O3 pm1.c pm1.c:34:3: warning: Building pm1.c in PM1_STANDALONE mode. [-W#warnings] #warning Building pm1.c in PM1_STANDALONE mode. ^ pm1.c:898:4: warning: Building pm1_stage2() in standalone (modmul-counting) mode! [-W#warnings] #warning Building pm1_stage2() in standalone (modmul-counting) mode! ^ pm1.c:1004:2: error: use of undeclared identifier 'dtmp'; did you mean 'tmp'? dtmp = mlucas_getOptVal(MLUCAS_INI_FILE,"InterimGCD"); // Any failure-to-find-or-parse can be checked for via isNaN(dtmp) ^~~~ tmp pm1.c:956:9: note: 'tmp' declared here uint64 tmp,q,q0,q1,q2, qlo = 0ull,qhi, reloc_start, pinv64 = 0ull; ^ pm1.c:1005:5: error: use of undeclared identifier 'dtmp'; did you mean 'tmp'? if(dtmp == 0) { ^~~~ tmp pm1.c:956:9: note: 'tmp' declared here uint64 tmp,q,q0,q1,q2, qlo = 0ull,qhi, reloc_start, pinv64 = 0ull; ^ pm1.c:1008:3: error: use of undeclared identifier 'interim_gcd' interim_gcd = 0; ^ pm1.c:1715:2: error: use of undeclared identifier 'q_old_10M' q_old_10M = (uint32)(q0 * inv10m); ^ pm1.c:1715:28: error: use of undeclared identifier 'inv10m' q_old_10M = (uint32)(q0 * inv10m); Code:
$ clang --version Debian clang version 13.0.1-+rc1-1~exp4 Target: x86_64-pc-linux-gnu Thread model: posix InstalledDir: /usr/bin |
![]() |
![]() |
![]() |
#22 | |
∂2ω=0
Sep 2002
República de California
1175510 Posts |
![]() Quote:
There is no "p-1 standalone" mode for the actual user build - if p-1 is all you want to do, you use the regular build as described on the README webpage ('bash makemake.sh' inside the unzipped source tarball) and simply restrict the assignment types to Pminus1 and/or Pfactor ones. Last fiddled with by ewmayer on 2022-03-07 at 20:00 |
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Mlucas v20.1 available | ewmayer | Mlucas | 7 | 2021-09-19 17:39 |
Mlucas v20 available | ewmayer | Mlucas | 9 | 2021-09-02 20:36 |
Mlucas v19.1 available | ewmayer | Mlucas | 46 | 2021-07-06 19:40 |
Mlucas v19 available | ewmayer | Mlucas | 89 | 2021-02-01 20:37 |
mlucas on sun | delta_t | Mlucas | 14 | 2007-10-04 05:45 |