mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software > Mlucas

Reply
 
Thread Tools
Old 2017-12-30, 08:04   #67
heliosh
 
Oct 2017
++41

53 Posts
Default

That did the job, thank you!
heliosh is offline   Reply With Quote
Old 2017-12-30, 22:55   #68
Lorenzo
 
Lorenzo's Avatar
 
Aug 2010
Republic of Belarus

2×89 Posts
Default

Hello!) Not sure ... bu looks like something wrong with txz archive. I see here only src folder. Nothing else for compile
Lorenzo is offline   Reply With Quote
Old 2017-12-30, 23:16   #69
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

101101111011002 Posts
Default

Quote:
Originally Posted by Lorenzo View Post
Hello!) Not sure ... bu looks like something wrong with txz archive. I see here only src folder. Nothing else for compile
Lorenzo, you need to look *inside* the src folder.
ewmayer is offline   Reply With Quote
Old 2017-12-31, 09:24   #70
Lorenzo
 
Lorenzo's Avatar
 
Aug 2010
Republic of Belarus

2·89 Posts
Default

ahhh, sorry. Just i have read instruction again. Got it how to install.
Lorenzo is offline   Reply With Quote
Old 2017-12-31, 11:52   #71
heliosh
 
Oct 2017
++41

1758 Posts
Default

Another day, another problem

mlucas -s m -cpu 0:3 resulting in an error with both, the self-compiled 17.1 as well as the raspbian precompiled 14.1-2.
It runs fine until:
Code:
NTHREADS = 4
M32156581: using FFT length 1664K = 1703936 8-byte floats.
 this gives an average   18.871941786545975 bits per digit
Using complex FFT radices        26         8        16        16        16
mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance.
pthread_create:: Cannot allocate memory
pthread_join: : Cannot allocate memory
pthread_join: : Cannot allocate memory
pthread_join: : Cannot allocate memory
ERROR: at line 1473 of file ../src/mers_mod_square.c
Assertion failed: threadpool_init failed!
Full log: https://pastebin.com/raw/XvNsn2wD
There was plenty of free RAM at that time.

Single-threaded is running fine so far.
And a minor issue: The URL printed at startup isn't working.
heliosh is offline   Reply With Quote
Old 2017-12-31, 12:05   #72
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

22·2,939 Posts
Default

Quote:
Originally Posted by heliosh View Post
Another day, another problem

mlucas -s m -cpu 0:3 resulting in an error with both, the self-compiled 17.1 as well as the raspbian precompiled 14.1-2.
It runs fine until:
Code:
NTHREADS = 4
M32156581: using FFT length 1664K = 1703936 8-byte floats.
 this gives an average   18.871941786545975 bits per digit
Using complex FFT radices        26         8        16        16        16
mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance.
pthread_create:: Cannot allocate memory
pthread_join: : Cannot allocate memory
pthread_join: : Cannot allocate memory
pthread_join: : Cannot allocate memory
ERROR: at line 1473 of file ../src/mers_mod_square.c
Assertion failed: threadpool_init failed!
Full log: https://pastebin.com/raw/XvNsn2wD
There was plenty of free RAM at that time.

Single-threaded is running fine so far.
And a minor issue: The URL printed at startup isn't working.
I haven't seen this particlular alloc error before, but have encountered not-dissimilar errors during self-tests on some systems, apparently due to the OS not being able to properly recover memory freed up by completed tests. Does re-running just the 1664K self-test (./Mlucas -fftlen 1664 -cpu 0:3) work? If so, on your system you may simply have to complete the self-tests in this one-length-at-a-time fashion, e.g. after 1664 finishes, paste the block below into your shell:

./Mlucas -fftlen 1792 -cpu 0:3
./Mlucas -fftlen 1920 -cpu 0:3
./Mlucas -fftlen 2048 -cpu 0:3
./Mlucas -fftlen 2304 -cpu 0:3
./Mlucas -fftlen 2560 -cpu 0:3
./Mlucas -fftlen 2816 -cpu 0:3
./Mlucas -fftlen 3072 -cpu 0:3
./Mlucas -fftlen 3328 -cpu 0:3
./Mlucas -fftlen 3584 -cpu 0:3
./Mlucas -fftlen 3840 -cpu 0:3
./Mlucas -fftlen 4096 -cpu 0:3
./Mlucas -fftlen 4608 -cpu 0:3
./Mlucas -fftlen 5120 -cpu 0:3
./Mlucas -fftlen 5632 -cpu 0:3
./Mlucas -fftlen 6144 -cpu 0:3
./Mlucas -fftlen 6656 -cpu 0:3
./Mlucas -fftlen 7168 -cpu 0:3
./Mlucas -fftlen 7680 -cpu 0:3

Each should add 1 line to your mlucas.cfg file.
ewmayer is offline   Reply With Quote
Old 2017-12-31, 12:23   #73
heliosh
 
Oct 2017
++41

53 Posts
Default

Yep. Manually running the individual tests seems to work. Thanks again.
heliosh is offline   Reply With Quote
Old 2017-12-31, 22:12   #74
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

22×2,939 Posts
Default

Quote:
Originally Posted by heliosh View Post
Yep. Manually running the individual tests seems to work. Thanks again.
Can you post your final cfg file and basic system details here? Note in single-fft-length-selftest mode the code appends a bunch of stuff to right of the usual cfg-file line as shown below, suggest stripping off the non-bolded fields of each line created via single-length-test:
Code:
      4608  msec/iter =    3.07  ROE[avg,max] = [0.238615989, 0.312500000]  radices = 144 16 32 32  0  0  0  0  0  0	10000-iteration Res mod 2^64, 2^35-1, 2^36-1 = 13BB5C9DDF0CD3D6, 15982066709, 51703797107
ewmayer is offline   Reply With Quote
Old 2018-01-01, 00:35   #75
heliosh
 
Oct 2017
++41

53 Posts
Default

Voilà:
https://pastebin.com/raw/tMesEMUi

It's a Raspberry Pi 3 running raspbian 32 with a steady background load of about 80% on one core that I don't want to stop right now.
mlucas was compiled with -O3 -mcpu=cortex-a53 -mfpu=neon-fp-armv8
heliosh is offline   Reply With Quote
Old 2018-01-01, 02:05   #76
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

22·2,939 Posts
Default

Quote:
Originally Posted by heliosh View Post
Voilà:
https://pastebin.com/raw/tMesEMUi

It's a Raspberry Pi 3 running raspbian 32 with a steady background load of about 80% on one core that I don't want to stop right now.
mlucas was compiled with -O3 -mcpu=cortex-a53 -mfpu=neon-fp-armv8
Thanks - your 1664K timing looks highly anomalous, could you redo that one and see if you get a significantly different result on attempt 2?

[Details: 1664K uses a front-end-radix (52 or 208) based on a prime-radix-13 DFT macro, but the opcount for that is actually decently low - and the next-larger radix-13-based FFT length, 3328K, only shows a modestly worse runtime than the 2 lengths bracketing it. Also, 3328K should give a runtime >= 2x that of 1664K, but the ratio is quite a bit less than 2, again pointing to a re-do of the 1664K timing.]

Edit: Just as a point of comparison, my A53 (odroid c2) is DCing right now, 120 ms/iter @2560K using all 4 cores. So if at all possible, I urge you to switch to a 64-bit version of Raspbian and a SIMD build at your earliest convenience.

Last fiddled with by ewmayer on 2018-01-01 at 04:02
ewmayer is offline   Reply With Quote
Old 2018-01-01, 08:58   #77
heliosh
 
Oct 2017
++41

53 Posts
Default

Yep, this one seems more reasonable.
Code:
1664  msec/iter =  541.97  ROE[avg,max] = [0.237590681, 0.312500000]  radices = 208 16 16 16  0  0  0  0  0  0
heliosh is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Mlucas v18 available ewmayer Mlucas 48 2019-11-28 02:53
Mlucas on ubuntu Damian Mlucas 17 2017-11-13 18:12
Mlucas version 17 ewmayer Mlucas 3 2017-06-17 11:18
MLucas on IBM Mainframe Lorenzo Mlucas 52 2016-03-13 08:45
mlucas on sun delta_t Mlucas 14 2007-10-04 05:45

All times are UTC. The time now is 04:26.


Fri Jul 7 04:26:33 UTC 2023 up 323 days, 1:55, 0 users, load averages: 2.58, 1.96, 1.68

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔