mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2022-08-18, 21:41   #683
pepi37
 
pepi37's Avatar
 
Dec 2011
After milion nines:)

157510 Posts
Default

I5 -9600K 24 MB RAM win 10


With W 6 option


Code:
e:\MTSIEVE\MTSIEVE-2-3-3>srsieve2 -W 6 -P 1e12 -n 3e6 -N 10e6   -o ferm81_3M_10M.txt -s "81*2^n+1"
srsieve2 v1.6.3, a program to find factors of k*b^n+c numbers for fixed b and variable k and n
(b2) Removed 1750000 algebraic factors for 81*2^n+1 of the form (3^2)*2^(n/2)-3*2^((n+2)/4))+1 when n%4=2
Sieving with generic logic for p >= 3
Sieve started: 3 < p < 1e12 with 5250001 terms (3000000 < n < 10000000, k*2^n+1) (expecting 5041260 factors)
Sieving with single sequence c=1 logic for p >= 257
BASE_MULTIPLE = 30, POWER_RESIDUE_LCM = 720, LIMIT_BASE = 720
Split 1 base 2 sequence into 384 base 2^720 sequences.
Legendre summary:  Approximately 2 B needed for Legendre tables
         1 total sequences
         1 are eligible for Legendre tables
         0 are not eligible for Legendre tables
         1 have Legendre tables in memory
         0 cannot have Legendre tables in memory
         0 have Legendre tables loaded from files
         1 required building of the Legendre tables
518400 bytes used for congruent q and ladder indices
295200 bytes used for congruent qs and ladders
Increasing worksize to 256000 since each chunk is tested in less than a second
  p=1589282789, 1.319M p/sec, 4524325 factors found at 4.361K f/sec (last 1 min), 0.2% done. ETC 2022-08-19 10:12       
CTRL-C accepted.  Threads will stop after sieving to 2020583611
Sieve interrupted at p=2020583611.
CPU time: 416.06 sec. (0.47 sieving) (5.59 cores)
717815 terms written to ferm81_3M_10M.txt
Primes tested: 99184016.  Factors found: 4532186.  Remaining terms: 717815.  Time: 74.44 seconds.
Linux Ryzen 3900x


w 6


Code:
srsieve2 v1.6.3, a program to find factors of k*b^n+c numbers for fixed b and variable k and n
(b2) Removed 1750000 algebraic factors for 81*2^n+1 of the form (3^2)*2^(n/2)-3*2^((n+2)/4))+1 when n%4=2
Sieving with generic logic for p >= 3
Sieve started: 3 < p < 1e12 with 5250001 terms (3000000 < n < 10000000, k*2^n+1) (expecting 5041260 factors)
Sieving with single sequence c=1 logic for p >= 257
BASE_MULTIPLE = 30, POWER_RESIDUE_LCM = 720, LIMIT_BASE = 720
Split 1 base 2 sequence into 384 base 2^720 sequences.
Legendre summary:  Approximately 2 B needed for Legendre tables
         1 total sequences
         1 are eligible for Legendre tables
         0 are not eligible for Legendre tables
         1 have Legendre tables in memory
         0 cannot have Legendre tables in memory
         0 have Legendre tables loaded from files
         1 required building of the Legendre tables
518400 bytes used for congruent q and ladder indices
295200 bytes used for congruent qs and ladders
Decreasing worksize to 8000 since each chunk needs more than 5 seconds to test
Increasing worksize to 128000 since each chunk is tested in less than a second
corrupted double-linked list (not small)
 ./a: line 1:  9550 Aborted                 ./srsieve2 -W 6 -P 1e12 -n 3e6 -N 10e6 -o ferm81_3M_10M.txt -s "81*2^n+1"
W 4 works ok!

Last fiddled with by pepi37 on 2022-08-18 at 22:01
pepi37 is offline   Reply With Quote
Old 2022-08-19, 12:47   #684
rogue
 
rogue's Avatar
 
"Mark"
Apr 2003
Between here and the

53×127 Posts
Default

There are no double linked lists in the code so that is a mystery to me
rogue is offline   Reply With Quote
Old 2022-08-19, 12:54   #685
kruoli
 
kruoli's Avatar
 
"Oliver"
Sep 2017
Porta Westfalica, DE

22·3·5·19 Posts
Default

That's an error message from GLIBC itself, it uses double-linked lists internally.
kruoli is offline   Reply With Quote
Old 2022-08-20, 01:04   #686
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2·33·197 Posts
Default

Quote:
Originally Posted by kruoli View Post
That's an error message from GLIBC itself, it uses double-linked lists internally.
Why? A sparse matrix?

Sincere question.

Last fiddled with by chalsall on 2022-08-20 at 01:07 Reason: s/Sparce/A sparse/; # It never fails...
chalsall is offline   Reply With Quote
Old 2022-08-20, 17:48   #687
kruoli
 
kruoli's Avatar
 
"Oliver"
Sep 2017
Porta Westfalica, DE

22·3·5·19 Posts
Default

Why they use it, I cannot say for sure (mostly because I do not enough about the interna). But I can link you to the GLIBC code where the error message comes from.
kruoli is offline   Reply With Quote
Old 2022-08-22, 16:22   #688
rogue
 
rogue's Avatar
 
"Mark"
Apr 2003
Between here and the

151138 Posts
Default

I do not know when I will get around to resolving the issue with srsieve2. I have a more pressing issue with srsieve2cl that is perplexing me. It was introduced when I started making the changes for Metal support. For some reason CL_KERNEL_PRIVATE_MEM_SIZE has increased dramatically even though the kernel itself has barely changed. Due to this increase the generic kernel won't run at all when one has a thousands of sequences. Once that is resolved, I will look into the other issue with srsieve2.
rogue is offline   Reply With Quote
Old 2022-08-23, 20:10   #689
rogue
 
rogue's Avatar
 
"Mark"
Apr 2003
Between here and the

151138 Posts
Default

srsieve2 and srsieve2cl code has been updated. I have not posted new builds yet.

The segfault with srsieve2 should be fixed. The issues with srsieve2cl should also be fixed. My test with srsieve2cl is sieving over 170,000 sequences at a time on the GPU. It is not super fast, but much faster than srsieve2.
rogue is offline   Reply With Quote
Old 2022-08-23, 20:58   #690
pepi37
 
pepi37's Avatar
 
Dec 2011
After milion nines:)

30478 Posts
Default

Quote:
Originally Posted by rogue View Post
srsieve2 and srsieve2cl code has been updated. I have not posted new builds yet.

The segfault with srsieve2 should be fixed. The issues with srsieve2cl should also be fixed. My test with srsieve2cl is sieving over 170,000 sequences at a time on the GPU. It is not super fast, but much faster than srsieve2.

If you can please post command line as reference for further tuning on GPU
Thanks
pepi37 is offline   Reply With Quote
Old 2022-08-23, 21:34   #691
rogue
 
rogue's Avatar
 
"Mark"
Apr 2003
Between here and the

11010010010112 Posts
Default

Quote:
Originally Posted by pepi37 View Post
If you can please post command line as reference for further tuning on GPU
Thanks
On a GPU you will want to tune mainly with -K, -g, and -M. If you have too many sequences to sieve, use -K to break them up. So if you have 6000 sequences and use -K2, it will make two GPU calls with 3000 sequences each. If so many factors are found that it fills the buffer, use -M to adjust the buffer size. You will want to play around with -g to determine if changing it from the default value improves performance. Use srsieve2cl -h to get the defaults for -g and -M.

If starting to sieve a set of sequences, I suggest sieving to 1e6 (which will be all CPU), then starting from the .abcd file so that you don't lose the progress of sieving to 1e6. I had to use -M1000 for what was testing. You might not need to change it at all.

-H will provide details around memory utilization. If srsieve2cl won't run in the GPU, you will likely need to adjust -K higher or -g smaller. I suggest adjusting -K first. On various GPUs CL_KERNEL_PRIVATE_MEM_SIZE seems to top out at 500000. Above that gives an error.
rogue is offline   Reply With Quote
Old 2022-08-23, 22:20   #692
pepi37
 
pepi37's Avatar
 
Dec 2011
After milion nines:)

62716 Posts
Default

Quote:
Originally Posted by rogue View Post
On a GPU you will want to tune mainly with -K, -g, and -M. If you have too many sequences to sieve, use -K to break them up. So if you have 6000 sequences and use -K2, it will make two GPU calls with 3000 sequences each. If so many factors are found that it fills the buffer, use -M to adjust the buffer size. You will want to play around with -g to determine if changing it from the default value improves performance. Use srsieve2cl -h to get the defaults for -g and -M.

If starting to sieve a set of sequences, I suggest sieving to 1e6 (which will be all CPU), then starting from the .abcd file so that you don't lose the progress of sieving to 1e6. I had to use -M1000 for what was testing. You might not need to change it at all.

-H will provide details around memory utilization. If srsieve2cl won't run in the GPU, you will likely need to adjust -K higher or -g smaller. I suggest adjusting -K first. On various GPUs CL_KERNEL_PRIVATE_MEM_SIZE seems to top out at 500000. Above that gives an error.

What is your command line? Or it is top secret one? :)
pepi37 is offline   Reply With Quote
Old 2022-08-24, 01:07   #693
rogue
 
rogue's Avatar
 
"Mark"
Apr 2003
Between here and the

673110 Posts
Default

Quote:
Originally Posted by pepi37 View Post
What is your command line? Or it is top secret one? :)
For this it was: srsieve2cl -ir63.abcd -K6 -M1000

This was a file presieved to 1e6.
rogue is offline   Reply With Quote
Reply

Thread Tools


All times are UTC. The time now is 08:17.


Sun Sep 25 08:17:53 UTC 2022 up 38 days, 5:46, 0 users, load averages: 1.13, 1.24, 1.21

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2022, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔