mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > And now for something completely different

Reply
 
Thread Tools
Old 2017-01-27, 05:38   #78
paulunderwood
 
paulunderwood's Avatar
 
Sep 2002
Database er0rr

3,739 Posts
Default

Quote:
Originally Posted by Batalov View Post
Perhaps it's time to hack the multithreaded llr binary.
Go for it -- you have 4-5 days to achieve it.
paulunderwood is offline   Reply With Quote
Old 2017-01-27, 06:14   #79
axn
 
axn's Avatar
 
Jun 2003

505110 Posts
Default

Quote:
Originally Posted by paulunderwood View Post
Any better sleuths out there?
Yes. It will be @ #12 (see the TOP12th bit?), so #digits > 4053946
axn is online now   Reply With Quote
Old 2017-01-27, 06:19   #80
axn
 
axn's Avatar
 
Jun 2003

5,051 Posts
Default

Quote:
Originally Posted by Batalov View Post
P.S. Cyclo is not written for this form: 393216 is not a power of 2, but 3 times a power of 2
Have you checked with Yves whether it is theoretically possible to modify Cyclo to efficiently support these extensions?
axn is online now   Reply With Quote
Old 2017-01-27, 06:51   #81
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

36×13 Posts
Default

Yes, but the progress was slow. He wrote that in his mainstream research:
Quote:
For the time being, I’m working on optimizing my [GFN OCL] transform on GTX 1080.
As for extending for Phi(2^u*3^v,b), he wrote much earlier
Quote:
first I would like to build a basic (theoretical) program able to test any 2^u.3^v, i.e. generic radix 2 and 3 stages.

If a radix 3 stage is available, any 2^u.3^v can be tested.

But optimized versions can certainly not be generic and I may not write all of them.
Batalov is offline   Reply With Quote
Old 2017-01-28, 06:10   #82
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

947710 Posts
Default

Quote:
Originally Posted by paulunderwood View Post
Go for it -- you have 4-5 days to achieve it.
I went for it...
...and now the ETA is only 30 hrs.

The initial patch is sent to Jean. He can make consistent llr code changes all over llr and provide API hooks for the ThreadsPerTest, like :
extra options to command-line (e.g. llr -t 4) and to llr.ini (e.g. ThreadsPerTest=4 )
Batalov is offline   Reply With Quote
Old 2017-01-28, 17:25   #83
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

4,861 Posts
Default

Your patch comes as my LLR efforts nearly all exceed 1M digits, and I wish for two-threaded LLR frequently. Thank you very very much, sir!
VBCurtis is offline   Reply With Quote
Old 2017-01-28, 17:57   #84
paulunderwood
 
paulunderwood's Avatar
 
Sep 2002
Database er0rr

3,739 Posts
Default

Do you think that a multi-threaded LLR will improve throughput, considering the impact on cache of tests on large numbers? Would muti-socket boxes need to use utilities such as taskset?
paulunderwood is offline   Reply With Quote
Old 2017-01-29, 00:35   #85
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

947710 Posts
Default

For some very large candidates, 2- or 4- threaded LLR may become handy even in the search process (like prime95), not just validation runs. Once this change is deployed, a bit of benchmarking and timing will be useful. Note that LLR has many modes in addition to Riesel and Proth; many of them may have different scaling behaviors.

A trivial disclaimer (so that no one expects miracles): The scaling for very large number of threads (e.g. 16 or 32) should not be expected to very impressive. The scaling factor still goes up but almost level off at about 5x, maybe 6x. However, for 2 or 4 threads, this is likely going to be a contender to a conventional run.
Batalov is offline   Reply With Quote
Old 2017-01-29, 04:02   #86
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

224058 Posts
Lightbulb

A small tweak about affinities:

I have rented one of my trusted instances (c4.8xlarge) because though it has less cores than m4.16xlarge - they are faster. And because scaling is almost flat between 18 and 32 cores.

One experiment that I now ran in real/time was: while observing effective speed of the test (ms / iter), I played two scenarios against each other:
1. assign affinities 0-17 (that's all physical cores used once, and leave h/t functionality idle)
2. assign affinities 0-8, 18-26 (that's physical and hyper cores on only one of the two 9-core chips; there are two 9-core Xeon(R) CPU E5-2666 v3 @ 2.90GHz in this instance)

Scenario 2 wins at about 12% premium. Which is specific for a 2-chip board like this and signals us that memory transport between chips is less effective than keeping the data in the cache of 1 CPU. More testing will be needed when the new LLR binary is released.
Batalov is offline   Reply With Quote
Old 2017-01-30, 12:04   #87
paulunderwood
 
paulunderwood's Avatar
 
Sep 2002
Database er0rr

3,739 Posts
Default

Quote:
Originally Posted by axn View Post
Yes. It will be @ #12 (see the TOP12th bit?), so #digits > 4053946
The cat is out of the bag: It is 4,055,114 digits. Phi(3, - 143332^393216)


Congrats to Ryan and Serge. And well done for the multi-threaded proof, Serge.


Last fiddled with by paulunderwood on 2017-01-30 at 12:05
paulunderwood is offline   Reply With Quote
Old 2017-01-31, 00:34   #88
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

36·13 Posts
Default

I ended up running two 18-threaded N-1 tests: one that started from a=3, then continued with a=5... and the other started from scratch with a=11 (in another folder and with affinities set to the 18 ht cores of the second 9-core chip). Both ran with 7.6 ms / iter speed.

The a=11 won the jackpot in one go, the a=5 run again didn't get the proof (one of the three a^((N-1)/f)-1 was not coprime to N) and continued into a=7 and I killed it and released the instance.
Batalov is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Prime 95 and internet connection issue Jwb52z Software 10 2013-01-30 01:09
Twin prime search? MooooMoo Twin Prime Search 115 2010-08-29 17:38
Prime Search at School Unregistered Information & Answers 5 2009-10-15 22:44
Prime Search on PS-3? Kosmaj Riesel Prime Search 6 2006-11-21 15:19
Running prime on PC without internet-connection Ferdy Software 3 2006-04-25 08:53

All times are UTC. The time now is 17:17.


Fri Jul 16 17:17:28 UTC 2021 up 49 days, 15:04, 1 user, load averages: 1.59, 1.78, 1.68

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.