mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > Sierpinski/Riesel Base 5

Reply
 
Thread Tools
Old 2007-06-28, 02:51   #155
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

F616 Posts
Default

That fixed it. I ran sr5check with all 3 values for EXPERIMENTAL, and they all matched the checked-factors file (the only "oddity" is that in every case they find one factor, "146876293 | 128432*5^814315+1" in a different order. In checked-factors.txt it is the 9995th, while in my factors.txt files it is the 9996th).

I haven't done any speed testing beside the sr5check runs, and the results are:
EXP 0: 96.097 cpu sec
EXP 1: 87.831 cpu sec
EXP 2: 87.896 cpu sec
BlisteringSheep is offline   Reply With Quote
Old 2007-06-28, 03:07   #156
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

13·89 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
That fixed it. I ran sr5check with all 3 values for EXPERIMENTAL, and they all matched the checked-factors file (the only "oddity" is that in every case they find one factor, "146876293 | 128432*5^814315+1" in a different order. In checked-factors.txt it is the 9995th, while in my factors.txt files it is the 9996th).
You will find that 146876293 divides two different terms. That is normal, if a factor divides two terms then the order in which the terms are reported may vary from version to version, or may vary depending on the order the terms were listed in the input file.

It would also be a good idea to do some tests on factors larger than 2^32. (But not too much larger, you need plenty of factors for comparison). You could use the sr5check.txt as input but use a range starting at 5e9, and compare the results to the non-experimantal version of sr5sieve.

If you have GMP installed then building srtest will do some tests on numbers up to the 2^63 limit.
geoff is offline   Reply With Quote
Old 2007-06-28, 04:07   #157
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2×3×41 Posts
Default

Quote:
Originally Posted by geoff View Post
If you have GMP installed then building srtest will do some tests on numbers up to the 2^63 limit.
I had forgotten that I had gmp installed. I'm running srtest now on it, and also doing some timing tests on another (faster) machine.

Do you have some recommendations for input values to srtest? It completed the default with 0 errors.

Also, the new timing code seems to be working well. It now has enough differentiation to unequivocably choose between the different methods (it uses the /4 variant of each). With older versions, it couldn't differentiate at all between the giant steps methods on faster machines (> 2 GHz).

Last fiddled with by BlisteringSheep on 2007-06-28 at 04:11 Reason: added timing praise
BlisteringSheep is offline   Reply With Quote
Old 2007-06-28, 04:30   #158
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

13×89 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
Do you have some recommendations for input values to srtest? It completed the default with 0 errors.
If you do a few million iterations that should be enough. On average about half of the iterations will have a modulus between 2^62 and 2^63.

I have to sign off now, I look forward to the timing data tomorrow :-)
geoff is offline   Reply With Quote
Old 2007-06-28, 04:46   #159
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2·3·41 Posts
Default

Quote:
Originally Posted by geoff View Post
I have to sign off now, I look forward to the timing data tomorrow :-)
Slacker.

I ran sr5check with -p 5000000000 -P 5200000000 on all 3 variants which found 437 factors. All factors matched. Execution times on a 2.0 GHz PowerMac are:
EXP 0: 161.500 cpu sec
EXP 1: 147.473 cpu sec
EXP 2: 147.434 cpu sec


Running with "real" data on a 2.2 GHz 970FX, using the RPS 5th drive Riesel.dat (14 small k's) with p values in the 20650000000000 range yields rates of:
sr2sieve-1.5.6 with vec mods: 792 kp/s
sr2sieve-1.5.10 with EXP 1: 862 kp/s
sr2sieve-1.5.10 with EXP 2: 862 kp/s

I'll take it!
BlisteringSheep is offline   Reply With Quote
Old 2007-06-28, 16:58   #160
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

F616 Posts
Default

More sr5check timings with 1.5.10 on a 2.5 GHz 970MP.

-p 100e6 -P 150e6
EXP 0: 76.546 cpu sec
EXP 1: 69.857 cpu sec
EXP 2: 69.813 cpu sec

-p 5000000000 -P 5100000000
EXP 0: 128.634
EXP 1: 117.334
EXP 2: 117.226
BlisteringSheep is offline   Reply With Quote
Old 2007-06-28, 20:03   #161
Cruelty
 
Cruelty's Avatar
 
May 2005

23×7×29 Posts
Default

Could you send me the test file your are working on (5-th drive)? I'd like to check how does PPC version compare to x86-64 on C2D
You will find my e-mail here.
Cruelty is offline   Reply With Quote
Old 2007-06-29, 03:15   #162
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

13·89 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
Running with "real" data on a 2.2 GHz 970FX, using the RPS 5th drive Riesel.dat (14 small k's) with p values in the 20650000000000 range yields rates of:
sr2sieve-1.5.6 with vec mods: 792 kp/s
sr2sieve-1.5.10 with EXP 1: 862 kp/s
sr2sieve-1.5.10 with EXP 2: 862 kp/s
That is a good result. Since there is no noticable difference between the two new variants, I'll make the EXPERIMENTAL=1 code the default in the next version.

There may still be some gains to be had from refining the assembly, but it seems clear that the basic ideas are faster, although I don't know how much is due to pre-computing b*pMagic and how much from interleaving the mulmods.
geoff is offline   Reply With Quote
Old 2007-06-29, 03:43   #163
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

F616 Posts
Default

geoff, are there any other specific tests you'd like me to run? I'm happy to run anything that is useful for you.
BlisteringSheep is offline   Reply With Quote
Old 2007-06-29, 04:04   #164
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

13×89 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
geoff, are there any other specific tests you'd like me to run? I'm happy to run anything that is useful for you.
It would be interesting to see the results from running with the -vv switch on a large input like riesel.dat or SoB.dat, to get an idea of the difference in speed between doing two or four mulmods in parallel (gen/2 and gen/4 methods).

Thanks for all your help, I'll let you know if I think of anything else to try out.
geoff is offline   Reply With Quote
Old 2007-06-29, 07:03   #165
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2·3·41 Posts
Default

Quote:
Originally Posted by geoff View Post
It would be interesting to see the results from running with the -vv switch on a large input like riesel.dat or SoB.dat, to get an idea of the difference in speed between doing two or four mulmods in parallel (gen/2 and gen/4 methods).

Thanks for all your help, I'll let you know if I think of anything else to try out.
Attached is a full capture of running both gcc-4.1.1 and gcc-4.1.2 versions with all EXPERIMENTAL variants on both riesel.dat and SoB.dat. These were executed sequentially on one core of 2.5 GHz 970MP. They were run with "time" to also capture actual wall-clock execution times.
Attached Files
File Type: zip test_1.5.10.experimental.txt.zip (2.7 KB, 116 views)
BlisteringSheep is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
srsieve/sr2sieve enhancements rogue Software 300 2021-03-18 20:31
32-bit of sr1sieve and sr2sieve for Win pepi37 Software 5 2013-08-09 22:31
sr2sieve question SaneMur Information & Answers 2 2011-08-21 22:04
sr2sieve client mgpower0 Prime Sierpinski Project 54 2008-07-15 16:50
How to use sr2sieve nuggetprime Riesel Prime Search 40 2007-12-03 06:01

All times are UTC. The time now is 09:31.


Sat Jul 17 09:31:03 UTC 2021 up 50 days, 7:18, 1 user, load averages: 1.09, 1.31, 1.47

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.