mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > Sierpinski/Riesel Base 5

Reply
 
Thread Tools
Old 2006-12-01, 17:18   #34
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2×3×41 Posts
Talking

With the bugfix, plus changing CONST_EMPTY_SLOT to 0 and HASH_MAX_DENSITY to 0.32, compiling with gcc4, my 2.5 GHz 970MPs are up from ~292000 p/sec to ~308000 p/sec.
BlisteringSheep is offline   Reply With Quote
Old 2006-12-01, 17:42   #35
Greenbank
 
Greenbank's Avatar
 
Jul 2005

18216 Posts
Default

Quote:
Originally Posted by rogue View Post
I have isolated and resolved the issue. Apparently r2 must be saved and restored in this environment. Under OS X, r2 can be used without such a precaution. I've send the updated code to geoff.
Rogue, In which function? Will it affect the expmod and mulmod code you wrote for the G5 PPC proth_sieve? I need to make sure the source is ok so I can give Sheep a linux build of proth_sieve for G5 PPC.

I've been away with work and won't have access to the proth_sieve source until Monday. Plus I'm doing a 220km cycle ride on Sunday so I'll probably be a daze all day Monday. Sorry for the long delay on this Sheep.

Last fiddled with by Greenbank on 2006-12-01 at 17:43
Greenbank is offline   Reply With Quote
Old 2006-12-01, 17:47   #36
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2×3×41 Posts
Default

Quote:
Originally Posted by Greenbank View Post
Rogue, In which function? Will it affect the expmod and mulmod code you wrote for the G5 PPC proth_sieve? I need to make sure the source is ok so I can give Sheep a linux build of proth_sieve for G5 PPC.

I've been away with work and won't have access to the proth_sieve source until Monday. Plus I'm doing a 220km cycle ride on Sunday so I'll probably be a daze all day Monday. Sorry for the long delay on this Sheep.
This was in expmod.

Enyoy your ride

BlisteringSheep is offline   Reply With Quote
Old 2006-12-01, 19:19   #37
rogue
 
rogue's Avatar
 
"Mark"
Apr 2003
Between here and the

11·577 Posts
Default

Quote:
Originally Posted by Greenbank View Post
Rogue, In which function? Will it affect the expmod and mulmod code you wrote for the G5 PPC proth_sieve? I need to make sure the source is ok so I can give Sheep a linux build of proth_sieve for G5 PPC.

I've been away with work and won't have access to the proth_sieve source until Monday. Plus I'm doing a 220km cycle ride on Sunday so I'll probably be a daze all day Monday. Sorry for the long delay on this Sheep.
It only affects expmod and will have a negligable affect on OS X builds. You can grab the source directly from srsieve as it was (originally) the same source I sent to you.
rogue is offline   Reply With Quote
Old 2006-12-02, 00:25   #38
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

13·89 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
With the bugfix, plus changing CONST_EMPTY_SLOT to 0 and HASH_MAX_DENSITY to 0.32, compiling with gcc4, my 2.5 GHz 970MPs are up from ~292000 p/sec to ~308000 p/sec.
Thanks, I'll make those the default for ppc64 in future.
geoff is offline   Reply With Quote
Old 2006-12-04, 12:58   #39
Greenbank
 
Greenbank's Avatar
 
Jul 2005

1100000102 Posts
Default

Thanks, I'll grab the latest source and see if I can get it done today.

Not feeling as knackered as expected as I had to abandon at 80km (not bad by midday) for various reasons.
Greenbank is offline   Reply With Quote
Old 2006-12-05, 06:22   #40
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

F616 Posts
Default

Geoff,
I got the new 1.4.7 tarball. Is it possible to create the cache file once and then link to it? I have a ton of machines all working out off of a NFS tree. I'm currently hard-linking all of the executables and SoB.dat, and would like to link the sr2cache.bin file as well (I prefer hard links to symlinks for things on the same file system).

I should be able to run some benchmarks & report back.

Thanks,
BlisteringSheep is offline   Reply With Quote
Old 2006-12-05, 16:22   #41
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2×3×41 Posts
Default 1.4.7 results

All from a 2.2 GHz PPC970FX, with a sr2work.txt and inital checkpoint.txt of
Code:
sr2work.txt: 273020, 273021
checkpoint.txt: pmin=273020049854500,factors_count=0
  • Default compile with gcc3
    Code:
    p=273020986686673, 261113 p/sec, 2 factors, 98.67% done, ETA 05 Dec 10:45
    3639.176u 0.164s 1:00:40.15 99.9%
  • Compile with gcc4
    Code:
    p=273020988990671, 263521 p/sec, 2 factors, 98.90% done, ETA 05 Dec 10:44
    3600.868u 0.140s 1:00:01.75 99.9%
  • Compile with gcc4 plus changing HASH_MAX_DENSITY to 0.32
    Code:
    p=273020991038581, 265975 p/sec, 2 factors, 99.10% done, ETA 05 Dec 10:44
    3571.896u 0.155s 59:33.42 99.9%
  • Version 1.4.3 with gcc4, HASH_MAX_DENSITY 0.32 and CONST_EMPTY_SLOT 0
    Code:
    p=273020991166519, 266097 p/sec, 2 factors, 99.12% done, ETA 05 Dec 10:44
    3570.397u 0.160s 59:31.38 99.9%
BlisteringSheep is offline   Reply With Quote
Old 2006-12-05, 16:54   #42
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

111101102 Posts
Default USE_INLINE_MULMOD

Note that USE_INLINE_MULMOD makes it run really fast (over 461000 p/sec), but it finds no factors at all (duplicates or new).
BlisteringSheep is offline   Reply With Quote
Old 2006-12-05, 18:30   #43
axn
 
axn's Avatar
 
Jun 2003

10011101111012 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
  • Compile with gcc4 plus changing HASH_MAX_DENSITY to 0.32
    Code:
    p=273020991038581, 265975 p/sec, 2 factors, 99.10% done, ETA 05 Dec 10:44
    3571.896u 0.155s 59:33.42 99.9%
  • Version 1.4.3 with gcc4, HASH_MAX_DENSITY 0.32 and CONST_EMPTY_SLOT 0
    Code:
    p=273020991166519, 266097 p/sec, 2 factors, 99.12% done, ETA 05 Dec 10:44
    3570.397u 0.160s 59:31.38 99.9%
There are no speed improvements between 1.4.3 and 1.4.7 -- just some minor improvements in handling of srwork.txt and usage of a cache file for lookup tables. So you'll find that the speeds are virtually identical.
axn is online now   Reply With Quote
Old 2006-12-05, 19:27   #44
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2×3×41 Posts
Default

Quote:
Originally Posted by axn1 View Post
There are no speed improvements between 1.4.3 and 1.4.7 -- just some minor improvements in handling of srwork.txt and usage of a cache file for lookup tables. So you'll find that the speeds are virtually identical.
I expected as much looking at the code. I just like to be thorough while also being helpful.

For PPC64, there was the improvement of making CONST_EMPTY_SLOT be 0. I think it also shows that it's worthwhile to have MAX_HASH_DENSITY reduced. I haven't experimented with different values; I've just been using Geoff's recommendation to halve it. I'll try some others.
BlisteringSheep is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
srsieve/sr2sieve enhancements rogue Software 300 2021-03-18 20:31
32-bit of sr1sieve and sr2sieve for Win pepi37 Software 5 2013-08-09 22:31
sr2sieve question SaneMur Information & Answers 2 2011-08-21 22:04
sr2sieve client mgpower0 Prime Sierpinski Project 54 2008-07-15 16:50
How to use sr2sieve nuggetprime Riesel Prime Search 40 2007-12-03 06:01

All times are UTC. The time now is 09:32.


Sat Jul 17 09:32:47 UTC 2021 up 50 days, 7:20, 1 user, load averages: 1.14, 1.34, 1.46

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.