mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > Sierpinski/Riesel Base 5

Reply
 
Thread Tools
Old 2006-12-01, 17:18   #34
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

F616 Posts
Talking

With the bugfix, plus changing CONST_EMPTY_SLOT to 0 and HASH_MAX_DENSITY to 0.32, compiling with gcc4, my 2.5 GHz 970MPs are up from ~292000 p/sec to ~308000 p/sec.
BlisteringSheep is offline   Reply With Quote
Old 2006-12-01, 17:42   #35
Greenbank
 
Greenbank's Avatar
 
Jul 2005

2×193 Posts
Default

Quote:
Originally Posted by rogue View Post
I have isolated and resolved the issue. Apparently r2 must be saved and restored in this environment. Under OS X, r2 can be used without such a precaution. I've send the updated code to geoff.
Rogue, In which function? Will it affect the expmod and mulmod code you wrote for the G5 PPC proth_sieve? I need to make sure the source is ok so I can give Sheep a linux build of proth_sieve for G5 PPC.

I've been away with work and won't have access to the proth_sieve source until Monday. Plus I'm doing a 220km cycle ride on Sunday so I'll probably be a daze all day Monday. Sorry for the long delay on this Sheep.

Last fiddled with by Greenbank on 2006-12-01 at 17:43
Greenbank is offline   Reply With Quote
Old 2006-12-01, 17:47   #36
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2·3·41 Posts
Default

Quote:
Originally Posted by Greenbank View Post
Rogue, In which function? Will it affect the expmod and mulmod code you wrote for the G5 PPC proth_sieve? I need to make sure the source is ok so I can give Sheep a linux build of proth_sieve for G5 PPC.

I've been away with work and won't have access to the proth_sieve source until Monday. Plus I'm doing a 220km cycle ride on Sunday so I'll probably be a daze all day Monday. Sorry for the long delay on this Sheep.
This was in expmod.

Enyoy your ride

BlisteringSheep is offline   Reply With Quote
Old 2006-12-01, 19:19   #37
rogue
 
rogue's Avatar
 
"Mark"
Apr 2003
Between here and the

634710 Posts
Default

Quote:
Originally Posted by Greenbank View Post
Rogue, In which function? Will it affect the expmod and mulmod code you wrote for the G5 PPC proth_sieve? I need to make sure the source is ok so I can give Sheep a linux build of proth_sieve for G5 PPC.

I've been away with work and won't have access to the proth_sieve source until Monday. Plus I'm doing a 220km cycle ride on Sunday so I'll probably be a daze all day Monday. Sorry for the long delay on this Sheep.
It only affects expmod and will have a negligable affect on OS X builds. You can grab the source directly from srsieve as it was (originally) the same source I sent to you.
rogue is offline   Reply With Quote
Old 2006-12-02, 00:25   #38
geoff
 
geoff's Avatar
 
Mar 2003
New Zealand

13·89 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
With the bugfix, plus changing CONST_EMPTY_SLOT to 0 and HASH_MAX_DENSITY to 0.32, compiling with gcc4, my 2.5 GHz 970MPs are up from ~292000 p/sec to ~308000 p/sec.
Thanks, I'll make those the default for ppc64 in future.
geoff is offline   Reply With Quote
Old 2006-12-04, 12:58   #39
Greenbank
 
Greenbank's Avatar
 
Jul 2005

1100000102 Posts
Default

Thanks, I'll grab the latest source and see if I can get it done today.

Not feeling as knackered as expected as I had to abandon at 80km (not bad by midday) for various reasons.
Greenbank is offline   Reply With Quote
Old 2006-12-05, 06:22   #40
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2×3×41 Posts
Default

Geoff,
I got the new 1.4.7 tarball. Is it possible to create the cache file once and then link to it? I have a ton of machines all working out off of a NFS tree. I'm currently hard-linking all of the executables and SoB.dat, and would like to link the sr2cache.bin file as well (I prefer hard links to symlinks for things on the same file system).

I should be able to run some benchmarks & report back.

Thanks,
BlisteringSheep is offline   Reply With Quote
Old 2006-12-05, 16:22   #41
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

2·3·41 Posts
Default 1.4.7 results

All from a 2.2 GHz PPC970FX, with a sr2work.txt and inital checkpoint.txt of
Code:
sr2work.txt: 273020, 273021
checkpoint.txt: pmin=273020049854500,factors_count=0
  • Default compile with gcc3
    Code:
    p=273020986686673, 261113 p/sec, 2 factors, 98.67% done, ETA 05 Dec 10:45
    3639.176u 0.164s 1:00:40.15 99.9%
  • Compile with gcc4
    Code:
    p=273020988990671, 263521 p/sec, 2 factors, 98.90% done, ETA 05 Dec 10:44
    3600.868u 0.140s 1:00:01.75 99.9%
  • Compile with gcc4 plus changing HASH_MAX_DENSITY to 0.32
    Code:
    p=273020991038581, 265975 p/sec, 2 factors, 99.10% done, ETA 05 Dec 10:44
    3571.896u 0.155s 59:33.42 99.9%
  • Version 1.4.3 with gcc4, HASH_MAX_DENSITY 0.32 and CONST_EMPTY_SLOT 0
    Code:
    p=273020991166519, 266097 p/sec, 2 factors, 99.12% done, ETA 05 Dec 10:44
    3570.397u 0.160s 59:31.38 99.9%
BlisteringSheep is offline   Reply With Quote
Old 2006-12-05, 16:54   #42
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

111101102 Posts
Default USE_INLINE_MULMOD

Note that USE_INLINE_MULMOD makes it run really fast (over 461000 p/sec), but it finds no factors at all (duplicates or new).
BlisteringSheep is offline   Reply With Quote
Old 2006-12-05, 18:30   #43
axn
 
axn's Avatar
 
Jun 2003

13BD16 Posts
Default

Quote:
Originally Posted by BlisteringSheep View Post
  • Compile with gcc4 plus changing HASH_MAX_DENSITY to 0.32
    Code:
    p=273020991038581, 265975 p/sec, 2 factors, 99.10% done, ETA 05 Dec 10:44
    3571.896u 0.155s 59:33.42 99.9%
  • Version 1.4.3 with gcc4, HASH_MAX_DENSITY 0.32 and CONST_EMPTY_SLOT 0
    Code:
    p=273020991166519, 266097 p/sec, 2 factors, 99.12% done, ETA 05 Dec 10:44
    3570.397u 0.160s 59:31.38 99.9%
There are no speed improvements between 1.4.3 and 1.4.7 -- just some minor improvements in handling of srwork.txt and usage of a cache file for lookup tables. So you'll find that the speeds are virtually identical.
axn is online now   Reply With Quote
Old 2006-12-05, 19:27   #44
BlisteringSheep
 
BlisteringSheep's Avatar
 
Oct 2006
On a Suzuki Boulevard C90

F616 Posts
Default

Quote:
Originally Posted by axn1 View Post
There are no speed improvements between 1.4.3 and 1.4.7 -- just some minor improvements in handling of srwork.txt and usage of a cache file for lookup tables. So you'll find that the speeds are virtually identical.
I expected as much looking at the code. I just like to be thorough while also being helpful.

For PPC64, there was the improvement of making CONST_EMPTY_SLOT be 0. I think it also shows that it's worthwhile to have MAX_HASH_DENSITY reduced. I haven't experimented with different values; I've just been using Geoff's recommendation to halve it. I'll try some others.
BlisteringSheep is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
srsieve/sr2sieve enhancements rogue Software 300 2021-03-18 20:31
32-bit of sr1sieve and sr2sieve for Win pepi37 Software 5 2013-08-09 22:31
sr2sieve question SaneMur Information & Answers 2 2011-08-21 22:04
sr2sieve client mgpower0 Prime Sierpinski Project 54 2008-07-15 16:50
How to use sr2sieve nuggetprime Riesel Prime Search 40 2007-12-03 06:01

All times are UTC. The time now is 09:32.


Sat Jul 17 09:32:55 UTC 2021 up 50 days, 7:20, 1 user, load averages: 1.11, 1.32, 1.45

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.