mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   Sierpinski/Riesel Base 5 (https://www.mersenneforum.org/forumdisplay.php?f=54)
-   -   A multiple k/c sieve for Sierpinski/Riesel problems (https://www.mersenneforum.org/showthread.php?t=5785)

Xentar 2006-12-28 21:44

[QUOTE=geoff;94899]Sorry, version 1.4.12 had a bug which could have caused 1 factor in 240 to be missed. If anyone is using it then please upgrade to version 1.4.13.[/QUOTE]

I used 1.4.12 for about 6 hours - do you think, it is necessary to redo this range?

jasong 2006-12-28 21:50

[QUOTE=geoff;94899]Sorry, version 1.4.12 had a bug which could have caused 1 factor in 240 to be missed. If anyone is using it then please upgrade to version 1.4.13.[/QUOTE]
111,100 k/sec. Awesome.

geoff 2006-12-29 21:40

[QUOTE=Xentar;94908]I used 1.4.12 for about 6 hours - do you think, it is necessary to redo this range?[/QUOTE]

No, but if you let me know the range concerned I can find any missed factors faster than fully resieving it -- only the factors p where p=1 (mod 120) have to be checked.

Version 1.4.14 fixes another bug that would have slowed the PPC build down a bit, but probably didn't affect the i386 builds.

Xentar 2006-12-29 23:59

[QUOTE=geoff;94967]No, but if you let me know the range concerned I can find any missed factors faster than fully resieving it -- only the factors p where p=1 (mod 120) have to be checked.[/QUOTE]

The ranges are 4200-4300 and 4300-4400. I think, I tried version 1.4.12 somewhere between 75 and 85%.. sorry, dont know exactly :(

Btw, could you please try to explain the math to me? I know, that for example 5 mod 3 = 2, but p = 1 mod 120 doesnt make any sense? Or I just dont understand it..

geoff 2006-12-30 02:28

[QUOTE=Xentar;94973]The ranges are 4200-4300 and 4300-4400. I think, I tried version 1.4.12 somewhere between 75 and 85%.. sorry, dont know exactly :([/QUOTE]

I'll test those 10% ranges tonight. It takes only 1/32 of the work of the full sieve to check, and I am interested to see how many if any were missed.

[QUOTE]Btw, could you please try to explain the math to me? I know, that for example 5 mod 3 = 2, but p = 1 mod 120 doesnt make any sense? Or I just dont understand it..[/QUOTE]

5 = 2 (mod 3) [or properly, [tex]5\equiv2\pmod3[/tex] means the same as 5 mod 3 = 2 in this case. The factors p of interest are all of the form 120x+1 for some x.

Xentar 2006-12-30 14:33

[QUOTE=geoff;94980]I'll test those 10% ranges tonight. It takes only 1/32 of the work of the full sieve to check, and I am interested to see how many if any were missed.[/QUOTE]
OK, could you then please send them as PM to me, so I can check, they are already in the factors-list?

[QUOTE]
5 = 2 (mod 3) [or properly, [tex]5\equiv2\pmod3[/tex] means the same as 5 mod 3 = 2 in this case. The factors p of interest are all of the form 120x+1 for some x.[/QUOTE]
Ok, I understand. So I was just confused by the kind of writing p = 1 (mod 120) :D
So, for any reason, the version 1.4.12 didnt check p = 120x+1 and if this p is a factor, it was missed, right?
Ok, thank you :)

geoff 2006-12-31 02:10

It appears that the 1.4.12 bug is in code that is never used, so there are no missed factors. Once I work out exactly why that piece of code is not needed then I can remove it, and probably some other pieces too, and make the program faster :-)

[QUOTE=Xentar;95014]OK, could you then please send them as PM to me, so I can check, they are already in the factors-list?[/quote]

Done, just in case.

Xentar 2006-12-31 13:55

[QUOTE=geoff;95045]It appears that the 1.4.12 bug is in code that is never used, so there are no missed factors. Once I work out exactly why that piece of code is not needed then I can remove it, and probably some other pieces too, and make the program faster :-)
[/QUOTE]

Just checked the factors, you send to me. All of them are already in my factors file, so no one was missed.

geoff 2007-01-05 22:34

sr5sieve 1.4.16
 
The main change is the addition of a SSE2 mulmod function for 32-bit machines. The FPU is still used for the division (so it still works mod primes up to 2^62), but uses SSE2 to perform the integer multiplications. It runs about 9% faster on my P4/Celeron. (That was the machine I tuned the code for, it may not do as well on AMD/Core2, all benchmarks welcome.)

tnerual 2007-01-07 12:14

[QUOTE=geoff;95392]The main change is the addition of a SSE2 mulmod function for 32-bit machines. The FPU is still used for the division (so it still works mod primes up to 2^62), but uses SSE2 to perform the integer multiplications. It runs about 9% faster on my P4/Celeron. (That was the machine I tuned the code for, it may not do as well on AMD/Core2, all benchmarks welcome.)[/QUOTE]

i don't know if it's normal but i have a good speed improve on my amd x2 4200

i use one core, sieving for one k (285728), speed was 2600000 and now is 4650000 :geek: it's a 78% improve ... and factor density is the same after 1 day running so it may be true.

if geoff can confirm it's not missing factors, it can give a good boost ...

Xentar 2007-01-07 15:01

[QUOTE=geoff;95392]The main change is the addition of a SSE2 mulmod function for 32-bit machines. The FPU is still used for the division (so it still works mod primes up to 2^62), but uses SSE2 to perform the integer multiplications. It runs about 9% faster on my P4/Celeron. (That was the machine I tuned the code for, it may not do as well on AMD/Core2, all benchmarks welcome.)[/QUOTE]

about 5 % improve on Core2. (just tested for a few minutes)


All times are UTC. The time now is 22:20.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.