mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2011-11-29, 14:31   #12
IBethune
 
Nov 2010

318 Posts
Default

Quote:
Originally Posted by nuggetprime View Post
Bug: segfaults when testing 1525*2^700391-1, before starting LL loop.
Seems to fail for every large-k/large-n combination.

Nugget
Most likely this is caused by compiling LLR for 64 bit system - try adding -m32 to your compiler flags and this should fix it.

Cheers

- Iain
IBethune is offline   Reply With Quote
Old 2011-12-08, 11:49   #13
IBethune
 
Nov 2010

52 Posts
Default

Quote:
Originally Posted by stevenj View Post
A couple of suggestions regarding performance:
...
Regards,
Steven G. Johnson
I've been playing with the code along the lines suggested above, and have made the following performance improvements (all for N=12.5K FFT length)
  • Modified code to allow compiling for 64-bit targets (~60% speedup)
  • Use FFTW r2c instead of r2r interface (~80% speedup)
  • Use FFTW_PATIENT instead of FFTW_MEASURE (~2% speedup)

Overall, for test of 5719*2^100002+1 (LLR N=8K, LLRP N=12.5K)
LLRP (original, 32 bit) = 109.958s
LLRP (new, 64 bit) = 37.891s (2.9x speedup)
LLR (native, 32 bit) = 8.645s

For a larger test 40311639203955*2^666666-1 (LLR N=72K, LLRP N=88K)
LLRP (old) = 48.749 ms/bit
LLRP (new) = 11.04 ms/bit (4.4x speedup)
LLR = 0.95 ms/bit

Clearly native LLR is a lot faster still (and is relatively even faster for the larger case. This may be because it requires a generic reduction, I'm not sure...), but though the above might be of use to anyone else who is using LLRP.

I attach a modified gwpnum.c with these changes.

Cheers

- Iain
Attached Files
File Type: zip gwpnum.c.zip (12.4 KB, 84 views)
IBethune is offline   Reply With Quote
Old 2011-12-15, 12:52   #14
Jean Penné
 
Jean Penné's Avatar
 
May 2004
FRANCE

58010 Posts
Default Nice work!

Many congrats, Iain and Steven, for this nice work!
I will include this new gwpnum.c in the next release of llrp.
Indeed, r2c usage is better than r2r one in the real input case.
Thank you again,and Best regards,
Jean
Jean Penné is offline   Reply With Quote
Old 2011-12-17, 12:45   #15
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Hi ,IBethune
Quote:
Originally Posted by IBethune View Post
  • Use FFTW r2c instead of r2r interface (~80% speedup)
CUFFT support r2c function.
Thank you,
msft is offline   Reply With Quote
Old 2011-12-17, 17:13   #16
Jean Penné
 
Jean Penné's Avatar
 
May 2004
FRANCE

22×5×29 Posts
Default llrp development version 3.8.1 is available.

Hi,

I uploaded now the 32 bit Windows and Linux binaries, and also the complete source of the llrp 3.8.1 version on my Development directory :

http://jpenne.free.fr/Development/

No new feature in this version ; I only updated it according to Steven and Iain's suggestions.

Regards,
Jean
Jean Penné is offline   Reply With Quote
Old 2012-04-08, 13:43   #17
danholle
 
Apr 2012
Weybridge, Surrey (UK)

1 Posts
Default

Hmmm...

Just testing my installation.

./sllrp -q"1*2^19937-1"

returns composite. 11213 and 21701 are correctly found to be prime.

Tried this on i5 based Ubuntu machine using llrp 3.8.0 and 3.8.1.

--- Dan
danholle is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Question about FFTW vs. gwnum? hangxanh Software 2 2014-10-24 15:48
FFTW vs. gwnum? ixfd64 Software 7 2011-09-20 16:05
llrpi : a full portable version of LLR Jean Penné Software 8 2011-01-29 21:44
idea: a portable version of Prime95 ixfd64 Software 3 2010-10-07 14:18
Prime95 on Playstation Portable jasong Software 4 2007-03-12 07:15

All times are UTC. The time now is 09:12.


Sat Jul 17 09:12:05 UTC 2021 up 50 days, 6:59, 1 user, load averages: 2.18, 1.83, 1.66

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.