mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2012-03-28, 21:47   #1156
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

27AE16 Posts
Default

Quote:
Originally Posted by kladner View Post
But at the moment, it ignores -f for some reason. I'll get back to it in a bit.
I just had to move the check files out of the folder. -f 474560 does run faster than the default 1572864.
Code:
474560
err = 0.09766 (0:50 real, 4.9958 ms/iter
1572864
err = 0.02148 , 5.3635 ms/iter
It seems the fft could go smaller, but I'll have to read the part of the thread that's been posted since I started experimenting and writing about it.
kladner is offline   Reply With Quote
Old 2012-03-28, 23:51   #1157
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

236568 Posts
Default

Quote:
Originally Posted by kladner View Post
-f 474560 does run faster than the default 1572864.
Code:
474560
err = 0.09766 (0:50 real, 4.9958 ms/iter
1572864
err = 0.02148 , 5.3635 ms/iter
It seems the fft could go smaller, but I'll have to read the part of the thread that's been posted since I started experimenting and writing about it.

Oops. That is 1474560. So far the smallest that doesn't terminate on a GTX 460 with a 26M exponent.
kladner is offline   Reply With Quote
Old 2012-03-29, 01:50   #1158
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Quote:
Originally Posted by kladner View Post
Oops. That is 1474560. So far the smallest that doesn't terminate on a GTX 460 with a 26M exponent.
That was the same for me on a 580
flashjh is offline   Reply With Quote
Old 2012-03-29, 02:26   #1159
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

41×251 Posts
Default

Quote:
Originally Posted by kladner View Post
EDIT: This is weird. <snitp>
But CuLu continues to start with 1572864.
You have to delete the checkpoint file "cXXXXX" and "tXXXX". If there is a checkpoint file it will always resume from where it left, and the checkpoint files are not interchangeable, they have the size of the fft used. So, if a file with old fft-size size exists, it will use THAT size regardless of what -f you use. So, you can appreciate, if you job is done more then 10-15-20% or so, it would be faster to let it finish with old 1572864 (=32768*48), then use 1474560 (=32768*45) starting with the new exponent. Both sizes work well for 26-27M range, the shortest one is faster between 10% and 30% depending on your card. Use fftbench option as explained before to check exactly for your card.
LaurV is offline   Reply With Quote
Old 2012-03-29, 02:26   #1160
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2×3×1,693 Posts
Default

Quote:
Originally Posted by flashjh View Post
That was the same for me on a 580
Thanks again for throwing that out there. It made a difference for me.
kladner is offline   Reply With Quote
Old 2012-03-30, 14:45   #1161
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default

Code:
Processing result: M( 26768243 )C, 0x3280d4e28ef0b188, n = 1474560, CUDALucas v2.00
LL test successfully completes double-check of M26768243
msft is offline   Reply With Quote
Old 2012-03-31, 16:59   #1162
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

236568 Posts
Default

Successful LLDC with CuLu:

Code:
26158007No factors below2^69 
P-1B1=390000 
Verified LLB50D7F090E32331F by "David Triggerson" 
Verified LLB50D7F090E32331F by "ktony" on 2012-03-31 
Historyno factor for M26158007 from 2^67 to 2^68 [mfaktc 0.17-Win barrett79_mul32] by "lalera" on 2011-12-05 
Historyno factor for M26158007 from 2^68 to 2^69 [mfaktc 0.18-pre7 71bit_mul24] by "Luigi Morelli" on 2011-12-06 
Historyb50d7f090e3233__ by "ktony" on 2012-03-31
kladner is offline   Reply With Quote
Old 2012-04-02, 15:49   #1163
apsen
 
Jun 2011

131 Posts
Default

I've been assigned triple check and got mismatch with the first two checks for 28982959.

I've run the check twice with different FFT lengths (and -t both times) and got all residues match.

Could someone run it through P95?

Thanks,
Andriy
apsen is offline   Reply With Quote
Old 2012-04-02, 16:59   #1164
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Quote:
Originally Posted by apsen View Post
I've been assigned triple check and got mismatch with the first two checks for 28982959.

I've run the check twice with different FFT lengths (and -t both times) and got all residues match.

Could someone run it through P95?

Thanks,
Andriy
I'll run it. Will take a few days.
flashjh is offline   Reply With Quote
Old 2012-04-02, 20:57   #1165
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Does anyone have any Linux (64bit) binaries?

If not, what SDK version do I need, and how exactly do I use the makefile? (Just 'make'? I've sometimes seen fancier things, like 'make; 'makeall && install' or some such, so I want to be sure.)

Last fiddled with by Dubslow on 2012-04-02 at 21:04 Reason: ()
Dubslow is offline   Reply With Quote
Old 2012-04-03, 01:49   #1166
msft
 
msft's Avatar
 
Jul 2009
Tokyo

10011000102 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Does anyone have any Linux (64bit) binaries?

If not, what SDK version do I need, and how exactly do I use the makefile? (Just 'make'? I've sometimes seen fancier things, like 'make; 'makeall && install' or some such, so I want to be sure.)
You not need SDK.
Code:
$ cat Makefile
CUDALucas: CUDALucas.o
        g++ -O2 -fPIC -o CUDALucas CUDALucas.o -L/usr/local/cuda/lib64 -L/usr/local/cuda/lib64 -lcufft  -lm
CUDALucas.o: CUDALucas.cu
        /usr/local/cuda/bin/nvcc -O2 -arch=sm_13 -I/usr/local/include  CUDALucas.cu  -c
clean:
        -rm *.o CUDALucas
$ make
/usr/local/cuda/bin/nvcc -O2 -arch=sm_13 -I/usr/local/include  CUDALucas.cu  -c
g++ -O2 -fPIC -o CUDALucas CUDALucas.o -L/usr/local/cuda/lib64 -L/usr/local/cuda/lib64 -lcufft  -lm
$ ./CUDALucas -r
Iteration 10000 M( 86243 )C, 0x23992ccd735a03d9, n = 4608, CUDALucas v2.00 err = 0.01074 (0:20 real, 2.0263 ms/iter, ETA 2:21)
msft is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Don't DC/LL them with CudaLucas LaurV Data 131 2017-05-02 18:41
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 Brain GPU Computing 13 2016-02-19 15:53
CUDALucas: which binary to use? Karl M Johnson GPU Computing 15 2015-10-13 04:44
settings for cudaLucas fairsky GPU Computing 11 2013-11-03 02:08
Trying to run CUDALucas on Windows 8 CP Rodrigo GPU Computing 12 2012-03-07 23:20

All times are UTC. The time now is 14:47.


Fri Jul 7 14:47:37 UTC 2023 up 323 days, 12:16, 0 users, load averages: 1.82, 1.48, 1.23

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔