mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2012-03-18, 01:32   #1002
msft
 
msft's Avatar
 
Jul 2009
Tokyo

10011000102 Posts
Default

Hi ,flashjh
Quote:
Originally Posted by flashjh View Post
I'm sure I'm missing something, but what is the method to choose the best FFT size? Where did you get these values?
Attached Files
File Type: bz2 cufftbench.tar.bz2 (1.0 KB, 131 views)
msft is offline   Reply With Quote
Old 2012-03-18, 02:25   #1003
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Hi ,flashjh
Quote:
Originally Posted by flashjh View Post
aspen/msft,

1.2b was the last build that included a win32 makefile. I modified my current makefile for win32, but it does not compile. Lots of errors during nvcc processing CUDALucas.cu. Has 32 bit compatability been removed or do I need some extra includes?
Please test with 32bit windows.
Attached Files
File Type: bz2 CUDALucas.test.tar.bz2 (10.6 KB, 117 views)
msft is offline   Reply With Quote
Old 2012-03-18, 03:36   #1004
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Quote:
Originally Posted by msft View Post
Hi ,flashjh

Please test with 32bit windows.
msft,

Compiled. I didn't see the version, so I labeled it 'test'.

Included MAKEFILE and compile output.

I have no way to actually test with WIN32. I'll see if I can throw something together... if anyone else can test, let us know.
Attached Files
File Type: zip CUDALucas.test.win32.zip (74.3 KB, 119 views)
flashjh is offline   Reply With Quote
Old 2012-03-18, 03:59   #1005
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default

Quote:
Originally Posted by flashjh View Post
I have no way to actually test with WIN32. I'll see if I can throw something together... if anyone else can test, let us know.
Thank you for your notice.
msft is offline   Reply With Quote
Old 2012-03-18, 11:29   #1006
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

41·251 Posts
Default

Match for 26077459.
LaurV is offline   Reply With Quote
Old 2012-03-18, 14:46   #1007
apsen
 
Jun 2011

131 Posts
Default

Quote:
Originally Posted by apsen View Post
The first one is also me - I just did not realize I was not logged in when I reserved it.
Never mind, since the second one matched I've just submitted the first one and let it triple check...

Last fiddled with by apsen on 2012-03-18 at 14:46
apsen is offline   Reply With Quote
Old 2012-03-18, 21:37   #1008
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

205716 Posts
Default

For my first foray into CUDA, I've tweaked CudaLucas 1.66.

I built this in a Visual Studio project so I don't know if I've got all the right nvcc switches set. That said, I did get about a 7% improvement.


The changes were:

1) Rewrote normalize kernel to do most of its work with integers
2) Two inline macros for rounding to integer.
3) Changed error from double to float
4) Minor change to rdft to save two negations.
5) Less memory used during normalize (no g_inv and g_ttmpp arrays).
6) The -2 was moved to normalize2

It isn't fully cleaned up -- normalize2 should be upgraded.

Can some else build a version and do some comparison timings?
Attached Files
File Type: txt CUDALucas.cu.txt (34.6 KB, 242 views)
Prime95 is offline   Reply With Quote
Old 2012-03-18, 22:35   #1009
msft
 
msft's Avatar
 
Jul 2009
Tokyo

61010 Posts
Default

Quote:
Originally Posted by Prime95 View Post
For my first foray into CUDA, I've tweaked CudaLucas 1.66.

I built this in a Visual Studio project so I don't know if I've got all the right nvcc switches set. That said, I did get about a 7% improvement.


The changes were:

1) Rewrote normalize kernel to do most of its work with integers
2) Two inline macros for rounding to integer.
3) Changed error from double to float
4) Minor change to rdft to save two negations.
5) Less memory used during normalize (no g_inv and g_ttmpp arrays).
6) The -2 was moved to normalize2

It isn't fully cleaned up -- normalize2 should be upgraded.

Can some else build a version and do some comparison timings?
Great !!!

Ver1.67
1) Marge Prime95's code.
2) 32bit Windows support.
Attached Files
File Type: bz2 CUDALucas.1.67.tar.bz2 (11.0 KB, 143 views)
msft is offline   Reply With Quote
Old 2012-03-18, 23:19   #1010
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Hi ,flashjh
Can you make cuda3.2 version?
Cuda3.2 CUFFT 5% faster than Cuda4.x.
msft is offline   Reply With Quote
Old 2012-03-18, 23:30   #1011
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Quote:
Originally Posted by msft View Post
Hi ,flashjh
Can you make cuda3.2 version?
Cuda3.2 CUFFT 5% faster than Cuda4.x.
Quote:
Originally Posted by msft View Post
Great !!!

Ver1.67
1) Marge Prime95's code.
2) 32bit Windows support.

I'll post updates in a bit

EDIT: I'm getting an unresolved error LNK2001: unresolved external symbol getting timeofday. I'll have to look at it later...

Last fiddled with by flashjh on 2012-03-19 at 00:24
flashjh is offline   Reply With Quote
Old 2012-03-19, 02:39   #1012
msft
 
msft's Avatar
 
Jul 2009
Tokyo

11428 Posts
Default

Quote:
Originally Posted by flashjh View Post
EDIT: I'm getting an unresolved error LNK2001: unresolved external symbol getting timeofday. I'll have to look at it later...
Code:
#ifdef _MSC_VER
#include <winsock2.h>
extern "C" int gettimeofday(struct timeval *tv, struct timezone *tz);
#else
#include <sys/time.h>
#include <unistd.h>
#endif
to
Code:
#ifdef _MSC_VER
typedef struct timeval
{
  long tv_sec;
  long tv_usec;
} timeval;
int gettimeofday (struct timeval *tv, struct timezone *);
#else
#include <sys/time.h>
#include <unistd.h>
#endif
I guess fix.
Thanks.
msft is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Don't DC/LL them with CudaLucas LaurV Data 131 2017-05-02 18:41
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 Brain GPU Computing 13 2016-02-19 15:53
CUDALucas: which binary to use? Karl M Johnson GPU Computing 15 2015-10-13 04:44
settings for cudaLucas fairsky GPU Computing 11 2013-11-03 02:08
Trying to run CUDALucas on Windows 8 CP Rodrigo GPU Computing 12 2012-03-07 23:20

All times are UTC. The time now is 14:47.


Fri Jul 7 14:47:32 UTC 2023 up 323 days, 12:16, 0 users, load averages: 1.68, 1.45, 1.21

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔