![]() |
|
|
#1002 |
|
Jul 2009
Tokyo
10011000102 Posts |
Hi ,flashjh
|
|
|
|
|
|
#1003 |
|
Jul 2009
Tokyo
2×5×61 Posts |
Hi ,flashjh
Please test with 32bit windows. |
|
|
|
|
|
#1004 |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
msft,
Compiled. I didn't see the version, so I labeled it 'test'. Included MAKEFILE and compile output. I have no way to actually test with WIN32. I'll see if I can throw something together... if anyone else can test, let us know. |
|
|
|
|
|
#1005 |
|
Jul 2009
Tokyo
2·5·61 Posts |
|
|
|
|
|
|
#1006 |
|
Romulan Interpreter
"name field"
Jun 2011
Thailand
41·251 Posts |
Match for 26077459.
|
|
|
|
|
|
#1007 |
|
Jun 2011
131 Posts |
Never mind, since the second one matched I've just submitted the first one and let it triple check...
Last fiddled with by apsen on 2012-03-18 at 14:46 |
|
|
|
|
|
#1008 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
205716 Posts |
For my first foray into CUDA, I've tweaked CudaLucas 1.66.
I built this in a Visual Studio project so I don't know if I've got all the right nvcc switches set. That said, I did get about a 7% improvement. The changes were: 1) Rewrote normalize kernel to do most of its work with integers 2) Two inline macros for rounding to integer. 3) Changed error from double to float 4) Minor change to rdft to save two negations. 5) Less memory used during normalize (no g_inv and g_ttmpp arrays). 6) The -2 was moved to normalize2 It isn't fully cleaned up -- normalize2 should be upgraded. Can some else build a version and do some comparison timings? |
|
|
|
|
|
#1009 | |
|
Jul 2009
Tokyo
61010 Posts |
Quote:
Ver1.67 1) Marge Prime95's code. 2) 32bit Windows support. |
|
|
|
|
|
|
#1010 |
|
Jul 2009
Tokyo
2×5×61 Posts |
Hi ,flashjh
Can you make cuda3.2 version? Cuda3.2 CUFFT 5% faster than Cuda4.x. |
|
|
|
|
|
#1011 | ||
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
Quote:
Quote:
I'll post updates in a bit EDIT: I'm getting an unresolved error LNK2001: unresolved external symbol getting timeofday. I'll have to look at it later... Last fiddled with by flashjh on 2012-03-19 at 00:24 |
||
|
|
|
|
|
#1012 | |
|
Jul 2009
Tokyo
11428 Posts |
Quote:
Code:
#ifdef _MSC_VER #include <winsock2.h> extern "C" int gettimeofday(struct timeval *tv, struct timezone *tz); #else #include <sys/time.h> #include <unistd.h> #endif Code:
#ifdef _MSC_VER
typedef struct timeval
{
long tv_sec;
long tv_usec;
} timeval;
int gettimeofday (struct timeval *tv, struct timezone *);
#else
#include <sys/time.h>
#include <unistd.h>
#endif
Thanks. |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Don't DC/LL them with CudaLucas | LaurV | Data | 131 | 2017-05-02 18:41 |
| CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 | Brain | GPU Computing | 13 | 2016-02-19 15:53 |
| CUDALucas: which binary to use? | Karl M Johnson | GPU Computing | 15 | 2015-10-13 04:44 |
| settings for cudaLucas | fairsky | GPU Computing | 11 | 2013-11-03 02:08 |
| Trying to run CUDALucas on Windows 8 CP | Rodrigo | GPU Computing | 12 | 2012-03-07 23:20 |