mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   CUDALucas (a.k.a. MaclucasFFTW/CUDA 2.3/CUFFTW) (https://www.mersenneforum.org/showthread.php?t=12576)

ET_ 2011-12-22 13:35

[QUOTE=msft;283156]Hi ,
Ver 1.4
Support non power of two fft length.:lol:
n = 786432 5.4 msec/Iter,n = 1310720 8.9 msec/Iter,n = 1835008 12.5 msec/Iter,n = 2359296 16 msec/Iter,n = 3670016 25.7 msec/Iter,n = 5242880 37.7 msec/Iter with GTX-550Ti
Thank you,[/QUOTE]

Is it safe to exchange versions (1.3 -> 1.4) during an LL test, or is better to finish the test and then exchange versions?

Luigi

msft 2011-12-22 13:50

Hi ,ET_
[QUOTE=ET_;283160]Is it safe to exchange versions (1.3 -> 1.4) during an LL test, or is better to finish the test and then exchange versions?

Luigi[/QUOTE]
Anyway cannot exchange fft length.

ET_ 2011-12-22 15:32

[QUOTE=msft;283165]Hi ,ET_

Anyway cannot exchange fft length.[/QUOTE]

Yep :redface: Thank you.

Luigi

Brain 2011-12-22 18:16

2 wishes
 
Firstly, could anybody post pre-compiled 1.4 binaries for both linux and win(64)? This would help a lot and I could integrate the urls into the GPU guide.

Secondly, could any forum admin rename or add "CUDALucas" into the thread's title. This would help a lot, too...

Thanks!

msft 2011-12-23 00:38

1 Attachment(s)
Hi ,Brain
[QUOTE=Brain;283198]Firstly, could anybody post pre-compiled 1.4 binaries for both linux and win(64)?[/QUOTE]
1.4 binarie file for Linux64.

moebius 2011-12-23 03:57

[QUOTE=msft;283250]Hi ,Brain

1.4 binarie file for Linux64.[/QUOTE]

Is it runable on Cuda 4.0?

msft 2011-12-23 08:46

Hi ,moebius
[QUOTE=moebius;283275]Is it runable on Cuda 4.0?[/QUOTE]
I have no information.
Do you have any comments?

TheJudger 2011-12-23 11:21

[QUOTE=moebius;283275]Is it runable on Cuda 4.0?[/QUOTE]

According to CUDA_C_Programming_Guide.pdf the CUDA runtime version must be the same version used for compiling. The CUDA driver must support the same or a newer version of CUDA.

Oliver

Brain 2011-12-23 12:56

[QUOTE=msft;283250]Hi ,Brain

1.4 binarie file for Linux64.[/QUOTE]
Thanks a lot. I thought of self-compiling the win version but I haven't MS Visual Studio installed. Last time I did, I almost smashed my win installation. So I'm still looking for a volunteer (kjaget, apsen, ethan_eo?)... Somewhen, I will set up a vm for MS VS and CUDALucas compilation.

Makefile.win says 9.0 is required. Is this correct? If anybody could post a short how-to-compile for Windows this would be nice. nvcc.exe <compile me>

Makefile.win:
[CODE]NVIDIA_SDK = "C:/ProgramData/NVIDIA Corporation/NVIDIA GPU Computing SDK"
LIBS = c:\\cuda\\lib64\\cudart.lib c:\\cuda\\lib64\\cufft.lib

CUFLAGS = -m64 --ptxas-options=-v -ccbin="C:\Program Files (x86)\Microsoft Visual Studio 9.0\VC\bin" -DWIN64 -Xcompiler /EHsc,/W3,/nologo,/Ox,/Oy,/GL -arch=sm_13 -DMERS_PACKAGE -DBIT_SIEVE -DTESTING_SMALL_EXPONENTS -DSIEVE_SIZE_IN_BYTES=32 -DNUM_SMALL_PRIMES=32768 -DDO_NOT_USE_LONG_DOUBLE -IC:/cuda/include -Ic:/cuda/include/cudart -I$(NVIDIA_SDK)/C/common/inc -D__x86_64__ -O3

LINK = link
LFLAGS = /nologo /LTCG #/ltcg:pgo

CUSRC = CUDALucas.cu setup.cu rw.cu balance.cu zero.cu

CUOBJS = $(CUSRC:.cu=.obj)

CUDALucas.exe: $(CUOBJS)
$(LINK) $(LFLAGS) $^ $(LIBS) /out:$@

%.obj: %.cu
nvcc -c $< -o $@ $(CUFLAGS)
[/CODE]

frmky 2011-12-27 07:59

Using CUDALucas 1.4, there's an interesting speed regression going from CUDA 3.2 to CUDA 4.1RC2 on my GTX 480 using 64-bit linux:

CUDA 3.2:
Iteration 20000 6.306 msec/Iter M( 45002497 )C, 0x6827d44a3efced00, n = 2621440, CUDALucas v1.4

CUDA 4.1RC2:
Iteration 20000 7.126 msec/Iter M( 45002497 )C, 0x6827d44a3efced00, n = 2621440, CUDALucas v1.4

Note that both of these runs used the 4.1RC2 driver. Only the compiler & libraries differ.

msft 2011-12-27 09:04

Hi ,frmky
[QUOTE=frmky;283635]CUDA 3.2:
Iteration 20000 6.306 msec/Iter M( 45002497 )C, 0x6827d44a3efced00, n = 2621440, CUDALucas v1.4

CUDA 4.1RC2:
Iteration 20000 7.126 msec/Iter M( 45002497 )C, 0x6827d44a3efced00, n = 2621440, CUDALucas v1.4[/QUOTE]
This is CUFFT issue.
Add information, CUDALucas 1.4 slower than 1.3 with same fft length.


All times are UTC. The time now is 23:06.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.