mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   CUDALucas (a.k.a. MaclucasFFTW/CUDA 2.3/CUFFTW) (https://www.mersenneforum.org/showthread.php?t=12576)

Dubslow 2011-10-23 04:43

[QUOTE=fivemack;275365]You're measuring the size of the memory required to write down the candidate number N; what's relevant for CUDALucas is the amount of memory required to perform an FFT-based multiply of numbers the size of N, which is a good deal bigger (firstly because you can only fit about eighteen bits of N into each 64-bit floating-point number, and secondly because the CUDA FFT uses a lot of scratch space to allow it to use hundreds of threads).[/QUOTE]

Ah, okay. Makes much more sense, especially the threadedness. Thanks.

aaronhaviland 2011-10-23 13:07

[QUOTE=Lorenzo;275350]All done works!

[CODE]Iteration 10000 M( 290000033 )C, 0x9e2181b8405a2511, n = 16777216, CUDALucas v1.2[/CODE]FFT size 16M![/QUOTE]

Err, actually not: the cufft size is initialiased as only half of n:

[CODE]cufftPlan1d(&plan, [B]n/2[/B], CUFFT_Z2Z, 1)[/CODE]

Lorenzo 2011-10-23 18:49

[QUOTE=aaronhaviland;275402]Err, actually not: the cufft size is initialiased as only half of n:

[CODE]cufftPlan1d(&plan, [B]n/2[/B], CUFFT_Z2Z, 1)[/CODE][/QUOTE]

:drama: Too bad.

Oops, i find in FAQ:
[CODE]Exponents: 2<=Exponent< 151,150,000
Exponent< 39,800,00  2MB FFT size
Exponent< 79,600,00  4MB FFT size
Exponent<159,200,00  8MB FFT size[/CODE]

No more question :redface:

Brain 2011-10-23 20:09

[QUOTE=Lorenzo;275430]No more question :redface:[/QUOTE]
No, there are still basic questions (for me):
1. what is the max expo possible? how much GPU memory is needed for each FFT size?
2. what are the fft size borders beyond 4m?
i only tried up to 4m on my own.

I will try later but anybody must already have tested it...

msft 2011-10-25 13:08

Hi ,aaronhaviland
[QUOTE=aaronhaviland;275402]Err, actually not: the cufft size is initialiased as only half of n:

[CODE]cufftPlan1d(&plan, [B]n/2[/B], CUFFT_Z2Z, 1)[/CODE][/QUOTE]
GIMPS(Prime95,GIMPS Web,MLucas,GLucas,...) use real data size.

aaronhaviland 2011-10-26 23:22

[QUOTE=msft;275673]Hi ,aaronhaviland

GIMPS(Prime95,GIMPS Web,MLucas,GLucas,...) use real data size.[/QUOTE]

Right, but I thought the question was about the limitations of CUFFT's transform sizes.

msft 2011-10-27 03:17

[QUOTE=aaronhaviland;275874]Right, but I thought the question was about the limitations of CUFFT's transform sizes.[/QUOTE]
NVIDIA not announce CUFFT's work area size and limitations ,we can only do the experiments.

Wizzard 2011-11-02 20:33

Sorry people, want try CUDALucas too, I downloaded CUDALucas.1.2b.zip, extracted, installed CUDA Toolkit 3.0 from nvidia website, wanted to compile CUDALucas using make and got an error: CUDALucas.cu:87:26: fatal error: cutil_inline.h: Directory or file does not exist. What is wrong? Do I need some CUDA patch? Which one? I am lost... I am using Ubuntu64 on Core2 Duo and GTS250.

Dubslow 2011-11-02 22:05

Check your Toolkit version. I'm pretty sure the current release is either 4.1 or 4.2. There's also some instructions you have to follow during installation regarding various *PATH variables.

Wizzard 2011-11-02 22:48

Okay, I replaced Toolkit with version 4.0, downloaded also GPU Computing SDK. Now I moved a little farther, and stopped at the error:

gcc: setup.cu: Directory or file does not exist
gcc: warning: '-x c++' after last input file has no effect
gcc: no input files
make: *** [setup.o] Error 1

Dubslow 2011-11-02 23:56

Unless CUDA threw an error about the gcc version being >4.5, I can't help. Sorry.


All times are UTC. The time now is 23:05.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.