mersenneforum.org  

Go Back   mersenneforum.org > Search Forums

Showing results 1 to 8 of 8
Search took 0.01 seconds.
Search: Posts Made By: Andrew Thall
Forum: GPU Computing 2011-02-11, 17:26
Replies: 2,817
Views: 217,903
Posted By Andrew Thall
Update your CUDA library and CUFFT. The most...

Update your CUDA library and CUFFT. The most recent version no longer has the 8M element limit. It's also much more numerically accurate, particularly the non-power-of-two transforms.
Forum: GPU Computing 2011-02-04, 16:27
Replies: 2,817
Views: 217,903
Posted By Andrew Thall
There are wide variations in the time similar...

There are wide variations in the time similar sized transforms based on their factorization: CUFFT (CUDA 3.2) on Fermi supports 2^a * 3^b * 5^c * 7^d transforms, with pure powers of 2 and 3 being...
Forum: GPU Computing 2011-02-03, 02:38
Replies: 6
Views: 2,169
Posted By Andrew Thall
Thanks, all. I've had a few volunteers by email...

Thanks, all. I've had a few volunteers by email already...it is mainly the CUDALucas timings I need, but I think we've got it covered for now. Just looking at msec per Lucas iteration for given FFT...
Forum: GPU Computing 2011-02-01, 18:37
Replies: 6
Views: 2,169
Posted By Andrew Thall
Talk on gpuLucas at GPGPU-4 Workshop in March

I'll be presenting a short paper describing the GPU Lucas-Lehmer code I reported on last month. If anyone is in the LA area, the workshop is in conjunction with ASPLOS XVI conference; GPGPU-4 will...
Forum: GPU Computing 2010-12-13, 17:38
Replies: 109
Views: 16,964
Posted By Andrew Thall
@Brain: Fact #1 is true but irrelevant. LL...

@Brain: Fact #1 is true but irrelevant. LL needs the double precision only for the FFT squaring; as I mentioned before, I get better timings from the Tesla 2050 over the GTX 480 only if I overclock...
Forum: GPU Computing 2010-12-09, 15:09
Replies: 109
Views: 16,964
Posted By Andrew Thall
With regard the GPU LLR work; haven't looked at...

With regard the GPU LLR work; haven't looked at the sequential algorithms; based on George W.'s description, use of straightline in place of circular convolution and shift-add for modular...
Forum: GPU Computing 2010-12-08, 14:30
Replies: 109
Views: 16,964
Posted By Andrew Thall
Certainly no intention of pwning anyone; this is...

Certainly no intention of pwning anyone; this is purely research code, I was working from Crandall's original paper and with the understanding that other's had gotten it to work with non-powers of...
Forum: GPU Computing 2010-12-07, 15:15
Replies: 109
Views: 16,964
Posted By Andrew Thall
Fast Mersenne Testing on the GPU using CUDA

I'd like to announce the implementation of a Lucas-Lehmer tester, gpuLucas, written in CUDA and running on Fermi-class NVidia cards. It's a full implementation of Crandall's IBDWT method and uses...
Showing results 1 to 8 of 8

 
All times are UTC. The time now is 04:47.

Tue Aug 11 04:47:12 UTC 2020 up 25 days, 33 mins, 1 user, load averages: 2.82, 2.92, 2.75

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.