mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   CUDALucas (a.k.a. MaclucasFFTW/CUDA 2.3/CUFFTW) (https://www.mersenneforum.org/showthread.php?t=12576)

MooMoo2 2010-08-27 23:50

[quote=nucleon;227337]Has there been any work done on the competition?

On ATI's cards - any work done on them?
[/quote]
Probably not. It doesn't really matter anyway, since an average dual core CPU should be able to outperform either a Nvidia card or an ATI card:

[URL]http://www.mersenneforum.org/showpost.php?p=195982&postcount=1[/URL]

[quote][B]Q: So I can look for prime numbers on a GPU now?

[/B]Indeed you can.

[B]Q: So how fast does it go?[/B]

It's a work in progress, but with a top-of-the-line card the current speed seems to be around what one core of a high-end PC can achieve.
[/quote]

TheJudger 2010-08-28 00:17

[QUOTE=MooMoo2;227338]Probably not. It doesn't really matter anyway, since an average dual core CPU should be able to outperform either a Nvidia card or an ATI card:
[/QUOTE]

This might be a bit outdated. msft did some nice improvements!

[url]http://www.mersenneforum.org/showpost.php?p=222637&postcount=206[/url]

A Core i7 980x (topend 6core) is around 7.5ms for a 2M FFT using all 6 cores.

Oliver

nucleon 2010-08-28 10:06

GTX 480 with ver R code, 2M FFT 4.39ms.

As per TheJudger, the faq was written pre Fermi architecture. Top end GTX480 is streaks ahead in raw numbers, output per initial costs and output per ongoings.

The second fastest computer on the planet is using GPGPUs based on fermi:

[url]www.top500.org[/url]

I'm sorry, but the time as come where we can no longer dismiss GPGPUs.

-- Craig

The Carnivore 2010-08-28 17:49

Can someone tell me what the FFT length requirements are for CPUs vs GPUs? For example, if you wanted to check M24036583, you'd need a FFT length of 1280K if you were using a CPU. What FFT length would you need if you were using a GPU? 1280K? 2048K? 4096K?

I remember reading somewhere that starting a prime searching project without GPU support was like going to war without a piano. Some GPUs may match an overclocked quad or hex core if you were to compare FFT lengths, but they won't be of much use if they need FFT lengths that are much longer than CPUs.

BTW, my friend has a high end GPU, and he says it adds nearly 500 watts to his system at full load. Yuck. :yucky:

frmky 2010-08-28 18:11

[QUOTE=The Carnivore;227428]For example, if you wanted to check M24036583, you'd need a FFT length of 1280K if you were using a CPU. What FFT length would you need if you were using a GPU? 1280K? 2048K? 4096K?

BTW, my friend has a high end GPU, and he says it adds nearly 500 watts to his system at full load. Yuck. :yucky:[/QUOTE]
Round up to the nearest power of 2. 1280K > 1024K so you have to use 2048K FFT. But remember that a GTX 480 runs at under 4.5 ms/iteration using the 2048K FFT. Compare that with your CPU's speed at 1280K.

A PCIe card is limited to drawing 300W at full load by spec, but the cards are coming close to that limit.

The Carnivore 2010-08-28 18:37

[quote=frmky;227430]Round up to the nearest power of 2. 1280K > 1024K so you have to use 2048K FFT. But remember that a GTX 480 runs at under 4.5 ms/iteration using the 2048K FFT. Compare that with your CPU's speed at 1280K.
[/quote]
My core i7 920 overclocked to 3.2GHz gets me about 20ms/iteration. But that's with only one core running. With all cores running one LL test each, the total output matches the GTX 480.

Surpassing the GTX 480 should be possible if you get an i7 980x or use more aggressive overclocks.

Mini-Geek 2010-08-28 18:50

[QUOTE=The Carnivore;227431]My core i7 920 overclocked to 3.2GHz gets me about 20ms/iteration. But that's with only one core running. With all cores running one LL test each, the total output matches the GTX 480.[/QUOTE]

But keep in mind that the GPU needs to go to the next highest power of 2. Going from 1280K to 2048K is approximately the worst-case scenario, since 1280K is just a step above 1024K, so the GPU has to take a huge performance hit (compared to a 1024K number) to run this number, while the CPU only had to take a small performance hit (compared to a 1024K number) . Of course, the best case would be where they both would use a power-of-2 base. So in a worst-case (for the GPU) scenario, you still need all cores of your i7 working together to match its output! In a best-case scenario, it's closer to twice as fast as your CPU.

ET_ 2010-08-28 20:48

[QUOTE=Mini-Geek;227433]But keep in mind that the GPU needs to go to the next highest power of 2. Going from 1280K to 2048K is approximately the worst-case scenario, since 1280K is just a step above 1024K, so the GPU has to take a huge performance hit (compared to a 1024K number) to run this number, while the CPU only had to take a small performance hit (compared to a 1024K number) . Of course, the best case would be where they both would use a power-of-2 base. So in a worst-case (for the GPU) scenario, you still need all cores of your i7 working together to match its output! In a best-case scenario, it's closer to twice as fast as your CPU.[/QUOTE]

I'd add that with his computer AND the GPU, he could run 2 jobs in parallel...

Luigi

sanzo 2010-08-29 21:51

Hello everyone,

I'm new member of this forum but I'm a GIMPS member from an year.

I have two graphics card
-an ATI RAdeon HD 5870
-and a Geforce 210

I found this discussion because I wont to test Lucas-Lehmer test on a GPU.

I want to know if there is a opencl program that can run on ATI's card or if there is only a cuda version (for my 210 card)?

Thanks all
sanzo

henryzz 2010-08-30 08:24

[quote=sanzo;227575]Hello everyone,

I'm new member of this forum but I'm a GIMPS member from an year.

I have two graphics card
-an ATI RAdeon HD 5870
-and a Geforce 210

I found this discussion because I wont to test Lucas-Lehmer test on a GPU.

I want to know if there is a opencl program that can run on ATI's card or if there is only a cuda version (for my 210 card)?

Thanks all
sanzo[/quote]
Only CUDA so far.

sanzo 2010-08-30 09:07

Thanks [URL="http://www.mersenneforum.org/member.php?u=7654"][COLOR=green]henryzz[/COLOR][/URL],

Can I test this program on my gf 210??
I've win 7 64 bit, how I can try it? (exist a precompiled win version)...

sanzo


All times are UTC. The time now is 22:42.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.