mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2012-03-08, 19:34   #89
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

112310 Posts
Default

Quote:
Originally Posted by aaronhaviland View Post
I started using one of these to move the middle card of my 3-GPU system for better airflow: http://www.amazon.com/gp/product/B0058UVVX2/
I might use another, to make it a 4-GPU system (it's a 4-slot mobo, in a too-small tower case)
Can you post a picture so I can see how it works with final setup? What PS do you use on that system?
flashjh is offline   Reply With Quote
Old 2012-04-06, 20:08   #90
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Whoa dang, the PDF is way out of date, we're up to CuLu 2.0 now, not 1.4.2 ;)

(Also, GMP-ECM has been ported at least sort of to the GPU, that should be removed.)
Dubslow is offline   Reply With Quote
Old 2012-04-07, 10:07   #91
Brain
 
Brain's Avatar
 
Dec 2009
Peine, Germany

331 Posts
Default GPU Computing Guide Update

I will probably do this soon. It's kind of useless to update the pdf when CL changes versions so quickly as it did the last weeks.
I monitor the CL thread and will release the update when I think that CL2 is ready for production work. I guess I will be convinced in the next few days.
Additionally, I dislike old versions wandering around in the stickies. Two options: Stop publishing the guide in the sticky and use a new thread or get mod rights. I prefer the former.
I am not convinced that CUDA-ECM should be integrated yet as it is expert and development level at the moment.
Thoughts? Is it useful at all?

Last fiddled with by Brain on 2012-04-07 at 10:08
Brain is offline   Reply With Quote
Old 2012-04-07, 17:46   #92
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

I wasn't saying it should be integrated, just remove the sentence saying it needs to be done.
Dubslow is offline   Reply With Quote
Old 2012-04-08, 07:18   #93
xilman
Bamboozled!
 
xilman's Avatar
 
"๐’‰บ๐’ŒŒ๐’‡ท๐’†ท๐’€ญ"
May 2003
Down not across

270268 Posts
Default

Quote:
Originally Posted by Brain View Post
I am not convinced that CUDA-ECM should be integrated yet as it is expert and development level at the moment.
Thoughts? Is it useful at all?
IMO, it is very useful. I've already found three non-trivial factors with it and another day-long run is about 80% completed at the moment.

Paul
xilman is offline   Reply With Quote
Old 2012-04-08, 08:57   #94
debrouxl
 
debrouxl's Avatar
 
Sep 2009

22×5×72 Posts
Default

Yeah, I've found some small factors as well, and I see that the ECMNET records page ( http://www.loria.fr/~zimmerma/records/ecmnet.html ) shows several factors whose stage 1 was performed by GPU-ECM (A = ... instead of sigma, and Cyril Bouvier being the discoverer).

My low-end GT 540M running GPU-ECM beta beats the associated Core i7-2670QM running GMP-ECM by a wide margin for some numbers, and other persons' tests show that the ratio is even more damning for CPUs if high-end GT(X) 4xx and 5xx are used.
debrouxl is offline   Reply With Quote
Old 2012-04-08, 10:57   #95
xilman
Bamboozled!
 
xilman's Avatar
 
"๐’‰บ๐’ŒŒ๐’‡ท๐’†ท๐’€ญ"
May 2003
Down not across

2·17·347 Posts
Default

Quote:
Originally Posted by debrouxl View Post
Yeah, I've found some small factors as well, and I see that the ECMNET records page ( http://www.loria.fr/~zimmerma/records/ecmnet.html ) shows several factors whose stage 1 was performed by GPU-ECM (A = ... instead of sigma, and Cyril Bouvier being the discoverer
That might be the case but I'm far from certain. Try running gmp-ecm with the -batch parameter to see what I mean.

Paul
xilman is offline   Reply With Quote
Old 2012-04-08, 17:26   #96
Brain
 
Brain's Avatar
 
Dec 2009
Peine, Germany

331 Posts
Default

Quote:
Originally Posted by xilman View Post
IMO, it is very useful. I've already found three non-trivial factors with it and another day-long run is about 80% completed at the moment.

Paul
My fault: The "Is it useful" was referring to the pdf guide. I will mention GMP-ECM thread in the next version. Just waiting for more news from mfakto - new version likely to come soon.
Brain is offline   Reply With Quote
Old 2012-07-13, 21:31   #97
ixfd64
Bemusing Prompter
 
ixfd64's Avatar
 
"Danny"
Dec 2002
California

23·313 Posts
Default

This is probably a stupid question, but would running multiple LL tests on one GPU yield a higher throughput?

For example, the GeForce GTX 570 has 480 CUDA cores, and I imagine that a lot of threads would get in each other's way. In this case, would it be better to run, say, ten parallel tests using 48 cores each, instead of doing one test with all 480 cores? I don't think memory bandwidth would be an issue as GPUs normally have much higher bandwidths than CPUs.
ixfd64 is offline   Reply With Quote
Old 2012-07-13, 21:41   #98
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

I think the problem is that much of the code is actually CPU based, but it's lightweight-enough code that CUDALucas doesn't appear to use any CPU time. The point is, a lot of transfers between host and device occur each iteration, and adding any more transfers would significantly slow both tests running side by side. Furthermore, it simply isn't possible to run the test only on the GPU.

You'll have to wait for msft to verify/clarify these statements, since he's the only one (plus George) to write the math code, but this should provide at least a basic answer.
Dubslow is offline   Reply With Quote
Old 2012-07-13, 21:52   #99
henryzz
Just call me Henry
 
henryzz's Avatar
 
"David"
Sep 2007
Liverpool (GMT/BST)

17FD16 Posts
Default

Quote:
Originally Posted by Dubslow View Post
I think the problem is that much of the code is actually CPU based, but it's lightweight-enough code that CUDALucas doesn't appear to use any CPU time. The point is, a lot of transfers between host and device occur each iteration, and adding any more transfers would significantly slow both tests running side by side. Furthermore, it simply isn't possible to run the test only on the GPU.

You'll have to wait for msft to verify/clarify these statements, since he's the only one (plus George) to write the math code, but this should provide at least a basic answer.
Wouldn't this mean that if we got some sort of GPU LL test running on a ivy bridge it would run really fast?
henryzz is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Anti-poverty drug testing vs "high" tax deduction testing kladner Soap Box 3 2016-10-14 18:43
What am I testing? GARYP166 Information & Answers 9 2009-02-18 22:41
k=243 testing ?? gd_barnes Riesel Prime Search 20 2007-11-08 21:13
Testing grobie Marin's Mersenne-aries 1 2006-05-15 12:26
Speed of P-1 testing vs. Trial Factoring testing eepiccolo Math 6 2006-03-28 20:53

All times are UTC. The time now is 14:46.


Fri Jul 7 14:46:19 UTC 2023 up 323 days, 12:14, 0 users, load averages: 1.43, 1.28, 1.13

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

โ‰  ยฑ โˆ“ รท ร— ยท โˆ’ โˆš โ€ฐ โŠ— โŠ• โŠ– โŠ˜ โŠ™ โ‰ค โ‰ฅ โ‰ฆ โ‰ง โ‰จ โ‰ฉ โ‰บ โ‰ป โ‰ผ โ‰ฝ โŠ โА โŠ‘ โŠ’ ยฒ ยณ ยฐ
โˆ  โˆŸ ยฐ โ‰… ~ โ€– โŸ‚ โซ›
โ‰ก โ‰œ โ‰ˆ โˆ โˆž โ‰ช โ‰ซ โŒŠโŒ‹ โŒˆโŒ‰ โˆ˜ โˆ โˆ โˆ‘ โˆง โˆจ โˆฉ โˆช โจ€ โŠ• โŠ— ๐–• ๐–– ๐–— โŠฒ โŠณ
โˆ… โˆ– โˆ โ†ฆ โ†ฃ โˆฉ โˆช โІ โŠ‚ โŠ„ โŠŠ โЇ โŠƒ โŠ… โŠ‹ โŠ– โˆˆ โˆ‰ โˆ‹ โˆŒ โ„• โ„ค โ„š โ„ โ„‚ โ„ต โ„ถ โ„ท โ„ธ ๐“Ÿ
ยฌ โˆจ โˆง โŠ• โ†’ โ† โ‡’ โ‡ โ‡” โˆ€ โˆƒ โˆ„ โˆด โˆต โŠค โŠฅ โŠข โŠจ โซค โŠฃ โ€ฆ โ‹ฏ โ‹ฎ โ‹ฐ โ‹ฑ
โˆซ โˆฌ โˆญ โˆฎ โˆฏ โˆฐ โˆ‡ โˆ† ฮด โˆ‚ โ„ฑ โ„’ โ„“
๐›ข๐›ผ ๐›ฃ๐›ฝ ๐›ค๐›พ ๐›ฅ๐›ฟ ๐›ฆ๐œ€๐œ– ๐›ง๐œ ๐›จ๐œ‚ ๐›ฉ๐œƒ๐œ— ๐›ช๐œ„ ๐›ซ๐œ… ๐›ฌ๐œ† ๐›ญ๐œ‡ ๐›ฎ๐œˆ ๐›ฏ๐œ‰ ๐›ฐ๐œŠ ๐›ฑ๐œ‹ ๐›ฒ๐œŒ ๐›ด๐œŽ๐œ ๐›ต๐œ ๐›ถ๐œ ๐›ท๐œ™๐œ‘ ๐›ธ๐œ’ ๐›น๐œ“ ๐›บ๐œ”