mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2011-11-14, 04:37   #23
bcp19
 
bcp19's Avatar
 
Oct 2011

7×97 Posts
Default

Quote:
Originally Posted by LaurV View Post
Thanks for the small tutorial. My babe came on last Saturday and I spent the weekend installing stuff on it. I will definitely go for CudaLucas on DC exponents for a while, until I am convinced that all residues match, then I will switch to some other jobs, like LL-front or so-much-debated-TF-front. That is my choice for now, so I don't want to hear any argument.

So, CudaLucas installed and running. So far so good. I use 64 bit version, on Win7. Just as a small observation, -c[xxx] switch does not work, no matter what I put there, it will still output every 10k iterations on screen (did someone tried with other value except the default one?). This is a minor problem, and it is just FYI, of course I can live with it.

My biggest problem is that I don't know how to convince CudaLucas (or a second/third, etc. copy of it) to run on the second GPU. Can anyone help? I have carefully read all the 36 pages of the GPU-thread on the forum (an related) but did not find too much.

If I start one copy of CudaLucas, about 75-80% of the first GPU is busy, and I get like 3.5ms per iteration (~25-30M range). If I start a second copy, then the same first GPU goes to 99%, and the time decrease per each CL process to about 4.5ms per iteration. Still reasonable. If I continue to launch copies of CL, they will all fight for the same GPU (and the time per iteration decreasing accordingly). The other one is plain empty.

Tried also CL 64 with 4.0, same result. Also, -t switch does not seems to work for any of them. Cuda capability is 2.0. Any switch I am missing for CL?
The -c switch is for how many iterations between outputs to the checkpoint file, not the screen output.

the -t switch only seems to be working on the 1.2b version

open a command prompt, change to your cudalucas directory and type cudalucas /? to get a list of switches

Unfortunately, I have no clue on getting cudalucas to work on the 2nd GPU.

Last fiddled with by bcp19 on 2011-11-14 at 04:39
bcp19 is offline   Reply With Quote
Old 2011-11-14, 06:30   #24
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

1C3516 Posts
Default

Quote:
Originally Posted by LaurV View Post
Wait, 1.2 TFLOPS? What the hell is on that thing?
Dubslow is offline   Reply With Quote
Old 2011-11-14, 06:38   #25
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

1029110 Posts
Default

Quote:
Originally Posted by bcp19 View Post
The -c switch is for how many iterations between outputs to the checkpoint file, not the screen output.

the -t switch only seems to be working on the 1.2b version

open a command prompt, change to your cudalucas directory and type cudalucas /? to get a list of switches

Unfortunately, I have no clue on getting cudalucas to work on the 2nd GPU.
Thanks. The /? I figured out in the very beginning, this is the first thing one does when he gets a new toy, he write "toy /?" at the command prompt :D
About -t I figured out on the forum, just before reading your post. Eager to go home in the evening, to try. About -c, I did not know. Thanks for telling me. Somehow I think that the "printf" used there is the same slow as a disk writing, especially when you have a SSD, and I wonder why the -c does not work for the screen too. I mean, if I use a redirection to a file, that is anyhow writing on disk. So, -c should affect both the screen and the outputs to the checkpoint file. Output to screen every 100k or even larger for a bigger exponent is ok. Whatever...

Seems like I still can't find how to run CudaLucas on both GPU's, and up to now the only profitable solution not to let the the second GPU to sleep, is to run one CudaLucas and one mfaktc (I am aware of the -d switch of the mfaktc, which selects the gpu, I did not try mfaktc yet, I would still prefer to run more CudaLucas instances, as that would let the CPU's free to do P-1. I am also aware of the fact that SLI should be disabled for that to work, as someone said in another thread here, I did not try mfaktc, but for CudaLucas I have tried both SLI and no-SLI, I can not cheat it to run different copies on different gpu's).

Conspiracy theory: I am sure someone knows the answer, but they refuse to tell me, to make me run mfaktc (and therefore TF, see the big debate around) :P:P
LaurV is offline   Reply With Quote
Old 2011-11-14, 07:35   #26
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

240638 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Wait, 1.2 TFLOPS? What the hell is on that thing?
You are right! The hell is in that thing! And it is (theoretical) 1.3, not 1.2. I will put a photo when I get home, if you tell me how to run cudalucas on both gpu's.
LaurV is offline   Reply With Quote
Old 2011-11-14, 07:42   #27
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Erm... sorry, no se.
What's the hardware?
Dubslow is offline   Reply With Quote
Old 2011-11-14, 08:30   #28
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

A6716 Posts
Default

A quick look at the source indicates that the unadvertised -D switch selects the GPU. GPU numbering starts at 0, so with two GPU's use -D0 and -D1.

Last fiddled with by frmky on 2011-11-14 at 08:30
frmky is offline   Reply With Quote
Old 2011-11-14, 13:23   #29
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

10011000000012 Posts
Default

Quote:
Originally Posted by frmky View Post
A quick look at the source indicates that the unadvertised -D switch selects the GPU. GPU numbering starts at 0, so with two GPU's use -D0 and -D1.
Now we all will eagerly wait for LaurV and his photos...

Luigi
ET_ is offline   Reply With Quote
Old 2011-11-14, 14:06   #30
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

41×251 Posts
Default

Quote:
Originally Posted by frmky View Post
A quick look at the source indicates that the unadvertised -D switch selects the GPU. GPU numbering starts at 0, so with two GPU's use -D0 and -D1.
Wow! Amazing! That is working! And there is no need to disable SLI. I used uppercase D (did not try smaller case d).

Iteration 9650000/27777653, ETA 18 hours, and (the one started later) Iteration 2630000/27863639, ETA 24 hours.

Thanks a billion! If we meet in RL, you have a beer from me!

(edit: this is in parallel with 4 P-1 on P95, another 8 waiting in the queue, and splitting the terms of aliquot 585000 with 4 threads of yafu! It feels no delay, it feels nothing except a lot of heat coming from under the desk...)

Last fiddled with by LaurV on 2011-11-14 at 14:10
LaurV is offline   Reply With Quote
Old 2011-11-14, 20:13   #31
bcp19
 
bcp19's Avatar
 
Oct 2011

7×97 Posts
Default

No pictures?
bcp19 is offline   Reply With Quote
Old 2011-11-14, 22:07   #32
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

1C3516 Posts
Default

At least 8 threads... but good performance, so not a Bulldozer?
Dubslow is offline   Reply With Quote
Old 2011-11-15, 03:32   #33
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

1029110 Posts
Default

Quote:
Originally Posted by Dubslow View Post
so not a Bulldozer?
Definitely not. I have read bad things about AMD, right here on this forum :D
About the photos, I really tried, but the 240kB limitation of the forum pissed me off, I have to make them either low resolution or tough jpg compression, in either case you can't see nothing clear... I will try again tonight.

Last fiddled with by LaurV on 2011-11-15 at 03:32
LaurV is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Anti-poverty drug testing vs "high" tax deduction testing kladner Soap Box 3 2016-10-14 18:43
What am I testing? GARYP166 Information & Answers 9 2009-02-18 22:41
k=243 testing ?? gd_barnes Riesel Prime Search 20 2007-11-08 21:13
Testing grobie Marin's Mersenne-aries 1 2006-05-15 12:26
Speed of P-1 testing vs. Trial Factoring testing eepiccolo Math 6 2006-03-28 20:53

All times are UTC. The time now is 15:38.


Fri Jul 7 15:38:19 UTC 2023 up 323 days, 13:06, 0 users, load averages: 1.85, 1.28, 1.14

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔