mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-02-07, 14:47   #1
Aramis Wyler
 
Aramis Wyler's Avatar
 
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA

1101010002 Posts
Default GPU RAM.

I upgraded my video card recently to a gtx580, and was appalled to discover that the thing has 3 gigs of ram on it. Do any of the gpu program types (ll, p-1, tf, etc) take better/more advantage of on card memory than others?
Aramis Wyler is offline   Reply With Quote
Old 2013-02-07, 14:49   #2
firejuggler
 
firejuggler's Avatar
 
"Vincent"
Apr 2010
Over the rainbow

23×5×73 Posts
Default

Unfortunatly, only P-1 might, and there is no software written for it (involving GPU) yet.
firejuggler is offline   Reply With Quote
Old 2013-02-07, 16:26   #3
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

37×59 Posts
Default

LL does, if the range is high.

EDIT:CudaLucas of course

Last fiddled with by kracker on 2013-02-07 at 16:31
kracker is offline   Reply With Quote
Old 2013-02-07, 16:36   #4
Aramis Wyler
 
Aramis Wyler's Avatar
 
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA

42410 Posts
Default

What constitutes 'high'? on p95 the memory needed never goes above 200 megs even when ll'ing ion the 60M block. Does CUDALucas use more memory on the card (not shared memory) than p95 does on the mainboard, or by high ranges are you talking the 332M range?

Last fiddled with by Aramis Wyler on 2013-02-07 at 16:36
Aramis Wyler is offline   Reply With Quote
Old 2013-02-07, 20:31   #5
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Quote:
Originally Posted by Aramis Wyler View Post
What constitutes 'high'? on p95 the memory needed never goes above 200 megs even when ll'ing ion the 60M block. Does CUDALucas use more memory on the card (not shared memory) than p95 does on the mainboard, or by high ranges are you talking the 332M range?
Both, I think. CUDALucas' memory footprint is larger than Prime95's footprint, but not by an order of magnitude. So you'd still only notice differences in the (e.g.) 332M range.
Dubslow is offline   Reply With Quote
Old 2013-02-07, 20:36   #6
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

101100011011102 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Both, I think. CUDALucas' memory footprint is larger than Prime95's footprint, but not by an order of magnitude. So you'd still only notice differences in the (e.g.) 332M range.
Just putting this out there...

There are many problem spaces which require a lot of near RAM.

Video games aside, having a GPU with a lot of RAM available is rarely a bad thing....
chalsall is offline   Reply With Quote
Old 2013-02-08, 03:05   #7
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

41·251 Posts
Default

@OP: can you try running cudalucas -cufftbench with some very large FFT sizes and see where it stops? That is the limit for your exponents. For a "normal" GTX580 (the one with 1536MB) the "crash point" is somewhere at 12M FFT size, IIRC, and I was never able to run LMH on those (332M expo would need at least 18M FFT size).

Last fiddled with by LaurV on 2013-02-08 at 03:08
LaurV is offline   Reply With Quote
Old 2013-02-08, 13:51   #8
Aramis Wyler
 
Aramis Wyler's Avatar
 
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA

23×53 Posts
Default

Sure thing, I'll run it when I get home from work, in about 9 hours. Any particular increment? 1k?
Aramis Wyler is offline   Reply With Quote
Old 2013-02-08, 14:27   #9
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

101000001100112 Posts
Default

Quote:
Originally Posted by Aramis Wyler View Post
Sure thing, I'll run it when I get home from work, in about 9 hours. Any particular increment? 1k?
You will cry near the computer Those large FFT need ages to go, I would suggest something more like a binary search, start with a 100k, or so, see where it crashes, do that range with 10k, etc. Or, depends on your time...

Last fiddled with by LaurV on 2013-02-08 at 14:27
LaurV is offline   Reply With Quote
Old 2013-02-08, 15:30   #10
Aramis Wyler
 
Aramis Wyler's Avatar
 
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA

23×53 Posts
Default

Ha! Fair enough. I'll run 11534336 20971520 1048576 just to see if it's even feasible to work in that range and if so work on something more in the 18-19M range at smaller increments (maybe 32k if I don't collapse into a river of tears).

I'll post the first search raw, and post the second one after I filter it down to remove the lower increments w/higher runtimes.
Aramis Wyler is offline   Reply With Quote
Old 2013-02-08, 15:46   #11
Redarm
 
Redarm's Avatar
 
Apr 2012
Berlin Germany

3316 Posts
Default

.

Last fiddled with by Redarm on 2013-02-08 at 16:08
Redarm is offline   Reply With Quote
Reply



All times are UTC. The time now is 14:52.


Fri Jul 7 14:52:23 UTC 2023 up 323 days, 12:20, 0 users, load averages: 1.24, 1.15, 1.12

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔