mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2017-01-09, 07:08   #1
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

585410 Posts
Default Which forum CUDA programs are DP vs SP?

I am considering the purchase of a secondhand workstation with a Quadro 5000 GPU, which has DP performance = 1/2 SP performance. If I purchase this machine, I surely do not want to waste this GPU running single-precision calculations among my choices at mersenneforum.

Which of our CUDA-enabled programs require DP calculations? Which are SP? I already use msieve poly select, ECM, cudaLLR; I would consider trial-factoring, cudalucas, and perhaps something I'm not thinking of (a sieve for a project I already like, say).
VBCurtis is offline   Reply With Quote
Old 2017-01-09, 07:47   #2
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013
https://pedan.tech/

24·199 Posts
Default

Keep in mind the Quadro 5000 is about as fast as a GTX 960 for LL. It is still reasonably power efficient when doing DP though.
Mark Rose is offline   Reply With Quote
Old 2017-04-13, 07:29   #3
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

26548 Posts
Default

TF (mfaktc, mfakto) is SP.
LL (CUDALucas, clLucas) is DP.

LL is also likely limited by memory bandwidth, not only DP performance.
preda is offline   Reply With Quote
Old 2017-12-26, 18:50   #4
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

1E9016 Posts
Default P-1

Quote:
Originally Posted by preda View Post
TF (mfaktc, mfakto) is SP.
LL (CUDALucas, clLucas) is DP.

LL is also likely limited by memory bandwidth, not only DP performance.
P-1 (CUDAPM1) is also DP. A brief look at the v0.20 source shows lots of variables typed double. Floats are rare by comparison. GPU Memory occupancy I've observed via GPU-Z is P-1>LL>TF. Occupancy limits maximum exponent for a given GPU's memory size, while bandwidth affects execution speed. But it seems reasonable to expect bandwidth and occupancy to be related. Even a 1GB equipped GPU (Quadro2000 for example) can run exponents well above the current Primenet wavefronts.

Last fiddled with by kriesel on 2017-12-26 at 19:02
kriesel is online now   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Seg fault with various programs Romuald Factoring 3 2016-11-14 05:38
Sieving programs calimero22 Software 12 2015-11-22 08:43
they used to be called programs... chappy Lounge 15 2012-08-11 21:02
Compiling CUDA programs on VS 2010 Express ET_ GPU Computing 9 2011-03-24 10:11
Two programs on same machine? Unregistered Software 14 2004-02-15 16:36

All times are UTC. The time now is 15:00.


Fri Jul 7 15:00:39 UTC 2023 up 323 days, 12:29, 0 users, load averages: 1.47, 1.22, 1.15

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔