mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-10-03, 09:27   #34
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

59710 Posts
Default

Quote:
Originally Posted by kracker View Post
-d 11 output. Hmm.
Cool, we already got everything prepared, just the OpenCL code does not fit the Intel compiler.

Please use the two attached barrett files to replace the ones in the mfakto folder. I hope I added enough type-casts to satisfy the Intel compiler. If not, then feel free to add more (to reduce the turnaround times).

Once we got that running, it might be useful to check, if that platform runs mad24() at least at the same speed as mad() - I know that NVIDIA needs to add extra operations to mask out the upper 8 bits of the operands ...
Attached Files
File Type: zip barretts.zip (24.9 KB, 325 views)

Last fiddled with by Bdot on 2013-10-03 at 09:28 Reason: typo
Bdot is offline   Reply With Quote
Old 2013-10-03, 15:19   #35
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

37·59 Posts
Default

Done. (Hmm..)
Attached Files
File Type: txt output.txt (10.2 KB, 410 views)
kracker is offline   Reply With Quote
Old 2013-10-03, 18:05   #36
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3·199 Posts
Default

Quote:
Originally Posted by kracker View Post
Done. (Hmm..)
Interesting. 4 tests succeeded!

So now it loads and compiles fine, just the code is failing

To see if it's the sieve or the tf, could you please run with
SieveOnGPU=0

Another test:
VectorSize=1

And I'd like to see the output of 2-3 minutes of mfakto -st (best would be with CPU sieving).
Bdot is offline   Reply With Quote
Old 2013-10-03, 18:15   #37
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

37·59 Posts
Default

Quote:
Originally Posted by Bdot View Post
Interesting. 4 tests succeeded!

So now it loads and compiles fine, just the code is failing

To see if it's the sieve or the tf, could you please run with
SieveOnGPU=0

Another test:
VectorSize=1

And I'd like to see the output of 2-3 minutes of mfakto -st (best would be with CPU sieving).
Will do. Unfortunately later this evening.
kracker is offline   Reply With Quote
Old 2013-10-03, 19:26   #38
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3·199 Posts
Default

Quote:
Originally Posted by kracker View Post
Will do. Unfortunately later this evening.
Assuming it is the TF that actually fails, the next step is to edit mfakto_Kernels.cl, line 45

#define TRACE_KERNEL 0

Change that to 2 at first - later 3 or 4 will be needed.

Run that with
SieveOnGPU=1
VectorSize=1

Use the same settings for your AMD GPU and compare the output. There should be no differences at TRACE level 2. Higher levels also trace the intermediate results which may differ due to different rounding.

I guess that at some point, TRACE level 2 also shows differences. These need to be examined in the higher-level traces ...

If you send me the Intel-output of level 2, 3 and 4 (just 1 minute each), I can do the comparing and searching myself.


The fact that 4 test cases were successful makes me think that not all is lost
Bdot is offline   Reply With Quote
Old 2013-10-03, 22:52   #39
TheMawn
 
TheMawn's Avatar
 
May 2013
East. Always East.

11·157 Posts
Default

I don't suppose there would be an advantage to delegating a certain part of the whole process to the iGPU and the rest to a discrete GPU? Or to the actual cores of the processor?
TheMawn is offline   Reply With Quote
Old 2013-10-03, 22:56   #40
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

218310 Posts
Default

Quote:
Originally Posted by TheMawn View Post
I don't suppose there would be an advantage to delegating a certain part of the whole process to the iGPU and the rest to a discrete GPU? Or to the actual cores of the processor?
Sorry, don't quite get it. Care to expand?
kracker is offline   Reply With Quote
Old 2013-10-03, 23:18   #41
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

218310 Posts
Default

Hmmmm.
With GPU sieving off and VectorSize1:
Code:
number of tests          117
successful tests         53
no factor found          64
kracker is offline   Reply With Quote
Old 2013-10-04, 00:59   #42
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

37·59 Posts
Default

Sorry, triple post
some self test.
http://www.filedropper.com/output
kracker is offline   Reply With Quote
Old 2013-10-04, 02:27   #43
TheMawn
 
TheMawn's Avatar
 
May 2013
East. Always East.

11·157 Posts
Default

Quote:
Originally Posted by kracker View Post
Sorry, don't quite get it. Care to expand?
What I meant was a while back the CPU did part of the work (sieving if I recall) while the GPU did the rest. Later, it was found that the GPU could do that as well. Is it possible that the iGPU might do some part of the work better or worse than another part which could be delegated to a different piece of hardware?

I.e. sieve on iGPU and TF on proper GPU if there is one available?
TheMawn is offline   Reply With Quote
Old 2013-10-04, 18:57   #44
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3×199 Posts
Default

Quote:
Originally Posted by TheMawn View Post
What I meant was a while back the CPU did part of the work (sieving if I recall) while the GPU did the rest. Later, it was found that the GPU could do that as well. Is it possible that the iGPU might do some part of the work better or worse than another part which could be delegated to a different piece of hardware?

I.e. sieve on iGPU and TF on proper GPU if there is one available?
CPU sieving is still an option, also once we get the IntelHD to work with mfakto. This has never been removed from mfakt[co], it's just not the default configuration anymore.

Edit: sieving on one GPU and TF on the other is less likely to be efficient as the speeds would need to adjust to each other, leading to the same issues that CPU sieving has: the slower part may not keep up, wasting the faster one's resources.

Last fiddled with by Bdot on 2013-10-04 at 18:59 Reason: forgot one part
Bdot is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Can I run my CPU's integrated GPU along with my discrete GPU? Red Raven GPU Computing 9 2014-10-24 02:01
New integrated CPU-GPU programming paradigm Dubslow GPU Computing 1 2012-02-15 08:45
Ivy Bridge integrated GPU? Dubslow GPU Computing 7 2011-11-18 23:36
Can I use integrated graphics alongside a GPU? mdettweiler GPU Computing 9 2010-09-15 19:41
turn off your integrated Snd card in CMOS nngs Hardware 0 2005-05-20 01:31

All times are UTC. The time now is 15:20.


Fri Jul 7 15:20:11 UTC 2023 up 323 days, 12:48, 0 users, load averages: 1.09, 1.08, 1.10

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔