mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2011-07-02, 18:32   #1046
Christenson
 
Christenson's Avatar
 
Dec 2010
Monticello

70316 Posts
Default

Oliver:
Exactly when the automated primenet interaction is released depends on me finishing it and how much other work you do. It happens when it happens. I hope to get the necessary changes into the main part of mfaktc on 0.18, but we shall see. I have to do them and Oliver has to fold them in along with the dummy version of mfaktc-primenet.

My separate file needs a termination function so that we can ensure that libcurl is also informed to close connections.

**********
As for estimating assignments, I was thinking that if we ensured there were 5 assignments in worktodo.txt, that would provide enough fodder for figuring out how much to get for next time.
********
Christenson is offline   Reply With Quote
Old 2011-07-03, 14:24   #1047
apsen
 
Jun 2011

100000112 Posts
Default Compute capability

I guess it does not happen often :-) but you probably should mention in the readme that mfaktc does not support compute capability. I went through getting getting CUDA dlls and installing new video drivers just to find out it does not support v1.0 :-(

BTW is there real problem with using v1.0 or it just hasn't been considered?

Andriy
apsen is offline   Reply With Quote
Old 2011-07-03, 17:58   #1048
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default

Hi Andriy,

sorry, I'll add the information into the README.txt. I just have noticed that this kind of information is only available in the Changelog.txt and in the mfaktc article on the mersennewiki.org (http://mersennewiki.org/index.php/Mfaktc).

Early versions ran fine on my 8800 GTX (G80 GPU, the only CC 1.0 GPU) but newer versions don't. There are (at least) two reasons why it doesn't work on CC 1.0 GPUs:
  • use of atomic instructions for access to the results array (this needs CC >=1.1)
  • I don't know what the problem is but at some point it stopped working on CC 1.0.
From mfaktc Changelog.txt
Code:
- officially GPUs with compute capability 1.0 are not supported. AFAIK the
  only GPU affected is the G80 (8800 GTS 320, 8800 GTS 640, 8800 GTX, 8800
  Ultra and their Quadro/Tesla variants (but not a 8800 GTS 512, this one is
  a G92 GPU)). The issue seems to be the synchronisation of the writes to
  *d_RES.
  _PERHAPS_ I'm able to fix this in feature releases. BUT are there really
  many G80 GPUs out there? I think it is not worth the work (and yes,
  personally I own a 8800GTX).
I won't place my money on a fix for CC 1.0 GPUs...

Oliver
TheJudger is offline   Reply With Quote
Old 2011-07-03, 20:10   #1049
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3×199 Posts
Default

Quote:
Originally Posted by TheJudger View Post
  • use of atomic instructions for access to the results array (this needs CC >=1.1)
That is also the reason why my OpenCL version will not be able to run on ATI GPUs before HD5xxx - the older ones don't have atomics ...

I spent a few thoughts on how to make this work without atomics, but implementing locks on GPUs is a little complicated. Without locks it will only work when we assume that we find only one factor per grid. However, I found hints how to implement the locking. So if there's a big vote for supporting HD4xxx or maybe CUDA CC 1.0 I could add something mfakto (and suggest it for mfaktc ).
Bdot is offline   Reply With Quote
Old 2011-07-03, 20:32   #1050
Christenson
 
Christenson's Avatar
 
Dec 2010
Monticello

5×359 Posts
Default

Oliver:
Should/is Bdot on the distribution list for pre-release mfaktc?

Just to let you, Bdot, know, the plan for mfaktc is to add a few CPU-side calls to the current code for when factors are found or not or work is needed, and to use as much P95 code as possible (preferably, whole source files unmodified) to do the work.
Christenson is offline   Reply With Quote
Old 2011-07-03, 21:45   #1051
Christenson
 
Christenson's Avatar
 
Dec 2010
Monticello

5·359 Posts
Default

Hey, what's the preferred exchange rate for spaces to tabs?
And are we using the one true style or the other true style for braces? [I like mine to line up, but I'll follow whatever convention].

Add to list: More robust parsing of worktodo.txt....in parse.c.
Christenson is offline   Reply With Quote
Old 2011-07-03, 22:25   #1052
apsen
 
Jun 2011

131 Posts
Default

Quote:
Originally Posted by TheJudger View Post

Early versions ran fine on my 8800 GTX (G80 GPU, the only CC 1.0 GPU) but newer versions don't.
Is it possible to get my hands on one of the earlier builds or they are not good enough to use them?

I have two 8800 GTX and one HD4550. I wonder how much work could they do?..
apsen is offline   Reply With Quote
Old 2011-07-04, 01:11   #1053
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

11×311 Posts
Default

Quote:
Originally Posted by apsen View Post
I have two 8800 GTX and one HD4550. I wonder how much work could they do?..
The 8800GTX should perform fairly close to my 8800GT; I get ~26GHz-days/day; assuming there was a stable version of mfaktc that performed similarly to recent builds, and there's no performance penalty for Compute v1.0 implementation, I'd expect ~25-30 GHz-days/day from an 8800GTX. That was quite a few "ifs", however...
James Heinrich is offline   Reply With Quote
Old 2011-07-04, 02:18   #1054
Christenson
 
Christenson's Avatar
 
Dec 2010
Monticello

179510 Posts
Default

Quote:
Originally Posted by apsen View Post
Is it possible to get my hands on one of the earlier builds or they are not good enough to use them?

I have two 8800 GTX and one HD4550. I wonder how much work could they do?..
If you wade far enough back in this thread (it's 700 posts long!), you will find the early versions of mfaktc. You will want to visit the "Putting it All Together" thread, as the necessary windows DLL is very easy to find there. Finally, if you want to make a post on that thread with links to the posts with the early versions, I think Rodrigo will find that helpful.
Christenson is offline   Reply With Quote
Old 2011-07-04, 06:01   #1055
Rodrigo
 
Rodrigo's Avatar
 
Jun 2010
Pennsylvania

2·467 Posts
Default

Quote:
Originally Posted by Christenson View Post
If you wade far enough back in this thread (it's 700 posts long!), you will find the early versions of mfaktc. You will want to visit the "Putting it All Together" thread, as the necessary windows DLL is very easy to find there. Finally, if you want to make a post on that thread with links to the posts with the early versions, I think Rodrigo will find that helpful.
That would be great!!

Rodrigo
Rodrigo is offline   Reply With Quote
Old 2011-07-04, 11:01   #1056
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3·199 Posts
Default

Quote:
Originally Posted by Christenson View Post
Oliver:
Should/is Bdot on the distribution list for pre-release mfaktc?

Just to let you, Bdot, know, the plan for mfaktc is to add a few CPU-side calls to the current code for when factors are found or not or work is needed, and to use as much P95 code as possible (preferably, whole source files unmodified) to do the work.
I received one prerel-version in order to get the signal handling into my stuff. However, mfakto is not yet ready to report results to primenet. Currently some GPUs do not correctly execute some kernels and I may need a fix from AMD. Once it is ready, and George agreed, I can diff-merge your and Olivers latest changes to my code so that we have the same type of communication with primenet.

Quote:
Originally Posted by apsen
... and one HD4550. I wonder how much work could they do?..
With my current kernels that would be about 8-10 GHz-days/day. And the code compiles fine for HD4xxx when I just skip the atomic_inc ...
Bdot is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 13:18.


Mon Aug 2 13:18:43 UTC 2021 up 10 days, 7:47, 0 users, load averages: 2.02, 2.04, 1.98

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.