![]() |
Oliver:
Exactly when the automated primenet interaction is released depends on me finishing it and how much other work you do. It happens when it happens. I hope to get the necessary changes into the main part of mfaktc on 0.18, but we shall see. I have to do them and Oliver has to fold them in along with the dummy version of mfaktc-primenet. My separate file needs a termination function so that we can ensure that libcurl is also informed to close connections. ********** As for estimating assignments, I was thinking that if we ensured there were 5 assignments in worktodo.txt, that would provide enough fodder for figuring out how much to get for next time. ******** |
Compute capability
I guess it does not happen often :-) but you probably should mention in the readme that mfaktc does not support compute capability. I went through getting getting CUDA dlls and installing new video drivers just to find out it does not support v1.0 :-(
BTW is there real problem with using v1.0 or it just hasn't been considered? Andriy |
Hi Andriy,
sorry, I'll add the information into the README.txt. I just have noticed that this kind of information is only available in the Changelog.txt and in the mfaktc article on the mersennewiki.org ([url]http://mersennewiki.org/index.php/Mfaktc[/url]). Early versions ran fine on my 8800 GTX (G80 GPU, the only CC 1.0 GPU) but newer versions don't. There are (at least) two reasons why it doesn't work on CC 1.0 GPUs:[LIST][*]use of atomic instructions for access to the results array (this needs CC >=1.1)[*]I don't know what the problem is but at some point it stopped working on CC 1.0. :sad:[/LIST]From mfaktc Changelog.txt [CODE]- officially GPUs with compute capability 1.0 are not supported. AFAIK the only GPU affected is the G80 (8800 GTS 320, 8800 GTS 640, 8800 GTX, 8800 Ultra and their Quadro/Tesla variants (but not a 8800 GTS 512, this one is a G92 GPU)). The issue seems to be the synchronisation of the writes to *d_RES. _PERHAPS_ I'm able to fix this in feature releases. BUT are there really many G80 GPUs out there? I think it is not worth the work (and yes, personally I own a 8800GTX). [/CODE] I won't place my money on a fix for CC 1.0 GPUs... Oliver |
[QUOTE=TheJudger;265318][LIST][*]use of atomic instructions for access to the results array (this needs CC >=1.1)[/QUOTE][/LIST] That is also the reason why my OpenCL version will not be able to run on ATI GPUs before HD5xxx - the older ones don't have atomics ...
I spent a few thoughts on how to make this work without atomics, but implementing locks on GPUs is a little complicated. Without locks it will only work when we assume that we find only one factor per grid. However, I found hints how to implement the locking. So if there's a big vote for supporting HD4xxx or maybe CUDA CC 1.0 I could add something mfakto (and suggest it for mfaktc :smile: ). |
Oliver:
Should/is Bdot on the distribution list for pre-release mfaktc? Just to let you, Bdot, know, the plan for mfaktc is to add a few CPU-side calls to the current code for when factors are found or not or work is needed, and to use as much P95 code as possible (preferably, whole source files unmodified) to do the work. |
Hey, what's the preferred exchange rate for spaces to tabs?
And are we using the one true style or the other true style for braces? [I like mine to line up, but I'll follow whatever convention]. Add to list: More robust parsing of worktodo.txt....in parse.c. |
[QUOTE=TheJudger;265318]
Early versions ran fine on my 8800 GTX (G80 GPU, the only CC 1.0 GPU) but newer versions don't. [/QUOTE] Is it possible to get my hands on one of the earlier builds or they are not good enough to use them? I have two 8800 GTX and one HD4550. I wonder how much work could they do?.. |
[QUOTE=apsen;265336]I have two 8800 GTX and one HD4550. I wonder how much work could they do?..[/QUOTE]The 8800GTX should perform fairly close to my 8800GT; I get ~26GHz-days/day; assuming there was a stable version of mfaktc that performed similarly to recent builds, [i]and[/i] there's no performance penalty for Compute v1.0 implementation, I'd expect ~25-30 GHz-days/day from an 8800GTX. That was quite a few "ifs", however...
|
[QUOTE=apsen;265336]Is it possible to get my hands on one of the earlier builds or they are not good enough to use them?
I have two 8800 GTX and one HD4550. I wonder how much work could they do?..[/QUOTE] If you wade far enough back in this thread (it's 700 posts long!), you will find the early versions of mfaktc. You will want to visit the "Putting it All Together" thread, as the necessary windows DLL is very easy to find there. Finally, if you want to make a post on that thread with links to the posts with the early versions, I think Rodrigo will find that helpful. |
[QUOTE=Christenson;265360]If you wade far enough back in this thread (it's 700 posts long!), you will find the early versions of mfaktc. You will want to visit the "Putting it All Together" thread, as the necessary windows DLL is very easy to find there. Finally, if you want to make a post on that thread with links to the posts with the early versions, I think Rodrigo will find that helpful.[/QUOTE]
That would be great!! :tu::tu: Rodrigo |
[QUOTE=Christenson;265328]Oliver:
Should/is Bdot on the distribution list for pre-release mfaktc? Just to let you, Bdot, know, the plan for mfaktc is to add a few CPU-side calls to the current code for when factors are found or not or work is needed, and to use as much P95 code as possible (preferably, whole source files unmodified) to do the work.[/QUOTE] I received one prerel-version in order to get the signal handling into my stuff. However, mfakto is not yet ready to report results to primenet. Currently some GPUs do not correctly execute some kernels and I may need a fix from AMD. Once it is ready, and George agreed, I can diff-merge your and Olivers latest changes to my code so that we have the same type of communication with primenet. [QUOTE=apsen] ... and one HD4550. I wonder how much work could they do?.. [/QUOTE] With my current kernels that would be about 8-10 GHz-days/day. And the code compiles fine for HD4xxx when I just skip the atomic_inc ... |
| All times are UTC. The time now is 23:13. |
Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.