mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-06-25, 17:42   #826
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23×271 Posts
Default

Quote:
Originally Posted by Rodrigo View Post
The exponents were in the 67M range (TF).

My 7770 is as it came, no adjustments made to it. The only tweak I've made to mfakto 0.13 is to change the VectorSize value from the default 4 down to 2, as suggested by the program itself the first time I ran it.

Have you made any other adjustments to increase the output?

Rodrigo
I pushed GPUSievePrimes around 10k up from default, GPUSieveProcessSize to 24 from 16.
kracker is offline   Reply With Quote
Old 2013-06-26, 02:32   #827
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

2×3×1,609 Posts
Default

Let's start by telling you that I have to go back 36 pages and read this topic from the start. I got my hands on a XFX HD 7970 and playing with it since last Friday. I may keep it for a while, but after running it till now, I am still remaining definitively a nVidia guy (sorry to say! reasons are multiple, not only related to performance or consumption, but goes also into scrypt mining, etc. not related to the current subject, we may discuss more in a different topic if someone is interested).

But my problem now is not with the card, but more with mfakto. Using default .ini file the performance was lousy, and the program advised me to change whatever variable from 4 to 2 (kudos to Bdot for not letting beginners like me scratch their head in vain!), which I did (something about magical-disappearing registers). With new settings, I can get about 390GHzD/D out of it with P95 running (GPU sieving) and about 420 with P95 stopped (GPU sieving too, there is no question about keeping my CPU busy with sieving, I would not try it if that was the case!). This for factoring in the 332M range of the expos (I took a bunch of them to 74 bits, founding 6 factors on the way, as one could check with the DB, in the last 5 days).

Everything wonderful up to here. Hip-hip hoorays go to Bdot.

But then my freaking fingers tickled and I run --perftest, and start playing with number of primes to sieve (default ~80k in the ini file). For any other values except the default one, the -sp test fails (always missing 2-3 factors from all the bunch). I tried numbers around 60k, 100k, 200k, and larger, close to the max 1M. It seems that the card would get better results for smaller value there, but what the hack, it always missing factors. I am thinking that the number I use there should be a multiple of something or have special properties? Otherwise I can't find an explanation. I tried also playing with the clock (like in "reducing it") and the card consumption (like "increasing it", give it more voltage), but the results are the same. In this status, my money are either on (1) some properties of the number which I fail to acknowledge (and therefore I am giving a bad parameter to the program, but I would wonder if Bdot missed to warn the user when rubbish is put in, I saw Bdot is very diligent with warning messages!) or either on a (2) software bug.

Therefore the current post... Till a reply, I will go back to read the topic from the beginning, to see what I missed.

Any idea?

Last fiddled with by LaurV on 2013-06-26 at 02:46
LaurV is online now   Reply With Quote
Old 2013-06-26, 02:42   #828
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23×271 Posts
Default

Quote:
Originally Posted by LaurV View Post
~~~~
Hmm, I don't think I've tried -st on other SievePrime than the default.

By the way, what catalyst version are you using? it shouldn't drop to 390 from 420... Just curious what GHz do you get in the LL or DC range?

Last fiddled with by kracker on 2013-06-26 at 03:29
kracker is offline   Reply With Quote
Old 2013-06-26, 02:50   #829
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

965410 Posts
Default

Didn't try current LL range yet. Queued some work from Chris for the next days.

OTOH, I always edit my posts in the next 10-20 minutes after they were posted, to repair grammar, typos, accentuate ideas, etc. It is not a good habit of you to include my last posts into reply, so everybody can see my stupidity before I am fast enough to mask it . Please delete it! (anyhow, it is not good to include a quote to a full post, especially if you reply to the last post, that's what the netiquette says, beside of the fact that it takes futile space onthe servers).

Last fiddled with by LaurV on 2013-06-26 at 02:54
LaurV is online now   Reply With Quote
Old 2013-06-26, 02:59   #830
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

2·3·1,609 Posts
Default

Quote:
Originally Posted by Bdot View Post
Third, the AMD drivers. Sadly, the latest windows drivers (13.4) make mfakto consume almost one CPU core, even when GPU sieving. When other programs (like prime95) use a lot of CPU, then mfakto's high CPU load goes away, but at the cost of some 10-20% throughput.
That is the first "Bingo!" related to my problem. Thanks. I am glad I started to read the topic viceversa! (from the newest posts to the oldest). The discussion is not necessarily connected, but it can provide a faster solution, hehe.

What drivers are recommended? (Win7 64 bits)

Still reading.
LaurV is online now   Reply With Quote
Old 2013-06-26, 03:28   #831
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23·271 Posts
Default

Quote:
Originally Posted by LaurV View Post
That is the first "Bingo!" related to my problem. Thanks. I am glad I started to read the topic viceversa! (from the newest posts to the oldest). The discussion is not necessarily connected, but it can provide a faster solution, hehe.

What drivers are recommended? (Win7 64 bits)

Still reading.
13.1. They aren't available on AMD's site because they are older, so techpowerup has them here. I also recommend using AMD's complete cleanup tool here to remove the driver completely.

Oh and sorry for the full quote, I really wasn't thinking..
kracker is offline   Reply With Quote
Old 2013-06-26, 05:18   #832
Rodrigo
 
Rodrigo's Avatar
 
Jun 2010
Pennsylvania

16468 Posts
Default

Quote:
Originally Posted by kracker View Post
Is this in the LL range ~65M? On my 7770 non OC'ed I am getting around 160 Ghz/days.
kracker, by any chance were you running Prime95 concurrently when you got this output?

The reason I ask has to do with Bdot's observation (just recently quoted by LaurV) that:
Quote:
Sadly, the latest windows drivers (13.4) make mfakto consume almost one CPU core, even when GPU sieving. When other programs (like prime95) use a lot of CPU, then mfakto's high CPU load goes away, but at the cost of some 10-20% throughput.
What I'm thinking is that if you take 10% from 160 GHz-days/day, you get 144 which is very close to what I'm getting with no tweaks beyond the VectorSize value, so could it be that it's not necessary to make further manual adjustments? (I was running Prime95 when I ran my mfakto tests.)

Just to confuse things further , FWIW I'm on "Catalyst Version" 12.10 according to AMD's Catalyst Control Center but Windows Device Manager says that the "driver version" is 9.2.0.0, dated 9/27/2012. I'm never sure which number is the "real" one, but if I had to bet on one I'd put my money on 12.10.

Rodrigo

Last fiddled with by Rodrigo on 2013-06-26 at 05:20 Reason: added emphasis to quote
Rodrigo is offline   Reply With Quote
Old 2013-06-26, 14:49   #833
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

87816 Posts
Default

Quote:
Originally Posted by Rodrigo View Post
kracker, by any chance were you running Prime95 concurrently when you got this output?
Nope.
Quote:
Just to confuse things further , FWIW I'm on "Catalyst Version" 12.10 according to AMD's Catalyst Control Center but Windows Device Manager says that the "driver version" is 9.2.0.0, dated 9/27/2012. I'm never sure which number is the "real" one, but if I had to bet on one I'd put my money on 12.10.

Rodrigo
the driver version is, the 9.2.0.0. the package version is 12.10, that's what everyone means when they say driver version. 12.10 is not affected by the cpu bug.
kracker is offline   Reply With Quote
Old 2013-06-26, 16:24   #834
Rodrigo
 
Rodrigo's Avatar
 
Jun 2010
Pennsylvania

2×467 Posts
Default

Quote:
Originally Posted by kracker View Post
the driver version is, the 9.2.0.0. the package version is 12.10, that's what everyone means when they say driver version. 12.10 is not affected by the cpu bug.
All right, I'll stay on 12.10 'til things get sorted out. Thanks.

Rodrigo

Last fiddled with by Rodrigo on 2013-06-26 at 16:24
Rodrigo is offline   Reply With Quote
Old 2013-06-26, 22:58   #835
Rodrigo
 
Rodrigo's Avatar
 
Jun 2010
Pennsylvania

16468 Posts
Default

More observations on version 0.13:

Often (not all the time), I've noticed a distinct lag when repositioning windows, opening websites, or even moving the mouse cursor around the screen -- especially just after sending a file to the printer. The mouse cursor lag is particularly pronounced as it seems to just sit there after I move the mouse, and then the cursor pops up way off somewhere else, making it difficult to select or click on things on the screen.

None of this happens with version 0.12.

The shine is coming off a bit from 0.13. As this is a production machine, I'm going back to 0.12 for the time being, even though Prime95 takes a hit and the TF yield is lower even with three mfakto instances running.

Anybody else getting this lag? If I can get rid of it, it'll make 0.13 viable again for me.

Rodrigo
Rodrigo is offline   Reply With Quote
Old 2013-06-26, 23:59   #836
Jayder
 
Jayder's Avatar
 
Dec 2012

11616 Posts
Default

Did you try lowering GPUSieveSize in mfakto.ini? That was what made the big difference for me.
Jayder is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
gpuOwL: an OpenCL program for Mersenne primality testing preda GpuOwl 2718 2021-07-06 18:30
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3497 2021-06-05 12:27
LL with OpenCL msft GPU Computing 433 2019-06-23 21:11
OpenCL for FPGAs TObject GPU Computing 2 2013-10-12 21:09
Program to TF Mersenne numbers with more than 1 sextillion digits? Stargate38 Factoring 24 2011-11-03 00:34

All times are UTC. The time now is 07:46.


Mon Aug 2 07:46:07 UTC 2021 up 10 days, 2:15, 0 users, load averages: 1.90, 1.52, 1.41

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.