mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
Thread Tools
Old 2011-01-14, 12:06   #518
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11×101 Posts
Default

Quote:
Originally Posted by aaronhaviland View Post
I can't find 0.14 anywhere. Is it out?
No, not yet.

Quote:
Originally Posted by aaronhaviland View Post
So true! Switched to a GTX 460 yesterday (clocked at 875MHz), still seems better without MORE_CLASSES, until I get to the 2^69-2^70 ranges.

Also, I've found that I need to lower SIEVEPRIMES much lower...
I reduced the lower limit and found that it liked to hover around 4500, if I'm not also using the CPU for other intensive tasks. If I am, it likes it around 500.
Please don't change SIEVE_PRIMES_MIN/MAX unless you're really sure what the consequences are. E.g. Increasing MAX above 100k will produce false results in some cases (24bit overflow in k_tab[] possible) and lowering MIN that small is usually just a waste of energy. Perhaps I'll lower MIN a little bit in feature releases if this is really wanted. If a single core of your CPU isn't able to feed your GPU fast enough just dedicate another core (another instance of mfaktc) or just accept the fact that you GPU isn't utilizated 100%.

Oliver
TheJudger is offline   Reply With Quote
Old 2011-01-23, 18:19   #519
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

45716 Posts
Default mfaktc 0.14

Hello,

find attached mfaktc 0.14.

Highlights of this version: The barrett79 kernel is a little bit faster and no longer limited to single bitlevels. On my GTX 275 (compute capability 1.3) this kernel is 3-4% faster than before and on my GTX 470 (compute capability 2.0) it is ~10% faster than before. The README is updated, too. It includes instructions how to get work and report results from/to the primenet server.

Fixed a potential application crash (exponents just above 1M and SievePrimes close to 100k) and the debug code received some changes, too.

The other issues which James has recently discovered have to wait until mfaktc 0.15, sorry (I've allready started coding for 0.15).

Oliver

P.S. for those nice persons who build Windows binaries and provide them to others here, please include the following files with your binaries:
- COPYING
- mfaktc.ini
- README.txt
(- Changelog.txt)
Attached Files
File Type: gz mfaktc-0.14.tar.gz (98.0 KB, 145 views)
TheJudger is offline   Reply With Quote
Old 2011-01-23, 18:41   #520
amphoria
 
amphoria's Avatar
 
"Dave"
Sep 2005
UK

23×347 Posts
Default

A win-64 binary of mfaktc 0.14 (complete with all files specified above ) can be found here. win-32 binary to follow.

http://www.sendspace.com/file/x0sijm
amphoria is offline   Reply With Quote
Old 2011-01-23, 18:52   #521
amphoria
 
amphoria's Avatar
 
"Dave"
Sep 2005
UK

AD816 Posts
Default

... and the win-32 binary.

http://www.sendspace.com/file/48y6i3

I have also attached below the win-32 Makefile that I use as this is not yet included in Oliver's tarball.
Attached Files
File Type: txt Makefile.win32.txt (2.0 KB, 161 views)
amphoria is offline   Reply With Quote
Old 2011-01-23, 21:37   #522
firejuggler
 
firejuggler's Avatar
 
Apr 2010
Over the rainbow

2·1,303 Posts
Default

Mfackt 0.14 bench, gtx460; model name GV-N460OC-1GI (gigabyte model), core 2 duo 8300; on Win XP.

Should i get another instance, both run at 80 M/s
Attached Thumbnails
Click image for larger version

Name:	speed.JPG
Views:	139
Size:	71.5 KB
ID:	6116  

Last fiddled with by firejuggler on 2011-01-23 at 21:41
firejuggler is offline   Reply With Quote
Old 2011-01-23, 21:57   #523
firejuggler
 
firejuggler's Avatar
 
Apr 2010
Over the rainbow

2·1,303 Posts
Default

here is with 2 instance running. should i change the number of CPUstream? ( atm 3, Numstream too)
Attached Thumbnails
Click image for larger version

Name:	speed2.JPG
Views:	135
Size:	149.2 KB
ID:	6117  
firejuggler is offline   Reply With Quote
Old 2011-01-24, 10:00   #524
Karl M Johnson
 
Karl M Johnson's Avatar
 
Mar 2010

3·137 Posts
Default

I cant execute Amphoria's latest x64 binary.
I get this error on start: cudaStreamCreate() failed for stream 0 .
However, 32 bit binary works like a charm !
Ideas ?
They both use same mfakt.ini , and both were fed the same worktodo.txt file, so it's not the config that causes such behavior.
Also, I get the same error with and without any cudarts in the folder, so it seems it crashes before calling it.

Last fiddled with by Karl M Johnson on 2011-01-24 at 10:02 Reason: Yes
Karl M Johnson is offline   Reply With Quote
Old 2011-01-24, 10:34   #525
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

100010101112 Posts
Default

Hello,

Quote:
Originally Posted by firejuggler View Post
here is with 2 instance running. should i change the number of CPUstream? ( atm 3, Numstream too)
2 instances look reasonable. You can try to adjust SievePrimes manually in mfaktc.ini (try ~6000 on your system) and set SievePrimesAdjust to 0.

Looks like the Win32 binary has some issues with the screen output, those lines should start with "xxxx/4620" and not those big numbers you get. But this is just cosmetic...

Quote:
Originally Posted by Karl M Johnson View Post
I cant execute Amphoria's latest x64 binary.
I get this error on start: cudaStreamCreate() failed for stream 0 .
However, 32 bit binary works like a charm !
Ideas ?
They both use same mfakt.ini , and both were fed the same worktodo.txt file, so it's not the config that causes such behavior.
Also, I get the same error with and without any cudarts in the folder, so it seems it crashes before calling it.
Ideas *hmm* as usall: driver- and CUDA-version, what does the "CUDA version info" output of mfaktc say?

Oliver
TheJudger is offline   Reply With Quote
Old 2011-01-24, 11:17   #526
Karl M Johnson
 
Karl M Johnson's Avatar
 
Mar 2010

3·137 Posts
Default

Amphoria, what were the drivers you compiled mfakt under ? The toolkit/sdk was 3.1, for sure.
Karl M Johnson is offline   Reply With Quote
Old 2011-01-24, 12:38   #527
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23·149 Posts
Default

No complaints: Amphoria's latest x64 binary works perfectly here.
James Heinrich is offline   Reply With Quote
Old 2011-01-24, 12:50   #528
Karl M Johnson
 
Karl M Johnson's Avatar
 
Mar 2010

3×137 Posts
Default

Then Amphoria must be using 26x.xx Forceware then.
And, it seems, 64 bit CUDA apps, which were compiled using 26x.xx and any toolkit/sdk, cant be executed properly on machines with < 26x.xx using cudart shenanigans.

Last fiddled with by Karl M Johnson on 2011-01-24 at 12:51
Karl M Johnson is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 23:29.


Fri Aug 6 23:29:27 UTC 2021 up 14 days, 17:58, 1 user, load averages: 3.93, 3.86, 3.95

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.