mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   mfaktc: a CUDA program for Mersenne prefactoring (https://www.mersenneforum.org/showthread.php?t=12827)

James Heinrich 2011-03-13 19:00

[QUOTE=TheJudger;255085]limited by the speed of your CPU (average wait is very small)[/QUOTE]I'm happy(?) to report I don't have that problem... I wish I did :wink:[code] class | candidates | time | avg. rate | SievePrimes | ETA | avg. wait
1348/4620 | 18.79G | 589.96s | 31.85M/s | 100000 | 4d15h | 1640us[/code]

firejuggler 2011-03-13 19:46

I merely report the speed.Another more expansive solution would be to get a better CPU.. ;p

opyrt 2011-03-14 13:31

I get this error with 0.16:
running a simple selftest...
ERROR: selftest failed for M49635893
no factor found
ERROR: selftest failed for M51375383
no factor found
ERROR: selftest failed for M47644171
no factor found
ERROR: selftest failed for M51038681
no factor found
ERROR: selftest failed for M53076719

It's reproducable. It always fails on those 5 candidates. GPU is a bit old: GeForce 9800 GTX/9800 GTX+. Works great with 0.14.

Any ideas?

ET_ 2011-03-14 15:32

[QUOTE=opyrt;255127]I get this error with 0.16:
running a simple selftest...
ERROR: selftest failed for M49635893
no factor found
ERROR: selftest failed for M51375383
no factor found
ERROR: selftest failed for M47644171
no factor found
ERROR: selftest failed for M51038681
no factor found
ERROR: selftest failed for M53076719

It's reproducable. It always fails on those 5 candidates. GPU is a bit old: GeForce 9800 GTX/9800 GTX+. Works great with 0.14.

Any ideas?[/QUOTE]

The same is happening to me, GTX275, CUDA 3.0 and Linux 64.

We are investigating...

Luigi

TheJudger 2011-03-14 16:04

opyrt: which OS / driver version / CUDA version / ...? Precompiled binary or build from source?

Good to know it is reproduceable for you, this lowers the chance for a race condition...

Quicky on a crappy Quadro NVS 290 (G86 chip), similar to a Geforce 8400GS (1st rev) or 8500GT:

[CODE]mfaktc v0.16
...
CUDA device info
name Quadro NVS 290
compute capability 1.1
maximum threads per block 512
number of multiprocessors 2 (16 shader cores)
clock rate 918MHz

CUDA version info
binary compiled for CUDA 3.20
CUDA driver version 3.20
CUDA runtime version 3.20
...
running a simple selftest...

Selftest statistics
number of tests 31
successfull tests 31

selftest PASSED!
[/CODE]

To be honest: right now I've no clue whats wrong. During betatest James Heinrich reported no issues on his 8800GT (cc 1.1, too) using my precompiled binaries.

For now we've to collect data on which systems it works and on which not. :sad:

---
Edit: reproduced by downgrading the CUDA toolkit from 3.2 to 3.0 on Linux.

Oliver

James Heinrich 2011-03-14 16:59

[QUOTE=TheJudger;255136]During betatest James Heinrich reported no issues on his 8800GT (cc 1.1, too) using my precompiled binaries.[/QUOTE]
On Windows, mind you. This may be a *nix-specific issue? I haven't started running v0.16-final yet (93h left to go on current assignment before I can switch over), but I did fire it up once and it passed the self-test just fine.

opyrt 2011-03-14 17:00

This is my CUDA info:
[CODE]CUDA device info
name GeForce 9800 GTX/9800 GTX+
compute capability 1.1
maximum threads per block 512
number of multiprocessors 16 (128 shader cores)
clock rate 1850MHz

CUDA version info
binary compiled for CUDA 3.10
CUDA driver version 3.20
CUDA runtime version 3.20
[/CODE]OS is:
[CODE]Linux 2.6.32-29-server #58-Ubuntu SMP Fri Feb 11 21:06:51 UTC 2011 x86_64 GNU/Linux
Ubuntu 10.04.2 LTS
[/CODE]Let me know if you need me to test anything.

opyrt 2011-03-14 17:10

I think I got it.

I installed CUDA 3.2 on my computer, did a "make clean; make" to recompile mfaktc. Info at start-up:[CODE]CUDA device info
name GeForce 9800 GTX/9800 GTX+
compute capability 1.1
maximum threads per block 512
number of multiprocessors 16 (128 shader cores)
clock rate 1850MHz

CUDA version info
binary compiled for CUDA 3.20
CUDA driver version 3.20
CUDA runtime version 3.20

Automatic parameters
threads per grid 1048576

running a simple selftest...
Selftest statistics
number of tests 31
successfull tests 31

selftest PASSED!
[/CODE]It seems this is an issue when mfaktc is compiled on CUDA 3.1 computers.

Brain 2011-03-14 21:19

Impressive
 
1 Attachment(s)
mfaktc 0.16: Overall Throughput > 200M/s (see attachment)

Intel Core i5-750 @ 3600 MHz (4 physical/logical cores)
Nvidia Geforce 560 Ti @ 823 MHz (384 CUDA cores)

Danke Oliver

S34960zz 2011-03-14 23:28

somewhat less impressive ...
 
1 Attachment(s)
mfaktc 0.16: Overall Throughput ~ 26 M/s (see attachment)

Brain's i5 / Geforce 560 TI is much faster, runs hotter, too, though!

I ran the same exponent as Brain's second instance, obtaining half the throughput for that instance (only took one mfaktc instance to fully load my GPU).

Intel Core i7-840QM @ 1.87 GHz (nominal) (4 cores, 8 logical CPUs)
Nvidia Quadro FX 2800M @ 600 MHz GPU core clock (96 CUDA cores), 999 MHz GPU memory clock, 1500 MHz GPU shader clock

+++++

Thank you to James Heinrich for providing me a copy of "cudart64_32_16.dll" (per post 588, [URL]http://www.mersenneforum.org/showpost.php?p=253371&postcount=588[/URL]).

+++++

drh ([URL="http://www.mersenneforum.org/showpost.php?p=255082&postcount=636%29:"]http://www.mersenneforum.org/showpost.php?p=255082&postcount=636):[/URL]

[QUOTE=drh;255082]I get it right after I try to run mfaktc -st ... mfaktc -tt runs fine, as far as I can tell. I get cudaSetDevice(0) failed. I also get it if I try to run the program with a -tf ...

Thanks,
Doug[/QUOTE]

Had same problem myself this morning. Solution:

Go here ([URL]http://developer.nvidia.com/object/cuda_3_2_downloads.html[/URL]) and get the developer drivers (notebook drivers 260.99) for your operating system, turn off your antivirus, install driver, turn antivirus back on.

James Heinrich 2011-03-15 00:01

[QUOTE=S34960zz;255187]Thank you to James Heinrich for providing me a copy of "cudart64_32_16.dll" (per post 588, [URL]http://www.mersenneforum.org/showpost.php?p=253371&postcount=588[/URL]).[/QUOTE]Actually, look at [url=http://www.mersenneforum.org/showpost.php?p=255068&postcount=632]my more recent post[/url] instead, where I actually attached the file; no need to PM/email me if you need the DLL.


All times are UTC. The time now is 23:06.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.