mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-12-16, 19:25   #2113
mognuts
 
mognuts's Avatar
 
Sep 2008
Bromley, England

43 Posts
Default

Quote:
Originally Posted by owftheevil View Post
Crashes how?
This is the console output:
Code:
C:\Users\John\Desktop\cudalucas>CUDALucas_205Beta_x64_r52 -cufftbench 2048 1 5
 ------- DEVICE 0 -------
name                GeForce GTX 460
Compatibility       2.1
clockRate (MHz)     1430
memClockRate (MHz)  1800
totalGlobalMem      1073741824
totalConstMem       65536
l2CacheSize         524288
sharedMemPerBlock   49152
regsPerBlock        32768
warpSize            32
memPitch            2147483647
maxThreadsPerBlock  1024
maxThreadsPerMP     1536
multiProcessorCount 7
maxThreadsDim[3]    1024,1024,64
maxGridSize[3]      65535,65535,65535
textureAlignment    512
deviceOverlap       1
 Thread bench, testing various thread sizes for ffts 1K to 2048K, doing 5 passes.
fft size = 1K, ave time = 6.5100 msec, Norm1 threads 32, Norm2 threads 32
fft size = 1K, ave time = 6.5094 msec, Norm1 threads 32, Norm2 threads 64
fft size = 1K, ave time = 6.5098 msec, Norm1 threads 32, Norm2 threads 128
fft size = 1K, ave time = 6.5084 msec, Norm1 threads 32, Norm2 threads 256
fft size = 1K, ave time = 6.5089 msec, Norm1 threads 32, Norm2 threads 512
fft size = 1K, ave time = 6.5085 msec, Norm1 threads 32, Norm2 threads 1024
fft size = 1K, ave time = 6.5084 msec, Norm1 threads 64, Norm2 threads 32
fft size = 1K, ave time = 6.5088 msec, Norm1 threads 64, Norm2 threads 64
fft size = 1K, ave time = 6.5085 msec, Norm1 threads 64, Norm2 threads 128
fft size = 1K, ave time = 6.5087 msec, Norm1 threads 64, Norm2 threads 256
fft size = 1K, ave time = 6.5080 msec, Norm1 threads 64, Norm2 threads 512
fft size = 1K, ave time = 6.5087 msec, Norm1 threads 64, Norm2 threads 1024
fft size = 1K, ave time = 6.5084 msec, Norm1 threads 128, Norm2 threads 32
fft size = 1K, ave time = 6.5080 msec, Norm1 threads 128, Norm2 threads 64
fft size = 1K, ave time = 6.5082 msec, Norm1 threads 128, Norm2 threads 128
fft size = 1K, ave time = 6.5079 msec, Norm1 threads 128, Norm2 threads 256
fft size = 1K, ave time = 6.5082 msec, Norm1 threads 128, Norm2 threads 512
fft size = 1K, ave time = 6.5072 msec, Norm1 threads 128, Norm2 threads 1024
fft size = 1K, ave time = 6.5090 msec, Norm1 threads 256, Norm2 threads 32
fft size = 1K, ave time = 6.5091 msec, Norm1 threads 256, Norm2 threads 64
fft size = 1K, ave time = 6.5085 msec, Norm1 threads 256, Norm2 threads 128
fft size = 1K, ave time = 6.5088 msec, Norm1 threads 256, Norm2 threads 256
fft size = 1K, ave time = 6.5078 msec, Norm1 threads 256, Norm2 threads 512
fft size = 1K, ave time = 6.5085 msec, Norm1 threads 256, Norm2 threads 1024
fft size = 1K, ave time = 6.5099 msec, Norm1 threads 512, Norm2 threads 32
fft size = 1K, ave time = 6.5098 msec, Norm1 threads 512, Norm2 threads 64
fft size = 1K, ave time = 6.5093 msec, Norm1 threads 512, Norm2 threads 128
fft size = 1K, ave time = 6.5098 msec, Norm1 threads 512, Norm2 threads 256
fft size = 1K, ave time = 6.5096 msec, Norm1 threads 512, Norm2 threads 512
fft size = 1K, ave time = 6.5099 msec, Norm1 threads 512, Norm2 threads 1024
fft size = 1K, ave time = 5.9309 msec, Norm1 threads 128, Mult threads 32, Norm2 threads 1024
fft size = 1K, ave time = 5.9307 msec, Norm1 threads 128, Mult threads 64, Norm2 threads 1024
fft size = 1K, ave time = 5.9311 msec, Norm1 threads 128, Mult threads 128, Norm2 threads 1024
fft size = 1K, ave time = 5.9318 msec, Norm1 threads 128, Mult threads 256, Norm2 threads 1024
Best time for fft = 1K, time: 5.9307, t0 = 128, t1 = 64, t2 = 1024
Followed by a dialogue box containing the following text:

CUDALucas_205Beta_x64_r52.exe has stopped working
A problem caused the program to stop working correctly. Windows will close the program and notify you if a solution is available.

This happens regardless of the parameters used for -cufftbench.
mognuts is offline   Reply With Quote
Old 2013-12-16, 21:24   #2114
mognuts
 
mognuts's Avatar
 
Sep 2008
Bromley, England

43 Posts
Default

On a more positive note, r52 correctly found 3 known primes.

M( 11213 )P, n = 1K, CUDALucas v2.05 Beta
M( 1257787 )P, n = 64K, CUDALucas v2.05 Beta
M( 2976221 )P, n = 256K, CUDALucas v2.05 Beta
mognuts is offline   Reply With Quote
Old 2013-12-17, 15:04   #2115
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32×5×7 Posts
Default

R53 is up, fixing the sparse <gpu> fft.txt file issue, the uninitialized pointer causing mismatched residues in the self-test, an incorrect fft length in the threads bench and a bad bounday case condition in the fft initialization.

@mognuts: I could not get the behaviour your 460 showed to happen, so I don't know if the problem is fixed or not.

Windows version is not up yet.
owftheevil is offline   Reply With Quote
Old 2013-12-17, 15:29   #2116
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

2·3·11·73 Posts
Default

Quote:
Originally Posted by owftheevil View Post
R53 is up, fixing the sparse <gpu> fft.txt file issue, the uninitialized pointer causing mismatched residues in the self-test, an incorrect fft length in the threads bench and a bad bounday case condition in the fft initialization.

@mognuts: I could not get the behaviour your 460 showed to happen, so I don't know if the problem is fixed or not.

Windows version is not up yet.
You are referring to CUDALucas, not CUDAPm1 issues, aren't you?

Luigi
ET_ is offline   Reply With Quote
Old 2013-12-17, 15:29   #2117
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

r53 is on SourceForge.

.ini file is updated, please re-download.

Formatting output can be customized now.

Please run the tests in this post and continute to post any issues or bugs.

Thanks!

Quote:
Originally Posted by ET_ View Post
You are referring to CUDALucas, not CUDAPm1 issues, aren't you?

Luigi
Yes

Last fiddled with by flashjh on 2013-12-17 at 15:30
flashjh is offline   Reply With Quote
Old 2013-12-17, 16:42   #2118
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Posted Win32 .exe files on SourceForge - first time I've built Win32 with 2.05 Beta, please test accordingly.
flashjh is offline   Reply With Quote
Old 2013-12-18, 02:10   #2119
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Successful test of Win32 version of r53

I am now able to build CUDA version 4.0 and up, 64 bit only, if anyone needs a version, let me know.
flashjh is offline   Reply With Quote
Old 2013-12-19, 01:42   #2120
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

100011000112 Posts
Default

SorceForge updated with latest commit, currently r55. Minor formatting changes and updated makefile.win file to allow for Win32 or x64 compiles with CUDA 4.0 up to 5.5.

If anyone wants help compiling with make or in MSVS, let me know.

Had another successful DC with Win32 version. With the help of petrw1 I have 23/24 good DCs. The bad one was probably caused by all my stopping/starting while compiling, etc. None the less, that's why we DC.
flashjh is offline   Reply With Quote
Old 2013-12-21, 03:39   #2121
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

112310 Posts
Default

Quote:
Originally Posted by mognuts View Post
This is the console output:
Code:
<snip>
Followed by a dialogue box containing the following text:

CUDALucas_205Beta_x64_r52.exe has stopped working
A problem caused the program to stop working correctly. Windows will close the program and notify you if a solution is available.

This happens regardless of the parameters used for -cufftbench.
I am running tests to cause the NVIDIA Windows Kernel Mode Driver failure. Testing all versions of NVidia WHQL drivers since 296.10. Those results later...

@mognuts, I was able to (accidentally) reproduce the results you experienced.

@owftheevil

-Anytime I run -cufftbench fft# smallerfft# 1 it causes CUDALucas to crash like mognuts experienced
-When I run -cufftbench fft# fft# any# it skips some of the fft tests completely

See the attached file for screenshot and bench.txt output for the skipped tests. I included the .exe file I'm using for testing. I'm currently on 314.22, but it doesn't seem to matter what driver I use.
Attached Files
File Type: zip cufftbench_errors.zip (148.3 KB, 61 views)
flashjh is offline   Reply With Quote
Old 2013-12-21, 21:53   #2122
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

1001110112 Posts
Default

I'll take a look.

New commit r56, fixes a regression concerning command line input. Try to specify a nonstandard fft like 3150k and you'll see what I'm talking about.
owftheevil is offline   Reply With Quote
Old 2013-12-22, 19:44   #2123
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Windows r56 executables posted to SourceForge

Last fiddled with by flashjh on 2013-12-22 at 19:44
flashjh is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Don't DC/LL them with CudaLucas LaurV Data 131 2017-05-02 18:41
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 Brain GPU Computing 13 2016-02-19 15:53
CUDALucas: which binary to use? Karl M Johnson GPU Computing 15 2015-10-13 04:44
settings for cudaLucas fairsky GPU Computing 11 2013-11-03 02:08
Trying to run CUDALucas on Windows 8 CP Rodrigo GPU Computing 12 2012-03-07 23:20

All times are UTC. The time now is 21:03.


Sun Aug 1 21:03:46 UTC 2021 up 9 days, 15:32, 0 users, load averages: 1.46, 1.50, 1.50

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.