mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-11-22, 18:12   #430
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

2·3·11·73 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
Note that the "<gpu> fft.txt" and "<gpu> threads.txt" files are distinct from each other.

<gpu> fft.txt should look something like
Code:
Device              GeForce GTX 670
Compatibility       3.0
clockRate (MHz)     980
memClockRate (MHz)  3004

  fft    max exp  ms/iter
    4      85933   0.0697
   16     333803   0.1153
   32     657719   0.1306
   36     738083   0.1618
   48     978041   0.1635
... skip a whole bunch of fft lines ...
28800  511382147  76.5273
32768  580225813  79.6749
Whereas "<gpu> threads.txt" should be quite short (and more cryptic), mine looks like:
Code:
17496  256   64  512  45.9160
 3456  256  128   32   8.0790
I suspect it didn't make a "<gpu> threads.txt" file for you because it appears to have failed partway through the process:
Thanks James.

From what you said, I assume that there should be 2 distinct files: the first created by cufftbench 1.8192 1, the second by -cufftbench 4096 4096 4

I'll try to modify the r parameter of the second bench run and see if it suffices.

Luigi
ET_ is offline   Reply With Quote
Old 2013-11-22, 18:26   #431
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

2×3×11×73 Posts
Default

Quote:
Originally Posted by ET_ View Post
Thanks James.

From what you said, I assume that there should be 2 distinct files: the first created by cufftbench 1.8192 1, the second by -cufftbench 4096 4096 4

I'll try to modify the r parameter of the second bench run and see if it suffices.

Luigi
Sadly, I always get "CUDAPm1.cu(2163) : cufftSafeCall() CUFFT error 6: CUFFT_EXEC_FAILED" with r between 1 and 5 and Threads=128 or 256.

Hints?

Luigi
ET_ is offline   Reply With Quote
Old 2013-11-22, 19:57   #432
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

Does it always fail at the same place in the test?

Also try putting

Code:
cutilSafeThreadSync();
after the cufft call on line 2161 and after the square call on 2162. That will at least tell us what is failing.
owftheevil is offline   Reply With Quote
Old 2013-11-22, 20:05   #433
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

2·3·11·73 Posts
Default

Quote:
Originally Posted by owftheevil View Post
Does it always fail at the same place in the test?

Also try putting

Code:
cutilSafeThreadSync();
after the cufft call on line 2161 and after the square call on 2162. That will at least tell us what is failing.
Yes, it always fails at the same place.

Added the line in the 2 places you asked. A new result:

Code:
CUDAPm1.cu(2165) : cufftSafeCall() CUFFT error 6: CUFFT_EXEC_FAILED
Added a new sync after line 2165: same error.

Luigi
ET_ is offline   Reply With Quote
Old 2013-11-22, 20:10   #434
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

Sorry, I jumped too quickly on the safecall stuff. More is needed. Let me think a bit.
owftheevil is offline   Reply With Quote
Old 2013-11-22, 20:14   #435
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

481810 Posts
Default

Quote:
Originally Posted by owftheevil View Post
Sorry, I jumped too quickly on the safecall stuff. More is needed. Let me think a bit.
No hurry. I'm actually playing with Threads=128 and the program is working: I just tried to squeeze some more juice from it.

I'll be quietly waiting for your thoughts, thank you.

Luigi
ET_ is offline   Reply With Quote
Old 2013-11-22, 20:38   #436
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

1001110112 Posts
Default

Could you try this little snippet after the square call on 2162?

Code:
cudaThreadSynchronize();
{
  cudaError_t error = cudaGetLastError();
  if(error != cudaSuccess)
  {
    printf("CUDA error: %s\n", cudaGetErrorString(error));
    exit(2);
  }
}
owftheevil is offline   Reply With Quote
Old 2013-11-22, 21:13   #437
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

2·3·11·73 Posts
Default

Quote:
Originally Posted by owftheevil View Post
Could you try this little snippet after the square call on 2162?

Code:
cudaThreadSynchronize();
{
  cudaError_t error = cudaGetLastError();
  if(error != cudaSuccess)
  {
    printf("CUDA error: %s\n", cudaGetErrorString(error));
    exit(2);
  }
}
The error is:

Code:
CUDA error: too many resources requested for launch
while the environment is:

Code:
------- DEVICE 0 -------
name                GeForce GTX 580
Compatibility       2.0
clockRate (MHz)     1594
memClockRate (MHz)  2025
totalGlobalMem      1610285056
totalConstMem       65536
l2CacheSize         786432
sharedMemPerBlock   49152
regsPerBlock        32768
warpSize            32
memPitch            2147483647
maxThreadsPerBlock  1024
maxThreadsPerMP     1536
multiProcessorCount 16
maxThreadsDim[3]    1024,1024,64
maxGridSize[3]      65535,65535,65535
textureAlignment    512
deviceOverlap       1
HTH... thanks.

Luigi

Last fiddled with by ET_ on 2013-11-22 at 21:13 Reason: [/cuda] is not an allowed tag in this forum...
ET_ is offline   Reply With Quote
Old 2013-11-22, 21:15   #438
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

10010110100102 Posts
Default

Sorry for the delay... I was dining.
ET_ is offline   Reply With Quote
Old 2013-11-22, 22:42   #439
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

1001110112 Posts
Default

Thanks for getting back with that. The only thing I can think of right now is that somehow, either t2 or the threads array have messed up values. I'll look at it over the weekend and get back on Monday.
owftheevil is offline   Reply With Quote
Old 2013-11-23, 11:03   #440
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

12D216 Posts
Default

Quote:
Originally Posted by owftheevil View Post
Thanks for getting back with that. The only thing I can think of right now is that somehow, either t2 or the threads array have messed up values. I'll look at it over the weekend and get back on Monday.
Thanks

I add that I am using Linux_64, driver 304.88.

CUDA version info:

Code:
CUDA version info
  binary compiled for CUDA  4.10
  CUDA runtime version      4.10
  CUDA driver version       5.0
Luigi
ET_ is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3497 2021-06-05 12:27
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51
World's dumbest CUDA program? xilman Programming 1 2009-11-16 10:26
Factoring program need help Citrix Lone Mersenne Hunters 8 2005-09-16 02:31
Factoring program ET_ Programming 3 2003-11-25 02:57

All times are UTC. The time now is 06:57.


Mon Aug 2 06:57:49 UTC 2021 up 10 days, 1:26, 0 users, load averages: 1.39, 1.22, 1.16

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.