mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-12-01, 04:08   #2058
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2×3×1,693 Posts
Default

Quote:
Originally Posted by flashjh View Post
What FFT lengths are you using when the restarts happen?
.......
Got the data, but forgot to post it, till now-
30.8M exponent, 1728K, GTX 570
37.5M exponent, 2048K, GTX 580

kladner is offline   Reply With Quote
Old 2013-12-01, 05:55   #2059
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

27AE16 Posts
Default

306.23 gives this-
Code:
E:\CUDA\2.05-BETA\CL_2.05_A>cudalucas -r

device_number >=  device_count ... exiting
(This is probably a driver problem)
I'll move back up to something a bit more recent and see what happens.

EDIT: Shoot! 314.22 gives the same result with CUDALucas.......R49.

Last fiddled with by kladner on 2013-12-01 at 06:36
kladner is offline   Reply With Quote
Old 2013-12-01, 07:58   #2060
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default

Back on driver 331.82. Seeming to run pretty well, again.

I have noticed that CUDALucas does not load the GPU's as heavily as mfaktc. My line-measured power consumption is down ~80 W with CL running on both cards. This is with nearly the OC core settings that mfaktc will run at. I still feel better about turning down the VRAM even from stock speeds to run CL, and it does affect the iteration time and the power consumption.

Regardless, with CL running on both cards, the whole system is pulling ~720 W with P95 running all eight cores of an FX-8350 on P-1, with 24 GB of RAM allowed. If the GPU's were running mfaktc, the power draw would be a bit over 800 W.

Last fiddled with by kladner on 2013-12-01 at 08:12
kladner is offline   Reply With Quote
Old 2013-12-01, 10:35   #2061
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

481810 Posts
Default

May I gently ask to also post GPU type, OS source name and release when you test new drivers?

Thanks

Luigi
ET_ is offline   Reply With Quote
Old 2013-12-01, 12:15   #2062
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

46316 Posts
Default

Quote:
Originally Posted by kladner View Post
306.23 gives this-
Code:
E:\CUDA\2.05-BETA\CL_2.05_A>cudalucas -r

device_number >=  device_count ... exiting
(This is probably a driver problem)
I'll move back up to something a bit more recent and see what happens.

EDIT: Shoot! 314.22 gives the same result with CUDALucas.......R49.
320.18 is the first WHQL built on CUDA 5.5, which should be the earliest release driver that would work with this build of CUDALucas. I have all CUDA install from 3.2 and up. I could try building earlier versions if you're interested.
flashjh is offline   Reply With Quote
Old 2013-12-01, 13:21   #2063
Manpowre
 
"Svein Johansen"
May 2013
Norway

3×67 Posts
Default

Quote:
Originally Posted by kladner View Post
Back on driver 331.82. Seeming to run pretty well, again.

I have noticed that CUDALucas does not load the GPU's as heavily as mfaktc. My line-measured power consumption is down ~80 W with CL running on both cards. This is with nearly the OC core settings that mfaktc will run at. I still feel better about turning down the VRAM even from stock speeds to run CL, and it does affect the iteration time and the power consumption.

Regardless, with CL running on both cards, the whole system is pulling ~720 W with P95 running all eight cores of an FX-8350 on P-1, with 24 GB of RAM allowed. If the GPU's were running mfaktc, the power draw would be a bit over 800 W.
This is probably due to the amount of memcopy done back and forth between each iteration from host->device->host. As far as I have understood MfaktC, is that it keeps data in device mem. therefore activating the card alot more.
Manpowre is offline   Reply With Quote
Old 2013-12-01, 14:46   #2064
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

236568 Posts
Default

Quote:
Originally Posted by ET_ View Post
May I gently ask to also post GPU type, OS source name and release when you test new drivers?

Thanks

Luigi
Sorry. I was running in sloppy late-night mode.

Driver 331.82, latest WHQL
The cards are a Gigabyte GTX 570, and an Asus GTX 580.
Windows 7 Pro 64 bit, SP 1, all current Windows updates.
More on request if I missed something.

EDIT: Completed a DC on each card, matched residues on both. Before completion, the 580 log showed three batch file starts, or two restarts. This is an incomplete picture as it restarted several times in the previous evening. Some of these were spontaneous, while others had to do with switching out drivers.

Last fiddled with by kladner on 2013-12-01 at 15:00
kladner is offline   Reply With Quote
Old 2013-12-01, 15:16   #2065
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

1015810 Posts
Default

Quote:
Originally Posted by flashjh View Post
It's no problem. Changes are posted on sourceforge. If anything else needs updating, etc. just post it and I'll include it in a future commit.

I want to bring in the custom output formatting from mfactx, so I'll also look at line breaks, also. That will also allow for adding username and computer id to the results file line.
Thanks, Jerry. For some reason, I can sort out the lines more easily without the break, in spite of the rather wide box that requires.

EDIT: Another display driver restart, GTX 570 running CUDALucas_BETA_2.05_r49, 580 running mfaktc.
Code:
C:/CUDA/CuLu/src/CUDALucas.cu(372) : cudaSafeCall() Runtime API error 30: unknown error.
Aside from the CL batch file loop restart, this did not seem to cause any disruption. mfaktc appeared to be unaffected.

Last fiddled with by kladner on 2013-12-01 at 16:03
kladner is offline   Reply With Quote
Old 2013-12-01, 16:36   #2066
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

72×197 Posts
Default

Well, it took me a wile to uncover this bug... I was beginning to think I am stupid , because for all of you it was working, but for me not...

Then suddenly it came... (I had to take the options one by one and play with them!)

I got so many errors about my cards not having enough memory, registers, wheels, purple lights, whatever, it even said I have minus few terabytes of RAM (!?!?), I was ready to give up... Then I tried to use the -info switch to see what freaking card he believes I have...

... And with -info switch it worked!

Here is where it did hit me! I have "PrintDeviceInfo=0" in the ini file ("who the hack need that? I know what kind of card I have!").

If you have "PrintDeviceInfo=0" in the ini file, then the program not only ignore printing them on screen, but also ignores reading them for himself...

Code:
e:\CudaLucas\CL0>cl205b_x64r49 -info

------- DEVICE 0 -------
name                GeForce GTX 580
Compatibility       2.0
clockRate (MHz)     1564
memClockRate (MHz)  2004
totalGlobalMem      1610612736
totalConstMem       65536
l2CacheSize         786432
sharedMemPerBlock   49152
regsPerBlock        32768
warpSize            32
memPitch            2147483647
maxThreadsPerBlock  1024
maxThreadsPerMP     1536
multiProcessorCount 16
maxThreadsDim[3]    1024,1024,64
maxGridSize[3]      65535,65535,65535
textureAlignment    512
deviceOverlap       1

mkdir: cannot create directory `backup0': File exists
Using threads: norm1 256, mult 128, norm2 128.
Starting M37500769 fft length = 2048K
        SIGINT caught, writing checkpoint. Estimated time spent so far: 0:39

<it works perfectly>

e:\CudaLucas\CL0>cl205b_x64r49

mkdir: cannot create directory `backup0': File exists
Using threads: norm1 256, mult 128, norm2 128.
over specifications Grid = 4096
try increasing norm1 threads (256) or decreasing FFT length (2048K)

<freaks out>

e:\CudaLucas\CL0>

Last fiddled with by LaurV on 2013-12-01 at 16:37
LaurV is offline   Reply With Quote
Old 2013-12-02, 01:23   #2067
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

LaurV, Good find!

Found the problem in the init_device function.

I tested it, but please test again, thanks!

Committed the change and updated the .exe files.

Edit:
Quote:
Originally Posted by Prime95 View Post
I don't disagree, except that we are already vulnerable. Prime95 uses option 3 with "secret" code. Will we make matters any worse by giving CUDALucas the exact same vulnerability?
Quote:
Originally Posted by chalsall View Post
Perhaps this vulnerability should be closed.
I like the idea of keeping the code open.

I think the changes to Primenet/G72 are a great option for now, but they require a reasonable amount of work, right? (And there is no guarantee that someone getting assignments would use the right option anyway). As such, I think leaving things as they are, may work best and once CUDALucas is stable and produces reliable results we can readdress the need for the secret code. Thoughts?

Also, CUDA 6 is going to (potentially significantly) change CUDALucas. This is one reason I don't think making big changes right now is a good idea.

Last fiddled with by flashjh on 2013-12-02 at 01:41
flashjh is offline   Reply With Quote
Old 2013-12-02, 03:41   #2068
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

72·197 Posts
Default

Just for recording, and as a guy who makes a living from writing code, I have nothing against "secret" CRCs. Small function in a dll, cudaLucas can call to it and generate some key, which may also depend on the assignment key (if the work was "legally" reserved). It can call the function every 1M iterations, and every time add few characters to the key string. At the end, they would be easy to be verified without re-doing whole the work. We should not be afraid of "vulnerabilities", and does not need to be something very complicate. Prime95 is fine as it is.

My point is that people who know how to exploit the vulnerability are too clever and too mature to use the exploit, they are "above" the "credit hunting fever". You don't get money for it (you can not "fake" a prime, for example - it will be verified by others immediately), and you even don't get "fame", contrarily, someone can realize you are cheating the system and you will have more to lose and suffer from the community. The "guarding" has to be against "childish" and "cmd*-like" stuff, like editing a text line and reporting two times, which anybody could do. (I wanted to write "any kid", but realized that kids today are so clever... hehe...)


(* for the new users here, "cmd" is a mersenneforum user who liked to do this kind of stupid things line adding all numbers with 37 digits to factorDB)

Last fiddled with by LaurV on 2013-12-02 at 03:44
LaurV is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Don't DC/LL them with CudaLucas LaurV Data 131 2017-05-02 18:41
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 Brain GPU Computing 13 2016-02-19 15:53
CUDALucas: which binary to use? Karl M Johnson GPU Computing 15 2015-10-13 04:44
settings for cudaLucas fairsky GPU Computing 11 2013-11-03 02:08
Trying to run CUDALucas on Windows 8 CP Rodrigo GPU Computing 12 2012-03-07 23:20

All times are UTC. The time now is 21:03.


Sun Aug 1 21:03:48 UTC 2021 up 9 days, 15:32, 0 users, load averages: 1.46, 1.50, 1.50

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.