![]() |
|
|
#221 |
|
Romulan Interpreter
Jun 2011
Thailand
72×197 Posts |
Second clLucas test finished with success. I think this is a wonder, and either cllucas is more stable than cudalucas, or (and it hurts me to say it!
) amd cards are more stable than nvidia cards. I say that because the computer suffered ove 100 restarts and blue-screens in this period (not related to cl-stuff nor video card, but to other chain of things this computer is part now). Honestly I expected a mismatch, but I let it run...I will dare now to start the third... (@kracker )
|
|
|
|
|
|
#222 |
|
Mar 2003
Melbourne
10000000112 Posts |
I have 7990 (375W) 2xGPUs @1000MHz clock/1500MHz mem.
Using M30583963 as a test with FFT=2M I get iteration times around 3.9ms on both GPUs. GPU0 Code:
Iteration 90000 M( 30583963 )C, 0x1bbfddfc1ddbe19f, n = 2097152, clLucas v1.01 err = 0.001099 (0:39 real, 3.8968 ms/iter, ETA 33:00:13) Iteration 100000 M( 30583963 )C, 0xde1a5280cc5ab5cc, n = 2097152, clLucas v1.01 err = 0.001099 (0:39 real, 3.8989 ms/iter, ETA 33:00:39) Iteration 110000 M( 30583963 )C, 0x5cfaabf2bb0e0487, n = 2097152, clLucas v1.01 err = 0.001099 (0:39 real, 3.8959 ms/iter, ETA 32:58:27) Code:
Iteration 90000 M( 30583963 )C, 0x1bbfddfc1ddbe19f, n = 2097152, clLucas v1.01 err = 0.0009766 (0:39 real, 3.8872 ms/iter, ETA 32:55:20) Iteration 100000 M( 30583963 )C, 0xde1a5280cc5ab5cc, n = 2097152, clLucas v1.01 err = 0.0009766 (0:39 real, 3.8944 ms/iter, ETA 32:58:20) Iteration 110000 M( 30583963 )C, 0x5cfaabf2bb0e0487, n = 2097152, clLucas v1.01 err = 0.0009766 (0:39 real, 3.8989 ms/iter, ETA 32:59:58) Code:
Iteration 470000 M( 30583963 )C, 0x64208421d1c227ee, n = 1835008, CUDALucas v2.03 err = 0.0283 (0:17 real, 1.7324 ms/iter, ETA 14:29:22) Iteration 480000 M( 30583963 )C, 0x29d2686b8bb60915, n = 1835008, CUDALucas v2.03 err = 0.0283 (0:18 real, 1.7334 ms/iter, ETA 14:29:36) Iteration 490000 M( 30583963 )C, 0x3308faaa69d0eef7, n = 1835008, CUDALucas v2.03 err = 0.0283 (0:17 real, 1.7254 ms/iter, ETA 14:25:18) |
|
|
|
|
|
#223 |
|
Romulan Interpreter
Jun 2011
Thailand
965310 Posts |
That is "normal". Your Titan uses a much shorter (therefore faster) FFT for this exponent. That shorter FFT is not optimized for cl-FFT. People are still working to convince clLucas to deal with non-powers-of-two FFT size (think when cudaLucas switched to v1.48 to 1.69 than later to 2.0, last year). Therefore, 1M9 FFT is much slower now, for clLucas, the same test would take 80 hours on your card. Therefore the 2M09 FFT is used, which is power of two. Try testing an exponent on your Titan which uses a comparable FFT size (ex: 38M exponent). Then, the comparison will be more accurate. Therefore posts like this.
(edit: by the way, my 580 is about 3% faster with 2097152, comparing to 1835008, you can try it for your Titan, an may get a speedup even for that 30M expo you are using for testing) Last fiddled with by LaurV on 2013-10-06 at 13:01 |
|
|
|
|
|
#224 | |
|
"Mr. Meeseeks"
Jan 2012
California, USA
23·271 Posts |
Quote:
@LaurV: I'll ask again, try pushing your memory clock up!
|
|
|
|
|
|
|
#225 |
|
Mar 2003
Melbourne
5×103 Posts |
Aaah, the power of 2 FFT issue.
Ok, next time I get a chance to experiment I'll play with something more suitable. -- Craig |
|
|
|
|
|
#226 | |
|
"Rob Gahan"
Aug 2013
Ireland
2416 Posts |
Quote:
M( 58191149 )C, 0x9108992abb23c5d1, n = 4194304, clLucas v1.01 10.5 days on 7870 with aggressive More on the way.... |
|
|
|
|
|
|
#227 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
87816 Posts |
|
|
|
|
|
|
#228 |
|
"Rob Gahan"
Aug 2013
Ireland
2416 Posts |
Cool,
Next due in 14hrs, one in 24hrs and one in 36hrs. Just bigger numbers take longer..... :-) On a different subject, has many of you tried 100 million attempts? Trying with Cudalucas at the mo. 1st attempt was reading 4500 hrs (190 days) on 20971520 (recommended by someone ;-( but after following this thread I tried 2097152 and (guess what) it dropped to 19 days on a gtx 690. Should have results in two days time... Just wondering if many were trying them yet? Cheers Rob. |
|
|
|
|
|
#229 | |
|
P90 years forever!
Aug 2002
Yeehaw, FL
7,537 Posts |
Quote:
|
|
|
|
|
|
|
#230 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
41708 Posts |
|
|
|
|
|
|
#231 | |
|
"Rob Gahan"
Aug 2013
Ireland
3610 Posts |
Quote:
Really? Can you explain why? or a link to somewhere/thread that covers this please? It all looks like its going fine at the moment anyway......?? |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| Can't get OpenCL to work on HD7950 Ubuntu 14.04.5 LTS | VictordeHolland | Linux | 4 | 2018-04-11 13:44 |
| OpenCL accellerated lattice siever | pstach | Factoring | 1 | 2014-05-23 01:03 |
| OpenCL for FPGAs | TObject | GPU Computing | 2 | 2013-10-12 21:09 |
| AMD's Graphics Core Next- a reason to accelerate towards OpenCL? | Belteshazzar | GPU Computing | 19 | 2012-03-07 18:58 |