mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   LL with OpenCL (https://www.mersenneforum.org/showthread.php?t=18297)

Robish 2013-10-17 18:33

[QUOTE=LaurV;356172]BTW, a supermod could mask the residue two posts above... Just in case...

@Robish: please do not post full residues of un-verified LL tests, some "credit hunters" will be tempted to "verify" them using your residue, and 20 years later we may find out we missed a prime, in case that residue is wrong.[/QUOTE]


Ok will (not) do ;-)

nucleon 2013-10-18 15:23

[QUOTE=nucleon;355457]Aaah, the power of 2 FFT issue.

Ok, next time I get a chance to experiment I'll play with something more suitable.

-- Craig[/QUOTE]

Ok, only a fortnight late. 7990@1GHz/1.5GHz mem clk

GPU0:
[CODE]
Iteration 10000 M( 38000009 )C, 0xfd9116e3760e4571, n = 2097152, clLucas v1.01 err = 0.1406 (0:39 real, 3.8668 ms/iter, ETA 40:48:19)
Iteration 20000 M( 38000009 )C, 0xd91fe21f272e5099, n = 2097152, clLucas v1.01 err = 0.1406 (0:39 real, 3.8636 ms/iter, ETA 40:45:38)
Iteration 30000 M( 38000009 )C, 0x7ec1302ed5173c26, n = 2097152, clLucas v1.01 err = 0.1406 (0:39 real, 3.8766 ms/iter, ETA 40:53:16)[/CODE]


GPU1:

[CODE]Iteration 10000 M( 38000009 )C, 0xfd9116e3760e4571, n = 2097152, clLucas v1.01 err = 0.1406 (0:38 real, 3.8664 ms/iter, ETA 40:48:05)
Iteration 20000 M( 38000009 )C, 0xd91fe21f272e5099, n = 2097152, clLucas v1.01 err = 0.1406 (0:39 real, 3.8693 ms/iter, ETA 40:49:15)
Iteration 30000 M( 38000009 )C, 0x7ec1302ed5173c26, n = 2097152, clLucas v1.01 err = 0.1406 (0:39 real, 3.8728 ms/iter, ETA 40:50:51)
[/CODE]

vs Titan:
[CODE]Iteration 10000 M( 38000009 )C, 0xfd9116e3760e4571, n = 2097152, CUDALucas v2.03 err = 0.1953 (0:18 real, 1.8289 ms/iter, ETA 19:17:59)
Iteration 20000 M( 38000009 )C, 0xd91fe21f272e5099, n = 2097152, CUDALucas v2.03 err = 0.1953 (0:18 real, 1.8124 ms/iter, ETA 19:07:16)
Iteration 30000 M( 38000009 )C, 0x7ec1302ed5173c26, n = 2097152, CUDALucas v2.03 err = 0.2031 (0:18 real, 1.8168 ms/iter, ETA 19:09:45)
[/CODE]
Titan is more efficient 250W vs 375W, but I guess th code for opencl is still early days.

-- Craig

Karl M Johnson 2013-10-18 17:39

[QUOTE=nucleon;356678]Ok, only a fortnight late. 7990@1GHz/1.5GHz mem clk

GPU0:
[CODE]
Iteration 10000 M( 38000009 )C, 0xfd9116e3760e4571, n = 2097152, clLucas v1.01 err = 0.1406 (0:39 real, 3.8668 ms/iter, ETA 40:48:19)
Iteration 20000 M( 38000009 )C, 0xd91fe21f272e5099, n = 2097152, clLucas v1.01 err = 0.1406 (0:39 real, 3.8636 ms/iter, ETA 40:45:38)
Iteration 30000 M( 38000009 )C, 0x7ec1302ed5173c26, n = 2097152, clLucas v1.01 err = 0.1406 (0:39 real, 3.8766 ms/iter, ETA 40:53:16)[/QUOTE]
Based on that data, R9 290X may very roughly yield from 3.514 to 2.811 ms/LL iter for that exp using that FFT (using default and boost clocks respectively).

kracker 2013-10-18 19:46

Perhaps. In gaming, it is neck and neck with the titan based on early benchies from Tom's and Anandtech, But compute will be what matters... :smile: Also, it is the first gpu to have a [I]512[/I] [I]bit[/I] memory bus.

Manpowre 2013-10-18 20:00

I get:

GTX 590 board:
[CODE]
Starting M38000009 fft length = 2097152
Iteration 10000 M( 38000009 )C, 0xfd9116e3760e4571, n = 2097152, CUDALucas v2.03
err = 0.1914 (0:40 real, 3.9348 ms/iter, ETA 41:31:21)
Iteration 20000 M( 38000009 )C, 0xd91fe21f272e5099, n = 2097152, CUDALucas v2.03
err = 0.1914 (0:38 real, 3.8383 ms/iter, ETA 40:29:37)
Iteration 30000 M( 38000009 )C, 0x7ec1302ed5173c26, n = 2097152, CUDALucas v2.03
err = 0.1914 (0:39 real, 3.8376 ms/iter, ETA 40:28:33)
[/CODE]

GTX Titan:
[CODE]
Iteration 10000 M( 38000009 )C, 0xfd9116e3760e4571, n = 2097152, CUDALucas v2.03
err = 0.2090 (0:19 real, 1.8580 ms/iter, ETA 19:36:26)
Iteration 20000 M( 38000009 )C, 0xd91fe21f272e5099, n = 2097152, CUDALucas v2.03
err = 0.2090 (0:17 real, 1.7758 ms/iter, ETA 18:44:04)
Iteration 30000 M( 38000009 )C, 0x7ec1302ed5173c26, n = 2097152, CUDALucas v2.03
err = 0.2090 (0:18 real, 1.7750 ms/iter, ETA 18:43:17)
Iteration 40000 M( 38000009 )C, 0x91312c6488821eac, n = 2097152, CUDALucas v2.03
err = 0.2090 (0:18 real, 1.7756 ms/iter, ETA 18:43:23)
[/CODE]

kracker 2013-10-18 20:17

Interesting. So I guess 590 and 7990 almost tie.

Just for the sake of curiosity, I'm curious on titan timings [I]without [/I]the DP switch.

Manpowre 2013-10-18 20:27

[QUOTE=kracker;356696]Interesting. So I guess 590 and 7990 almost tie.

Just for the sake of curiosity, I'm curious on titan timings [I]without [/I]the DP switch.[/QUOTE]

How do you get that kind of results without the DP switch ?
Mine switch is on. just logged into the node to check the setting.

sorry for hijacking cl thread.. but when I calculate, I get 58 ghz days LL testing with one titan per day with these timings on 73m exponents. It equals the statistics for that card, mabye a little better, but then I tweaked a little..

Manpowre 2013-10-19 07:18

I updated the titan thread with some timings for the titan, we dont need to discuss that here in LL thread :)

kracker 2013-10-20 15:32

Some timings for no reason really :razz:
2M FFT
HD-
7750: 15 ms
7770: 12 ms
7850: 8? ms
7870: 7 ms
7950: 4.2 ms
7970: 3.7 ms
7990: 3.9 ms(x2)

Manpowre 2013-10-20 17:02

[QUOTE=kracker;356864]Some timings for no reason really :razz:
2M FFT
HD-
7750: 15 ms
7770: 12 ms
7850: 8? ms
7870: 7 ms
7950: 4.2 ms
7970: 3.7 ms
7990: 3.9 ms(x2)[/QUOTE]

Very interesting actually.
It kind of shows us where to exptect R9 290x.. I guess 2.5-2.8ms with the same FFT length test ?

LaurV 2013-10-21 13:34

[QUOTE=kracker;356864]Some timings for no reason really :razz:
2M FFT
HD-
7750: 15 ms
7770: 12 ms
7850: 8? ms
7870: 7 ms
7950: 4.2 ms
7970: 3.7 ms
7990: 3.9 ms(x2)[/QUOTE]
Confirm for 7990 and 7970, with about 5% longer for me (as I know, you push the memory clock, which I can't do, the cards seems not to be so stable, they are air cooled, and here is still hot during the day, when nobody home and no aircond running).


All times are UTC. The time now is 22:30.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.