mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   CUDALucas (a.k.a. MaclucasFFTW/CUDA 2.3/CUFFTW) (https://www.mersenneforum.org/showthread.php?t=12576)

Mark Rose 2015-02-27 18:43

[QUOTE=MacFactor;396570]Thanks, but it didn't fix my problem.[/quote]

I had to compile it from source as I was running into the same issue. On Ubuntu 14.04, I installed nvidia-cuda-toolkit, changed the cuda install location at the top of Makefile to /usr, ran `make`, and it worked.

Mark Rose 2015-02-27 19:16

2 Attachment(s)
[QUOTE=James Heinrich;396554]Thanks, much appreciated -- I didn't have any Compute 5.2 benchmarks. It's within 3% of predicted so I'm happy.

To everyone else: I still don't have any benchmarks for Compute 5.0, 3.5 or 2.1 under CuLu 2.05, so more benchmarks are welcome, either here or [email]james@mersenne.ca[/email][/QUOTE]

Here are two benchmarks for low end 2.1 cards. Attached are the command outputs.

Device GeForce GT 430
Compatibility 2.1
clockRate (MHz) 1400
memClockRate (MHz) 600

fft max exp ms/iter
1024 19535569 13.1073
1152 21921901 14.9833
1225 23280269 16.5440
1280 24302527 17.4606
1296 24599717 17.5964
1323 25101101 18.1876
1372 26010389 20.1547
1568 29640913 20.1924
1600 30232693 20.5022
1728 32597297 23.1662
1792 33778141 24.0094
2048 38492887 26.3347
2304 43194913 31.8484
2401 44973503 33.1631
2592 48471289 36.1269
2744 51250889 40.7279
3200 59570449 40.8349
3456 64229677 45.8811
3888 72075517 55.4194
4096 75846319 55.6313
4320 79902611 73.0512
4375 80897867 73.7962
4480 82797151 74.6121
5184 95507747 75.5244
5488 100984691 82.0007
6272 115080019 83.8615
6400 117377567 85.8051
6912 126558077 98.1325
7776 142017539 110.4415
8192 149447533 115.0217

Device GeForce GT 520
Compatibility 2.1
clockRate (MHz) 1620
memClockRate (MHz) 600

fft max exp ms/iter
1024 19535569 19.1582
1152 21921901 21.1080
1260 23930909 25.1102
1296 24599717 25.1694
1344 25490893 26.9664
1440 27271147 27.6734
1568 29640913 29.6729
1600 30232693 31.0359
1728 32597297 32.7081
1792 33778141 36.0988
2048 38492887 37.7887
2240 42020509 45.4196
2304 43194913 46.2915
2592 48471289 50.0936
2880 53735041 58.4163
3136 58404433 62.8308
3200 59570449 68.3727
3360 62483353 71.3736
3456 64229677 72.2613
3584 66556463 74.7343
3600 66847171 78.6148
4096 75846319 80.5501
4320 79902611 93.3572
4608 85111207 96.0147
5040 92911087 104.0299
5184 95507747 106.3927
5600 103000823 124.7254
5760 105879517 126.8726
6048 111056879 131.7726
6144 112781477 135.7366
6272 115080019 137.5057
6300 115582697 143.4483
6480 118813021 147.0057
6720 123117161 151.9291
6912 126558077 157.1928
7168 131142761 157.2835
7200 131715607 159.5422
7776 142017539 161.8901
7840 143161159 175.8101
8064 147162241 180.3350
8192 149447533 180.9630

James Heinrich 2015-02-27 19:39

[QUOTE=Mark Rose;396581]Here are two benchmarks for low end 2.1 cards.[/QUOTE]Thanks. Those numbers are a fair bit higher (~25%) than expected. If someone has a higher-end Compute 2.1 card (e.g. GTX 460 or 560) that matches those numbers I'll feel more confident in updating the chart.

Mark Rose 2015-02-27 21:50

[QUOTE=James Heinrich;396582]Thanks. Those numbers are a fair bit higher (~25%) than expected. If someone has a higher-end Compute 2.1 card (e.g. GTX 460 or 560) that matches those numbers I'll feel more confident in updating the chart.[/QUOTE]

They were done with CUDA 5.5 if it matters.

owftheevil 2015-02-27 23:20

I also have results for a 570, 780, Titan, and Titan black if you want them.


[CODE]
Device GeForce GTX 560 Ti
Compatibility 2.1
clockRate (MHz) 1940
memClockRate (MHz) 2080


fft max exp ms/iter
1024 19535569 2.3434
1152 21921901 2.6485
1260 23930909 3.0699
1296 24599717 3.0931
1344 25490893 3.3730
1440 27271147 3.4673
1568 29640913 3.7194
1600 30232693 3.9279
1728 32597297 4.0755
1792 33778141 4.3713
2048 38492887 4.6964
2240 42020509 5.5626
2304 43194913 5.5691
2592 48471289 6.2081
2688 50227213 7.0311
2880 53735041 7.1812
3136 58404433 7.7523
3200 59570449 8.2700
3360 62483353 8.5920
3456 64229677 8.6056
3584 66556463 8.9110
3600 66847171 9.6526
3780 70115887 9.8984
4096 75846319 9.9148
4320 79902611 11.2540
4608 85111207 11.4486
5040 92911087 12.6529
5184 95507747 12.9797
5292 97454309 14.6345
5376 98967641 14.8351
5760 105879517 15.0610
6048 111056879 15.8011
6144 112781477 16.1751
6272 115080019 16.2582
6300 115582697 17.0884
6480 118813021 17.3343
6720 123117161 17.9061
6912 126558077 18.2475
7168 131142761 18.5232
7200 131715607 19.6253
7776 142017539 19.8418
7840 143161159 21.2217
8192 149447533 21.2728
[/CODE]

James Heinrich 2015-02-28 00:11

[QUOTE=owftheevil;396608]I also have results for a 570, 780, Titan, and Titan black if you want them.[/QUOTE]Your 560ti result was more in line with what I'd seen before. These numbers are from v2.05 correct?

I already have 570/580 results, but more is merrier if you have them handy. I would also be very interested in your 780/Titan/Black results please.

Mark Rose 2015-02-28 01:54

I can bench on three more 580's and a 760 if it would be useful.

owftheevil 2015-02-28 02:14

[CODE]
Device GeForce GTX 780
Compatibility 3.5
clockRate (MHz) 1032
memClockRate (MHz) 3004


fft max exp ms/iter
1024 19535569 1.2052
1080 20580341 1.4758
1152 21921901 1.4977
1280 24302527 1.6422
1296 24599717 1.6571
1323 25101101 1.8665
1344 25490893 1.8907
1440 27271147 1.9198
1568 29640913 1.9577
1600 30232693 2.0038
1728 32597297 2.2062
1792 33778141 2.2945
2048 38492887 2.4171
2304 43194913 2.8495
2560 47885689 3.2959
2592 48471289 3.3462
2646 49459153 3.8456
2700 50446621 3.8838
2800 52274087 3.9217
2880 53735041 3.9759
2916 54392209 3.9872
3136 58404433 4.0511
3200 59570449 4.2620
3240 60298969 4.5078
3584 66556463 4.5897
4096 75846319 4.9067
4608 85111207 5.9424
5120 94353877 6.7233
5184 95507747 6.9934
5292 97454309 7.5551
5600 103000823 7.5775
5832 107174381 8.0734
6272 115080019 8.3218
6400 117377567 8.9122
6912 126558077 9.1317
7168 131142761 9.4188
7200 131715607 9.7199
8192 149447533 10.2932
[/CODE]

owftheevil 2015-02-28 02:16

All with CUDAlucas 2.05.1 and CUDA-5.5

[CODE]
Device GeForce GTX TITAN
Compatibility 3.5
clockRate (MHz) 980
memClockRate (MHz) 3004


fft max exp ms/iter
1024 19535569 0.6874
1080 20580341 0.8469
1296 24599717 0.8590
1568 29640913 1.0307
1600 30232693 1.1269
1728 32597297 1.2268
2000 37609879 1.2813
2048 38492887 1.3269
2592 48471289 1.6516
2646 49459153 2.0153
2700 50446621 2.0498
3136 58404433 2.1236
3200 59570449 2.4147
3240 60298969 2.4481
4000 74106457 2.5415
4096 75846319 2.5937
4320 79902611 3.2794
4374 80879779 3.3026
4500 83158811 3.3899
4536 83809729 3.4085
5184 95507747 3.4727
5292 97454309 3.9633
5400 99399967 4.0524
5600 103000823 4.2176
5832 107174381 4.3864
6000 110194363 4.5187
6048 111056879 4.5813
6125 112440191 4.6094
6272 115080019 4.6856
6400 117377567 4.7771
6480 118813021 4.8461
6561 120266023 4.8945
6750 123654943 5.0736
6912 126558077 5.1885
7000 128134459 5.2399
7056 129137381 5.3293
8000 146019329 5.3398
8192 149447533 5.4728
[/CODE]

[CODE]
Device GeForce GTX TITAN Black
Compatibility 3.5
clockRate (MHz) 1110
memClockRate (MHz) 3500


fft max exp ms/iter
1024 19535569 0.6642
1080 20580341 0.8309
1296 24599717 0.8310
1568 29640913 0.9913
1600 30232693 1.0648
1728 32597297 1.1501
2000 37609879 1.2579
2048 38492887 1.2876
2592 48471289 1.6101
2744 51250889 1.9976
3136 58404433 2.0265
3200 59570449 2.4074
3240 60298969 2.4360
4000 74106457 2.4614
4096 75846319 2.5284
4320 79902611 3.2676
4374 80879779 3.2938
5184 95507747 3.3084
5292 97454309 3.9584
5488 100984691 4.0006
5600 103000823 4.1916
5832 107174381 4.3808
6048 111056879 4.5168
6125 112440191 4.5787
6250 114685037 4.6657
6272 115080019 4.6844
6400 117377567 4.7705
6480 118813021 4.8617
6561 120266023 4.8914
6750 123654943 5.0513
8000 146019329 5.1017
8192 149447533 5.1987

[/CODE]


[CODE]
Device GeForce GTX 570
Compatibility 2.0
clockRate (MHz) 1464
memClockRate (MHz) 1900


fft max exp ms/iter
1024 19535569 1.7232
1080 20580341 2.0214
1120 21325891 2.0459
1152 21921901 2.0549
1176 22368691 2.2586
1296 24599717 2.3083
1440 27271147 2.5121
1568 29640913 2.7261
1600 30232693 2.8704
1728 32597297 3.0130
2048 38492887 3.2886
2160 40551479 4.0701
2304 43194913 4.1211
2352 44075249 4.4544
2592 48471289 4.4614
2880 53735041 5.1680
3072 57237889 5.4704
3136 58404433 5.6023
3200 59570449 6.0137
3360 62483353 6.0838
3456 64229677 6.3130
3584 66556463 6.3526
4096 75846319 6.8811
4320 79902611 7.7969
4608 85111207 7.9405
5040 92911087 8.9266
5184 95507747 9.2909
5400 99399967 10.4533
5600 103000823 10.6724
5670 104260469 10.8861
5760 105879517 10.9340
6144 112781477 10.9955
6272 115080019 11.9533
6400 117377567 12.3915
6480 118813021 12.5910
7056 129137381 12.9406
7168 131142761 13.2404
7200 131715607 13.3282
7776 142017539 13.8598
7840 143161159 14.8712
8192 149447533 15.0081

[/CODE]

LaurV 2015-02-28 02:36

Your Titan/Titan Black are about 1-2% faster than mine :tantrum:.
Were they done with P95 running? (mine yes, I try to go for "as close to real conditions as possible" when I build them)
Or you keep them cooler? (how? mine are air cooled and with hot Thai days they lose a lot of productivity when the aircond is off, for example, the TDP go down, and the iterations go from 3 to 4 ms, or even 5 etc.)
Just curious.

MacFactor 2015-02-28 03:56

Please see this post:
[url]http://www.mersenneforum.org/showthread.php?p=396570#post396570[/url]


All times are UTC. The time now is 23:04.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.