mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   CUDALucas - Compile on CUDA10.0 (https://www.mersenneforum.org/showthread.php?t=25384)

saviourz 2020-03-19 21:44

CUDALucas - Compile on CUDA10.0
 
Hi folks,

I am compiling CUDALucas 2.06 from the sourceforge ([url]https://sourceforge.net/projects/cudalucas/files/[/url]). After ran makefile, I used $./CUDALucas -r 1 to test it's reliable or not.

Unfortunately, I got all residue [0000000000000].

My OS is Ubuntu 18.04 and I have already changed CUDA path in makefile and also --generate-code arch=compute_60, code=sm_60.

Message showed on the top of ./CUDALucas:
binary compiled for CUDA 10.10
CUDA runtime version 10.0
CUDA driver version 10.20

GPU type is Tesla V100-PCIE and driver version is 440.33.01.

I have read all related posts to my question but none of them can solve my problem. What's more, I set worktodo.txt as "Test=79437629" and got
"Illegal residue: 0x0000000000000000. See mersenneforum.org for help.".

Thanks in advance for replies and sorry if I posted in wrong place.

axn 2020-03-20 03:11

[QUOTE=saviourz;540174]My OS is Ubuntu 18.04 and I have already changed CUDA path in makefile and also --generate-code arch=compute_60, code=sm_60.

Message showed on the top of ./CUDALucas:
binary compiled for CUDA 10.10
CUDA runtime version 10.0
CUDA driver version 10.20

GPU type is Tesla V100-PCIE and driver version is 440.33.01.[/QUOTE]

V100 or P100? V100 is CC7.0, not 6.0, so probably try recompiling with that. Ideally, even 6.0 should work, as long as a PTX is available, but ...

saviourz 2020-03-20 03:27

[QUOTE=axn;540196]V100 or P100? V100 is CC7.0, not 6.0, so probably try recompiling with that. Ideally, even 6.0 should work, as long as a PTX is available, but ...[/QUOTE]

It's V100. I tried CC7.0 by setting "--generate-code arch=compute_70, code=sm_70".

But I still got "Illegal Residue 0x0000000".

Am I right for understanding how to set the CC version to 7.0?

axn 2020-03-20 03:30

[QUOTE=saviourz;540198]Am I right for understanding how to set the CC version to 7.0?[/QUOTE]
That part is correct.

EDIT:- Next thing I'd look for is the fact that you have multiple CUDA versions in your system:
[CODE]binary compiled for CUDA 10.10
CUDA runtime version 10.0
CUDA driver version 10.20
[/CODE]
During compilation, it is picking 10.10, but during execution, it is picking 10.0, but the driver is 10.20. I'm not sure what it all means.

Would it be possible to nuke all nvidia stuff from the system and do a clean install of latest drivers + cuda toolkit?

saviourz 2020-03-20 03:42

[QUOTE=axn;540199]That part is correct.

EDIT:- Next thing I'd look for is the fact that you have multiple CUDA versions in your system:
[CODE]binary compiled for CUDA 10.10
CUDA runtime version 10.0
CUDA driver version 10.20
[/CODE]
During compilation, it is picking 10.10, but during execution, it is picking 10.0, but the driver is 10.20. I'm not sure what it all means.

Would it be possible to nuke all nvidia stuff from the system and do a clean install of latest drivers + cuda toolkit?[/QUOTE]

Thanks for the advice! I'll try to make the version of those three unified to see the results.

saviourz 2020-03-20 22:42

[QUOTE=axn;540199]That part is correct.

EDIT:- Next thing I'd look for is the fact that you have multiple CUDA versions in your system:
[CODE]binary compiled for CUDA 10.10
CUDA runtime version 10.0
CUDA driver version 10.20
[/CODE]
During compilation, it is picking 10.10, but during execution, it is picking 10.0, but the driver is 10.20. I'm not sure what it all means.

Would it be possible to nuke all nvidia stuff from the system and do a clean install of latest drivers + cuda toolkit?[/QUOTE]

Hi, I have already unified those three CUDA things to 10.20 as:
[CODE]binary compiled for CUDA 10.20
CUDA runtime version 10.20
CUDA driver version 10.20
[/CODE]

But the problem is still here. [OEIS]"Illegal Residue 0x0000000". [/OEIS]I am now trying to run it on Tesla M6 with "--generate-code arch=compute_60,code=sm_60" in Makefile.

Which part should I suspect now? I checked nvidia-smi and saw ./CUDALucas was successfully on the GPU task queue.

axn 2020-03-21 04:14

Somebody faced similar problem, and found that the pre-compiled binary worked. Try reading the next few posts starting here: [url]https://www.mersenneforum.org/showthread.php?p=497460#post497460[/url]

But people have made it work with native compilation as well. Maybe you can try PM-ing those folks. Not sure what platform they were using.

saviourz 2020-03-21 20:20

[QUOTE=axn;540347]Somebody faced similar problem, and found that the pre-compiled binary worked. Try reading the next few posts starting here: [url]https://www.mersenneforum.org/showthread.php?p=497460#post497460[/url]

But people have made it work with native compilation as well. Maybe you can try PM-ing those folks. Not sure what platform they were using.[/QUOTE]

Thanks for your advice! I changed CUDA version to 9.1 and used Linux pre-compiled CUDALucas from [url]https://download.mersenne.ca/CUDALucas/old[/url]. The output seems good. No 0 residue appears in the middle of the looping.

But I am still trying to figure out my compilation problem. Open to any advice!

saviourz 2020-03-21 23:58

1 Attachment(s)
FYI,

I succeed in compiling CUDALucas v2.06 on Tesla M6 with CUDA runtime 10.20. The test is correct with "./CUDALucas -r 1".

I don't understand clearly about how the previous error happened. Only changes I made are in Makefile:

[C]CUDA = "path to CUDA10.20"

--generate-code arch=compute_50, code=sm_50[/C]

I attached this executive file. Hope it can help.

kriesel 2020-03-22 00:15

[QUOTE=saviourz;540418]FYI,

I succeed in compiling CUDALucas v2.06 on Tesla M6 with CUDA runtime 10.20. The test is correct with "./CUDALucas -r 1".

I don't understand clearly about how the previous error happened. Only changes I made are in Makefile:

[C]CUDA = "path to CUDA10.20"

--generate-code arch=compute_50, code=sm_50[/C]

I attached this executive file. Hope it can help.[/QUOTE]Outstanding! Thanks for posting the executable file.
Welcome to the forum.
If you haven't seen it yet, you might find some background and reference info interesting at [URL]https://www.mersenneforum.org/showthread.php?t=24607[/URL]
Please run some double-checks with that executable to confirm it produces correct runs.

saviourz 2020-03-22 00:18

[QUOTE=kriesel;540422]Outstanding! Thanks for posting the executable file.
Welcome to the forum.
If you haven't seen it yet, you might find some background and reference info interesting at [URL]https://www.mersenneforum.org/showthread.php?t=24607[/URL][/QUOTE]

Thanks for those useful information!


All times are UTC. The time now is 15:23.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.