![]() |
[QUOTE=SethTro;601925]I'm happy to look at this as a bug. I vaguely remember that I wasn't sure if I should always stop or only if the cofactor is composite.[/QUOTE]Thanks, but I've moved to using a separate machine with ecm.py and it is working as needed.
I'd leave as is unless others are interested. For me it is no longer an issue. Again, thank you for considering it. |
There seems to be an oddity about using a GPU in GMP-ECM.
The [C]--help[/C] switch shows the maximum size of a B1 value is 2^1018. Using Windows Calculator to calculate log(2) * 1018 says this is a 308 digit value. If I run this: [CODE]echo "2^14447-1" | gpuecm -gpu -maxmem 2048 5e6 10e6[/CODE]I get this message: [QUOTE]GMP-ECM 7.0.4-dev [configured with MPIR 2.7.2, --enable-gpu, --enable-openmp] [ECM] Input number is 2^14447-1 (4349 digits) GPU: Error, input number should be strictly lower than 2^1018 please report internal errors at <ecm-discuss@lists.gforge.inria.fr>. [/QUOTE]B1 in my example is 5e6 or 5,000,000. This is obviously smaller than a 308 digit number. I am missing something here and I cannot determine what it is. |
[QUOTE=storm5510;606417]There seems to be an oddity about using a GPU in GMP-ECM.
The [C]--help[/C] switch shows the maximum size of a B1 value is 2^1018. Using Windows Calculator to calculate log(2) * 1018 says this is a 308 digit value. If I run this: [CODE]echo "2^14447-1" | gpuecm -gpu -maxmem 2048 5e6 10e6[/CODE]I get this message: B1 in my example is 5e6 or 5,000,000. This is obviously smaller than a 308 digit number. I am missing something here and I cannot determine what it is.[/QUOTE] You are missing something. The inputted number is too big. You specified 2^14447-1. The max is 2^1018. :ermm: |
If you are willing to compile gmp-ecm from source you can test larger input numbers (it's also faster).
See [url]https://www.mersenneforum.org/showthread.php?t=27103[/url] |
[QUOTE=paulunderwood;606419]You are missing something. The inputted number is too big. You specified 2^14447-1. The max is 2^1018. :ermm:[/QUOTE]
Understood. IMHO, it is not good for much of anything at this size. |
Is there any known progress in implementing stage 1 in (at least somewhat optimised) OpenCL up to 512 or maybe 1024 bits?
|
[QUOTE=kruoli;625295]Is there any known progress in implementing stage 1 in (at least somewhat optimised) OpenCL up to 512 or maybe 1024 bits?[/QUOTE]
I don't know of anyone working on this. I did some quick google searches and didn't find any OpenCl arbitrary precision libraries. The old ECM gpu code was fairly straightforward grade school multiplication algorithm (IIRC) that could be adapted quickly for OpenCl, but if you/someone wanted competitive speeds from OpenCl more code would be needed for montgomery multiplication, various size kernels, etc. |
[QUOTE=SethTro;625329]I don't know of anyone working on this. I did some quick google searches and didn't find any OpenCl arbitrary precision libraries. The old ECM gpu code was fairly straightforward grade school multiplication algorithm (IIRC) that could be adapted quickly for OpenCl, but if you/someone wanted competitive speeds from OpenCl more code would be needed for montgomery multiplication, various size kernels, etc.[/QUOTE]
AFAIK, nothing meaningful has been done with [I]GMP-ECM[/I] for a long time. I stopped using it several years ago. It was not worth the time required to run anything. [I]OpenCL[/I] appears not to work a GPU very hard. I have used [I]gpuOwl[/I] a lot, so I know. It does a good job, but there is a limit to its GPU utilization. Around 60% seems to be its limit. |
[QUOTE=storm5510;625373][I]OpenCL[/I] appears not to work a GPU very hard. I have used [I]gpuOwl[/I] a lot, so I know. It does a good job, but there is a limit to its GPU utilization. Around [B]60%[/B] seems to be its limit.[/QUOTE]That statement is not consistent with my experience. I routinely see 99-100% GPU load indicated in GPU-Z for Gpuowl running on Radeon VII, RX480, or RX550.
|
[QUOTE=kriesel;625374]That statement is not consistent with my experience. I routinely see 99-100% GPU load indicated in GPU-Z for Gpuowl running on Radeon VII, RX480, or RX550.[/QUOTE]
I have no experience with [I]AMD[/I] GPU's. All of mine, past and present, are [I]Nvidia[/I] based. Something tells me that [I]AMD[/I] variations are much better suited to run OpenCL than [I]Nvidia[/I] models are. CUDA is much better for [I]Nvidia[/I], it seems. [I]mfaktc[/I] cranks out over 3,000 Ghz-d/day on my ever-older RTX-2080. I am satisfied with that. |
NVIDIA quadro k620, gpuowl v6.11-382, gpu-Z v2.52.0, indicated gpu load [B]100%[/B].
Most recent NVIDIA consumer GPUs have poor DP performance (PRP, LL, P-1) relative to SP performance (TF), while Teslas and AMD typically have decent DP performance ratios. I almost never use RTX2080 or GTX 1650 for anything other than TF because of the 32:1 SP/DP ratio. |
| All times are UTC. The time now is 04:22. |
Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.