![]() |
![]() |
#485 | |
"Mark"
Apr 2003
Between here and the
188416 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#486 |
"Mark"
Apr 2003
Between here and the
22·3·523 Posts |
![]()
I have released 2.1.4. Here are the changes:
Code:
framework: Fixed an issue with creating GPU kernels on OS X. srseive2cl: new release Finally an OpenCL version of srsieve2. srsieve2cl is at least 3x faster than srsieve2, On my GPU it is limited to about 5000 sequences due to GPU memory limitations. I do not know what the limits are for other GPUs. It will switch to the GPU at p>1e6. |
![]() |
![]() |
![]() |
#487 | |
Sep 2011
Germany
2×3×461 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#488 |
"Mark"
Apr 2003
Between here and the
11000100001002 Posts |
![]()
3257 sequences (9383 subsequences) using the GPU takes about 37 MB of RAM in the CPU and about 6 GB dedicated memory in the GPU (per Task Manager).
I do not recall how much CPU memory was used with 80000 sequences, but I thought it was around 2 GB. Last fiddled with by rogue on 2021-01-10 at 19:46 |
![]() |
![]() |
![]() |
#489 | |
Jun 2003
1,579 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#490 |
"Mark"
Apr 2003
Between here and the
22·3·523 Posts |
![]()
I do not look at p/sec as it is calculated differently. I look at factors per second. It is far more accurate. Nevertheless srsieve2 and sr2sieve can be faster if your GPU isn't particularly fast.
|
![]() |
![]() |
![]() |
#491 |
"Dylan"
Mar 2017
23D16 Posts |
![]()
Might it be possible to update the primesieve code used by mtsieve to version 7.6? It seems to provide some improvements over 7.3 which is currently used:
|
![]() |
![]() |
![]() |
#492 | |
Sep 2011
Germany
2×3×461 Posts |
![]() Quote:
Tried now the cl version on a RTX 5500XT with 8GB RAM but hit the limit, there was a driver timeout because of too much RAM used, I think it was 7.4GB. srsieve2cl.exe -n2501 -N10000 -P1e9 -M 15000 -spl_remain.txt -fB 2021-01-11 19:57:22: Sieve completed at p=1000071173. Primes tested 50772480. Found 87459308 factors. 16098192 terms remaining. Time 239.43 seconds The speed is awesome, still running this on 16 cores srsieve2 to compare. Could be much better on faster cards with 16-24GB RAM. |
|
![]() |
![]() |
![]() |
#493 | |
"Mark"
Apr 2003
Between here and the
22×3×523 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#494 | |
"Mark"
Apr 2003
Between here and the
22·3·523 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#495 | |
Sep 2011
Germany
ACE16 Posts |
![]() Quote:
srsieve2 -n2501 -N10000 -P1e9 -W16 -spl_remain.txt -fB 2021-01-11 20:50:35: Sieve completed at p=1000000007. Primes tested 50847420. Found 92827983 factors. 10729517 terms remaining. Time 4990.80 seconds The CPU reduces the sievefile a bit more than GPU. |
|
![]() |
![]() |