![]() |
|
|
#859 |
|
Romulan Interpreter
Jun 2011
Thailand
72·197 Posts |
Test if I can attach it. (I did not expect that you generate gigabytes of those test results...
)
|
|
|
|
|
|
#860 | |
|
Jun 2010
Pennsylvania
2×467 Posts |
Quote:
Going along with his other idea there, GPUSievePrimes is at 92486. GPUSieveSize has been back at 64 ever since I reported the results with that value at 48. With these settings, TF of 72xxxxxx exponents has been hovering around 144-146 GHz-d/d. All of this work done in mfakto x32. Rodrigo Last fiddled with by Rodrigo on 2013-07-02 at 20:07 Reason: add'l info |
|
|
|
|
|
|
#862 | |
|
Nov 2010
Germany
3×199 Posts |
Quote:
What is the general opinion if any of the tests need to be repeated? I tried to come up with an estimate how "bad" the bug is ... My opinion is, that if anyone used GPUSieveProcessSize=24 along with really low GPUSieveSize(say, < 20), then chances of missing a factor (like one in 256 to one in 1024) are too high and the tests should be repeated. For GPUSieveSize=32 and above, missing between one in 4096 and one in 16384 factors is probably not worth the effort. But I'd like to hear other opinions. If you can provide a guideline for an "acceptable miss probability", then I can compile a list of GPUSieveSize settings that would require a re-test. |
|
|
|
|
|
|
#863 | |
|
Nov 2010
Germany
3·199 Posts |
Quote:
However, I do not yet have a good performance measurement of the GPU sieving itself - something to add soon. |
|
|
|
|
|
|
#864 |
|
Jun 2013
1538 Posts |
GPUSieveSize=64
GPUSieveProcessSize=16 What should I do? I don't think it's possible to "redo" or "double check" TF is it? Edit: Or are these settings safe? Upon reading it again it seems more like GPUSieveSize=64 only needs to be divisible by 3 for GPUSieveProcessSize=24? Edit2: Okay, I read more carefully, ignore this post (I would delete it but can't) Last fiddled with by blahpy on 2013-07-02 at 22:15 |
|
|
|
|
|
#865 | |
|
Nov 2010
Germany
3×199 Posts |
Quote:
|
|
|
|
|
|
|
#866 | |
|
Jun 2010
Pennsylvania
16468 Posts |
Quote:
Code:
# GPUSieveProcessSize defines how far many bits of the sieve each TF block # processes (in K bits). Larger values may lead to less wasted cycles by # reducing the number of times all threads in a warp are not TFing a # candidate. However, more shared memory is used which may reduce occupancy. # Smaller values should lead to a more responsive system (each kernel takes # less time to execute). GPUSieveProcessSize must be a multiple of 8. # # # Minimum: GPUSieveProcessSize=8 # Maximum: GPUSieveProcessSize=32 (requires GPUSievePrimes > 310) # # Default: GPUSieveProcessSize=24 GPUSieveProcessSize=16 ![]() But, whatever the reason, I seem to have stumbled onto the safest values for the time being. Rodrigo Last fiddled with by Rodrigo on 2013-07-03 at 02:35 |
|
|
|
|
|
|
#867 |
|
Nov 2010
Germany
3·199 Posts |
Hmm, the one that you can download with the win package reads:
Code:
# GPUSieveProcessSize defines how far many bits of the sieve each TF block # processes (in K bits). Larger values may lead to less wasted cycles by # reducing the number of times all threads in a warp are not TFing a # candidate. However, more shared memory is used which may reduce occupancy. # Smaller values should lead to a more responsive system (each kernel takes # less time to execute). GPUSieveProcessSize must be a multiple of 8. # # # Minimum: GPUSieveProcessSize=8 # Maximum: GPUSieveProcessSize=32 (requires GPUSievePrimes > 310) # # Default: GPUSieveProcessSize=16 GPUSieveProcessSize=16 |
|
|
|
|
|
#868 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
Quote:
GPUSieveSize=128 GPUSieveProcessSize=8 I don't think I've ever used 24. |
|
|
|
|
|
|
#869 | |
|
Jun 2010
Pennsylvania
2·467 Posts |
Quote:
The only way that I can think of that could lead to the value changing in the comments line, would be if I had made the change manually to the comments line and never actually adjusted the real setting to the 24 that @kracker had recommended. (In which case I never changed it to OR from 16.) Although I have no memory of it, this is entirely possible: my wife and I spent several days doing some intense (physical and Web) car shopping. I can picture hurriedly making this change on the way out to yet another auto dealership. This must be what happened, there's no other sensible explanation. I plead temporary insanity.Rodrigo |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| gpuOwL: an OpenCL program for Mersenne primality testing | preda | GpuOwl | 2718 | 2021-07-06 18:30 |
| mfaktc: a CUDA program for Mersenne prefactoring | TheJudger | GPU Computing | 3497 | 2021-06-05 12:27 |
| LL with OpenCL | msft | GPU Computing | 433 | 2019-06-23 21:11 |
| OpenCL for FPGAs | TObject | GPU Computing | 2 | 2013-10-12 21:09 |
| Program to TF Mersenne numbers with more than 1 sextillion digits? | Stargate38 | Factoring | 24 | 2011-11-03 00:34 |