![]() |
|
|
#2058 | |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
3·29·83 Posts |
Quote:
![]() By the way, how can I make it more responsive? I set GPUSieveProcessSize=8 and GPUSieveSize=16, and now it's useable, but still somewhat laggy. I don't really want to reduce GPUSieveSize further, so will doing something like reducing sieve primes help? Or will that cause an even bigger performance hit than reducing GPUSieveSize further? |
|
|
|
|
|
|
#2059 |
|
"GIMFS"
Sep 2002
Oeiras, Portugal
3·491 Posts |
I´ve been running 0.20 on 60-61M exponents and am quite pleased with it. Faster, and "CPU-free", which is a plus, definitely.
Nevertheless, I was surprised when trying to run it on small exponents (by small I mean 2.6M exponents, from 62 to 64 bits. The GHz-d/d went down from ~258 to 40-45, and this even using using the "LessClasses" version. It is way slower than 0.19 for this type of work. Is there any setting I should look into, or is it just the way it is? |
|
|
|
|
|
#2060 |
|
Banned
"Luigi"
Aug 2002
Team Italia
2·3·11·73 Posts |
Thank you Oliver and George
![]() Now, my question. I am actually running 1 mfaktc 0.19 and 1 cudalucas (DC) on my GTX 275, and 2 mfaktc 0.19 and 1 cudalucas (DC+LL) on my GTX580. If I run mfaktc 0.20, how much GPU can be used for Cudalucas? Luigi |
|
|
|
|
|
#2061 | |
|
"James Heinrich"
May 2004
ex-Northern Ontario
11×311 Posts |
Quote:
Experiment with GridSize (my exponents run in 10 seconds or less, so GridSize=0 made a big improvement for me), use v0.20-LESS_CLASSES 64-bit (in CPU-sieving cases, 64-bit is faster; for GPU-sieving 32-bit is faster). If you notice on the mfaktc v0.20 .plan there's now a line for improved support below 264, but it talking to Oliver it's apparently non-trivial, so don't hold your breath. |
|
|
|
|
|
|
#2062 |
|
"James Heinrich"
May 2004
ex-Northern Ontario
D5D16 Posts |
On your GTX 275: you can't -- GPU sieving isn't supported below CC 2.0 (that GPU is CC 1.3).
On any supported GPU: try and see. There's no controllable load-sharing, it's just a competition for GPU resources, whether it's mfaktc+CUDALucas, or multiple instances of mfatkc. You'll likely get somewhere around 50:50 balance, or it may be biased towards one program or the other, depending on how the code flows. Easiest way to answer is try and see. |
|
|
|
|
|
#2063 | |
|
Banned
"Luigi"
Aug 2002
Team Italia
481810 Posts |
Quote:
Luigi |
|
|
|
|
|
|
#2064 | |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
1C3516 Posts |
Quote:
|
|
|
|
|
|
|
#2065 |
|
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA
6508 Posts |
I will put up benchmarks for my GTX480 when I get home, but I'll need to set the sieve back to default. I had gotten an extra 1.3 ghzdays/day by setting it down to 70000. When I get home from work I'll reset the default, run 5 numbers and post the results to your form.
It's currently doing 374.85 ghzdays/day. It's going about 25 ghz days/day faster than it did when I was running 4 instances of .19 (one per cpu core) because my cpu couldn't keep up with it. |
|
|
|
|
|
#2066 |
|
May 2003
Belgium
11616 Posts |
Is there a way to suppress the newline-posting after every 5 seconds?
For the benchmark (which asks for wall clock time), it would be quite useful to only see 1st and last line of the output. It was possible in the previous version, but I don't know whether it's still possible. |
|
|
|
|
|
#2067 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
27AE16 Posts |
Quote:
Code:
# possible values for PrintMode: # 0: print a new line for each finished class # 1: overwrite the current line (more compact output) # # Default: PrintMode=0 PrintMode=0 |
|
|
|
|
|
|
#2068 | ||
|
"Oliver"
Mar 2005
Germany
11·101 Posts |
Quote:
Quote:
If you want to mix mfaktc and cudalucas you can run half of your time cudalucas and the remaining time mfaktc. Oliver |
||
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |