![]() |
|
|
#2685 |
|
Random Account
Aug 2009
7A416 Posts |
A simple question: Does mfaktc read the entire worktodo file into a queue, or does it take it one line at a time?
|
|
|
|
|
|
#2686 |
|
"James Heinrich"
May 2004
ex-Northern Ontario
11·311 Posts |
Pretty sure it reads line by line, skipping any invalid lines, until it finds a valid assignment.
Once it has finished the assignment it rewrites the entire worktodo, minus the assignment-line it just completed. Side note: disk I/O can be killer (e.g. 1-10MB/s sustained) for things like TF>1G where an assignment is completed every second or so and a large input buffer is maintained -- a RAM drive is essential. |
|
|
|
|
|
#2687 | |
|
Random Account
Aug 2009
36448 Posts |
Quote:
TF>1G: I suspected there were people out there doing this but I had no idea of the magnitude of it. A very interesting page; I bookmarked it.
|
|
|
|
|
|
|
#2688 |
|
"Curtis"
Sep 2016
Fort Collins, CO
A16 Posts |
I think there would be a speedup for Pascal cards if the linux version were compiled with 8.0. Actually, I cannot run the current binaries unless I change the makefile and compile them for compute 6.1. But even if you can get this to run on a Pascal card in its current form, my experience suggests that there is a performance penalty for running binaries compiled for compute capability <6.0 cards on the 1080 or 1070.
|
|
|
|
|
|
#2689 |
|
"Mike"
Aug 2002
202A16 Posts |
We've had a (FE) GTX 1060 card for several months but never got around to running mfaktc.
![]() We tried it today and it just worked, out of the box, without anything extra needed! In the past we had to install the CUDA toolkit but we didn't today. The card is doing roughly 530 GHz-d/day and the display has no lag whatsoever. The card is at 80 C and it is nearly silent. We didn't modify the fan curve or anything.
|
|
|
|
|
|
#2690 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2·3·1,693 Posts |
Do you have fan headroom to bring that down a bit from 80? I get nervous in the upper 70s.
|
|
|
|
|
|
#2691 |
|
Feb 2012
5 Posts |
Are there updated linux64 binaries available for cuda 8? I don't see them in the download section or in this thread.
If not, how difficult would it be to compile them? I recently upgraded from a 970 to 1070 and am getting the 'cudaGetLastError() returned 8: invalid device function' error. Thank you! |
|
|
|
|
|
#2692 |
|
Feb 2012
5 Posts |
I took a stab at compiling the linux64 binaries myself using the cuda8 toolkit and it's running successfully. The GHz-d/day is hovering around 780 in the terminal for my GTX 1070, and nvidia-smi shows GPU utilization in the high 90's.
When compiling I added an nvcc flag for compute 6.1 capabilities. I also had to remove the existing line for compute 1.1 (Tesla?) since it wouldn't compile with that flag. Otherwise I left all settings the same as in the source file for mfaktc with cuda 6.5. I copied the compiled mfaktc.exe and the libraries for cuda 8.0.61 on top of the existing folder structure for mfaktc with cuda 6.5. Attached is the result if anyone else is looking for or wants to test it. Be aware I'm not an expert, so use at your own risk. |
|
|
|
|
|
#2693 |
|
"Jerry"
Nov 2011
Vancouver, WA
21438 Posts |
Thank you
|
|
|
|
|
|
#2694 | |
|
"Tony Gott"
Aug 2002
Yell, Shetland, UK
22×83 Posts |
Quote:
|
|
|
|
|
|
|
#2695 |
|
"Oliver"
Mar 2005
Germany
100010101112 Posts |
Code:
# ./mfaktc.exe -tf 66362159 75 76 mfaktc v0.21 (64bit built) [...] CUDA device info name Graphics Device compute capability 6.1 max threads per block 1024 max shared memory per MP 98304 byte number of multiprocessors 28 clock rate (CUDA cores) 1582MHz memory clock rate: 5505MHz memory bus width: 352 bit [...] Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Mar 23 23:43 | 0 0.1% | 7.003 1h51m | 1481.90 82485 n.a.% Mar 23 23:44 | 4 0.2% | 6.980 1h51m | 1486.78 82485 n.a.% Mar 23 23:44 | 9 0.3% | 7.003 1h51m | 1481.90 82485 n.a.% Mar 23 23:44 | 12 0.4% | 7.110 1h53m | 1459.59 82485 n.a.% Mar 23 23:44 | 16 0.5% | 7.494 1h59m | 1384.80 82485 n.a.% Mar 23 23:44 | 24 0.6% | 7.928 2h06m | 1309.00 82485 n.a.% Mar 23 23:44 | 25 0.7% | 7.955 2h06m | 1304.55 82485 n.a.% After 20-25 seconds: limited by thermal target, hovers around at ~190W. Reason need more fresh air in chassis. ![]() Oliver Last fiddled with by TheJudger on 2017-03-23 at 22:53 |
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |