mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2018-09-29, 11:43   #2905
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

7×11×41 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
The LessClasses version should only be used for extremely-fast-running assignments (where each assignment only takes a few seconds).

mfaktc can be download from here or here.
The CUDA 10 binary does not work with just the .dll files, it wants CUDA 10 installed (I have CUDA 9.2 installed):
"ERROR: current CUDA driver version is lower than the CUDA toolkit version used during compile!
Please update your graphics driver."

I think the other binaries work with just the dll files.
ATH is offline   Reply With Quote
Old 2018-09-29, 11:54   #2906
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

111110 Posts
Default

Quote:
Originally Posted by VictordeHolland View Post
Still I find it a bit confusing that you need to compile for different architectures, right? A mfaktc compile with CUDA SDK 10, GTX980 won't work on a GTX1080 right? Cause the architecture/CUDA capability of the GTX1080 is higher (and somehow not backwards compatibe?) Or am I just being ignorent?
It is not that bad.
  • the CUDA runtime DLL must be exactly the same version used during compilation of mfaktc
  • the driver of your system must support the same or newer version of CUDA used for compiling mfaktc.
  • the binary (mfaktc.exe) must have support for your GPU. A single binary can support multiple GPU architectures, e.g. the CUDA 8 binary found here are compiled for Fermi, Kepler, Kepler "update", Maxwell and Pascal.

Oliver
TheJudger is offline   Reply With Quote
Old 2018-09-30, 12:27   #2907
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

100010101112 Posts
Default

JFYI: just built some Windows binaries using CUDA Toolkit 10.0. Will do some testing and provide binaries after sucessfull testing.

Oliver
TheJudger is offline   Reply With Quote
Old 2018-09-30, 14:57   #2908
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

7×11×41 Posts
Default

If you have time could you please also post a guide describing how you compile it for Windows.
ATH is offline   Reply With Quote
Old 2018-09-30, 15:29   #2909
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

3×7×257 Posts
Default

Quote:
Originally Posted by ATH View Post
If you have time could you please also post a guide describing how you compile it for Windows.
The readme.txt for mfaktc v0.21 CUDA 8.0 says
Code:
#############################
# 2.2 Compilation (Windows) #
#############################

The following instructions have been tested on Windows 7 64bit using Visual
Studio 2012 Professional. A GNU compatible version of make is also required
as the Makefile is not compatible with nmake. GNU Make for Win32 can be
downloaded from http://gnuwin32.sourceforge.net/packages/make.htm.

Run the Visual Studio 2012 x64 Win64 Command Prompt for x64 or
Run the Visual Studio 2012 x86 Native Tools Command Prompt for x86 (32 bit)

 and change into the "\src" subdirectory.

Run 'make -f Makefile.win' for a 64bit built (recommended on 64bit systems)
or 'make -f Makefile.win32' for a 32bit built.

You will have to adjust the paths to your CUDA installation and the
Microsoft Visual Studio binaries in the makefiles if you have something
other than CUDA 8.0 and MSVS 2012. The binaries "mfaktc-win-64.exe" or
"mfaktc-win-32.exe" are placed in the parent directory.
Presumably you're asking for an update or more detail.
kriesel is offline   Reply With Quote
Old 2018-09-30, 17:11   #2910
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

100010101112 Posts
Default

Hello!

Quote:
Originally Posted by ATH View Post
If you have time could you please also post a guide describing how you compile it for Windows.
  1. Installed Visual Studio 2017.8 "Community"
  2. Installed CUDA Toolkit 10 for Windows
  3. Installed MinGW as on of many options for GNU Make on Windows. In MinGW folder I've copied bin/mingw32-make.exe to bin/make.exe because I'm lazy. Careful when updating mingw32-make.exe...
  4. Configure Environment for "x64 Native Tools-Command Promt" - add MinGW/bin and CUDA/bin to PATH variable.

The just open "x64 Native Tools-Command Promt" and change into the directory with the mfaktc source files and run
Code:
make -f Makefile.win
I had to adjust some settings in Makefile.win:
Code:
CUDA_DIR = "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0"

CC = cl
CFLAGS = /Ox /Oy /GL /W2 /fp:fast /I$(CUDA_DIR)\include /I$(CUDA_DIR)\include\cudart /nologo

NVCCFLAGS = --ptxas-options=-v
CUFLAGS = -ccbin "C:\Program Files (x86)\Microsoft Visual Studio\2017\Community\VC\Tools\MSVC\14.15.26726\bin\Hostx86\x64" -x cu -I$(CUDA_DIR)\/include --machine 64 --compile -Xcompiler "/wd 4819" -DWIN64 -Xcompiler "/EHsc /W3 /nologo /O2 /FS" $(NVCCFLAGS)

# generate code for various compute capabilities
# NVCCFLAGS += --generate-code arch=compute_11,code=sm_11 # CC 1.1, 1.2 and 1.3 GPUs will use this code (1.0 is not possible for mfaktc)
# NVCCFLAGS += --generate-code arch=compute_20,code=sm_20 # CC 2.x GPUs will use this code, one code fits all!
NVCCFLAGS += --generate-code arch=compute_30,code=sm_30 # all CC 3.x GPUs _COULD_ use this code 
NVCCFLAGS += --generate-code arch=compute_35,code=sm_35 # but CC 3.5 (3.2?) _CAN_ use funnel shift which is useful for mfaktc
NVCCFLAGS += --generate-code arch=compute_50,code=sm_50 # CC 5.x GPUs will use this code
NVCCFLAGS += --generate-code arch=compute_60,code=sm_60 # CC 6.x GPUs will use this code
NVCCFLAGS += --generate-code arch=compute_70,code=sm_70 # CC 7.x GPUs will use this code
# NVCCFLAGS += --generate-code arch=compute_75,code=sm_75 # CC 7.5 GPUs will use this code
Oliver
TheJudger is offline   Reply With Quote
Old 2018-09-30, 17:13   #2911
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default Wanted: Testrun on Turing card!

Hello,

anyone with a Turing GPU (e.g. Geforce RTX 2080 or 2080 Ti) willing to run some tests? Honza can't right now, I've asked him already.

Oliver
TheJudger is offline   Reply With Quote
Old 2018-10-05, 20:23   #2912
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11×101 Posts
Default Initial Turing benchmarks (RTX 2080Ti)

Hello,

finally I was able to put my hands on a Turing (RTX 20x0 series) card. Because of this I was excited and I was right, Turing is a beast for mfaktc.

Unmodified mfaktc 0.21 sources (just adjusted the Makefile) + CUDA 10.0.130 on Linux:
Code:
# ./mfaktc.exe -tf 66362159 73 74
mfaktc v0.21 (64bit built)
[...]
CUDA device info
  name                      GeForce RTX 2080 Ti
  compute capability        7.5
  max threads per block     1024
  max shared memory per MP  65536 byte
  number of multiprocessors 68
  clock rate (CUDA cores)   1635MHz
  memory clock rate:        7000MHz
  memory bus width:         352 bit
[...]
got assignment: exp=66362159 bit_min=73 bit_max=74 (28.83 GHz-days)
Starting trial factoring M66362159 from 2^73 to 2^74 (28.83 GHz-days)
 k_min =  71160531149400
 k_max =  142321062305090
Using GPU kernel "barrett76_mul32_gs"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 05 22:12 |    0   0.1% |  0.630  10m04s |   4118.14    82485    n.a.%
Oct 05 22:12 |    4   0.2% |  0.563   8m59s |   4608.22    82485    n.a.%
Oct 05 22:12 |    9   0.3% |  0.562   8m58s |   4616.42    82485    n.a.%
[...]
Oct 05 22:21 | 4612  99.9% |  0.599   0m01s |   4331.27    82485    n.a.%
Oct 05 22:21 | 4617 100.0% |  0.600   0m00s |   4324.05    82485    n.a.%
no factor for M66362159 from 2^73 to 2^74 [mfaktc 0.21 barrett76_mul32_gs]
tf(): total time spent:  9m 30.800s
This is a founders editions card, starting with a cold card. Power draw is ~260W on average so limited by power target. Temperature is a bit below 80°C and average clock is about 1680MHz once the card is "hot".

New performance king but slightly behind Tesla V100 in terms of energy efficency.

Oliver
TheJudger is offline   Reply With Quote
Old 2018-10-05, 20:37   #2913
petrw1
1976 Toyota Corona years forever!
 
petrw1's Avatar
 
"Wayne"
Nov 2006
Saskatchewan, Canada

124A16 Posts
Default

Quote:
Originally Posted by TheJudger View Post
Hello,

finally I was able to put my hands on a Turing (RTX 20x0 series) card. Because of this I was excited and I was right, Turing is a beast for mfaktc.

Unmodified mfaktc 0.21 sources (just adjusted the Makefile) + CUDA 10.0.130 on Linux:
Code:
# ./mfaktc.exe -tf 66362159 73 74
mfaktc v0.21 (64bit built)
[...]
CUDA device info
  name                      GeForce RTX 2080 Ti
  compute capability        7.5
  max threads per block     1024
  max shared memory per MP  65536 byte
  number of multiprocessors 68
  clock rate (CUDA cores)   1635MHz
  memory clock rate:        7000MHz
  memory bus width:         352 bit
[...]
got assignment: exp=66362159 bit_min=73 bit_max=74 (28.83 GHz-days)
Starting trial factoring M66362159 from 2^73 to 2^74 (28.83 GHz-days)
 k_min =  71160531149400
 k_max =  142321062305090
Using GPU kernel "barrett76_mul32_gs"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 05 22:12 |    0   0.1% |  0.630  10m04s |   4118.14    82485    n.a.%
Oct 05 22:12 |    4   0.2% |  0.563   8m59s |   4608.22    82485    n.a.%
Oct 05 22:12 |    9   0.3% |  0.562   8m58s |   4616.42    82485    n.a.%
[...]
Oct 05 22:21 | 4612  99.9% |  0.599   0m01s |   4331.27    82485    n.a.%
Oct 05 22:21 | 4617 100.0% |  0.600   0m00s |   4324.05    82485    n.a.%
no factor for M66362159 from 2^73 to 2^74 [mfaktc 0.21 barrett76_mul32_gs]
tf(): total time spent:  9m 30.800s
This is a founders editions card, starting with a cold card. Power draw is ~260W on average so limited by power target. Temperature is a bit below 80°C and average clock is about 1680MHz once the card is "hot".

New performance king but slightly behind Tesla V100 in terms of energy efficency.

Oliver
Wow...I can hardly wait. Mine is supposed to arrive Tuesday...hopefully alive within a week. Good to hear it works without any mfaktc compatibility issues.
petrw1 is offline   Reply With Quote
Old 2018-10-05, 20:42   #2914
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

2·3·569 Posts
Default

https://www.mersenne.ca/mfaktc.php has been updated with the new benchmark.
James Heinrich is offline   Reply With Quote
Old 2018-10-05, 20:50   #2915
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

24·7·43 Posts
Default

What is the Street Price of this monster?
ET_ is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 08:23.


Tue Jul 27 08:23:39 UTC 2021 up 4 days, 2:52, 0 users, load averages: 1.62, 1.77, 1.77

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.