It took me a while, but I succeeded in building a 64-bit Windows 10 executable using the NVIDIA GPU Computing Toolkit v.11.1, Visual Studio 2019 Community and Msys64 for mfaktc adding the new 8.0 and 8.6 architecture (which my RTX 3060 ti uses).
Now, I get a CUDA runtime error.
Code:
CUDA version info
binary compiled for CUDA 11.10
CUDA runtime version 11.10
CUDA driver version 11.10
CUDA device info
name GeForce RTX 3060 Ti
compute capability 8.6
max threads per block 1024
max shared memory per MP 102400 byte
number of multiprocessors 38
clock rate (CUDA cores) 1800MHz
memory clock rate: 7001MHz
memory bus width: 256 bit
Automatic parameters
threads per grid 622592
GPUSievePrimes (adjusted) 82486
GPUsieve minimum exponent 1055144
########## testcase 1/2867 ##########
Starting trial factoring M50804297 from 2^67 to 2^68 (0.59 GHz-days)
Using GPU kernel "75bit_mul32_gs"
Date Time | class Pct | time ETA | GHz-d/day Sieve Wait
Dec 06 14:53 | 3387 0.1% | 0.001 n.a. | n.a. 82485 n.a.%
ERROR: cudaGetLastError() returned 98: invalid device function
Any advice?
BTW, I also get the same error running on a RTX 2060 card.