![]() |
|
|
#12 |
|
"Alexander"
Nov 2008
The Alamo City
24×43 Posts |
FWIW I think xilman's post is a more appropriate quote than mine, since I was describing a Kubuntu (Linux) computer. Do you have any advice for us Linux users?
|
|
|
|
|
|
#13 | |
|
Apr 2010
2·83 Posts |
Quote:
First you have to get rid of the cub that comes with msieve and use the cub that comes with cuda. On debian I did these commands in the msieve directory: Code:
cd cub rm -rf cub ln -s /usr/include/cub Code:
Index: Makefile
===================================================================
--- Makefile (Revision 1037)
+++ Makefile (Arbeitskopie)
@@ -34,6 +34,9 @@
-DMSIEVE_SVN_VERSION="\"$(SVN_VERSION)\"" \
-I. -Iaprcl -Iinclude -Ignfs -Ignfs/poly -Ignfs/poly/stage1
+CUDA = 1
+NO_ZLIB = 1
+
# tweak the compile flags
ifeq ($(ECM),1)
@@ -197,10 +200,7 @@
#---------------------------------- GPU file lists -------------------------
GPU_OBJS += \
- stage1_core_sm20.ptx \
- stage1_core_sm30.ptx \
- stage1_core_sm35.ptx \
- stage1_core_sm50.ptx \
+ stage1_core_sm75.ptx \
cub/built
#---------------------------------- NFS file lists -------------------------
@@ -346,5 +346,8 @@
stage1_core_sm50.ptx: $(NFS_GPU_HDR)
$(NVCC) -arch sm_50 -ptx -o $@ $<
+stage1_core_sm75.ptx: $(NFS_GPU_HDR)
+ $(NVCC) -arch sm_75 -ptx -o $@ $<
+
cub/built:
- cd cub && make WIN=$(WIN) WIN64=$(WIN64) sm=200,300,350,520 && cd ..
+ cd cub && make WIN=$(WIN) WIN64=$(WIN64) sm=750 && cd ..
Index: cub/Makefile
===================================================================
--- cub/Makefile (Revision 1037)
+++ cub/Makefile (Arbeitskopie)
@@ -16,7 +16,7 @@
NVCC = "$(shell which nvcc)"
CUDA_ROOT = $(shell dirname $(NVCC))/../
EXT = so
- NVCCFLAGS += -Xptxas -v -Xcudafe -\# -shared -Xptxas -abi=no \
+ NVCCFLAGS += -Xptxas -v -Xcudafe -\# -shared \
-Xcompiler -fPIC -Xcompiler -fvisibility=hidden
endif
@@ -27,6 +27,10 @@
SM_ARCH = 200
endif
+ifeq (750, $(findstring 750, $(SM_ARCH)))
+ SM_TARGETS += -gencode=arch=compute_75,code=\"sm_75,compute_75\"
+ SM_DEF += -DSM750
+endif
ifeq (520, $(findstring 520, $(SM_ARCH)))
SM_TARGETS += -gencode=arch=compute_52,code=\"sm_52,compute_52\"
SM_DEF += -DSM520
Index: gnfs/poly/stage1/stage1_sieve_gpu.c
===================================================================
--- gnfs/poly/stage1/stage1_sieve_gpu.c (Revision 1037)
+++ gnfs/poly/stage1/stage1_sieve_gpu.c (Arbeitskopie)
@@ -1113,7 +1113,7 @@
CUDA_TRY(cuModuleLoad(&t->gpu_module, "stage1_core_sm35.ptx"))
}
else if (d->gpu_info->compute_version_major >= 5) {
- CUDA_TRY(cuModuleLoad(&t->gpu_module, "stage1_core_sm50.ptx"))
+ CUDA_TRY(cuModuleLoad(&t->gpu_module, "stage1_core_sm75.ptx"))
}
else
{
|
|
|
|
|
|
|
#14 |
|
Apr 2010
2468 Posts |
For ecm to compile with current CUDA I had to apply this patch:
Code:
Index: cudakernel_default.cu
===================================================================
--- cudakernel_default.cu (Revision 3092)
+++ cudakernel_default.cu (Arbeitskopie)
@@ -7,7 +7,7 @@
carry_t cytemp;
unsigned int thm1;
- while(__any(cy[threadIdx.x])!=0)
+ while(__any_sync(__activemask(), cy[threadIdx.x])!=0)
{
thm1 = (threadIdx.x - 1) % ECM_GPU_NB_DIGITS;
cytemp = cy[thm1];
|
|
|
|
|
|
#15 |
|
"Alexander"
Nov 2008
The Alamo City
24·43 Posts |
Thank you, thank you, thank you! They both build now. yafu doesn't build, though. I'll report that to Ben.
|
|
|
|
|
|
#16 |
|
Apr 2010
Over the rainbow
23×52×13 Posts |
I do have the PTX but I still have trouble with the sort_engine...
edit : adding second screenshot Last fiddled with by firejuggler on 2021-01-28 at 10:24 Reason: adding another screenshot and a report |
|
|
|
|
|
#17 |
|
Apr 2010
2·83 Posts |
I think that the 1660 Ti needs a different shader model. But I don't know which.
|
|
|
|
|
|
#18 |
|
Apr 2010
Over the rainbow
1010001010002 Posts |
a quick google search tell me it is shader model 6.5... so I should modify the sm_arch to 65?
|
|
|
|
|
|
#19 | |
|
Apr 2010
2·83 Posts |
Quote:
Code:
-gencode arch=compute_35,code=sm_35 -gencode arch=compute_37,code=sm_37 -gencode arch=compute_50,code=sm_50 -gencode arch=compute_52,code=sm_52 -gencode arch=compute_60,code=sm_60 -gencode arch=compute_61,code=sm_61 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_86,code=compute_86 |
|
|
|
|
|
|
#20 | |
|
"Ben"
Feb 2007
3×1,171 Posts |
Quote:
Code:
Msieve v. 1.54 (SVN 998) Tue Jun 8 11:24:02 2021 random seeds: 2f8aea7c 74608973 factoring 138924029959401366454963864059579437250850355925904953363654825080008713183159095653855715163496880698665441863162263 (117 digits) searching for 15-digit factors commencing number field sieve (117-digit input) commencing number field sieve polynomial selection polynomial degree: 5 max stage 1 norm: 2.24e+18 max stage 2 norm: 7.93e+14 min E-value: 4.22e-10 poly select deadline: 6326 time limit set to 1.76 CPU-hours expecting poly E from 5.27e-10 to > 6.06e-10 searching leading coefficients from 1 to 213804 using GPU 0 (TITAN RTX) selected card has CUDA arch 7.5 deadline: 6326 CPU-seconds per coefficient error (line 1116): CUDA_ERROR_FILE_NOT_FOUND |
|
|
|
|
|
|
#21 |
|
Apr 2010
A616 Posts |
My SVN revision is a bit newer, but the solution is the same:
Code:
Index: gnfs/poly/stage1/stage1_sieve_gpu.c
===================================================================
--- gnfs/poly/stage1/stage1_sieve_gpu.c (Revision 1030)
+++ gnfs/poly/stage1/stage1_sieve_gpu.c (Arbeitskopie)
@@ -1113,7 +1113,7 @@
CUDA_TRY(cuModuleLoad(&t->gpu_module, "stage1_core_sm35.ptx"))
}
else if (d->gpu_info->compute_version_major >= 5) {
- CUDA_TRY(cuModuleLoad(&t->gpu_module, "stage1_core_sm50.ptx"))
+ CUDA_TRY(cuModuleLoad(&t->gpu_module, "stage1_core_sm75.ptx"))
}
else
{
|
|
|
|
|
|
#22 |
|
"Ben"
Feb 2007
3·1,171 Posts |
Working now, thanks!
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| CUDA 5.5 | ET_ | GPU Computing | 2 | 2013-06-13 15:50 |
| AVX CPU LL vs CUDA LL | nucleon | GPU Computing | 11 | 2012-01-04 17:52 |
| Best CUDA GPU for the $$ | Christenson | GPU Computing | 24 | 2011-05-01 00:06 |
| CUDA P-1? | nucleon | GPU Computing | 2 | 2010-11-17 17:52 |
| CUDA? | Xentar | Conjectures 'R Us | 6 | 2010-03-31 07:43 |