mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2019-06-01, 00:36   #1519
kracker
ἀβουλία
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23·271 Posts
Default

Got a Ryzen 5 2400G up, which probably has the fastest integrated gpu out there at the moment...
Code:
Resulting speed for M74000077:
bit_min - bit_max  GHz-days/day  kernelname
     60 -      69       228.758  cl_barrett15_69_gs  
     69 -      70       217.379  cl_barrett15_71_gs  
     70 -      73       193.854  cl_barrett15_73_gs  
     73 -      76       182.299  cl_barrett32_76_gs  
     76 -      77       172.009  cl_barrett32_77_gs  
     77 -      81       171.178  cl_barrett15_82_gs  
     81 -      87       165.730  cl_barrett32_87_gs  
     87 -      88       155.828  cl_barrett32_88_gs  
     88 -      92       144.272  cl_barrett32_92_gs
kracker is online now   Reply With Quote
Old 2019-06-05, 23:56   #1520
ixfd64
Bemusing Prompter
 
ixfd64's Avatar
 
"Danny"
Dec 2002
California

44038 Posts
Default

I've noticed a small issue: on my Mac Pro, pressing Ctrl + C twice does not immediately terminate the program. The message telling users they can press Ctrl + C again to stop mfakto only appears after the current class finishes. I'm not sure whether this affects other operating systems.

Last fiddled with by ixfd64 on 2019-06-05 at 23:57
ixfd64 is offline   Reply With Quote
Old 2019-06-06, 03:20   #1521
Rodrigo
 
Rodrigo's Avatar
 
Jun 2010
Pennsylvania

32·103 Posts
Default

Quote:
Originally Posted by ixfd64 View Post
I've noticed a small issue: on my Mac Pro, pressing Ctrl + C twice does not immediately terminate the program. The message telling users they can press Ctrl + C again to stop mfakto only appears after the current class finishes. I'm not sure whether this affects other operating systems.
Yup, it works the same way in Windows (7).
Rodrigo is offline   Reply With Quote
Old 2019-06-07, 20:26   #1522
ixfd64
Bemusing Prompter
 
ixfd64's Avatar
 
"Danny"
Dec 2002
California

3·769 Posts
Default

Another minor issue: AMD no longer supports the APP SDK and has removed it from their website. See: https://community.amd.com/thread/228059#comment-2866963

The readme and makefile should be updated.
ixfd64 is offline   Reply With Quote
Old 2019-06-07, 22:08   #1523
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

107478 Posts
Default

Quote:
Originally Posted by ixfd64 View Post
Another minor issue: AMD no longer supports the APP SDK and has removed it from their website. See: https://community.amd.com/thread/228059#comment-2866963

The readme and makefile should be updated.
For Windows however, while the main SDK page does not show it, there is a set of links in https://community.amd.com/thread/203821
kriesel is online now   Reply With Quote
Old 2019-06-21, 19:38   #1524
mnd9
 
Jun 2019
Boston, MA

3×13 Posts
Default

Hi all, attempting to run mfakto on an ATI HD 4650 which should be supported per the documentation, I can get it to run fine, but anytime I enable GPU sieving I get a boatload of error messages. I'm using the 0.15pre6 binaries from mersenne.ca, see this excerpt:

Quote:
Runtime options
Inifile mfakto.ini
Verbosity 1
SieveOnGPU yes
MoreClasses yes
GPUSievePrimes 4096
GPUSieveProcessSize 24Ki bits
WARNING: GPUSieveSize=4M must be a multiple of GPUSieveProcessSize=24k, adjusting GPUSieveSize to 6M
GPUSieveSize 6Mi bits
FlushInterval 8
WorkFile worktodo.txt
ResultsFile results.txt
Checkpoints enabled
CheckpointDelay 300s
Stages enabled
StopAfterFactor class
PrintMode compact
V5UserID none
ComputerID none
TimeStampInResults yes
VectorSize 2
GPUType VLIW5
SmallExp no
UseBinfile mfakto_Kernels.elf
Compiletime options

Select device - Get device info:

WARNING: Device does not support atomic operations. This may lead to errors when multiple factors are found in the same block. Possible errors include reporting just one of the factors, or (less likely) scrambled factors. If the reported factor(s) are not accepted by primenet, please re-run this test on the CPU, or on a GPU with atomics.

OpenCL device info
name ATI RV730 (Advanced Micro Devices, Inc.)
device (driver) version OpenCL 1.0 AMD-APP (937.2) (CAL 1.4.1734)
maximum threads per block 128
maximum threads per grid 2097152
number of multiprocessors 8 (640 compute elements)
clock rate 700MHz

Automatic parameters
threads per grid 0
optimizing kernels for VLIW5

Loading binary kernel file mfakto_Kernels.elf

Cannot use binary kernel: its build options (-I. -DVECTOR_SIZE=2 -DVLIW5 -O3 -DMORE_CLASSES) are different than the current build options (-I. -DVECTOR_SIZE=2 -DVLIW5 -O3 -DMORE_CLASSES -DCL_GPU_SIEVE). Rebuilding kernels.
Compiling kernels.

BUILD OUTPUT
".\common.cl", line 31: warning: unrecognized #pragma #pragma "Enabling printf"

".\common.cl", line 44: warning: unrecognized #pragma #pragma "Emulating popcount"

".\common.cl", line 60: warning: unrecognized #pragma #pragma "No double precision available"

".\common.cl", line 73: warning: unrecognized #pragma #pragma "Replacing atomic_inc by non-atomics"

".\common.cl", line 81: warning: unrecognized #pragma #pragma "Emulating amd_bitalign"

".\common.cl", line 94: warning: unrecognized #pragma #pragma "Emulating amd_max3"

".\gpusieve.cl", line 96: warning: integer conversion resulted in a change of sign
1<<16, 1<<17, 1<<18, 1<<19, 1<<20, 1<<21, 1<<22, 1<<23, 1<<24, 1<<25, 1<<26, 1<<27, 1<<28, 1<<29, 1<<30, 1<<31,

".\gpusieve.cl", line 216: error: write to < 32 bits via pointer not allowed unless cl_khr_byte_addressable_store is enabled locsieve8[bclr >> 3] |= 1 << (bclr & 7);

".\gpusieve.cl", line 224: error: write to < 32 bits via pointer not allowed unless cl_khr_byte_addressable_store is enabled if (! (val & mask)) locsieve8[bytenum] = val | mask;
mnd9 is offline   Reply With Quote
Old 2019-06-21, 20:13   #1525
ixfd64
Bemusing Prompter
 
ixfd64's Avatar
 
"Danny"
Dec 2002
California

44038 Posts
Default

Try updating your drivers. I believe mfakto does not officially support OpenCL 1.0.
ixfd64 is offline   Reply With Quote
Old 2019-06-21, 20:27   #1526
mnd9
 
Jun 2019
Boston, MA

3×13 Posts
Default

Quote:
Originally Posted by ixfd64 View Post
Try updating your drivers. I believe mfakto does not officially support OpenCL 1.0.
I have the most up to date drivers per the AMD website (catalyst 13.9), and the HD 4000 series should be supported per the mfakto documentation.

I also don't quite understand the nomenclature re OpenCL versions--according to GPU-Z my "platform version" of OpenCL is 1.2 AMD-APP (937.2) while the "version" is OpenCL 1.0 AMD-APP (937.2).

Is there a way to get newer drivers aside from AMD directly?
mnd9 is offline   Reply With Quote
Old 2019-06-21, 21:36   #1527
kracker
ἀβουλία
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23·271 Posts
Default

Quote:
Originally Posted by mnd9 View Post
I have the most up to date drivers per the AMD website (catalyst 13.9), and the HD 4000 series should be supported per the mfakto documentation.

I also don't quite understand the nomenclature re OpenCL versions--according to GPU-Z my "platform version" of OpenCL is 1.2 AMD-APP (937.2) while the "version" is OpenCL 1.0 AMD-APP (937.2).

Is there a way to get newer drivers aside from AMD directly?

May not help, but try running older mfakto versions(0.13/0.14)


EDIT: Also set the VectorSize to 4 in the ini, if I recall correctly it's faster for pre-GCN cards.

Last fiddled with by kracker on 2019-06-21 at 21:37
kracker is online now   Reply With Quote
Old 2019-06-21, 23:59   #1528
mnd9
 
Jun 2019
Boston, MA

3×13 Posts
Default

Quote:
Originally Posted by kracker View Post
May not help, but try running older mfakto versions(0.13/0.14)


EDIT: Also set the VectorSize to 4 in the ini, if I recall correctly it's faster for pre-GCN cards.
I get similar errors on older versions including 0.14, 0.13 and the "0.12 hd 4000" special version. I'll try changing the vector size and see if that works...
mnd9 is offline   Reply With Quote
Old 2019-07-04, 05:56   #1529
ixfd64
Bemusing Prompter
 
ixfd64's Avatar
 
"Danny"
Dec 2002
California

3·769 Posts
Default

I'd like to propose a change to the GPU detection code.

mfakto currently uses if-then statements for each family of devices:

Code:
if (strstr(deviceinfo.d_name, "Capeverde")  ||    // 7730, 7750, 7770, 8760, 8740, R7 250X
        strstr(deviceinfo.d_name, "Pitcairn")   ||    // 7850, 7870, 8870
        strstr(deviceinfo.d_name, "Bonaire")    ||    // 7790, R7 260, R7 260X
        strstr(deviceinfo.d_name, "Oland")      ||    // 8670, 8570, R9 240, R9 250
        strstr(deviceinfo.d_name, "Sun")        ||    // 85x0M
        strstr(deviceinfo.d_name, "Mars")       ||    // 86x0M, 87x0M
        strstr(deviceinfo.d_name, "Venus")      ||    // 88x0M
        strstr(deviceinfo.d_name, "Saturn")     ||    // 8930M, 8950M
        strstr(deviceinfo.d_name, "Neptune")    ||    // 8970M, 8990M
        strstr(deviceinfo.d_name, "Curacao")    ||    // R9 265, R9 270, R9 270X
        strstr(deviceinfo.d_name, "Tonga")      ||    // R9 285
        strstr(deviceinfo.d_name, "Hainan")      ||    // R9 285
        strstr(deviceinfo.d_name, "Kalindi")          // GCN APU, Kabini, R7 ???
        )
    {
      mystuff.gpu_type = GPU_GCN;
}
However, good programming practice dictates that static values should be defined as constants. Therefore, I want to turn each group of devices names into an array (as described here) so that anyone who wants to add a device only has to add a string to an array. Does anyone have any objections?

I also want to use this opportunity to add newer GPUs to the list of detected devices. If you have a GPU that mfakto doesn't recognize, please feel free to post the name here. I'm aware Wikipedia has a fairly complete list of AMD GPUs, but the name returned by clGetDeviceInfo() isn't always consistent.

Last fiddled with by ixfd64 on 2019-07-04 at 05:57
ixfd64 is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
gpuOwL: an OpenCL program for Mersenne primality testing preda GpuOwl 2523 2020-10-22 22:36
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3398 2020-10-22 12:57
LL with OpenCL msft GPU Computing 433 2019-06-23 21:11
OpenCL for FPGAs TObject GPU Computing 2 2013-10-12 21:09
Program to TF Mersenne numbers with more than 1 sextillion digits? Stargate38 Factoring 24 2011-11-03 00:34

All times are UTC. The time now is 00:15.

Fri Oct 23 00:15:18 UTC 2020 up 42 days, 21:26, 0 users, load averages: 1.26, 1.53, 1.63

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.