![]() |
|
|
#2168 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
5×11×137 Posts |
If you specify the MM2_CHAIN setting and a different MM2_CHAIN setting is auto-generated, I do not know which one will win.
|
|
|
|
|
|
#2169 | |
|
∂2ω=0
Sep 2002
República de California
2D7E16 Posts |
Quote:
Related question for you & Mihai: Can the program determine at runtime if the MM2_CHAIN setting needs upping? Because ROEs are not necessarily monotonic with exponent (much depends on the particluar DWT weights and their rounding-to-double) I found it useful in Mlucas to allow runtime-detection of such conditions, culminating in an upping of FFT length (and reset of the same-FFT-length ease-up params) if the highest setting proved insufficient for the exponent under test. But that all relies on per-iteration ROE data sampling. Oh, have you tried forcing MM2_CHAIN=1 in your own runs? It would be useful to see how broadly the "this runs faster" effect applies. Just Radeon VIIs? Just some subset thereof? Last fiddled with by ewmayer on 2020-05-09 at 23:05 |
|
|
|
|
|
|
#2170 | |||
|
P90 years forever!
Aug 2002
Yeehaw, FL
165578 Posts |
Quote:
Quote:
Quote:
|
|||
|
|
|
|
|
#2171 |
|
∂2ω=0
Sep 2002
República de California
1164610 Posts |
Cross-posting from the "R7 @ newegg for $500" thread - new-build is alive, same Ubuntu 19.10 image I used to upgrade my Haswell system to host a Radeon VII (but that system remains on ROCm 2.10 for now), ROCm 3.3 installed, latest gpuowl built, but having OpenCL issues - first hit a missing-shared-lib error on program invocation which Paul Underwood helped me look into. Here the OpenCL-install info from the system as of last night:
Code:
apt-cache search libOpenCL ocl-icd-libopencl1 - Generic OpenCL ICD Loader libopencl-clang-dev - thin wrapper for clang -- development files libopencl-clang9 - thin wrapper for clang nvidia-libopencl1-331 - Transitional package for nvidia-libopencl1-340 nvidia-libopencl1-331-updates - Transitional package for nvidia-libopencl1-340 nvidia-libopencl1-340 - NVIDIA OpenCL Driver and ICD Loader library nvidia-libopencl1-340-updates - Transitional package for nvidia-libopencl1-340 nvidia-libopencl1-384 - Transitional package for nvidia-headless-390 Code:
apt list --installed | grep libopencl1 WARNING: apt does not have a stable CLI interface. Use with caution in scripts. Code:
ocl-icd-libopencl1/eoan,now 2.2.11-1ubuntu1 amd64 [installed] Code:
2020-05-13 13:31:31 gpuowl v6.11-278-ga39cc1a 2020-05-13 13:31:31 Note: not found 'config.txt' 2020-05-13 13:31:31 device 0, unique id 'df7080c172fd5d6e' 2020-05-13 13:31:31 df7080c172fd5d6e 104954387 FFT: 5.50M 1K:11:256 (18.20 bpw) 2020-05-13 13:31:31 df7080c172fd5d6e Expected maximum carry32: 50D10000 Segmentation fault (core dumped) Last fiddled with by ewmayer on 2020-05-13 at 21:15 |
|
|
|
|
|
#2172 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
165578 Posts |
Did you install libncurses5? rocm-dev?
Does clinfo work? |
|
|
|
|
|
#2173 | |
|
∂2ω=0
Sep 2002
República de California
2×32×647 Posts |
I did the same install I used for the Haswell system, which IIRC was geared toward ROCm 3.0 (or maybe it was 3.1), which I later overrode to 2.10 to be able to run:
wget -qO - http://repo.radeon.com/rocm/apt/debian/rocm.gpg.key | sudo apt-key add - echo 'deb [arch=amd64] http://repo.radeon.com/rocm/apt/debian/ xenial main' | sudo tee /etc/apt/sources.list.d/rocm.list sudo apt update && sudo apt install rocm-dev Quote:
Code:
Command 'clinfo' not found, but can be installed with: sudo apt install clinfo |
|
|
|
|
|
|
#2174 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
753510 Posts |
install libncurses5
Last fiddled with by ewmayer on 2020-05-14 at 01:55 Reason: I did - as noted "also grabbed the libncurses5 per your suggestion". Thanks! |
|
|
|
|
|
#2175 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
124538 Posts |
Code:
2020-05-14 04:53:20 gpuowl v6.11-278-ga39cc1a 2020-05-14 04:53:20 config: -user kriesel -cpu asr2/radeonvii3 -d 3 -use NO_ASM -maxAlloc 15000 2020-05-14 04:53:20 device 3, unique id '' 2020-05-14 04:53:20 asr2/radeonvii3 worktodo.txt line ignored: "Test=(AID),91493761,77,1" 2020-05-14 04:53:20 asr2/radeonvii3 Bye |
|
|
|
|
|
#2176 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,419 Posts |
Mihai,
Please add pseudorandom shift to gpuowl. Its absence is interfering with doublecheck sampling of higher exponents. (I'm attempting to fill in double checks for LL and for PRP from the current state to where there's at least one of each for every million-exponent-range bin up to 200M, well ahead of the first-test wavefront. https://www.mersenneforum.org/showpo...77&postcount=3 https://www.mersenneforum.org/showpo...81&postcount=6) As Radeon VIIs become more common in the GIMPS fleet, and further conversion from cudalucas to gpuowl occurs on NVIDIA, the issue will become more common in LL and PRP at the wavefront also. It's tedious to check shifts one by one, and I missed a few. Code:
2020-05-14 18:31:17 asr2/radeonvii2-w2 140000177 OK 139800000 99.86%; 2590 us/it; ETA 0d 00:09; 420066ee63e325a2 (check 1.42s)
2020-05-14 18:39:59 asr2/radeonvii2-w2 140000177 OK 140000000 100.00%; 2604 us/it; ETA 0d 00:00; d33ef20fe4d7b3c8 (check 1.54s)
{"status":"C", "exponent":"140000177", "worktype":"PRP-3", "res64":"892fa228d6b157__", "residue-type":"1", "errors":{"gerbicz":"0"}, "fft-length":"8388608", "program":{"name":"gpuowl", "version":"v6.11-278-ga39cc1a"}, "user":"kriesel", "computer":"asr2/radeonvii2-w2", "timestamp":"2020-05-14 23:40:02 UTC"}
Code:
2020-05-15 10:07:43 asr2/radeonvii 152171251 OK 152150000 99.99%; 2624 us/it; ETA 0d 00:01; 09166e3101f3f7a1 (check 1.53s) 28 errors
{"status":"C", "exponent":"152171251", "worktype":"PRP-3", "res64":"d4e28827ea97dd__", "residue-type":"1", "errors":{"gerbicz":"28"}, "fft-length":"8388608", "program":{"name":"gpuowl", "version":"v6.11-278-ga39cc1a"}, "user":"kriesel", "computer":"asr2/radeonvii", "timestamp":"2020-05-15 15:08:42 UTC"}
The good news is the PRP res64s on that one match to the extent it can be checked, despite 28 GEC errors detected and calculations redone from the previous check. Well done, Dr. Gerbicz, Mihai, George, et al. Last fiddled with by kriesel on 2020-05-16 at 13:51 |
|
|
|
|
|
#2177 | |
|
Romulan Interpreter
Jun 2011
Thailand
72×197 Posts |
Quote:
|
|
|
|
|
|
|
#2178 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,419 Posts |
Quote:
Code:
2020-05-15 00:28:04 asr2/radeonvii2 121642771 OK 121600000 99.96%; 2151 us/it; ETA 0d 00:02; f394cb39ecc84d04 (check 1.16s)
{"status":"C", "exponent":"121642771", "worktype":"PRP-3", "res64":"a3569f57e1792d__", "residue-type":"1", "errors":{"gerbicz":"0"}, "fft-length":"7340032", "program":{"name":"gpuowl", "version":"v6.11-278-ga39cc1a"}, "user":"kriesel", "computer":"asr2/radeonvii2", "timestamp":"2020-05-15 05:29:38 UTC"}
|
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| GPUOWL AMD Windows OpenCL issues | xx005fs | GpuOwl | 0 | 2019-07-26 21:37 |
| Testing an expression for primality | 1260 | Software | 17 | 2015-08-28 01:35 |
| Testing Mersenne cofactors for primality? | CRGreathouse | Computer Science & Computational Number Theory | 18 | 2013-06-08 19:12 |
| Primality-testing program with multiple types of moduli (PFGW-related) | Unregistered | Information & Answers | 4 | 2006-10-04 22:38 |