mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2020-02-03, 23:13   #56
paulunderwood
 
paulunderwood's Avatar
 
Sep 2002
Database er0rr

5·937 Posts
Default

Quote:
Originally Posted by ewmayer View Post
Thanks-
Per the readme, single minus sign ... from within a subdir 'run0' where I have created a worktodo.txt file containing a pair of PRP assignments, I tried 'sudo ../gpuowl -user ewmayer' ... after entering my sudo password the run echoed same as the 2nd #fail above, just with an added 'config: -user ewmayer' line. Trying to instead login as root and run that way [this the Ubuntu 19.10 setup I created last week] and using the same pwd gives 'Authentication failure'. I don't recall entering any other pwd during the set-pwd phase of Ubuntu 19.10 setup.

Not needed yet since I can't run at all, but how do determine the max stock voltage of my R7?
Two things:

Make sure you are in the group "video" by running id ewmayer. If not, you need to be added to it and re-login.

To create a root password run sudo passwd root and take it from there. It is best not to run X as root, but in a terminal type su and enter root's password.

Last fiddled with by paulunderwood on 2020-02-03 at 23:42
paulunderwood is offline   Reply With Quote
Old 2020-02-03, 23:17   #57
S485122
 
S485122's Avatar
 
"Jacob"
Sep 2006
Brussels, Belgium

2×977 Posts
Default

Quote:
Originally Posted by ewmayer View Post
...
I don't recall entering any other pwd during the set-pwd phase of Ubuntu 19.10 setup.
...
If my memory serves well once installed there is no password for the root account and it can't be used because of that until you have set it via "sudo passwd root".

Jacob
S485122 is offline   Reply With Quote
Old 2020-02-03, 23:41   #58
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

22·2,939 Posts
Default

Quote:
Originally Posted by S485122 View Post
If my memory serves well once installed there is no password for the root account and it can't be used because of that until you have set it via "sudo passwd root".

Jacob
That worked - thanks - but even running as root, I still get the getDeviceIDs error, whether I use -user ewmayer, -user root, or no -user stuff at all.

I've PMed Mihai, hopefully he can provide further guidance.
ewmayer is offline   Reply With Quote
Old 2020-02-03, 23:47   #59
paulunderwood
 
paulunderwood's Avatar
 
Sep 2002
Database er0rr

5×937 Posts
Default

Quote:
Originally Posted by ewmayer View Post
That worked - thanks - but even running as root, I still get the getDeviceIDs error, whether I use -user ewmayer, -user root, or no -user stuff at all.

I've PMed Mihai, hopefully he can provide further guidance.
Did you login to root in a terminal by using su?

Last fiddled with by paulunderwood on 2020-02-03 at 23:48
paulunderwood is offline   Reply With Quote
Old 2020-02-04, 02:52   #60
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

22·2,939 Posts
Default

Quote:
Originally Posted by paulunderwood View Post
Did you login to root in a terminal by using su?
Yes ... 'su' using the newly-set root pwd, instead 'sudo'.
ewmayer is offline   Reply With Quote
Old 2020-02-04, 03:35   #61
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

782310 Posts
Default

Quote:
Originally Posted by ewmayer View Post
First I'd like to play with some basic single-instance running, but something is borked. The readme says "Self-test: simply start gpuowl with any valid exponent..." but does not say how to specify that expo via cmd-line flags. I tried just sticking a prime expo in there, then without any arguments whatever, both gave the following kind of error:
Code:
ewmayer@ewmayer-haswell:~/gpuowl$ ./gpuowl 90110269
2020-02-01 18:43:36 gpuowl v6.11-142-gf54af2e
2020-02-01 18:43:36 Note: not found 'config.txt'
2020-02-01 18:43:36 config: 90110269 
2020-02-01 18:43:36 device 0, unique id ''
2020-02-01 18:43:36 Exception gpu_error: DEVICE_NOT_FOUND clGetDeviceIDs(platforms[i], kind, 64, devices, &n) at clwrap.cpp:77 getDeviceIDs
2020-02-01 18:43:36 Bye
ewmayer@ewmayer-haswell:~/gpuowl$ ./gpuowl
2020-02-01 18:44:02 gpuowl v6.11-142-gf54af2e
2020-02-01 18:44:02 Note: not found 'config.txt'
2020-02-01 18:44:02 device 0, unique id ''
2020-02-01 18:44:02 Exception gpu_error: DEVICE_NOT_FOUND clGetDeviceIDs(platforms[i], kind, 64, devices, &n) at clwrap.cpp:77 getDeviceIDs
2020-02-01 18:44:02 Bye
Matt had noted to me, "If the PRP test starts we are good to go. If it fails with something along the lines ofclGetDeviceId then gpuowl couldn't see the card." How to debug that latter problem?
./gpuowl -help
should give a big list of fft lengths, then at the end a list of detected gpus. If that's an empty list, confirm it with something else like an OpenCL diagnostic tool. Possibly one of the tools listed at the top of page 3 of the pdf attachment at https://www.mersenneforum.org/showpo...74&postcount=6
Then fix the OpenCL driver installation somehow. (Can't help you there, too little gpu linux experience to go by.)

On Windows one can also use GPU-Z to check driver version, OpenCL and other standards' parameters, etc. Some of the other hardware monitoring tools listed on pages 1-2 of that same attachment might also allow that.
kriesel is online now   Reply With Quote
Old 2020-02-04, 08:21   #62
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

22×3×112 Posts
Default

Quote:
Originally Posted by ewmayer View Post
That worked - thanks - but even running as root, I still get the getDeviceIDs error, whether I use -user ewmayer, -user root, or no -user stuff at all.

I've PMed Mihai, hopefully he can provide further guidance.
Does clinfo work? (i.e. does it detect any devices)

If clinfo does not detect anything, then the problem is with the OpenCL setup in the system (i.e. drivers, ROCm).
preda is offline   Reply With Quote
Old 2020-02-04, 10:10   #63
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2·52·19 Posts
Default

Maybe the issue is to do with me recommending he install ROCm using the upstream drivers ( https://github.com/RadeonOpenCompute...kernel-drivers ), it's been a long time since I did a ROCm setup and something in the installation procedure or environment may have changed breaking this method or requiring extra steps. rocm-smi can see the card but I didn't have him check clinfo or rocminfo.
M344587487 is offline   Reply With Quote
Old 2020-02-04, 10:46   #64
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

145210 Posts
Default

Quote:
Originally Posted by M344587487 View Post
Maybe the issue is to do with me recommending he install ROCm using the upstream drivers ( https://github.com/RadeonOpenCompute...kernel-drivers ), it's been a long time since I did a ROCm setup and something in the installation procedure or environment may have changed breaking this method or requiring extra steps. rocm-smi can see the card but I didn't have him check clinfo or rocminfo.
If it's ROCm 3.0, it may have broken OpenCL, see https://github.com/RadeonOpenCompute/ROCm/issues/977

ROCm 2.10 works for me.

Last fiddled with by preda on 2020-02-04 at 10:46
preda is offline   Reply With Quote
Old 2020-02-04, 10:57   #65
paulunderwood
 
paulunderwood's Avatar
 
Sep 2002
Database er0rr

5·937 Posts
Default

Quote:
Originally Posted by preda View Post
If it's ROCm 3.0, it may have broken OpenCL, see https://github.com/RadeonOpenCompute/ROCm/issues/977

ROCm 2.10 works for me.
This looks like the best advice:

Quote:
OlegSmelov commented on Dec 23, 2019

For those wondering how to revert to a previous version on Debian-based distros:

sudo apt autoremove rocm-dkms rock-dkms
sudo vim /etc/apt/sources.list.d/rocm.list

Replace http://repo.radeon.com/rocm/apt/debian/ with http://repo.radeon.com/rocm/apt/2.10.0/

sudo apt update
sudo apt install rocm-dkms # or any other set of packages you need
paulunderwood is offline   Reply With Quote
Old 2020-02-04, 20:05   #66
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

22×2,939 Posts
Default

Quote:
Originally Posted by kriesel View Post
./gpuowl -help
should give a big list of fft lengths, then at the end a list of detected gpus. If that's an empty list, confirm it with something else like an OpenCL diagnostic tool. Possibly one of the tools listed at the top of page 3 of the pdf attachment at https://www.mersenneforum.org/showpo...74&postcount=6
Then fix the OpenCL driver installation somehow. (Can't help you there, too little gpu linux experience to go by.)
Thanks - note the help command needs --help or -h ... anyhow, that gives me
Code:
[build version]
Command line options:
...
-device <N>: select a specific device:
[timestamp] Exception gpu_error: DEVICE_NOT_FOUND clGetDeviceIDs(platforms[i], kind, 64, devices, &n) at clwrap.cpp:77 getDeviceIDs
I don't see the big list of FFT lengths you mentioned.

Quote:
Originally Posted by preda View Post
Does clinfo work? (i.e. does it detect any devices)

If clinfo does not detect anything, then the problem is with the OpenCL setup in the system (i.e. drivers, ROCm).
Is that supposed to be an installed command? 'which clinfo' comes up empty, and I don't see any such command in /opt/rocm/bin.

Quote:
Originally Posted by preda View Post
If it's ROCm 3.0, it may have broken OpenCL, see https://github.com/RadeonOpenCompute/ROCm/issues/977

ROCm 2.10 works for me.
That sounds like a possible suspect, given that I installed Ubunto 19.10, which is newer than the 19.04 Matt based his setup-recipe on.

How do I query the version number for the ROCm install on my system?

Once I do that, if it indeed is 3.0, I'll try the Debian-distro reversion commands Paul dug up, which hopefully will work similarly on Ubuntu.

Last fiddled with by ewmayer on 2020-02-04 at 20:10
ewmayer is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
AMD Radeon Pro WX 3200 ET_ GPU Computing 1 2019-07-04 11:02
Radeon Pro Vega II Duo (look at this monster) M344587487 GPU Computing 10 2019-06-18 14:00
What's the best project to run on a Radeon RX 480? jasong GPU Computing 0 2016-11-09 04:32
Radeon Pro Duo 0PolarBearsHere GPU Computing 0 2016-03-15 01:32
AMD Radeon R9 295X2 firejuggler GPU Computing 33 2014-09-03 21:42

All times are UTC. The time now is 14:35.


Fri Jul 7 14:35:45 UTC 2023 up 323 days, 12:04, 0 users, load averages: 1.15, 0.79, 0.90

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔