mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing > GpuOwl

Reply
 
Thread Tools
Old 2018-10-30, 20:08   #12
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2×11×37 Posts
Default

Right now mem clock ceiling is around 900MHz. I know the card can go further because I've done it for mining but that was on windows and the tools just worked. The best so far at 1401MHz core is 0.945V core, 900MHz mem, 2.34ms/it drawing 142W. Compared to 2.47ms/it at 100W it's way worse. There's more to do between 1269 and 1401MHz but I think I will have to flash with Vega 64 bios to progress further and probably will. It's a shame the bios are signed though.
M344587487 is offline   Reply With Quote
Old 2018-10-31, 04:18   #13
xx005fs
 
"Eric"
Jan 2018
USA

22×53 Posts
Default

Quote:
Originally Posted by M344587487 View Post
Right now mem clock ceiling is around 900MHz. I know the card can go further because I've done it for mining but that was on windows and the tools just worked. The best so far at 1401MHz core is 0.945V core, 900MHz mem, 2.34ms/it drawing 142W. Compared to 2.47ms/it at 100W it's way worse. There's more to do between 1269 and 1401MHz but I think I will have to flash with Vega 64 bios to progress further and probably will. It's a shame the bios are signed though.
That to me sounds like Hynix HBM as my card with Samsung could overclock way further than that even with Vega 56 stock BIOS. I reckon that you can change the bios switch to power saving bios and try to lower power even further.
xx005fs is offline   Reply With Quote
Old 2018-11-16, 09:22   #14
SELROC
 

22×52×61 Posts
Default

Quote:
Originally Posted by preda View Post
I use ROCm 1.9.1, Ubuntu 18.04 with Linux kernel 4.18.8.
With dual Vega64 (air with the standard "blower" cooler).
Here are my observations:

1. ROCm is in general faster then amdgpu-pro (better compiler, producing better ISA code).
2. My sweet-spot is p-state 5 (rocm-smi --setsclk 5), which results in 1401MHz, GPU fan at 2300 RPM (automatic), 150W power, 75degC temperature.

If I set the frequency higher (p-state 6, or 7, or automatic (default)), the GPU quickly reaches 82-84 decC and there does thermal throttling. This thermal throttling results in worse performance then p-state 5, so it's a lose-lose: higher temperature, higher power use, lower performance.

I do not set the fan speed manually, I leave it on automatic, which is enough cooling for 150W with 75C.

Impressions of ROCm + RX580 on Debian:
1. faster than amdgpu-pro for large exponents
2. automatic and default settings, core clock 1319MHz, memory clock 2000MHz, fan 2130 rpm, power 144W, 77 deg C
  Reply With Quote
Old 2018-12-29, 08:11   #15
SELROC
 

816610 Posts
Default

Quote:
Originally Posted by preda View Post
I use ROCm 1.9.1, Ubuntu 18.04 with Linux kernel 4.18.8.
With dual Vega64 (air with the standard "blower" cooler).
Here are my observations:

1. ROCm is in general faster then amdgpu-pro (better compiler, producing better ISA code).
2. My sweet-spot is p-state 5 (rocm-smi --setsclk 5), which results in 1401MHz, GPU fan at 2300 RPM (automatic), 150W power, 75degC temperature.

If I set the frequency higher (p-state 6, or 7, or automatic (default)), the GPU quickly reaches 82-84 decC and there does thermal throttling. This thermal throttling results in worse performance then p-state 5, so it's a lose-lose: higher temperature, higher power use, lower performance.

I do not set the fan speed manually, I leave it on automatic, which is enough cooling for 150W with 75C.
Quote:
Originally Posted by SELROC View Post
Impressions of ROCm + RX580 on Debian:
1. faster than amdgpu-pro for large exponents
2. automatic and default settings, core clock 1319MHz, memory clock 2000MHz, fan 2130 rpm, power 144W, 77 deg C

At 1319MHz core clock it is in ROCm profile 6.
GpuOwl Version 5.0 https://github.com/preda/gpuowl/comm...a518a6913e9cff


Timing 4.48 ms/sq with 88M exponent.
  Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
gpuOwL: an OpenCL program for Mersenne primality testing preda GpuOwl 2718 2021-07-06 18:30
gpuowl: runtime error SELROC GpuOwl 59 2020-10-02 03:56
How to interface gpuOwl with PrimeNet preda PrimeNet 2 2017-10-07 21:32
Organizational tuning biwema Software 12 2006-01-17 03:02

All times are UTC. The time now is 02:40.


Sat Jul 17 02:40:36 UTC 2021 up 50 days, 27 mins, 1 user, load averages: 1.57, 1.56, 1.48

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.