mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2019-03-26, 09:17   #89
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2×52×19 Posts
Default

The price of the PowerColor version on ebuyer has dropped by £10 to £640. First time I've seen the price drop below £650 which is interesting: https://www.ebuyer.com/875303-powerc...i-16gbhbm2-3dh
M344587487 is offline   Reply With Quote
Old 2019-04-05, 12:08   #90
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

26548 Posts
Default

I now have a Radeon VII, I'd like to share my setup. I overclocked the RAM from 1000 to 1100, and undervolted a bit.
Initial pp_od_clk_voltage:
OD_SCLK:
0: 808Mhz
1: 1801Mhz
OD_MCLK:
1: 1000Mhz
OD_VDDC_CURVE:
0: 808Mhz 695mV
1: 1304Mhz 791mV
2: 1801Mhz 1089mV

Modified to:
D_SCLK:
0: 808Mhz
1: 1801Mhz
OD_MCLK:
1: 1100Mhz
OD_VDDC_CURVE:
0: 808Mhz 695mV
1: 1304Mhz 780mV
2: 1801Mhz 1070mV

I run it with --setsclk 4 (1547Mhz). I get 0.92ms/it at wavefront (FFT 4608K), and the GPU uses 160W. Fan on auto, temperature reported by sensors is 105C, but I suspect this value is with 20C over the real temperature (because the limit is reported at 118C).
preda is offline   Reply With Quote
Old 2019-04-05, 12:57   #91
SELROC
 

11101011002 Posts
Default

Quote:
Originally Posted by preda View Post
I now have a Radeon VII, I'd like to share my setup. I overclocked the RAM from 1000 to 1100, and undervolted a bit.
Initial pp_od_clk_voltage:
OD_SCLK:
0: 808Mhz
1: 1801Mhz
OD_MCLK:
1: 1000Mhz
OD_VDDC_CURVE:
0: 808Mhz 695mV
1: 1304Mhz 791mV
2: 1801Mhz 1089mV

Modified to:
D_SCLK:
0: 808Mhz
1: 1801Mhz
OD_MCLK:
1: 1100Mhz
OD_VDDC_CURVE:
0: 808Mhz 695mV
1: 1304Mhz 780mV
2: 1801Mhz 1070mV

I run it with --setsclk 4 (1547Mhz). I get 0.92ms/it at wavefront (FFT 4608K), and the GPU uses 160W. Fan on auto, temperature reported by sensors is 105C, but I suspect this value is with 20C over the real temperature (because the limit is reported at 118C).

I have ambient temp 22~24 C and the gpu is at 107 C with good cooling.
  Reply With Quote
Old 2019-04-08, 10:40   #92
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

22×3×112 Posts
Default

With its 16GB of RAM, R7 is also quite good at P-1.
I'm doing P-1(B1=300K,B2=9M) on 91M exponents in about 17min/test.
(and I found this 118-bit factor: 420168247365933163207630527781851871 )
preda is offline   Reply With Quote
Old 2019-04-08, 10:44   #93
SELROC
 

2×19×131 Posts
Default

Quote:
Originally Posted by preda View Post
With its 16GB of RAM, R7 is also quite good at P-1.
I'm doing P-1(B1=300K,B2=9M) on 91M exponents in about 17min/test.
(and I found this 118-bit factor: 420168247365933163207630527781851871 )

I am waiting to deplenish my current worktodo.txt before starting P-1 :-)
  Reply With Quote
Old 2019-04-08, 10:51   #94
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

3B616 Posts
Default

Quote:
Originally Posted by preda View Post
With its 16GB of RAM, R7 is also quite good at P-1.
I'm doing P-1(B1=300K,B2=9M) on 91M exponents in about 17min/test.
(and I found this 118-bit factor: 420168247365933163207630527781851871 )
Do you have to manually set anything to do P-1 with gpuowl? I'm not familiar with P-1 and these lines generated from this calculator were ignored: https://www.mersenne.ca/prob.php

Code:
Pminus1=1,2,344587487,-1,1645000,32900000,82
 Pfactor=1,2,344587487,-1,82,2

edit: I figured it out, the calculator is older than an identifying hash being added to the line.

Last fiddled with by M344587487 on 2019-04-08 at 10:53
M344587487 is offline   Reply With Quote
Old 2019-04-08, 11:10   #95
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

22×3×112 Posts
Default

Quote:
Originally Posted by M344587487 View Post
Do you have to manually set anything to do P-1 with gpuowl? I'm not familiar with P-1 and these lines generated from this calculator were ignored: https://www.mersenne.ca/prob.php

Code:
Pminus1=1,2,344587487,-1,1645000,32900000,82
 Pfactor=1,2,344587487,-1,82,2

edit: I figured it out, the calculator is older than an identifying hash being added to the line.
Yes, assignments like
PFactor=AID,1,2,91157513,-1,77,2

these are easily generated/submitted by e.g.:
gpuowl/primenet.py -u user -p passwd --dirs workdir -w PM1 --tasks 40

and for B1/B2 I add to openowl e.g.:
./openowl -B1 300000
./openowl -B1 1000000 -rB2 25

Bounds (B1,B2) can also be specified per-exponent, by prefixing the worktodo line with:
B1=x;line
B1=x,B2=y;line

Last fiddled with by preda on 2019-04-08 at 11:13
preda is offline   Reply With Quote
Old 2019-04-08, 11:21   #96
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

11101101102 Posts
Default

Thanks. Could it make sense to do a P-1 test and a PRP test simultaneously? Two PRP tests improves throughput, two P-1 cannot be done simultaneously as they both want to max out RAM. One of each could make the most of the hardware.
M344587487 is offline   Reply With Quote
Old 2019-04-08, 11:29   #97
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

22·3·112 Posts
Default

Quote:
Originally Posted by M344587487 View Post
Thanks. Could it make sense to do a P-1 test and a PRP test simultaneously? Two PRP tests improves throughput, two P-1 cannot be done simultaneously as they both want to max out RAM. One of each could make the most of the hardware.
Yes I think that would work. I'm not doing it myself, because the benefit is too small IMO, but it may be worth if you're patient. You should watch the memory use reported by the P-1 at start of test (e.g. "P-1 GPU RAM fits 388 stage2 buffers @ 40.0 MB each, using 360"), to make sure the PRP has enough space -- in this example there is plenty of buffers between 360 and 388 for the PRP.
preda is offline   Reply With Quote
Old 2019-04-08, 13:52   #98
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

24×3×163 Posts
Default

Quote:
Originally Posted by preda View Post
With its 16GB of RAM, R7 is also quite good at P-1.
I'm doing P-1(B1=300K,B2=9M) on 91M exponents in about 17min/test.
(and I found this 118-bit factor: 420168247365933163207630527781851871 )
For such an exponent, B1=300K, or B2=9M, look low to me.
See for example https://www.mersenne.org/report_expo...1140901&full=1
B1= 745 000
B2= 17 135 000
run in CUDAPm1 v0.20 on a GTX1070 (8GB)

Or M90888739, B1=870000, B2=18052500, e=12, in CUDAPm1 v0.22 on a GTX1080Ti (11GB)
kriesel is online now   Reply With Quote
Old 2019-04-08, 15:31   #99
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

95010 Posts
Default

Here's a rough test exploring PRP and P-1 simultaneously. Timings in various configs of a 90M P-1 (B1=500000, B2=15000000) and 88M PRP, auto test settings, 1546 SCLK 1200 MCLK:
Code:
[PRP ms/it]  [PRP ms/it] [P-1 Stage 1 ms/it] [P-1 Stage 2 ms/it] [Power W] [P-1 Stage Completion minutes est.]
 1.02         0           0                   0                   170
 1.89         1.89        0                   0                   180
 0            0           1.02                0                   170       ~12
 0            0           0                   1.2                 160       ~16
 1.89         0           1.89                0                   180       ~22
 2.06         0           0                   2.09                175       ~28
Which I make out to be:
  • ~7.9% increase in throughput from 1 PRP to 2 simultaneous PRP at ~6% increase in power. Definitely worth it
  • P-1 running at ~56% speed relative to solo and PRP running at ~51.5% speed relative to solo at ~6% increase in power. Still worth it for throughput but the power numbers make efficiency less certain.
M344587487 is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Vega 20 announced with 7.64 TFlops of FP64 M344587487 GPU Computing 4 2018-11-08 16:56
GTX 1180 Mars Volta consumer card specs leaked tServo GPU Computing 20 2018-06-24 08:04
RX Vega performance xx005fs GPU Computing 5 2018-01-17 00:22
Radeon Pro Duo 0PolarBearsHere GPU Computing 0 2016-03-15 01:32
AMD Radeon R9 295X2 firejuggler GPU Computing 33 2014-09-03 21:42

All times are UTC. The time now is 15:05.


Fri Jul 7 15:05:21 UTC 2023 up 323 days, 12:33, 0 users, load averages: 1.46, 1.29, 1.19

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔