mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2022-12-22, 18:57   #23
Magellan3s
 
Mar 2022
Earth

1768 Posts
Default

Has anyone been able to benchmark the 7900 xtx?
Magellan3s is offline   Reply With Quote
Old 2022-12-22, 23:15   #24
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

7×13×47 Posts
Default

Quote:
Originally Posted by Magellan3s View Post
Has anyone been able to benchmark the 7900 xtx?
If there are any mfakto or gpuowl benchmarks for 7900 (XT or XTX I don't care) I would desperately likely to see them, the numbers on my charts are more-or-less made up.
James Heinrich is offline   Reply With Quote
Old 2022-12-23, 14:11   #25
Xyzzy
 
Xyzzy's Avatar
 
Aug 2002

21D216 Posts
Default

https://old.reddit.com/r/Amd/comment...e_7xxx_series/

Xyzzy is offline   Reply With Quote
Old 2022-12-23, 15:16   #26
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2·52·19 Posts
Default

I don't know what to make of that thread. Particularly this quote seems misguided:
Quote:
...dual SIMD is useless for some (most) applications since the added second SIMD per CU doesn't support integer ops...
given that most applications are interested in fp AFAIK.

The locked PP sounds concerning, but from what I recall we could do as we pleased with Vega 10/20, and this quote:
Quote:
There is some small sliver of hope that AMD will eventually unlock the PPtables, but looking at Vega10/20, that doesn't seem likely.
seems to contradict that. If they're wrong about that I'm not convinced they know what they're talking about.

Quote:
...Also, indications are that they've moved instruction pipeline responsibilities to software, meaning you now need to carefully reorder instructions to not get pipeline stalls and/or provide hints (there's a new instruction for this specific purpose, s_delay_alu). Since many software kernels are hand-rolled in raw assembly, this is a potentially a huge pain point for developers - since this platform needs specific instructions that no other platform does....
Does this apply to gpuowl or mfakto? I don't think .cl files are assembly or that assembly is used at all but could be wrong.
M344587487 is offline   Reply With Quote
Old 2022-12-28, 09:22   #27
Magellan3s
 
Mar 2022
Earth

2×32×7 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
If there are any mfakto or gpuowl benchmarks for 7900 (XT or XTX I don't care) I would desperately likely to see them, the numbers on my charts are more-or-less made up.
7900 xtx


Code:
2022-12-28 01:30:43 GpuOwl VERSION v7.2-70-g212618e
2022-12-28 01:30:43 config: log 1000
2022-12-28 01:30:43 config:
2022-12-28 01:30:43 device 0, unique id ''
2022-12-28 01:30:43 gfx1100-0 77936867 FFT: 4M 1K:8:256 (18.58 bpw)
2022-12-28 01:30:43 gfx1100-0 77936867 OpenCL args "-DEXP=77936867u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=8u -DAMDGPU=1 -DMM_CHAIN=1u -DMM2_CHAIN=2u -DMAX_ACCURACY=1 -DWEIGHT_STEP=0.33644726404543274 -DIWEIGHT_STEP=-0.25174750481886216 -DIWEIGHTS={0,-0.44011820345520131,-0.37306474779553728,-0.29798072935699788,-0.21390437908665341,-0.11975874301407295,-0.014337887291734644,-0.44814572555075455,} -DFWEIGHTS={0,0.78609128957452257,0.5950610473469905,0.42446232150303748,0.2721098723818392,0.1360521812214803,0.014546452690911484,0.81207258201996746,} -cl-std=CL2.0 -cl-finite-math-only "
2022-12-28 01:30:44 gfx1100-0 77936867 OpenCL compilation in 1.07 s
2022-12-28 01:30:44 gfx1100-0 77936867 trig table : 65 points, cos 73.86 bits, sin 73.34 bits
2022-12-28 01:30:44 gfx1100-0 77936867 trig table : 257 points, cos 72.90 bits, sin 73.11 bits
2022-12-28 01:30:45 gfx1100-0 77936867 trig table : 262145 points, cos 72.03 bits, sin 72.56 bits
2022-12-28 01:30:45 gfx1100-0 77936867 maxAlloc: 0.0 GB
2022-12-28 01:30:45 gfx1100-0 77936867 You should use -maxAlloc if your GPU has more than 4GB memory. See help '-h'
2022-12-28 01:30:45 gfx1100-0 77936867 P1(0) 0 bits
2022-12-28 01:30:45 gfx1100-0 77936867 PRP starting from beginning
2022-12-28 01:30:45 gfx1100-0 77936867 OK 0 on-load: blockSize 400, 0000000000000003
2022-12-28 01:30:45 gfx1100-0 77936867 validating proof residues for power 8
2022-12-28 01:30:45 gfx1100-0 77936867 Proof using power 8
2022-12-28 01:30:46 gfx1100-0 77936867 OK 800 0.00% 1579c241dc63eca6 784 us/it + check 0.36s + save 0.11s; ETA 16:58
2022-12-28 01:30:54 gfx1100-0 77936867 10000 0.01% fc4f135f7cf4ad29 785 us/it
2022-12-28 01:31:02 gfx1100-0 77936867 20000 0.03% 3cd1bd9d5e09cbc5 788 us/it
2022-12-28 01:31:09 gfx1100-0 77936867 30000 0.04% c4e0ff35e3290d98 791 us/it
2022-12-28 01:31:17 gfx1100-0 77936867 40000 0.05% dffe1b1b0d748128 793 us/it
2022-12-28 01:31:25 gfx1100-0 77936867 50000 0.06% 52e286945371ed29 793 us/it
2022-12-28 01:31:33 gfx1100-0 77936867 60000 0.08% 0945da4dc08bdd95 795 us/it
2022-12-28 01:31:41 gfx1100-0 77936867 70000 0.09% 7131fa4eb77f4bb2 795 us/it

Last fiddled with by Magellan3s on 2022-12-28 at 09:23
Magellan3s is offline   Reply With Quote
Old 2022-12-28, 11:11   #28
axn
 
axn's Avatar
 
Jun 2003

23×683 Posts
Default

Quote:
Originally Posted by Magellan3s View Post
7900 xtx


Code:
2022-12-28 01:30:43 GpuOwl VERSION v7.2-70-g212618e
Can you try running 2 parallel instances of gpuowl (you can use 77936923 for the second instance)? Would like to see what, if any, thruput gains we can get.
axn is offline   Reply With Quote
Old 2022-12-28, 12:47   #29
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2×52×19 Posts
Default

Quote:
Originally Posted by axn View Post
Can you try running 2 parallel instances of gpuowl (you can use 77936923 for the second instance)? Would like to see what, if any, thruput gains we can get.
The results are from here: https://mersenneforum.org/showthread.php?t=28303
M344587487 is offline   Reply With Quote
Old 2023-01-09, 17:15   #30
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

22·3·112 Posts
Default

Chips&Cheese benchmarking 7900xtx:

https://chipsandcheese.com/2023/01/0...-architecture/
preda is offline   Reply With Quote
Old 2023-01-13, 10:05   #31
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2·52·19 Posts
Default

The ISA has been published, a nice light read at 600 pages: https://gpuopen.com/rdna3-isa-guide-now-available/
M344587487 is offline   Reply With Quote
Reply

Thread Tools


All times are UTC. The time now is 14:16.


Fri Jul 7 14:16:30 UTC 2023 up 323 days, 11:45, 0 users, load averages: 0.99, 1.34, 1.29

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔