mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2017-12-07, 23:56   #89
diep
 
diep's Avatar
 
Sep 2006
The Netherlands

11001001112 Posts
Default

Ok i'm digging in older articles.

https://www.anandtech.com/show/4455/...-for-compute/4

Here is see first time in my life picture with 16 ALU's for each SIMD as description for older GCN.
diep is offline   Reply With Quote
Old 2017-12-08, 00:02   #90
diep
 
diep's Avatar
 
Sep 2006
The Netherlands

3·269 Posts
Default

Ok if i understand it correctly AMD moved to 16 alu's in each SIMD, which is pathetic little, to sail around an old problem they had, that's that they can't execute different kernels (wavefronts) at the same time.

By having now 4 different SIMDs in each CU, it allows within the same CU different wavefronts to execute. So instructions that take longer than 1 clock throughput then also can get handled meanwhile the other execution units still can execute simple instructions then, instead of needing to wait long time.

edit: Ok so i need to dig more into how many wavefronts can most ideally get executed simultaneously (not the maximum as that's clear) to still get good IPC. Blindfolded i would gamble at 8 for GCN.

Last fiddled with by diep on 2017-12-08 at 00:05
diep is offline   Reply With Quote
Old 2017-12-08, 00:13   #91
diep
 
diep's Avatar
 
Sep 2006
The Netherlands

3·269 Posts
Default

Ok that opens possibilities! More than 4 wavefronts @ 64 streamcores doesn't make sense it seems if i estimate it here.

Means 64KB / 4 = 16KB LDS available.

For the internal iterations of the FFT that's more than sufficient.
That's 2K doubles or 2^11 is 11 iterations.
diep is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Cost to compute prime Unregistered Information & Answers 30 2013-12-18 03:34
Doubled compute power for a day? Christenson PrimeNet 19 2011-10-26 08:29
New Compute Box Christenson Hardware 0 2011-01-15 04:44
Compute billions of digits of Pi using GMP M0CZY Miscellaneous Math 5 2010-10-14 09:40
My throughput does not compute... petrw1 Hardware 9 2007-08-13 14:38

All times are UTC. The time now is 15:01.


Fri Jul 7 15:01:30 UTC 2023 up 323 days, 12:30, 0 users, load averages: 1.25, 1.20, 1.15

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔