![]() |
|
|
#342 | |
|
Dec 2011
New York, U.S.A.
11000012 Posts |
Quote:
For that matter, AVX is also AVX-256. You're getting AVX-256 and AVX-512 confused. SSE is 128 bit registers. AVX is 256 bit registers. AVX-512 is 512 bit registers. |
|
|
|
|
|
|
#343 |
|
"/X\(‘-‘)/X\"
Jan 2013
55648 Posts |
|
|
|
|
|
|
#344 |
|
"/X\(‘-‘)/X\"
Jan 2013
B7416 Posts |
|
|
|
|
|
|
#345 | |
|
Dec 2011
New York, U.S.A.
97 Posts |
Quote:
What you're calling AVX-256 is plain old original AVX, which has been supported by AMD for as long as Intel has supported it. AMD's implementation was crippled, however, so it hasn't been used here until Zen 2. With Zen 2, it's useful, finally -- but it's always been there. Zen2 supports FMA3. And it supports AVX. And, finally, they're as good as Intel's implementation. They don't support AVX-512, but that's a whole different discussion. Edit: Please see the Wikipedia page for the Piledriver architecture: https://en.wikipedia.org/wiki/Piledr...oarchitecture) . It clearly states that Piledriver supports AVX and FMA3. Last fiddled with by AG5BPilot on 2019-08-20 at 19:31 |
|
|
|
|
|
|
#346 |
|
Bemusing Prompter
"Danny"
Dec 2002
California
2·5·239 Posts |
George, could you please have a look at this issue?
https://mersenneforum.org/showpost.p...&postcount=296 On macOS, Prime95 doesn't let me set one core per worker unless I edit the configuration files. This happens even for non-100 million digit work types. Last fiddled with by ixfd64 on 2019-08-20 at 20:57 |
|
|
|
|
|
#347 | |
|
Jul 2019
the Netherlands
2×11 Posts |
Quote:
Sigh. No I'm not confused. AVX also has an 128-bit compatibility mode. Which is used on current Zen models. Of which the older don't have 256-bit logic, so they used two consecutive 128-bit operations. The newer Zen models can execute AVX-256 bit code without penalty. |
|
|
|
|
|
|
#348 |
|
Jul 2019
the Netherlands
2×11 Posts |
I should elaborate to prevent confusion:
vxorpd %xmm0,%xmm0,%xmm0 -> AVX-128 bit vxorpd %ymm0,%ymm0,%ymm0 -> AVX-256 bit See the difference? There are also two FMA3s: a 128-bit one, and a 256-bit one. The '3' only implies the number of arguments: vfmaddpd213 %xmm2,%xmm1,%xmm1 -> FMA3-128 bit vfmaddpd213 %ymm2,%ymm1,%ymm1 -> FMA3-256 bit Last fiddled with by Evil Genius on 2019-08-20 at 21:23 |
|
|
|
|
|
#349 | |
|
Dec 2011
New York, U.S.A.
97 Posts |
Quote:
The 128-bit xmm# registers are SSE registers (also usable by AVX instructions). Are you trying to say that Piledriver lacked the 16 256 bit ymm# registers? |
|
|
|
|
|
|
#350 | |
|
P90 years forever!
Aug 2002
Yeehaw, FL
2×53×71 Posts |
Quote:
What do I need to do differently? |
|
|
|
|
|
|
#351 | |
|
Bemusing Prompter
"Danny"
Dec 2002
California
95616 Posts |
Quote:
|
|
|
|
|
|
|
#352 | |
|
Jul 2019
the Netherlands
101102 Posts |
Quote:
No, I'm saying that although there's a 256-bit implementation on the outside, on the inside many AVX compatible processors implemented 256-bit operations as 2 consecutive 128-bit operations. Zen 1(+) was no exception, but this changed with Zen 2. Also of note is that if there's no native 256-bit implementation, the 128-bit implementation is faster. Last fiddled with by Evil Genius on 2019-08-20 at 21:49 |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Prime95 version 29.2 | Prime95 | Software | 71 | 2017-09-16 16:55 |
| Prime95 version 29.1 | Prime95 | Software | 95 | 2017-08-22 22:46 |
| Prime95 version 26.5 | Prime95 | Software | 175 | 2011-04-04 22:35 |
| Prime95 version 25.9 | Prime95 | Software | 143 | 2010-01-05 22:53 |
| Prime95 version 25.8 | Prime95 | Software | 159 | 2009-09-21 16:30 |