mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2019-07-14, 17:01   #89
maxzor
 
Apr 2017

22×5 Posts
Default

All this human effort trying to figure out what a CPU can do when AI could do it much better...
maxzor is offline   Reply With Quote
Old 2019-07-22, 13:35   #90
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2·52·19 Posts
Default

Damn these supply issues I want some tasty 3900X benchmarks. It seems like they underestimated 3900X demand as the other new SKUs have the usual supply issues but nowhere near to the same extent. I'm expecting worse for the 3950X when it supposedly gets launched in September, which will be competing with Epyc for the well-binned 8 core chips.


Speaking of the 16 core chips, that's almost certainly going to be an issue with dual channel memory. Are there any recommended ancillary GIMPS/prime programs that run mainly from L2 cache that can soak up the remaining compute available?
M344587487 is offline   Reply With Quote
Old 2019-07-22, 15:56   #91
Mysticial
 
Mysticial's Avatar
 
Sep 2016

38010 Posts
Default

Quote:
Originally Posted by M344587487 View Post
Damn these supply issues I want some tasty 3900X benchmarks. It seems like they underestimated 3900X demand as the other new SKUs have the usual supply issues but nowhere near to the same extent. I'm expecting worse for the 3950X when it supposedly gets launched in September...
And there's an unknown # of people like me who are holding out for the 3950X. It's probably gonna be a mad rush and I'll definitely be pre-ordering. Reminds of the Skylake X launch 2 years ago. Had I not pre-ordered, I wouldn't have gotten the chip for several more weeks.

I'm also keeping an eye on the 32GB DIMM situation. Those also just launched and also can't stay in stock. But the ones that I want (higher speed + RGB) aren't available yet.

Quote:
... which will be competing with Epyc for the well-binned 8 core chips.
I'm also waiting to see if AMD throws a curve ball with AVX512 on the TR or Epyc. While Ryzen doesn't have AVX512, AMD declined to confirm it was absent all of Zen 2.
Mysticial is offline   Reply With Quote
Old 2019-07-22, 16:50   #92
mackerel
 
mackerel's Avatar
 
Feb 2016
UK

26·7 Posts
Default

Quote:
Originally Posted by M344587487 View Post
It seems like they underestimated 3900X demand as the other new SKUs have the usual supply issues but nowhere near to the same extent.
I'd speculate this is a problem born from AMDs marketing position. They want to show a progression of boost clocks up the range, and it gets more difficult to bin enough of those chiplets, while at the same time saving the even better ones for eventual 3950X.

I've heard but not confirmed that the 3900X has one higher binned chiplet than the other. This makes sense as far as boost applies more so at lower active core counts.

Quote:
Are there any recommended ancillary GIMPS/prime programs that run mainly from L2 cache that can soak up the remaining compute available?
At <64k FFT sizes you can look for tiny primes with LLR/PFGW/genefer. I'm wondering if sieves are a suitable task but I'm not sure how these work. They usually do benefit greatly from HT/SMT.

Quote:
Originally Posted by Mysticial View Post
But the ones that I want (higher speed + RGB) aren't available yet.
The lack of RGB is probably not helping in my attempts at setting speed records
mackerel is offline   Reply With Quote
Old 2019-07-22, 17:13   #93
hansl
 
hansl's Avatar
 
Apr 2019

5×41 Posts
Default

Quote:
Originally Posted by Mysticial View Post
I'm also keeping an eye on the 32GB DIMM situation. Those also just launched and also can't stay in stock. But the ones that I want (higher speed + RGB) aren't available yet.
Don't the higher densities mean higher rank, and generally more latency?
hansl is offline   Reply With Quote
Old 2019-07-22, 17:40   #94
Mysticial
 
Mysticial's Avatar
 
Sep 2016

22×5×19 Posts
Default

Quote:
Originally Posted by hansl View Post
Don't the higher densities mean higher rank, and generally more latency?
Correct, but I value capacity more than latency.

For my purposes, the order of importance is:
  1. Total capacity.
  2. Memory frequency. (bandwidth)
  3. Memory timings. (latency)
Mysticial is offline   Reply With Quote
Old 2019-07-22, 19:45   #95
nordi
 
Dec 2016

7F16 Posts
Default

Quote:
Originally Posted by M344587487 View Post
Are there any recommended ancillary GIMPS/prime programs that run mainly from L2 cache that can soak up the remaining compute available?
You can use mprime to do ECM on small exponents. Exponents up to ~3,000,000 should fit into the 512kB of L2 cache (stage 2 of ECM uses some RAM bandwidth, but accounts only for ~30% of execution time). You can get assignments from mersenne.org's manual assignment page.
nordi is offline   Reply With Quote
Old 2019-07-22, 22:15   #96
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2×52×19 Posts
Default

Quote:
Originally Posted by Mysticial View Post
...
I'm also waiting to see if AMD throws a curve ball with AVX512 on the TR or Epyc. While Ryzen doesn't have AVX512, AMD declined to confirm it was absent all of Zen 2.
That would be a curveball. They all use the same chiplets so the main reason it would be absent from Ryzen 3000 is that as a consumer part they'd want to guarantee stock clocks were high enough to satisfy a general audience. Even if TR and Epyc parts come out that do support AVX512 (maybe even a microcode update for Ryzen enabling it), I wouldn't believe it's a fully fledged implementation until benchmarks confirm it. If anything it's more realistic to expect that even if AVX512 does appear it'll mainly be via doubled AVX2 (like they did supporting AVX2 with doubled AVX on zen) so the gains will be a minimal improvement to instruction density and whatever benefit that may bring. Proper AVX512 smells more like a zen3 thing but I'd love to be proved wrong.

Quote:
Originally Posted by mackerel View Post
I'd speculate this is a problem born from AMDs marketing position. They want to show a progression of boost clocks up the range, and it gets more difficult to bin enough of those chiplets, while at the same time saving the even better ones for eventual 3950X.
Releasing on the 7th of July was a big part of their marketing which may have taken priority over trivial things like supply.

Quote:
Originally Posted by mackerel View Post
I've heard but not confirmed that the 3900X has one higher binned chiplet than the other. This makes sense as far as boost applies more so at lower active core counts.
I don't know if it's intentional, probably not, but it is possible to have a higher binned chip. It's possible that the two CCX's on the same chip clock differently. These chips already self-boost to get the most out of themselves out of the box (acting much more like GPUs do than what we expect of CPUs), but one of the recommended ways I've seen to try and overclock beyond that is to do it on a CCX basis. This video shows a manually overclocked 3900X with a 150 MHz disparity between the highest and lowest CCX (4490, 4441, 4341, 4341, the chiplets do clock quite differently but this is a sample size of 1): https://www.youtube.com/watch?v=M5pHUHGZ7hU&t=45

Thanks for the program suggestions.
M344587487 is offline   Reply With Quote
Old 2019-07-22, 22:27   #97
Mysticial
 
Mysticial's Avatar
 
Sep 2016

22×5×19 Posts
Default

Quote:
Originally Posted by M344587487 View Post
That would be a curveball. They all use the same chiplets so the main reason it would be absent from Ryzen 3000 is that as a consumer part they'd want to guarantee stock clocks were high enough to satisfy a general audience. Even if TR and Epyc parts come out that do support AVX512 (maybe even a microcode update for Ryzen enabling it), I wouldn't believe it's a fully fledged implementation until benchmarks confirm it. If anything it's more realistic to expect that even if AVX512 does appear it'll mainly be via doubled AVX2 (like they did supporting AVX2 with doubled AVX on zen) so the gains will be a minimal improvement to instruction density and whatever benefit that may bring. Proper AVX512 smells more like a zen3 thing but I'd love to be proved wrong.
I don't think anybody is expecting it to have native AVX512. It doesn't need to anyway since there's a lot more to AVX512 than just the 512-bit width.
Mysticial is offline   Reply With Quote
Old 2019-07-23, 00:02   #98
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2×52×19 Posts
Default

Quote:
Originally Posted by Mysticial View Post
I don't think anybody is expecting it to have native AVX512. It doesn't need to anyway since there's a lot more to AVX512 than just the 512-bit width.
I didn't realise, it appears that AVX512 has a load of optional extras. If the wiki page is to be believed the Ice Lake chips could be interesting not just for the fabled 10nm. Does anything jump out as particularly useful?
M344587487 is offline   Reply With Quote
Old 2019-07-23, 00:35   #99
Mysticial
 
Mysticial's Avatar
 
Sep 2016

22×5×19 Posts
Default

Quote:
Originally Posted by M344587487 View Post
I didn't realise, it appears that AVX512 has a load of optional extras. If the wiki page is to be believed the Ice Lake chips could be interesting not just for the fabled 10nm. Does anything jump out as particularly useful?
Off the top of my head, the non-width related things that I've already found useful are:
  • 32 registers
  • 64-bit arithmetic right shift.
  • 64-bit unsigned comparison.
  • double <-> int64 conversion (AVX512-DQ)
  • Built-in masking.
  • Fault-suppressing masked load/store.
  • 2-vector permute.
  • 16-bit granular permute. (AVX512-BW)
  • 8-bit granular permute. (AVX512-VBMI)
  • vpmullq (AVX512-DQ)
  • valignd/q
  • vpternlog
  • vrndscale
  • Integer FMA (AVX512-IFMA)
  • Scatter
  • 64-bit min/max
  • 14-bit reciprocal
  • Bitwise rotate.
  • Embedded rounding for FP.
I haven't really tried to use the Sunny Cove/Ice Lake stuff yet. But the things that stand out and may be useful for my purposes are:
  • Double vector shifts. (AVX512-VBMI2)
  • Galois Field. (AVX512-GFNI)

Last fiddled with by Mysticial on 2019-07-23 at 00:45
Mysticial is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
RX470 and RX460 announced VictordeHolland GPU Computing 0 2016-07-30 13:05
Intel Xeon D announced VictordeHolland Hardware 7 2015-03-11 23:26
Factoring details mturpin Information & Answers 4 2013-02-08 02:43
Euler (6,2,5) details. Death Math 10 2011-08-03 13:49
Larrabee instruction set announced fivemack Hardware 0 2009-03-25 12:09

All times are UTC. The time now is 16:37.


Fri Jul 7 16:37:54 UTC 2023 up 323 days, 14:06, 1 user, load averages: 3.44, 2.72, 2.22

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔