![]() |
|
|
#166 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
10100111100102 Posts |
prime95 on Windows 10 gave mostly 2 cores/worker optimal, until reaching large exponents (~300M). But latency can be long compared to assignment expiration. See second attachment of https://www.mersenneforum.org/showpo...4&postcount=11 for prime95/win10 exhaustive benchmark, for comparison to any ubuntu/mprime results to follow.
Last fiddled with by kriesel on 2021-03-05 at 20:36 |
|
|
|
|
|
#167 | |
|
∂2ω=0
Sep 2002
República de California
103×113 Posts |
Quote:
My own rule of thumb for manycore-system runs is to go with whichever combo has the fewest #workers and yields at least 95% of the maximum total throughput, to keep completion times reasonable. |
|
|
|
|
|
|
#168 | |
|
∂2ω=0
Sep 2002
República de California
101101011101112 Posts |
Of possible interest to our UK readers, a cheap refurb 68C/272T 7250 KNL.
Priced some RAM modules, going to hold off for now, but do occasional price-checks, posting links here for reference. From the SuperMicro mobo manual: Quote:
Code:
DDR4 1.2V-2133: GB-Per-DIMM Recommended Modules 8: Samsung M393A1G40DB0-CPB $50 per dimm Hynix HMA41GR7AFR8N $44 on Amzn 16: Hynix HMA42GR7AFR4N-TF $56 on Amzn 32: Samsung M393A4K40BB0-CPB $139 on Amzn Hynix HMA84GR7MFR4N-TF $139 on Amzn DDR4 1.2V-2400: GB-Per-DIMM Recommended Modules 8: Samsung M393A1G40EB1-CRC $50 per dimm Hynix HMA41GR7AFR4N-UH $40 on Amzn 16: Samsung M393A2K40BB1-CRC $73 on Amzn Samsung M393A2K40CB1-CRC $80 on Amzn Hynix HMA42GR7AFR4N-UH $65 on Amzn Hynix HMA82GR7AFR8N-UH $85 on Amzn, but as low as $58 used [MicroMark had best price] 32: Samsung M393A4K40BB1-CRC $157 on Amzn Samsung M393A4K40CB1-CRC $172 on Amzn; this was the 1 module where the Sponsored-Ad top-placer in my DDG searches, Nemix.com, had the best price, $150 Hynix HMA84GR7AFR4N-UH $157 on Amzn 64: Samsung M386A8K40BM1-CRC4Q $297 on Amzn Micron MTA72ASS8G72LZ-2G3B2 $298 on eBay, $350 on Amzn Last fiddled with by ewmayer on 2021-05-22 at 00:36 |
|
|
|
|
|
|
#169 |
|
6809 > 6502
"""""""""""""""""""
Aug 2003
101×103 Posts
230708 Posts |
That sold days ago.
Last fiddled with by ewmayer on 2021-05-02 at 03:06 Reason: Still sets a price level... |
|
|
|
|
|
#170 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
536210 Posts |
Quote:
It might be better to go with 2133 RAM. From https://www.mersenneforum.org/showpo...72&postcount=8 (bold emphasis mine): Code:
I found the following in the specification update: "KNL25.Operating With DDR4-2400 Memory May Cause Unpredictable System Behavior Problem:Operating the processor with DDR4 memory configured to operate at 2400 MT/s may cause unpredictable system behavior. Implication:When the erratum occurs, the system will exhibit unpredictable system behavior. Workaround:It is possible for BIOS to contain a workaround for this erratum. Status:No Fix" Last fiddled with by kriesel on 2021-05-07 at 11:06 |
|
|
|
|
|
|
#171 |
|
Sep 2002
Database er0rr
3,739 Posts |
Scary number of pads on the "chip" -- would have to be carefully accurate putting into the socket.
Last fiddled with by paulunderwood on 2021-05-07 at 10:59 |
|
|
|
|
|
#172 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
536210 Posts |
From https://www.supermicro.com/manuals/s.../MNL-1891.pdf:
"These units are sold as complete systems with the motherboard, processor, and heat sink assembly factory installed. These components are not user serviceable, and the user should not attempt to remove or modify these components other than adjustments specifically outlined in this manual. Making other adjustments to these components risks damaging the system and voiding the system warranty." There's a liquid cooling loop block mounted atop the package by two alignment pins, 4 Torx fasteners, and distinct tightening and loosening orders given on the block label. Like torquing down a multicylinder engine head. I've decided not to mess with mine. Last fiddled with by kriesel on 2021-05-07 at 15:04 |
|
|
|
|
|
#173 |
|
∂2ω=0
Sep 2002
República de California
101101011101112 Posts |
I updated the current-prices table in my post above to add the officially supported 2133 DIMMs - as with 2400, the current sweet spot is at 16GB, one could buy 6 of those and thus max out the memory bandwidth to the resulting 96GB for around $360.
|
|
|
|
|
|
#174 |
|
∂2ω=0
Sep 2002
República de California
1163910 Posts |
Ordered 192GB = 6 x 32GB-per-dimm of RAM for my KNL last night - the 2400MHz actually proved slightly cheaper (I walked both 2133 and 2400 through to checkout in separate-site shopping baskets), as the seller didn't charge me CA sales tax, saving around $80. Guess I'll find out if the "Unpredictable System Behavior" issue Kriesel mentioned is rare or not.
I ended up going with 192GB as opposed to the slightly-cheaper-per-GB 96GB option, because I need to do shakedown testing of the Mlucas v20 p-1 code, and 192GB should allow me to run 17 simultaneous p-1 jobs, each on 4 distinct cores, with a maxalloc setting of 12GB per job, which will yield 60-70% of stage 2 primes paired. (The min-RAM option, which needs ~1.5GB RAM per p-1 instance for current-wavefront exponents, yields ~25% primes-paired.) Or, assuming a random mix of progress among the 17 jobs and each spending roughly the same amount of time in stages 1 and 2, I might be able to use maxalloc of around 20GB per job, which would cut an additional 10% off the stage 2 runtime. We shall see. |
|
|
|
|
|
#175 |
|
Sep 2006
The Netherlands
36 Posts |
hello i see kriesel benchmarked woltmann code on it. How fast does it work for say much smaller prime testing if we'd try to use it for riesel testing? Which is what is it some millions of digits max.
How fast is it versus LLR with latest woltman inside on a Hashwell or broadwell? On ebay i see the 2699ES getting offered. Can be put in a 2 socket system. Will be 2.1Ghz clocked so that's: 2 cpu's * 2.1Ghz * 22 cores * 32 flops (hashwell/broadwell) = 2956.8 gflops = 3 Tflops and of course each cpu has its own memory channels to it. Roughly 1500 euro to asemble system for so i counted for motherboard+cpu's+ram A 7290 system times 2 cpu's would be more expensive than this and intel will not like to support it anymore and i do not know about development tools for these cpu's. Usually the intel c++ compiler and such is paid when i toy at home as i have a company (which is not for profit busy in prime numbers - commerciel for robots/drones and especially 3d printers - as all this here is my hobby obviously). So what matters is how fast Woltman is with his library at the knights landing. Any benchmarks done there for non-mersenne primes? |
|
|
|
|
|
#176 |
|
Sep 2002
Database er0rr
3,739 Posts |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| AMD vs Intel | dtripp | Software | 3 | 2013-02-19 20:20 |
| Intel NUC | nucleon | Hardware | 2 | 2012-05-10 23:53 |
| Intel RNG API? | R.D. Silverman | Programming | 19 | 2011-09-17 01:43 |
| AMD or Intel | mack | Information & Answers | 7 | 2009-09-13 01:48 |
| Intel Mac? | penguain | NFSNET Discussion | 0 | 2006-06-12 01:31 |