![]() |
|
|
#1 |
|
Mar 2021
118 Posts |
Hi All
I’m trying to assemble dedicated hardware for LL testing. Currently I have a few I7-9700K (8 cores @ 3.6GHz) processors with a single 4GB ram stick but performance is not anywhere near what I expected. Testing a 103M exponent with one worker on all 8 cores takes around 30 days. (25ms / iteration) In comparison I have a laptop doing LL testing with a E3-1575M (4 cores @ 3GHz) processor and 4 x 16GB memory and that takes 8-9 days (6-7ms / iteration). Since the I7-9700K should be faster I was excepting it to take less time. When I look at the memory usage the entire system is using less than 500MB so 4GB memory should be enough. I’m wondering if there is a bottleneck on the memory bandwidth? Would it help if I installed 2 or even 4 ram sticks to give more bandwidth? I have been trying to find a way to see how busy the memory is with no luck. I’m using Fedora with Linux64,Prime95,v30.3,build 6. Any points would be highly apricated. Also if someone is having a recipe for a 2021 bang for bug hardware list I would very much like to hear about it. Thanks. |
|
|
|
|
|
#2 | |
|
Jun 2003
23×683 Posts |
Quote:
Either 2x16GB or 4x8GB of the fastest RAM that you can get will give you the most bandwidth. Performance will scale (near) linearly with RAM bandwidth. Last fiddled with by axn on 2021-03-15 at 07:59 |
|
|
|
|
|
|
#3 | |
|
Mar 2021
32 Posts |
Quote:
Thanks a lot for fast reply. I will order 4 of the fastest supported RAM i can find and test with 2 and 4 modules. Is there any way i can check the system to see if i'm maxing out the memory bandwidth? |
|
|
|
|
|
|
#4 | |
|
Jun 2003
23×683 Posts |
Quote:
I don't know how to monitor the memory bandwidth usage directly, sorry. |
|
|
|
|
|
|
#5 | |
|
Mar 2021
32 Posts |
Quote:
Thanks for the clarification. I have just tested with a single core and I’m getting the same speed as when I’m running on all 8 cores. For reference the current setup is as follows: Motherboard: TUF H370-PRO GAMING CPU: I7-9700K RAM: 1 x Kingston ValueRAM - DDR4 - 4 GB - DIMM 288-PIN - 2400 MHz / PC4-19200 - CL17 - 1.2 V PSU: 1650W (Totally overkill) I have now ordered: 2 x HyperX Predator - DDR4 8 GB - DIMM 288-PIN - 2666 MHz / PC4-21300 - CL13 - 1.35 V 2 x HyperX FURY - DDR4 4GB - DIMM 288-PIN - 2666 MHz / PC4-21300 - CL16 - 1.2 V When the RAM is delivered and I’m getting some time to play around with it I will post results here. If anyone else have any inputs please do not hesitate to post :) |
|
|
|
|
|
|
#6 |
|
Feb 2016
UK
7008 Posts |
Faster ram always helps in this scenario, but with a H370 mobo I believe you're limited to whatever the official speed is supported by the CPU. It would have cost more, but perform better to get a Z370/Z390 mobo and faster ram.
If you keep the current mobo, then the best you can do is to put in 4 modules. Capacity doesn't matter. This is in two parts: firstly you get dual channel which already doubles what you had. Secondly, you get more than one rank per channel. This helps you get more effective usage of the bandwidth of dual channels. 4x4gb might be the most economic if you can still find modules that small. You can try testing with the mismatched pairs already ordered. 4GB modules probably were always single rank. 8GB modules way back when it was still relatively new in 2015-ish might have been dual rank, but they've been single rank for a long time. 16GB modules were all dual rank, but I understand newer ones coming out now are single rank. So generally speaking if you don't need capacity, the cheapest way to rank up is to use 4 modules on a dual channel system. Even on a quad core 6700k dual channel ram is wholly inadequate. It was a while ago, but from memory going 2xSR to 4xSR or equivalently 2xDR at 3000 speed gave around 20-25% speedup. CL rating doesn't seem to make much difference in my testing so I wouldn't pay extra for it. |
|
|
|
|
|
#7 | |
|
Mar 2021
32 Posts |
Quote:
Thanks. That was very helpful. I have now ordered the following: Gigabyte Z390 M Micro-ATX LGA1151 Intel Z390 4 x CORSAIR Vengeance DDR4 8GB 3600MHz CL18 Will update once i get everything tested. :) |
|
|
|
|
|
|
#8 |
|
6809 > 6502
"""""""""""""""""""
Aug 2003
101×103 Posts
22×2,767 Posts |
Please don't do LL tests, unless they are to double check. PRP is know the preferred test time for first time tests. It has superior error checking built into it. (Making errors that pas through very few and far between.) Also, using the latest version of either Prime95 or GpuOwL will produce a file that will allow the run to be verified quickly on another nachine. This will save 95% of the effort that a traditional Double Check would take,
Also, consider adding a GPU. You will get more throughput from a good GPU than the CPU. |
|
|
|
|
|
#9 | |
|
Mar 2021
916 Posts |
Quote:
Thanks for your advice. I have briefly looked into PRP and i need to look more into it but i will definitely use PRP going forward. In regards to GPU i have picked up 10 used Nvidia Tesla K80 i'm hoping to get up and run soon. Just waiting for some power adapters to arrive. Long delivery time. From what i could see from benchmark reports the K80 and GPU's in general is that they are way faster to be used for factor checking compare to LL testing. Maybe this is not the case with PRP? For that reasons my plan was to get some fast CPU's to do LL/PRP testing and use the GPU's for factor checking. Please correct me if i am mistaken :) |
|
|
|
|
|
|
#10 |
|
6809 > 6502
"""""""""""""""""""
Aug 2003
101×103 Posts
22·2,767 Posts |
There are some GPU's that are better than others at PRP. This chart can be an approximate guide: https://www.mersenne.ca/cudalucas.php
This one covers factoring: https://www.mersenne.ca/mfaktc.php You can punch in different cards and compare them. PRP and LL compare well for speed comparisons. |
|
|
|
|
|
#11 | |
|
Aug 2020
2×3×29 Posts |
Quote:
Gigabyte B365M DS3H, I7-9700K, four sticks of DDR-2666 4GB ram. The motherboard limits the ram to 2666. Attached is a picture of my testing data for the CPU. It might be useful to you. You can save a lot of energy by decreasing the CPU clock without decreasing mprime throughput. |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| The prime-crunching on dedicated hardware FAQ (II) | jasonp | Hardware | 46 | 2016-07-18 16:41 |
| Xbox for TF (aka not higher performance hardware) | spaz | Hardware | 4 | 2009-12-14 17:11 |
| The prime-crunching on dedicated hardware FAQ | jasonp | Hardware | 142 | 2009-11-15 23:20 |
| Optimal Hardware for Dedicated Crunching Computer | Angular | Hardware | 5 | 2004-01-16 12:37 |
| Athlon: poor performance need help | phalanx | Software | 4 | 2003-12-12 07:00 |