mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2023-03-27, 22:46   #1
cardogab7341
 
Mar 2013
Dallas, TX

37 Posts
Default Ryzen 7950X Optimization

I recently upgraded my PC from an I7-3820 to a Ryzen 7950X. On the Intel machine, Prime 95 ran 4 workers, one on each core. When I installed Prime95 on the new machine, it is running 4 workers on 4 cores, with the rest of the cores as helpers. My question is, is this the optimum configuration? Has the 7950X been characterized to come up with the most efficient affinity profile? My work type is set to Let GIMPS decide.

Thanks
cardogab7341 is offline   Reply With Quote
Old 2023-03-30, 05:48   #2
Jurzal
 
Jurzal's Avatar
 
Jan 2023
Riga, Latvia

22·3·5 Posts
Default

Quote:
Originally Posted by cardogab7341 View Post
I recently upgraded my PC from an I7-3820 to a Ryzen 7950X. On the Intel machine, Prime 95 ran 4 workers, one on each core. When I installed Prime95 on the new machine, it is running 4 workers on 4 cores, with the rest of the cores as helpers. My question is, is this the optimum configuration? Has the 7950X been characterized to come up with the most efficient affinity profile? My work type is set to Let GIMPS decide.

Thanks
Congrats on your upgrade!

7950X is dual chiplet CPU with 32 MB of cache per 8 cores, so 2x8 cores with 2x32 MB of cache.
I have 5900X and have similar situation with 12 cores, 2x6, 2x32 MB of cache. So, optimally you want to have optimally 2 workers - one for first 8 cores, other one for other 8.

Currently my system runs 1 worker on DC LL, since it mostly is done within cache, second worker is set to PRP, since it takes most of the memory bandwidth. You can do P-1 if you want.

Running both workers on either P-1 or PRP not really effective, since RAM bottlenecks the system and CPU runs more on empty.
Jurzal is offline   Reply With Quote
Old 2023-03-30, 05:50   #3
Jurzal
 
Jurzal's Avatar
 
Jan 2023
Riga, Latvia

6010 Posts
Default

Quote:
Originally Posted by Jurzal View Post
Congrats on your upgrade!

7950X is dual chiplet CPU with 32 MB of cache per 8 cores, so 2x8 cores with 2x32 MB of cache.
I have 5900X and have similar situation with 12 cores, 2x6, 2x32 MB of cache. So, optimally you want to have optimally 2 workers - one for first 8 cores, other one for other 8.

Currently my system runs 1 worker on DC LL, since it mostly is done within cache, second worker is set to PRP, since it takes most of the memory bandwidth. You can do P-1 if you want.

Running both workers on either P-1 or PRP not really effective, since RAM bottlenecks the system and CPU runs more on empty.
Here is a screenshot of 5900X at work
Attached Thumbnails
Click image for larger version

Name:	p95 5900x.png
Views:	96
Size:	136.5 KB
ID:	28172  
Jurzal is offline   Reply With Quote
Old 2023-03-30, 14:42   #4
cardogab7341
 
Mar 2013
Dallas, TX

37 Posts
Default

Quote:
Originally Posted by Jurzal View Post
Here is a screenshot of 5900X at work
Thanks for the tips I will investigate re-organizing work types. Can you show a screenshot showing the ms/iteration for the PRP test?

Thanks.
cardogab7341 is offline   Reply With Quote
Old 2023-03-30, 15:24   #5
Jurzal
 
Jurzal's Avatar
 
Jan 2023
Riga, Latvia

22×3×5 Posts
Default

Quote:
Originally Posted by cardogab7341 View Post
Thanks for the tips I will investigate re-organizing work types. Can you show a screenshot showing the ms/iteration for the PRP test?

Thanks.
No problem, here you go. If at idle, it usually rests at 3.6 - 3.7, if I have youtube open, discord or whatever, then around 3.9 - 4.0 for PRP. Yours should be much faster, because of DDR5 and faster/more cores. I tried out different setups - LL + LL, PRP + PRP, etc and I found LL + PRP most effective for my Zen 3 system.

My memory bandwidth typically maxes out at 25-28 Gbps, I am using 4x8 GB RAM at 3800 CL16 with tuned subtimings.
Attached Thumbnails
Click image for larger version

Name:	iters and sensors.png
Views:	81
Size:	255.9 KB
ID:	28173  
Jurzal is offline   Reply With Quote
Old 2023-04-01, 22:32   #6
cardogab7341
 
Mar 2013
Dallas, TX

2516 Posts
Default

I did some quick tweaking, and it looks like the optimum configuration for PRP in the 116m range for me, running a 6272k FFT is 2 workers and 14 helpers, split evenly between chiplets (1 worker, 7 helpers per chiplet). I get around 3.6ms/iteration on one and 3.9ms/iteration on the other.

Thanks for the tips.
cardogab7341 is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Ryzen 7950X + Windows 11 + P-1 stage2 = lag James Heinrich Hardware 17 2023-04-29 12:10
Unstable Ryzen 7950X tuckerkao Hardware 34 2022-12-18 01:26
Zen4 7950X Benchmarks Mysticial Hardware 23 2022-12-17 03:02
Ryzen help Prime95 Hardware 9 2018-05-14 04:06
AMD Ryzen is risin' up. jasong Hardware 11 2017-03-02 19:56

All times are UTC. The time now is 16:28.


Fri Jul 7 16:28:00 UTC 2023 up 323 days, 13:56, 0 users, load averages: 1.55, 1.96, 1.71

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔