mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2019-07-31, 18:02   #34
Evil Genius
 
Evil Genius's Avatar
 
Jul 2019
the Netherlands

2×11 Posts
Default

Machine (11GB total) + Package L#0
NUMANode L#0 (P#0 11GB)
L3 L#0 (16MB)
L2 L#0 (512KB) + L1d L#0 (32KB) + L1i L#0 (32KB) + Core L#0
PU L#0 (P#0)
PU L#1 (P#1)
L2 L#1 (512KB) + L1d L#1 (32KB) + L1i L#1 (32KB) + Core L#1
PU L#2 (P#2)
PU L#3 (P#3)
L2 L#2 (512KB) + L1d L#2 (32KB) + L1i L#2 (32KB) + Core L#2
PU L#4 (P#4)
PU L#5 (P#5)
L2 L#3 (512KB) + L1d L#3 (32KB) + L1i L#3 (32KB) + Core L#3
PU L#6 (P#6)
PU L#7 (P#7)
L3 L#1 (16MB)
L2 L#4 (512KB) + L1d L#4 (32KB) + L1i L#4 (32KB) + Core L#4
PU L#8 (P#8)
PU L#9 (P#9)
L2 L#5 (512KB) + L1d L#5 (32KB) + L1i L#5 (32KB) + Core L#5
PU L#10 (P#10)
PU L#11 (P#11)
L2 L#6 (512KB) + L1d L#6 (32KB) + L1i L#6 (32KB) + Core L#6
PU L#12 (P#12)
PU L#13 (P#13)
L2 L#7 (512KB) + L1d L#7 (32KB) + L1i L#7 (32KB) + Core L#7
PU L#14 (P#14)
PU L#15 (P#15)
Evil Genius is offline   Reply With Quote
Old 2019-07-31, 21:31   #35
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

2×32×647 Posts
Default

Quote:
Originally Posted by Prime95 View Post
Although it will prove nothing if it succeeds, try running hwloc's stand-alone program called lstopo or lstopo-no-graphics from https://www.open-mpi.org/software/hwloc/v2.0/
Is there a way to compare the hwloc-reported topology on Windows with that given by hwloc (or simply in /proc/cpuinfo) on a same-CPU Linux system?

Last fiddled with by ewmayer on 2019-07-31 at 21:31
ewmayer is offline   Reply With Quote
Old 2019-08-02, 19:35   #36
mackerel
 
mackerel's Avatar
 
Feb 2016
UK

6638 Posts
Default

Since I'm tinkering with the boxes now, I just reproduced the problem with Windows 7, which strictly speaking neither MS nor AMD support with Zen 2 CPUs.

The interesting thing is, the first time I tried running 4 cores, 4 workers with a 3600, I got a load of "Error setting affinity to core #xyz. There are 6 cores." messages on screen, before Windows reported an application error. Nothing in log after hwloc. Subsequent runs just went to the application error without those affinity messages, presumably due to something written in config files after 1st run.

Might take some time but I'm going to drop in a 2600 shortly to see if that is also affected. I didn't get any testers on the other forum I posted on.
mackerel is offline   Reply With Quote
Old 2019-08-02, 20:13   #37
mackerel
 
mackerel's Avatar
 
Feb 2016
UK

3×5×29 Posts
Default

2600 temporarily installed. I tried 4 cores, 4 workers, and it crashed just like it did on 3600 and 3700X. Since it was mentioned earlier in the thread, I also tried 2 cores 2 workers, and it also gave a load of affinity errors but completed without crashing. The errors didn't appear in the log, but as it didn't crash I was able to copy and save it in attached text file.

So new information right now is:
It also happens in Windows 7, not just Windows 10.
It also happens with Zen+ CPU, not limited to Zen 2.

I have a crazy idea to try out, back shortly :)

Edit: and the results are in. I went into the bios and disabled half the cores, so it is running in 3+0 configuration. One CCX. Tried a bench with 2 core 2 workers, ran fine, no errors. Similar 3c3w. Is there something about splitting work across CCX that is causing the problem?
Attached Files
File Type: txt p95error2600-2c2w.txt (39.9 KB, 88 views)

Last fiddled with by mackerel on 2019-08-02 at 20:20
mackerel is offline   Reply With Quote
Old 2019-08-02, 22:38   #38
PhilF
 
PhilF's Avatar
 
Feb 2005
Colorado

5·131 Posts
Default

After it crashes without an error message, have you checked the Windows Event Viewer to see the code reported by the application crash?
PhilF is offline   Reply With Quote
Old 2019-08-19, 13:19   #39
hansl
 
hansl's Avatar
 
Apr 2019

20510 Posts
Default

I don't have a Zen 2 to test on, but would the recent fix in 29.8b6 apply to the issues in this thread?
hansl is offline   Reply With Quote
Old 2019-08-19, 20:10   #40
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

1D6E16 Posts
Default

Quote:
Originally Posted by hansl View Post
I don't have a Zen 2 to test on, but would the recent fix in 29.8b6 apply to the issues in this thread?
Yes. The bug was in prime95 running on a CPU with multiple L3 caches. The benchmark code that makes sure a worker's threads are all running in the same L3 cache was flawed.
Prime95 is online now   Reply With Quote
Old 2019-08-20, 19:24   #41
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

2×32×647 Posts
Default

Quote:
Originally Posted by Prime95 View Post
Yes. The bug was in prime95 running on a CPU with multiple L3 caches. The benchmark code that makes sure a worker's threads are all running in the same L3 cache was flawed.
Is this an extension to the core-affinity considerations? I.e. do various cores statically map to a given L3 cache, or is that mapping something the OS can fiddle at runtime?
ewmayer is offline   Reply With Quote
Old 2019-08-20, 20:12   #42
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

2×3,767 Posts
Default

Quote:
Originally Posted by ewmayer View Post
Is this an extension to the core-affinity considerations? I.e. do various cores statically map to a given L3 cache, or is that mapping something the OS can fiddle at runtime?
Maybe the OS is smart enough to group different threads from the same process into the same L3 cache -- or maybe not. Hwloc libraries give you enough control to ensure this happens.
Prime95 is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Prime95 crashes everytime on same core chiro Software 18 2018-04-12 12:41
Memtest86+ shows no errors but computer crashes with Prime95 TObject Hardware 11 2013-05-09 11:43
Prime95 v27.7 crashes x3r Software 26 2012-06-23 01:25
Newb help (it crashes) Proggie Software 4 2005-01-05 07:35
Everything crashes my computer Unregistered Hardware 6 2004-08-09 19:28

All times are UTC. The time now is 18:00.


Sun Aug 1 18:00:47 UTC 2021 up 9 days, 12:29, 0 users, load averages: 2.61, 2.38, 2.17

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.