mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2022-01-19, 20:11   #177
WraithX
 
WraithX's Avatar
 
Mar 2006

3×173 Posts
Default

Hello Prime95, I just ran into a small issue when compiling 30.7b9 with MingW64 in MSYS. I just needed to make two small changes to the makemw64 file to correct these issues and get the gwnum.a library to build.

First, on line 29 I needed to add a line continuation character \.
Second, I needed to add a target to build radix.o.

Third, I needed to add "#include <sys/time.h>" to line 68 of gwutil.h to get rid of a warning during compile:
Code:
gcc -I.. -I../sqlite-amalgamation-3180000 -DX86_64 -DWINDOWS64 -O2 -o mw64/giants.o -c giants.c
In file included from giants.c:23:
gwutil.h:68:26: warning: 'struct timeval' declared inside parameter list will not be visible outside of this definition or declaration
 int gettimeofday (struct timeval *tp, void *tzp);
                          ^~~~~~~
gcc -I.. -I../sqlite-amalgamation-3180000 -DX86_64 -DWINDOWS64 -O2 -o mw64/radix.o -c radix.c
In file included from radix.c:17:
gwutil.h:68:26: warning: 'struct timeval' declared inside parameter list will not be visible outside of this definition or declaration
 int gettimeofday (struct timeval *tp, void *tzp);
                          ^~~~~~~
I have attached the updated makemw64.
Attached Files
File Type: txt makemw64.txt (3.3 KB, 41 views)
WraithX is offline   Reply With Quote
Old 2022-01-19, 23:17   #178
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

41×193 Posts
Default

Quote:
Originally Posted by WraithX View Post
Third, I needed to add "#include <sys/time.h>" to line 68 of gwutil.h to get rid of a warning during compile
Which, of course, breaks the MSVC build. Is there a MINGW ifdef I can test?
Prime95 is offline   Reply With Quote
Old 2022-01-20, 01:09   #179
WraithX
 
WraithX's Avatar
 
Mar 2006

3·173 Posts
Default

Quote:
Originally Posted by Prime95 View Post
Which, of course, breaks the MSVC build. Is there a MINGW ifdef I can test?
Ah, sorry. Good catch! Yes, there are two you can use:
Code:
#define __MINGW32__ 1
#define __MINGW64__ 1
I'd recommend using the mingw64 target. If someone needs the mingw32 target, we can check at that time how to resolve any issues they have.
WraithX is offline   Reply With Quote
Old 2022-02-06, 13:41   #180
S485122
 
S485122's Avatar
 
"Jacob"
Sep 2006
Brussels, Belgium

1,823 Posts
Default

This is not relevant any more since there is no more GIMPS work with that FFT length, but it seems that 2940 KiB FFT's are slower than they should be.
That FFT size seems to be AVX512 specific, but shouldn't it be 2944 KiB instead ? (An odd multiple of 4096 while the "neighbouring" FFTs are multiples of 65536...)
Just curious.
S485122 is offline   Reply With Quote
Old 2022-02-06, 14:12   #181
axn
 
axn's Avatar
 
Jun 2003

19×283 Posts
Default

2944 = 2^7 * 23. Ignore the powers of 2. AFAIK, George hasn't implemented a 23 FFT.
2940 = 2^2 * 3 * 5 * 7^2, which looks better. Still... a multiple of 735 is unusual. Probably will be slower than expected. 3072 ought to be faster.

2916 & 2880 seems to be more "normal" smaller sizes.
axn is offline   Reply With Quote
Old 2022-02-06, 15:56   #182
S485122
 
S485122's Avatar
 
"Jacob"
Sep 2006
Brussels, Belgium

1,823 Posts
Default

Quote:
Originally Posted by axn View Post
2944 = 2^7 * 23. Ignore the powers of 2. AFAIK, George hasn't implemented a 23 FFT.
...
In other words I just showed my ignorance of how the FFT innards work. :-(
S485122 is offline   Reply With Quote
Old 2022-02-06, 15:58   #183
Zhangrc
 
"University student"
May 2021
Beijing, China

32·29 Posts
Default

Quote:
Originally Posted by axn View Post
2944 = 2^7 * 23. Ignore the powers of 2. AFAIK, George hasn't implemented a 23 FFT.
2940 = 2^2 * 3 * 5 * 7^2, which looks better. Still... a multiple of 735 is unusual. Probably will be slower than expected. 3072 ought to be faster.

2916 & 2880 seems to be more "normal" smaller sizes.
I wonder if there's a 5888k FFT (which handles 110M exponents).
(5888=2^8*23, which is an auspicious number in Chinese, because it sounds roghly like "I'm rich rich rich")
Zhangrc is offline   Reply With Quote
Old 2022-02-18, 12:25   #184
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5·1,319 Posts
Default prime95 or Win10 or hwloc confused by dimms added to xeon phi system

Xeon Phi 7250 is a 68-core & x4 HT CPU with 8 channels MCDRAM (totaling 16GiB). A 7210 is similar but 64-core.
The motherboard Supermicro K1SPE has 6 DIMM slots. With all DIMM slots empty, prime95 shows activity of one thread per physical core in Task Manager, where the hyperthreads of a specific core appear in succession. Windows 10 Task Manager displays the correct numbers of physical and logical cores, correct cache amounts, etc.

When DIMMs are added (observed with 1 32 or 64GiB in slot A, 3 32GiB in A-C, 6 32GiB in A-F), Task Manager shows 234 physical cores, 272 logical for the 7250, or 208 physical, 256 logical for the 7210, and odd figures for cache amounts. Prime95 shows core numbers higher than are physically present per Intel spec.
With any DIMMs installed, Task Manager shows a different pattern of logical core use that appears to be using multiple hyperthreads in prime95 when intending not to. Iteration times are noticeably longer than before the DIMMs were added.

In Ubuntu 18.04 atop WSL1 on Win10, after the DIMMs are added, stream.c shows ~30% higher memory bandwidth.
Attached Thumbnails
Click image for larger version

Name:	build1909.png
Views:	39
Size:	80.4 KB
ID:	26556   Click image for larger version

Name:	after 64GB DIMM added taskmgr cpu tab seems confused.png
Views:	39
Size:	35.7 KB
ID:	26557   Click image for larger version

Name:	7210 win10 confused on caches with 32GB DIMM installed.png
Views:	35
Size:	43.2 KB
ID:	26558   Click image for larger version

Name:	hydra win10 prime95 shows too high core numbers odd logical core use pattern.png
Views:	38
Size:	123.7 KB
ID:	26559   Click image for larger version

Name:	7210 mcdram only prime95 physical core use pattern.png
Views:	43
Size:	136.1 KB
ID:	26560  


Last fiddled with by kriesel on 2022-02-18 at 12:27
kriesel is offline   Reply With Quote
Old 2022-02-18, 15:25   #185
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

41×193 Posts
Default

Run a prime95 benchmark (abort it). Results.bench.txt will contain hwloc's description of your machine.
Prime95 is offline   Reply With Quote
Old 2022-02-19, 00:36   #186
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5×1,319 Posts
Default

hwloc's portion looks ok to me.
Attached Files
File Type: zip xeonphi7250-with-dimms-prime95-results.bench.zip (2.9 KB, 25 views)
kriesel is offline   Reply With Quote
Old 2022-02-20, 17:21   #187
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

147038 Posts
Default Very disparate ETAs on Xeon Phi 7210

Hi,

On Win10 pro, Xeon Phi 7210, prime95 v30.7b9, PrimeNet connected, manual assignments, I'm seeing very disparate ETAs between status output and worker windows, that leads to months discrepancies for exponents currently in progress, and YEARS difference in ETAs for following work (which should all complete within ~1 year).
Also the CPU clock rate ("Speed") is misrepresented on my Computer Properties page, and I have not found a way to fix that. Should be 1.3 GHz, is 0.346 GHz. (3.76:1 ratio)

W1 332897017 ~39. d vs. ~204. d ~5.23 ratio;
W2 344587487 ~18.27 d vs 126. d ~6.90 ratio.

Help!
Attached Thumbnails
Click image for larger version

Name:	xeon phi 7210 prime95 30.7b9 very disparate etas and runtime estimates.png
Views:	37
Size:	79.2 KB
ID:	26573  

Last fiddled with by kriesel on 2022-02-20 at 17:22
kriesel is offline   Reply With Quote
Reply

Thread Tools


All times are UTC. The time now is 22:00.


Tue Jul 5 22:00:57 UTC 2022 up 82 days, 20:02, 0 users, load averages: 0.79, 1.24, 1.32

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2022, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔