![]() |
|
|
#1 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
1D7716 Posts |
Can someone with a Skylake X CPU please run a quick test. You'll need linux and gcc installed.
See https://www.mersenneforum.org/showpo...0&postcount=29 |
|
|
|
|
|
#2 |
|
Sep 2003
5·11·47 Posts |
For Ernst's main.c from the linked post, I compiled with gcc -O3 -march=skylake-avx512 main.c using gcc --version = gcc (GCC) 7.3.1 20180303 (Red Hat 7.3.1-5)
Running a.out gives no output. This is on an AWS machine, Intel(R) Xeon(R) Platinum 8124M CPU @ 3.00GHz I don't know if "Skylake X" is different from some other kind of Skylake??? |
|
|
|
|
|
#3 | |
|
Einyen
Dec 2003
Denmark
2·1,579 Posts |
You need to run:
time ./a.out I did on a c5d.large in the other thread: Quote:
Let us know if you need testing on more cores? |
|
|
|
|
|
|
#4 |
|
Sep 2003
50318 Posts |
On a one-core (2 vCPU) c5d.large on AWS I got:
Code:
real 0m0.765s user 0m0.765s sys 0m0.000s Code:
real 0m0.707s user 0m0.707s sys 0m0.000s Code:
real 0m0.961s user 0m0.957s sys 0m0.004s There is a discrepancy with ATH's timing on the identical c5d.large instance. I did pause mprime while I ran the test. Perhaps different compiler versions, or different kernels with Spectre fixes that slow down execution? I am using the latest version of Amazon Linux 2. Perhaps the likeliest explanation is simply the shared tenancy with other AWS users on the same physical hardware. I have some instances running mprime about 10% faster than others on the same instance type, it's the luck of the draw. Last fiddled with by GP2 on 2018-12-01 at 07:07 |
|
|
|
|
|
#5 | |
|
∂2ω=0
Sep 2002
República de California
101101011111112 Posts |
Quote:
Presumably George is hoping to find some hw/sw configuration which fails the test due to the same issue he encountered. |
|
|
|
|
|
|
#6 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
23·11·73 Posts |
model name : Intel(R) Core(TM) i9-7940X CPU @ 3.10GHz
pineapple@pineapple:/tmp$ time ./a.out real 0m0.755s user 0m0.755s sys 0m0.000s model name : Intel(R) Xeon(R) Silver 4114 CPU @ 2.20GHz oak@oak:~$ time ./mayer-test real 0m0.879s user 0m0.876s sys 0m0.004s |
|
|
|
|
|
#7 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
19×397 Posts |
|
|
|
|
|
|
#8 | ||
|
∂2ω=0
Sep 2002
República de California
19·613 Posts |
Quote:
Quote:
EVEX.256.66.0F38.WIG 32 /r You have more experience with this low-level opcode-parsing than I - can you discern from the numbers you posted whether your assembler is producing the low-4-bytes-of-input-xmm-to-low-4-doubles-of-output-ymm half-width version of the instruction? |
||
|
|
|
|
|
#9 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
19·397 Posts |
Actually, we have the same level of experience -- none. I have not tried to figure out what "EVEX.256.66.0F38.WIG 32" means. I got the "62h, 62h, 7dh, 48h, 32h, 0f8h" bytes frome the defuse.ca web site, which uses the GNU tool chain.
|
|
|
|
|
|
#10 |
|
∂2ω=0
Sep 2002
República de California
19×613 Posts |
What input format does that tool need? I have .s and .o files from compilation of the little test code I posted, perhaps seeing what the assembly for that code's vpmovzxbq instruction can be compared versus yours above. Here are gzipped .s and .o from my code, in case that might be of help:
|
|
|
|
|
|
#11 |
|
Jan 2008
France
10001001102 Posts |
I tried with two different disassemblers, the one from GNU and the one that comes with LLVM. I don't know if it's of any help...
Code:
$ llvm-objdump-3.8 -d main.o | grep vpmov
337: 62 62 7d 48 32 f0 vpmovzxbq %xmm0, %zmm30
33d: 62 62 7d 48 32 f9 vpmovzxbq %xmm1, %zmm31
$ objdump -d main.o | grep vpmov
337: 62 62 7d 48 32 f0 vpmovzxbq %xmm0,%zmm30
33d: 62 62 7d 48 32 f9 vpmovzxbq %xmm1,%zmm31
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Skylake vs Kabylake | ET_ | Hardware | 17 | 2017-05-24 16:19 |
| Skylake's power management under Linux is dreadful and you shouldn't buy one until it's fixed | Dubslow | Hardware | 8 | 2016-04-15 09:14 |
| Skylake and RAM scaling | mackerel | Hardware | 34 | 2016-03-03 19:14 |
| Skylake AVX-512 | clarke | Software | 15 | 2015-03-04 21:48 |
| blend / small FFT torture test on linux ? | lukasz | Linux | 1 | 2007-01-09 16:16 |