mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   Lounge (https://www.mersenneforum.org/forumdisplay.php?f=7)
-   -   Let's buy GIMPS an Opteron! (https://www.mersenneforum.org/showthread.php?t=964)

gbvalor 2003-09-05 21:45

Just for comparation, here are the results of my Athlon XP (Barton) BOX,

[code:1]
gbv@gauss:~/glucas/glucas-2.9.1> less selftest.out
You have new mail in /var/spool/mail/gbv
gbv@gauss:~/glucas/glucas-2.9.1> cat /proc/cpuinfo
processor : 0
vendor_id : AuthenticAMD
cpu family : 6
model : 10
model name : AMD Athlon(tm) XP 2500+
stepping : 0
cpu MHz : 1830.138
cache size : 512 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse syscall mmxext 3dnowext 3dnow
bogomips : 3643.80
[/code:1]

And the timings:
[code:1] milsec/iter (user time)
FFT(k) round check
on / off
--- ---------
512 53/ 50
576 63/ 60
640 74/ 70
768 88/ 84
896 107/100
1024 115/116
1152 138/130
1280 149/142
1536 181/172
1792 217/213
2048 238/228
[/code:1]

Guillermo

Bok 2003-09-05 23:16

Ok, ran that Glucas test (your link needs /pub/ btw)

I guess you cleaned up the results so I'll try and do the same, I think I'm interpreting it correctly.

milsec/iter (user time)
FFT(k) round check
on / off
--- ---------
512 43/40
576 52/48
640 56/51
768 66/62
896 80/76
1024 88/84
1152 107/100
1280 115/107
1536 137/130
1792 166/158
2048 183/175

I'll try making with -m64 -m128bit-long-double as well

Bok

Prime95 2003-09-05 23:46

Bok, try this:

Run "mprime -m"
Choose 18
exit mprime
vi local.ini
add line "CpuSupportsSSE2=0"
repeat choice 18

Prime95 2003-09-05 23:55

Bok, also please try this in a fresh directory

Get ftp://mersenne.org/gimps/mprtst.tar.gz

Run mprime -m
Choose 5
exit
delete worktodo.ini
Run mprime -m
Choose 8, exponent = 20000000, iterations = 10
exit

Post the results.txt file - full of weird timer numbers

Bok 2003-09-06 00:21

[quote="Prime95"]Bok, try this:

Run "mprime -m"
Choose 18
exit mprime
vi local.ini
add line "CpuSupportsSSE2=0"
repeat choice 18[/quote]

This is the results.txt for this one

AMD Engineering Sample
CPU speed: 1799.79 MHz
CPU features: RDTSC, CMOV, PREFETCH, MMX, SSE, SSE2
L1 cache size: 64 KB
L2 cache size: 1024 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 32
L2 TLBS: 512
Prime95 version 22.12, RdtscTiming=1
Best time for 256K FFT length: 16.347 ms.
Best time for 320K FFT length: 21.512 ms.
Best time for 384K FFT length: 25.947 ms.
Best time for 448K FFT length: 31.366 ms.
Best time for 512K FFT length: 34.787 ms.
Best time for 640K FFT length: 46.058 ms.
Best time for 768K FFT length: 56.882 ms.
Best time for 896K FFT length: 69.160 ms.
Best time for 1024K FFT length: 76.882 ms.
Best time for 1280K FFT length: 98.280 ms.
Best time for 1536K FFT length: 118.428 ms.
Best time for 1792K FFT length: 145.785 ms.

AMD Engineering Sample
CPU speed: 1800.20 MHz
CPU features: RDTSC, CMOV, PREFETCH, MMX, SSE
L1 cache size: 64 KB
L2 cache size: 1024 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 32
L2 TLBS: 512
Prime95 version 22.12, RdtscTiming=1
Best time for 256K FFT length: 17.196 ms.
Best time for 320K FFT length: 22.334 ms.
Best time for 384K FFT length: 29.183 ms.
Best time for 448K FFT length: 32.451 ms.
Best time for 512K FFT length: 35.374 ms.
Best time for 640K FFT length: 46.979 ms.
Best time for 768K FFT length: 56.599 ms.
Best time for 896K FFT length: 68.014 ms.
Best time for 1024K FFT length: 75.773 ms.
Best time for 1280K FFT length: 100.296 ms.
Best time for 1536K FFT length: 120.103 ms.
Best time for 1792K FFT length: 145.532 ms.

Bok 2003-09-06 00:24

[quote="Prime95"]Bok, also please try this in a fresh directory

Get ftp://mersenne.org/gimps/mprtst.tar.gz

Run mprime -m
Choose 5
exit
delete worktodo.ini
Run mprime -m
Choose 8, exponent = 20000000, iterations = 10
exit

Post the results.txt file - full of weird timer numbers[/quote]

And here is the results of this one....

Test 0: 0.000 sec. (121 clocks), avg: 0.000 sec. (127 clocks)
Test 1: 0.000 sec. (2131 clocks), avg: 0.000 sec. (2140 clocks)
Test 2: 0.000 sec. (654142 clocks), avg: 0.000 sec. (660652 clocks)
Test 3: 0.001 sec. (2623567 clocks), avg: 0.001 sec. (2685738 clocks)
Test 4: 0.002 sec. (2840212 clocks), avg: 0.002 sec. (2854997 clocks)
Test 1000: 0.000 sec. (528147 clocks), avg: 0.000 sec. (537882 clocks)
Test 1001: 0.001 sec. (1547728 clocks), avg: 0.001 sec. (1581690 clocks)
Test 1002: 0.001 sec. (2323676 clocks), avg: 0.001 sec. (2347559 clocks)
Test 1003: 0.000 sec. (525146 clocks), avg: 0.000 sec. (525155 clocks)
Test 1004: 0.001 sec. (2460613 clocks), avg: 0.001 sec. (2470975 clocks)
Test 1005: 0.002 sec. (4224755 clocks), avg: 0.002 sec. (4397632 clocks)
Test 1006: 0.000 sec. (528147 clocks), avg: 0.000 sec. (529015 clocks)
Test 1007: 0.001 sec. (2485652 clocks), avg: 0.001 sec. (2512956 clocks)
Test 1008: 0.002 sec. (3813135 clocks), avg: 0.002 sec. (4017296 clocks)
Test 1009: 0.001 sec. (1040145 clocks), avg: 0.001 sec. (1042419 clocks)
Test 1010: 0.003 sec. (4907879 clocks), avg: 0.003 sec. (4914776 clocks)
Test 1011: 0.003 sec. (4732079 clocks), avg: 0.003 sec. (4976143 clocks)
Test 1012: 0.000 sec. (504955 clocks), avg: 0.000 sec. (505812 clocks)
Test 1013: 0.000 sec. (831666 clocks), avg: 0.000 sec. (834674 clocks)
Test 1014: 0.001 sec. (1120767 clocks), avg: 0.001 sec. (1142366 clocks)
Test 1015: 0.001 sec. (1581311 clocks), avg: 0.001 sec. (1590955 clocks)
Test 1016: 0.001 sec. (1290771 clocks), avg: 0.001 sec. (1312691 clocks)
Test 1017: 0.001 sec. (1901221 clocks), avg: 0.001 sec. (1910335 clocks)
Test 1018: 0.001 sec. (1152645 clocks), avg: 0.001 sec. (1154434 clocks)
Test 1019: 0.001 sec. (2025428 clocks), avg: 0.001 sec. (2029746 clocks)
[Fri Sep 5 16:23:34 2003]
timer 0: 72921256
timer 1: 57044016
timer 2: 579204
timer 3: 72340708
timer 4: 10620074
timer 5: 6002809
timer 6: 13630747
timer 9: 12004685
timer 10: 13410368
timer 13: 7685964
timer 14: 7410400
timer 16: 9099429
timer 17: 5984708
timer 18: 8094904
timer 20: 12457308
timer 21: 13360638
timer 24: 7918447
timer 26: 18079
timer 27: 36327



Bok

GP2 2003-09-06 03:14

Bok, perhaps you ought to try mprime version 23.5 instead of 22.12?

Bok 2003-09-06 04:42

ok, ran it again with version 23.5

AMD Engineering Sample
CPU speed: 1799.80 MHz
CPU features: RDTSC, CMOV, PREFETCH, MMX, SSE, SSE2
L1 cache size: 64 KB
L2 cache size: 1024 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 32
L2 TLBS: 512
Prime95 version 23.5, RdtscTiming=1
Best time for 384K FFT length: 25.615 ms.
Best time for 448K FFT length: 30.690 ms.
Best time for 512K FFT length: 34.733 ms.
Best time for 640K FFT length: 43.128 ms.
Best time for 768K FFT length: 52.786 ms.
Best time for 896K FFT length: 63.544 ms.
Best time for 1024K FFT length: 71.629 ms.
Best time for 1280K FFT length: 95.534 ms.
Best time for 1536K FFT length: 116.598 ms.
Best time for 1792K FFT length: 140.507 ms.
Best time for 2048K FFT length: 158.316 ms.
[Fri Sep 5 20:38:36 2003]
Compare your results to other computers at http://www.mersenne.org/bench.htm
That web page also contains instructions on how your results can be included.

AMD Engineering Sample
CPU speed: 1799.76 MHz
CPU features: RDTSC, CMOV, PREFETCH, MMX, SSE
L1 cache size: 64 KB
L2 cache size: 1024 KB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 32
L2 TLBS: 512
Prime95 version 23.5, RdtscTiming=1
Best time for 384K FFT length: 28.408 ms.
Best time for 448K FFT length: 32.031 ms.
Best time for 512K FFT length: 34.807 ms.
Best time for 640K FFT length: 47.125 ms.
Best time for 768K FFT length: 55.922 ms.
Best time for 896K FFT length: 67.866 ms.
Best time for 1024K FFT length: 74.954 ms.
Best time for 1280K FFT length: 96.482 ms.
Best time for 1536K FFT length: 117.145 ms.
Best time for 1792K FFT length: 139.925 ms.
Best time for 2048K FFT length: 156.999 ms.


Should it have been slower ???

Bok

GP2 2003-09-06 05:39

[quote="Bok"]ok, ran it again with version 23.5



Should it have been slower ???

Bok[/quote]

When comparing results, note that version 22 output starts at 256K FFT and ends at 1792K, while version 23 output starts at 384K and ends at 2048K.

So your results show that version 23.5 is actually a bit faster than 22.12 for your Opteron.

gbvalor 2003-09-06 08:03

It seems there is no significative advantage using Revision C for Glucas. Actually, the timings are near the same. The version Bok tested is a bit slower because I supressed an optimization causing problems in other targets.

Other interesting thing is that Glucas and mprime are closest than ever :shock: . Well, when magic George's hands touch again the code this will disapear :) .

And I also have work to do with Glucas ...

[quote]I'll try making with -m64 -m128bit-long-double as well
[/quote]

Bok, I don't think it will run, at least it will not give any advantage.

Guillermo

Prime95 2003-09-06 20:33

[quote="gbvalor"]It seems there is no significative advantage using Revision C for Glucas. [/quote]

From the timer values I see no significant advantage for rev C for prime95 either.


All times are UTC. The time now is 21:21.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.