mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2005-03-30, 23:21   #1
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

1AFC16 Posts
Default Early Beta of version 24.11

Version 24.11 has two major improvements.

1) There is a 64-bit version for 64-bit Windows. It contains all new faster factoring code.

2) An Athlon64 optimization was found for both the 32-bit and 64-bit versions of prime95. You'll get about a 15% performance boost. Still not as fast as a similarly clocked P4, but it is much closer.

Only AMD64 machines should try this version. It is not well QA'ed. Save your work before installing, just in case there is a problem.

You can download from:

Windows: ftp://mersenne.org/gimps/p95v2411.zip
Windows 64-bit: ftp://mersenne.org/gimps/p64v2411.zip
Windows NT service: ftp://mersenne.org/gimps/winnt2411.zip
Linux: ftp://mersenne.org/gimps/mprime2411.tar.gz
Linux (static link): ftp://mersenne.org/gimps/sprime2411.tar.gz

Let me know if you find any problems.
Prime95 is offline   Reply With Quote
Old 2005-03-30, 23:23   #2
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

22·11·157 Posts
Default

and source is at ftp://mersenne.org/gimps/source24.zip

If any AMD64 gurus want to try and improve the factoring code in factor64.asm, I'm more than willing to incorporate any improvements.
Prime95 is offline   Reply With Quote
Old 2005-03-31, 07:16   #3
Peter Nelson
 
Peter Nelson's Avatar
 
Oct 2004

232 Posts
Default Excellent!

Excellent to see this and AMD users will be delighted to finally get their hands on it.

Few thoughts occured....

a) will you be building a 64 bit version for LINUX users (eg Suse 9.2 on AMD64 and other distros which are 64-bit)?

b) as the Pentium 6xx series now have Intel EM64T which has almost same instruction set as Athlon64, might the code run on 6xx too?
I appreciate the architectures are different eg cache, and that there are very few of these in the field yet but it might be forward thinking to support Intel's 64 bit efforts too.

c) will you be able to maintain a single source code tree or have to have different versions for Intel/AMD/32/64 bits? The target architecture (32/64) could be specified at compile time, other differences as existing code by cpu detection. The range of cpu types is likely to further increase ie dualcore where two processors share memory subsystem which will slow mem accesses outside the L2 cache. This may mean different optimisations.

d) please would it be possible to include a short test of trial factoring speed in the benchmark (of the release version) because this would be very useful to know and compare the benefit of your optimisations. Maybe it could just appear when Fullbench option is specified.
Peter Nelson is offline   Reply With Quote
Old 2005-03-31, 11:17   #4
sonjohan
 
sonjohan's Avatar
 
May 2003
Belgium

2×139 Posts
Default

It is likely to be a silly question, but how do I know wether I have a 64-bit Windows or not??
sonjohan is offline   Reply With Quote
Old 2005-03-31, 13:30   #5
dsouza123
 
dsouza123's Avatar
 
Sep 2002

2×331 Posts
Default

Unless you downloaded Window XP PRO 64 or got it through MSDN
you don't have it. It is/was a beta release from Microsoft, and
hasn't shipped on PCs for retail yet.

Has gone Gold today, shipping to manufacturing, so probably on
shipping PCs later part of April.
dsouza123 is offline   Reply With Quote
Old 2005-03-31, 14:47   #6
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

1AFC16 Posts
Default

a) I have no immediate plans to try a 64-bit Linux port. I'm not sure if objcopy can convert the MASM object files into ELF64 format.

b) The code should run on Intel 6xx machines too.

c) There is just one source tree. Right now there are 4 different versions of the FFT:

x87 - optimized for Pentium Pro runs on any x87 machine
x87 - optimized for Athlons (also used by P3s and later)
SSE2 - P4 optimized
SSE2 - AMD64 optimized

d) Maybe
Prime95 is offline   Reply With Quote
Old 2005-04-01, 17:39   #7
penguin22
 
Sep 2002

22·3 Posts
Default

I have been using version 24.6 with the CPUSupportsSSE2=0 option and was wondering if it would be better to delete that line now that this version is out and has support for the features in the A64?

Thanks for your hard work.
penguin22 is offline   Reply With Quote
Old 2005-04-01, 17:51   #8
Jeff Gilchrist
 
Jeff Gilchrist's Avatar
 
Jun 2003
Ottawa, Canada

7·167 Posts
Default

Can anyone with A64/Opteron boxes post some benchmarks with 23.x, 24.6, and 24.11 comparisons please?
Jeff Gilchrist is offline   Reply With Quote
Old 2005-04-01, 20:20   #9
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

22·11·157 Posts
Default

Quote:
Originally Posted by penguin22
I have been using version 24.6 with the CPUSupportsSSE2=0 option and was wondering if it would be better to delete that line now that this version is out and has support for the features in the A64?
Time it both ways. I'm betting the SSE2 code is now faster for the same FFT size.
Prime95 is offline   Reply With Quote
Old 2005-04-01, 20:59   #10
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

24·3·23 Posts
Default

Quote:
Originally Posted by Prime95
a) I have no immediate plans to try a 64-bit Linux port. I'm not sure if objcopy can convert the MASM object files into ELF64 format.
very sad to hear :(

is it possible for you to make a binary with an other asm compiler for non-windoze os'ses (even if it's a bit slower)?

thejudger
TheJudger is offline   Reply With Quote
Old 2005-04-01, 21:21   #11
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

22·11·157 Posts
Default

Quote:
Originally Posted by TheJudger
is it possible for you to make a binary with an other asm compiler for non-windoze os'ses (even if it's a bit slower)?
No. Converting all that assembly code to another assembler format would be a monumental task.

I'm sure the binutils guys will make objcopy work eventually if they haven't done so already. The source is available if someone wants to try a 64-bit linux port.

My next task is more optimizations, especially making use of the extra SSE2 registers in 64-bit mode. Don't expect much - a few percent on AMD64, perhaps a little more on the P4.
Prime95 is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Prime95 beta version 28.4 Prime95 Software 20 2014-03-02 02:51
Prime95 beta version 28.3 Prime95 Software 68 2014-02-23 05:42
Prime95 version 27.1 early preview, not-even-close-to-beta release Prime95 Software 126 2012-02-09 16:17
Beta version 24.12 available Prime95 Software 33 2005-06-14 13:19
Beta version of PRP Prime95 PSearch 15 2004-09-17 19:21

All times are UTC. The time now is 04:49.

Sun Jul 5 04:49:23 UTC 2020 up 102 days, 2:22, 1 user, load averages: 1.18, 1.12, 1.17

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.