![]() |
![]() |
#1 |
P90 years forever!
Aug 2002
Yeehaw, FL
41×199 Posts |
![]()
Version 24.11 has two major improvements.
1) There is a 64-bit version for 64-bit Windows. It contains all new faster factoring code. 2) An Athlon64 optimization was found for both the 32-bit and 64-bit versions of prime95. You'll get about a 15% performance boost. Still not as fast as a similarly clocked P4, but it is much closer. Only AMD64 machines should try this version. It is not well QA'ed. Save your work before installing, just in case there is a problem. You can download from: Windows: ftp://mersenne.org/gimps/p95v2411.zip Windows 64-bit: ftp://mersenne.org/gimps/p64v2411.zip Windows NT service: ftp://mersenne.org/gimps/winnt2411.zip Linux: ftp://mersenne.org/gimps/mprime2411.tar.gz Linux (static link): ftp://mersenne.org/gimps/sprime2411.tar.gz Let me know if you find any problems. |
![]() |
![]() |
![]() |
#2 |
P90 years forever!
Aug 2002
Yeehaw, FL
41·199 Posts |
![]()
and source is at ftp://mersenne.org/gimps/source24.zip
If any AMD64 gurus want to try and improve the factoring code in factor64.asm, I'm more than willing to incorporate any improvements. |
![]() |
![]() |
![]() |
#3 |
Oct 2004
10000100012 Posts |
![]()
Excellent to see this and AMD users will be delighted to finally get their hands on it.
Few thoughts occured.... a) will you be building a 64 bit version for LINUX users (eg Suse 9.2 on AMD64 and other distros which are 64-bit)? b) as the Pentium 6xx series now have Intel EM64T which has almost same instruction set as Athlon64, might the code run on 6xx too? I appreciate the architectures are different eg cache, and that there are very few of these in the field yet but it might be forward thinking to support Intel's 64 bit efforts too. c) will you be able to maintain a single source code tree or have to have different versions for Intel/AMD/32/64 bits? The target architecture (32/64) could be specified at compile time, other differences as existing code by cpu detection. The range of cpu types is likely to further increase ie dualcore where two processors share memory subsystem which will slow mem accesses outside the L2 cache. This may mean different optimisations. d) please would it be possible to include a short test of trial factoring speed in the benchmark (of the release version) because this would be very useful to know and compare the benefit of your optimisations. Maybe it could just appear when Fullbench option is specified. |
![]() |
![]() |
![]() |
#4 |
May 2003
Belgium
23·5·7 Posts |
![]()
It is likely to be a silly question, but how do I know wether I have a 64-bit Windows or not??
|
![]() |
![]() |
![]() |
#5 |
Sep 2002
10100101102 Posts |
![]()
Unless you downloaded Window XP PRO 64 or got it through MSDN
you don't have it. It is/was a beta release from Microsoft, and hasn't shipped on PCs for retail yet. Has gone Gold today, shipping to manufacturing, so probably on shipping PCs later part of April. |
![]() |
![]() |
![]() |
#6 |
P90 years forever!
Aug 2002
Yeehaw, FL
41×199 Posts |
![]()
a) I have no immediate plans to try a 64-bit Linux port. I'm not sure if objcopy can convert the MASM object files into ELF64 format.
b) The code should run on Intel 6xx machines too. c) There is just one source tree. Right now there are 4 different versions of the FFT: x87 - optimized for Pentium Pro runs on any x87 machine x87 - optimized for Athlons (also used by P3s and later) SSE2 - P4 optimized SSE2 - AMD64 optimized d) Maybe |
![]() |
![]() |
![]() |
#7 |
Sep 2002
22×3 Posts |
![]()
I have been using version 24.6 with the CPUSupportsSSE2=0 option and was wondering if it would be better to delete that line now that this version is out and has support for the features in the A64?
Thanks for your hard work. |
![]() |
![]() |
![]() |
#8 |
Jun 2003
Ottawa, Canada
3·17·23 Posts |
![]()
Can anyone with A64/Opteron boxes post some benchmarks with 23.x, 24.6, and 24.11 comparisons please?
|
![]() |
![]() |
![]() |
#9 | |
P90 years forever!
Aug 2002
Yeehaw, FL
41·199 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#10 | |
"Oliver"
Mar 2005
Germany
2·557 Posts |
![]() Quote:
is it possible for you to make a binary with an other asm compiler for non-windoze os'ses (even if it's a bit slower)? thejudger |
|
![]() |
![]() |
![]() |
#11 | |
P90 years forever!
Aug 2002
Yeehaw, FL
815910 Posts |
![]() Quote:
I'm sure the binutils guys will make objcopy work eventually if they haven't done so already. The source is available if someone wants to try a 64-bit linux port. My next task is more optimizations, especially making use of the extra SSE2 registers in 64-bit mode. Don't expect much - a few percent on AMD64, perhaps a little more on the P4. |
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Prime95 beta version 28.4 | Prime95 | Software | 20 | 2014-03-02 02:51 |
Prime95 beta version 28.3 | Prime95 | Software | 68 | 2014-02-23 05:42 |
Prime95 version 27.1 early preview, not-even-close-to-beta release | Prime95 | Software | 126 | 2012-02-09 16:17 |
Beta version 24.12 available | Prime95 | Software | 33 | 2005-06-14 13:19 |
Beta version of PRP | Prime95 | PSearch | 15 | 2004-09-17 19:21 |