![]() |
![]() |
#45 |
I moo ablest echo power!
May 2013
26×29 Posts |
![]()
Here's the exe along with the pthreads dll and sort engine dll and ptx files. I added a CC5.2 ptx compilation. Don't know which are needed or not, but everything is there except for the cudart dlls. They'd be too big for posting here, I think. CPU is an Ivy Bridge.
|
![]() |
![]() |
![]() |
#46 |
Sep 2009
1001100101012 Posts |
![]()
I made it work with CUDA 7.5 by editing the makefiles to remove CC 1.1 and 1.3:
http://mersenneforum.org/showpost.ph...2&postcount=20 Chris |
![]() |
![]() |
![]() |
#47 |
Einyen
Dec 2003
Denmark
2·3·52·23 Posts |
![]()
SVN991 compiled fine with CUDA 7.5 without tinkering with the Makefile beyond the usual parameters. It even worked with ZLIB on. What is the benefit of Zlib, it compresses relations? It is better to leave NO_ZLIB=1 ?
It seems to work, it searches for a poly at least. I tried to use parameters from this old RSA896 thread: http://www.mersenneforum.org/showthread.php?t=17460 but it is not finding the same polynomials or I do not know what I'm doing, which is far more likely. Anyone have some more recent parameters that should find a polynomial just for a test? Last fiddled with by ATH on 2016-01-06 at 00:48 |
![]() |
![]() |
![]() |
#48 |
Tribal Bullet
Oct 2004
356510 Posts |
![]()
Compiling zlib in allows the binary to read and write compressed relation files.
I don't think there's a controlled test you can run that will find a known polynomial. There's a lot of checking in polyselect stage 2, so if you find any polynomials at all it's probably working fine. Of course with a hot modern GPU you will find stage 1 hits so fast that performing stage 2 will leave the GPU mostly idle. Sorry to everyone that it took so long to get back to a working state. Last fiddled with by jasonp on 2016-01-06 at 01:56 |
![]() |
![]() |
![]() |
#49 |
Romulan Interpreter
"name field"
Jun 2011
Thailand
24·643 Posts |
![]()
If you have enough space, and don't intend to move the files much around, and don't have a terrible slow hdd, then yes. Disadvantages, as shown by the former sentence, are that the uncompressed files are big, taking a lot of space and being difficult to move from folders to folders, or share on the web, the are slow to read when resuming the work, etc. Advantages is that if the zipped file crash - and the probability is not null, because the files are big and not fast to handle/read/write - then you can most probably say bye-bye to all your relations, but when the file is in clear, the crashed lines are just ignored, and most of the relations are still recoverable.
|
![]() |
![]() |
![]() |
#50 |
Einyen
Dec 2003
Denmark
345010 Posts |
![]()
Here are the svn 991 files compiled without Zlib:
msieve-svn991-cuda75-haswell.zip msieve-svn991-cuda75-sandybridge.zip |
![]() |
![]() |
![]() |
#51 |
Apr 2010
2·131 Posts |
![]()
On my System (Linux, GeForce GTX 650 Ti) the new version (compiled with Cuda 6.5, using sm30) is about 20% slower than the old version (compiled with Cuda 6.0, using sm20).
For the comparison I disabled the randomization. The card isn't used for the display. I haven't had the time to try the old version with Cuda 6.5. |
![]() |
![]() |
![]() |
#52 |
Sep 2009
17248 Posts |
![]()
I only use rotating HDDs, and using gzipped relations, as produced by NFS@Home clients for bandwidth reasons, usually saves filtering time compared to dealing with uncompressed relations. Possibly square root time as well, as the bottleneck when reading relations is on I/O.
On modern zlib versions, the CPU cost of dealing with uncompressed output is minimal. |
![]() |
![]() |
![]() |
#53 | |
"Victor de Hollander"
Aug 2011
the Netherlands
32·131 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#54 | |
Einyen
Dec 2003
Denmark
2×3×52×23 Posts |
![]() Quote:
I also added "nvcuda.dll" to the gpu versions just to be safe, although it might be present on all computers with Nvidia drivers installed? |
|
![]() |
![]() |
![]() |
#55 |
Dec 2015
1416 Posts |
![]()
991-haswell doesn't work unless I delete nvcuda.dll - apparently that DLL is highly driver-specific and must be loaded from the system.
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Newer milestone thread | Uncwilly | Data | 3668 | 2023-04-20 12:24 |
Help needed - M77 232 917 celebration build | jbpace | Hardware | 19 | 2018-01-10 16:32 |
Performance of cuda-ecm on newer hardware? | fivemack | GMP-ECM | 14 | 2015-02-12 20:10 |
Newer msieves are slow on Core i7 | mklasson | Msieve | 9 | 2009-02-18 12:58 |
Use of large memory pages possible with newer linux kernels | Dresdenboy | Software | 3 | 2003-12-08 14:47 |