mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Msieve

Reply
 
Thread Tools
Old 2016-01-05, 01:15   #45
wombatman
I moo ablest echo power!
 
wombatman's Avatar
 
May 2013

23·223 Posts
Default MSieve 991M compiled with CUDA 7

Here's the exe along with the pthreads dll and sort engine dll and ptx files. I added a CC5.2 ptx compilation. Don't know which are needed or not, but everything is there except for the cudart dlls. They'd be too big for posting here, I think. CPU is an Ivy Bridge.
Attached Files
File Type: 7z msieve_CUDA7.7z (998.1 KB, 445 views)
wombatman is offline   Reply With Quote
Old 2016-01-05, 17:29   #46
chris2be8
 
chris2be8's Avatar
 
Sep 2009

1000100000102 Posts
Default

Quote:
Originally Posted by ATH View Post
What about CUDA 7.0 and 7.5? Are they still not compatible?
I made it work with CUDA 7.5 by editing the makefiles to remove CC 1.1 and 1.3:
http://mersenneforum.org/showpost.ph...2&postcount=20

Chris
chris2be8 is offline   Reply With Quote
Old 2016-01-06, 00:47   #47
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

2×7×227 Posts
Default

SVN991 compiled fine with CUDA 7.5 without tinkering with the Makefile beyond the usual parameters. It even worked with ZLIB on. What is the benefit of Zlib, it compresses relations? It is better to leave NO_ZLIB=1 ?

It seems to work, it searches for a poly at least. I tried to use parameters from this old RSA896 thread:
http://www.mersenneforum.org/showthread.php?t=17460
but it is not finding the same polynomials or I do not know what I'm doing, which is far more likely.

Anyone have some more recent parameters that should find a polynomial just for a test?

Last fiddled with by ATH on 2016-01-06 at 00:48
ATH is offline   Reply With Quote
Old 2016-01-06, 01:55   #48
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

3·1,181 Posts
Default

Compiling zlib in allows the binary to read and write compressed relation files.

I don't think there's a controlled test you can run that will find a known polynomial. There's a lot of checking in polyselect stage 2, so if you find any polynomials at all it's probably working fine. Of course with a hot modern GPU you will find stage 1 hits so fast that performing stage 2 will leave the GPU mostly idle.

Sorry to everyone that it took so long to get back to a working state.

Last fiddled with by jasonp on 2016-01-06 at 01:56
jasonp is offline   Reply With Quote
Old 2016-01-06, 02:41   #49
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

9,787 Posts
Default

Quote:
Originally Posted by ATH View Post
SVN991 compiled fine with CUDA 7.5 without tinkering with the Makefile beyond the usual parameters. It even worked with ZLIB on. What is the benefit of Zlib, it compresses relations? It is better to leave NO_ZLIB=1 ?
If you have enough space, and don't intend to move the files much around, and don't have a terrible slow hdd, then yes. Disadvantages, as shown by the former sentence, are that the uncompressed files are big, taking a lot of space and being difficult to move from folders to folders, or share on the web, the are slow to read when resuming the work, etc. Advantages is that if the zipped file crash - and the probability is not null, because the files are big and not fast to handle/read/write - then you can most probably say bye-bye to all your relations, but when the file is in clear, the crashed lines are just ignored, and most of the relations are still recoverable.
LaurV is offline   Reply With Quote
Old 2016-01-06, 14:21   #50
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

1100011010102 Posts
Default

Here are the svn 991 files compiled without Zlib:

msieve-svn991-cuda75-haswell.zip
msieve-svn991-cuda75-sandybridge.zip
ATH is offline   Reply With Quote
Old 2016-01-07, 07:03   #51
Gimarel
 
Apr 2010

23×23 Posts
Default Slower

On my System (Linux, GeForce GTX 650 Ti) the new version (compiled with Cuda 6.5, using sm30) is about 20% slower than the old version (compiled with Cuda 6.0, using sm20).

For the comparison I disabled the randomization. The card isn't used for the display.

I haven't had the time to try the old version with Cuda 6.5.
Gimarel is offline   Reply With Quote
Old 2016-01-07, 07:15   #52
debrouxl
 
debrouxl's Avatar
 
Sep 2009

3D216 Posts
Default

I only use rotating HDDs, and using gzipped relations, as produced by NFS@Home clients for bandwidth reasons, usually saves filtering time compared to dealing with uncompressed relations. Possibly square root time as well, as the bottleneck when reading relations is on I/O.

On modern zlib versions, the CPU cost of dealing with uncompressed output is minimal.
debrouxl is offline   Reply With Quote
Old 2016-01-07, 12:24   #53
VictordeHolland
 
VictordeHolland's Avatar
 
"Victor de Hollander"
Aug 2011
the Netherlands

23·3·72 Posts
Default

Quote:
Originally Posted by ATH View Post
Here are the Msieve svn988 I compiled a few days ago without CUDA support (before I figured out how to):

msieve-svn988-nogpu-haswell.zip
msieve-svn988-nogpu-sandybridge.zip
msieve-svn988-nogpu-core2.zip
These need libwinpthread-1.dll to work on a machine without minGW and Visual Studio installed.
Attached Files
File Type: 7z libwinpthread-1_dll.7z (20.5 KB, 249 views)
VictordeHolland is offline   Reply With Quote
Old 2016-01-07, 15:38   #54
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

2×7×227 Posts
Default

Quote:
Originally Posted by VictordeHolland View Post
These need libwinpthread-1.dll to work on a machine without minGW and Visual Studio installed.
Thanks, I did not get a chance to test them on another computer. I added my "libwinpthread-1.dll" from MSYS2 to all the zip-files.

I also added "nvcuda.dll" to the gpu versions just to be safe, although it might be present on all computers with Nvidia drivers installed?
ATH is offline   Reply With Quote
Old 2016-01-09, 05:29   #55
Googulator
 
Dec 2015

22·5 Posts
Default

991-haswell doesn't work unless I delete nvcuda.dll - apparently that DLL is highly driver-specific and must be loaded from the system.
Googulator is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Newer milestone thread Uncwilly Data 3469 2021-10-13 15:55
Help needed - M77 232 917 celebration build jbpace Hardware 19 2018-01-10 16:32
Performance of cuda-ecm on newer hardware? fivemack GMP-ECM 14 2015-02-12 20:10
Newer msieves are slow on Core i7 mklasson Msieve 9 2009-02-18 12:58
Use of large memory pages possible with newer linux kernels Dresdenboy Software 3 2003-12-08 14:47

All times are UTC. The time now is 11:51.


Mon Oct 25 11:51:42 UTC 2021 up 94 days, 6:20, 0 users, load averages: 2.78, 1.93, 1.44

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.