![]() |
|
|
#408 |
|
Feb 2012
32×7 Posts |
I am using a GTX 980 on my laptop with 2048 cuda cores and running 1024 stage 1 instances per process. I have been running curves using ecm_gpu (stage 1 on my gpu and multi-threading stage 2 on my cpu using ecm.py script). Suddenly, the gpu speed cut in half. I tried playing around with the number of curves running on my gpu using the -gpucurves tag. It was still slow. Then I shut my laptop off and restarted... it returned to normal. Has anybody ever experienced this? My concern is that it might go back to "half speed" later on in the middle of some big job...
|
|
|
|
|
|
#409 | |
|
Bamboozled!
"๐บ๐๐ท๐ท๐ญ"
May 2003
Down not across
1179610 Posts |
Quote:
|
|
|
|
|
|
|
#410 | |
|
Nov 2008
1FD16 Posts |
Quote:
|
|
|
|
|
|
|
#411 |
|
I moo ablest echo power!
May 2013
74116 Posts |
Yeah, my assumption would be the GPU getting too hot. GPU-ECM will definitely raise the temperature.
|
|
|
|
|
|
#412 |
|
Romulan Interpreter
"name field"
Jun 2011
Thailand
41×251 Posts |
Take the GPU-Z and run it in parallel with GPU-ECM, watch it close. If there was a temperature issue, then it will happen again in about the same amount of time, then look to what GPU-Z says in that moment (it says why the card's speed is restricted, like temperature issues, power issues, etc). Look if the clock changes (it may, to reduce the power, or it may not, and reduce the power by inserting idle clocks, each method has advantages and disadvantages, but for a clear temperature issue, the clock will be cut, for sure).
Last fiddled with by LaurV on 2016-08-11 at 05:57 |
|
|
|
|
|
#413 | ||
|
Feb 2012
3F16 Posts |
Quote:
I was wondering you have have thought about what the steps that you took to compile gmp-ecm for gpu-ecm. I have the standard 1018-bit version and was wondering if I could try to see if a lower bit version would work more quickly and be effective (as in finding factors in stage 1 or stage 2 on the cpu). This guy from a much earlier post seems to have gotten it to work: Quote:
Last fiddled with by cgy606 on 2016-08-19 at 22:34 |
||
|
|
|
|
|
#414 |
|
I moo ablest echo power!
May 2013
3×619 Posts |
Here's the program compiled with NB_DIGITS set to 20. I would try finding a known factor to ensure it works properly. It passed the first few tests in test.gpuecm, which generally indicates it is working, but it's good to be sure.
|
|
|
|
|
|
#415 |
|
Feb 2012
32·7 Posts |
Some interesting behavior to say the least. I ran a test on a C147 that I know has a p10, p17, p20, p20 at B1 = 250K. I ran 512 curves on the GPU using the ndigits = 20. It found all factors after stage 1:
Code:
GMP-ECM 7.0.1-dev [configured with MPIR 2.7.0, --enable-gpu, --enable-openmp] [ECM] Input number is (99!+5)/9176362385 (147 digits) Using B1=250000, B2=0, sigma=3:3407157017-3:3407157528 (512 curves) Block: 20x32x1 Grid: 16x1x1 Computing 512 Step 1 took 2328ms of CPU time / 19709ms of GPU time ********** Factor found in step 1: 5275321151 Found probable prime factor of 10 digits: 5275321151 Composite cofactor ((99!+5)/9176362385)/5275321151 has 137 digits ********** Factor found in step 1: 42645646522247063 Found probable prime factor of 17 digits: 42645646522247063 Composite cofactor (((99!+5)/9176362385)/5275321151)/42645646522247063 has 120 digits ********** Factor found in step 1: 61133702826671342149 Found probable prime factor of 20 digits: 61133702826671342149 Composite cofactor ((((99!+5)/9176362385)/5275321151)/42645646522247063)/61133702826671342149 has 100 digits ********** Factor found in step 1: 31905776268663843113 Found probable prime factor of 20 digits: 31905776268663843113 Probable prime cofactor (((((99!+5)/9176362385)/5275321151)/42645646522247063)/61133702826671342149)/31905776268663843113 has 81 digits The scientist in me decided that I should try another 512 curves at B1 = 250K (a t30 search) using the default ndigits = 32. Here is the output I got: Code:
ON GPU
GMP-ECM 7.0.1-dev [configured with MPIR 2.7.0, --enable-gpu, --enable-openmp] [ECM]
Input number is (126!+5)/79768672096773991353065 (189 digits)
Using B1=250000, B2=0, sigma=3:290623459-3:290623970 (512 curves)
Block: 32x32x1 Grid: 16x1x1
300000 iterations to go
200000 iterations to go
100000 iterations to go
90000 iterations to go
80000 iterations to go
70000 iterations to go
60000 iterations to go
50000 iterations to go
40000 iterations to go
30000 iterations to go
20000 iterations to go
10000 iterations to go
GPU: factor 324295084094116127662247 found in Step 1 with curve 368 (-sigma 3:290623827)
Computing 512 Step 1 took 2344ms of CPU time / 33876ms of GPU time
********** Factor found in step 1: 324295084094116127662247
Found probable prime factor of 24 digits: 324295084094116127662247
Composite cofactor ((126!+5)/79768672096773991353065)/324295084094116127662247 has 165 digits
STAGE 2 RESUME ON CPU
-> ___________________________________________________________________
-> | Running ecm.py, a Python driver for distributing GMP-ECM work |
-> | on a single machine. It is copyright, 2011-2016, David Cleaver |
-> | and is a conversion of factmsieve.py that is Copyright, 2010, |
-> | Brian Gladman. Version 0.40 (Python 2.6 or later) 6th Aug 2016 |
-> |_________________________________________________________________|
-> Resuming work from resume file: 126fac5_250e3_0.save
-> Spreading the work across 8 thread(s)
->=============================================================================
-> Working on the number(s) in the resume file: 126fac5_250e3_0.save
-> Using up to 8 instances of GMP-ECM...
-> Found 512 unique resume lines to work on.
-> Will start working on the 512 resume lines.
-> ecm -resume resume_job_126fac5_250e3_0-save_inp_t00.txt 250000 > resume_job_126fac5_250e3_0-save_out_t00.txt (64 resume lines)
-> ecm -resume resume_job_126fac5_250e3_0-save_inp_t01.txt 250000 > resume_job_126fac5_250e3_0-save_out_t01.txt (64 resume lines)
-> ecm -resume resume_job_126fac5_250e3_0-save_inp_t02.txt 250000 > resume_job_126fac5_250e3_0-save_out_t02.txt (64 resume lines)
-> ecm -resume resume_job_126fac5_250e3_0-save_inp_t03.txt 250000 > resume_job_126fac5_250e3_0-save_out_t03.txt (64 resume lines)
-> ecm -resume resume_job_126fac5_250e3_0-save_inp_t04.txt 250000 > resume_job_126fac5_250e3_0-save_out_t04.txt (64 resume lines)
-> ecm -resume resume_job_126fac5_250e3_0-save_inp_t05.txt 250000 > resume_job_126fac5_250e3_0-save_out_t05.txt (64 resume lines)
-> ecm -resume resume_job_126fac5_250e3_0-save_inp_t06.txt 250000 > resume_job_126fac5_250e3_0-save_out_t06.txt (64 resume lines)
-> ecm -resume resume_job_126fac5_250e3_0-save_inp_t07.txt 250000 > resume_job_126fac5_250e3_0-save_out_t07.txt (64 resume lines)
GMP-ECM 7.0.1-dev [configured with MPIR 2.7.0, --enable-gpu, --enable-openmp] [ECM]
Using B1=250000-250000, B2=128992510, polynomial Dickson(3), 8 threads
____________________________________________________________________________
Curves Complete | Average seconds/curve | Runtime | ETA
-----------------|---------------------------|---------------|--------------
17 of 512 | Stg1 0.000s | Stg2 0.523s | 0d 00:00:02 | 0d 00:02:09
Resume line 17 out of 512:
Using B1=250000-250000, B2=128992510, polynomial Dickson(3), sigma=3:290623649
Step 1 took 0ms
Step 2 took 594ms
********** Factor found in step 2: 452655830807187689684039
Found probable prime factor of 24 digits: 452655830807187689684039
Probable prime cofactor (((126!+5)/79768672096773991353065)/324295084094116127662247)/452655830807187689684039 has 142 digits
I think something is grossly wrong in the ecm code when ndigits is changed from the default (I know cyril has pointed to this), but haven;t a clue what it is... Last fiddled with by Batalov on 2016-08-20 at 15:49 Reason: formatted the program output chunks into /code/ blocks |
|
|
|
|
|
#416 |
|
Just call me Henry
"David"
Sep 2007
Liverpool (GMT/BST)
3×23×89 Posts |
From memory only 16 and 32 worked when I looked at it a long while ago. Not a clue why.
|
|
|
|
|
|
#417 |
|
"Curtis"
Feb 2005
Riverside, CA
2·2,927 Posts |
I agree with Henry- only power-of-2 values worked, and only 32 worked without issue. 16 and 64 "mostly worked", but were never found to be 100% reliable for anyone.
IMO, it's not worth the missed factors to gain the time savings from using 16, unless you're running CPU-only ECM in parallel. |
|
|
|
|
|
#418 |
|
I moo ablest echo power!
May 2013
3·619 Posts |
Yeah, I played around NB_DIGITS before, but only bumping it up as I recall. It will compile and run and can find factors in Stage 1, but it always had issues with Stage 2.
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Running CUDA on non-Nvidia GPUs | Rodrigo | GPU Computing | 3 | 2016-05-17 05:43 |
| Error in GMP-ECM 6.4.3 and latest svn | ATH | GMP-ECM | 10 | 2012-07-29 17:15 |
| latest SVN 1677 | ATH | GMP-ECM | 7 | 2012-01-07 18:34 |
| Has anyone seen my latest treatise? | davieddy | Lounge | 0 | 2011-01-21 19:29 |
| Latest version? | [CZ]Pegas | Software | 3 | 2002-08-23 17:05 |