![]() |
|
|
#1 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
72×131 Posts |
2^941-1 reservations are closed, and it will soon enter the two-month linear algebra phase, so it's time to start figuring out what to do next.
I'm not sure there's much point manually organising any more sieving, given that NFS@home exists. So I've set my computers over to doing sieving for NFS@home for the moment. I'm doing a polynomial search with msieve_gpu for the 180-digit EM43 number; my aim is to get a decent polynomial (two GeForce 275-weeks ought to suffice) and then present it to frmky as a candidate for 'first NFS@home GNFS'. I'm tempted to organise manually a polynomial search, using msieve_gpu, for some local-record-size GNFS, probably the C188 from 7^347+1 - I have checked that the polynomial for RSA200 is too hard to sieve with 32-bit large primes and gnfs-lasieve4I16e, and that I don't have the resources to run a matrix job with 33-bit large primes, but I'm hoping that the polynomial for that C188 might be on the edge of practicality. I don't know whether a new msieve release might use 96-bit arithmetic on CPU for polynomial search at this level, it's in one sense a complete waste of time to implement since the GPU code is so much faster. But I find that msieve_gpu leaves the machine it's running on pretty much interactively unusable - you get random five-second pauses every fifteen seconds during which the mouse doesn't move and windows don't refresh - and I expect most people have their biggest GPU in their main desktop and wouldn't be willing to use it under those conditions. Any comments? Last fiddled with by fivemack on 2009-11-22 at 22:02 |
|
|
|
|
|
#2 |
|
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
36·13 Posts |
For another gnfs-180 (2,2254L), msieve144_gpu bailed after half a day of a run. The pol51 poly for the same number has
# norm 1.16e+25 alpha -6.20 Murphy_E 6.31e-14 It could be the card of course. Is your search producing "save"s? |
|
|
|
|
|
#3 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
72·131 Posts |
Yes, I'm getting saves, though it took eight hours to get the first one.
# norm 4.644635e-18 alpha -6.486823 e 3.759e-14 is the best polynomial I've got after twelve hours, but twelve hours into a GNFS-180 polynomial search really isn't very far. I'm getting to the point of believing that a) the card I have is actually reasonably reliable and b) as soon as you have written outside the bounds of an array in CUDA code running on the GPU, your card has entered a wobbly state and will fall over for no clear reason within the next two minutes to 48 hours, almost surely making you think that some other bit of code is to blame. Which means I probably should buy an absolute-minimal computer, with a processor so slow that I don't run sieving on it, install the GTX275 in it, and count on rebooting it after each CUDA segfault, if I plan to do any CUDA development at all. |
|
|
|
|
|
#4 | |
|
Jul 2003
So Cal
2·34·13 Posts |
Quote:
What would be reasonable parameters for this size GNFS? Approximately equivalent to a 270-digit SNFS? I want to hold off for a while getting into anything larger than 2,941-. |
|
|
|
|
|
|
#5 |
|
Tribal Bullet
Oct 2004
3,541 Posts |
Serge has asked before that the 96-bit poly selection code be moved into a CPU version, and it wouldn't be that hard. I estimate that a single modern GPU would be around 35x faster than one core of a comparable CPU when doing poly selection, but Bruce has hundreds of such CPUs available and is not afraid to let them run for a while.
I've noticed the unresponsiveness and there's not much I can do about it, short of using the card in smaller bites and sacrificing some time for extra overhead. At least in windows the refresh rate on the monitor goes way down (a single GPU kernel lasts about 1.2 seconds) but at least background processes do not get stuck. If 2,941 is almost too large for the postprocessing, how much larger can we go before hitting a postprocessing limit? With NFS@home, enormous sieving jobs can take a few weeks, so it won't be enough to rely on Moore's law making memory cheap by the time we need more of it. The obvous solutions are parallel Lanczos and parallel Wiedemann, and while the second is out of the question given my time budget, the first is doable with a concentrated effort. Otherwise my preference is to continue with the stage 2 root sieve for degree 6; it would be kind of cool to see what a good degree 6 polynomial for RSA >= 200 digits would look like. Last fiddled with by jasonp on 2009-11-23 at 05:04 |
|
|
|
|
|
#6 | |
|
"Ben"
Feb 2007
1101101110012 Posts |
Quote:
That, and the job submission queue the clusters use, are barriers to NFS@Home... I'll only be using it manually. Last fiddled with by bsquared on 2009-11-23 at 05:01 Reason: no more air quotes... |
|
|
|
|
|
|
#7 |
|
Oct 2004
Austria
2×17×73 Posts |
I have used a C2D with 1 GB RAM and Windoze XP for sieving so far (no cluster available), and I have seen that 16e jobs are definitely too large to run in the background (a 1M-sized sieving job at the low end of the range took me several weeks of running it just overnight, stopping it every morning and "-R'ing" it in the evening, with the second thread left idle), so I will not participate in record-sized NFSes any more. I plan to shift this cpu to either ECM or to one of the aliqueit subprojects or to the aliqueit-4788-team-sieves.
Last fiddled with by Andi47 on 2009-11-23 at 06:34 Reason: typo |
|
|
|
|
|
#8 | |
|
Jul 2003
So Cal
2·34·13 Posts |
Quote:
|
|
|
|
|
|
|
#9 | ||
|
Jul 2003
So Cal
2×34×13 Posts |
Quote:
Quote:
After spending about 2 1/2 months administering NFS@Home, I'm beginning to think that sieving a 250-digit SNFS in a few days is normal. It's a strange feeling, actually.
|
||
|
|
|
|
|
#10 | |
|
Jun 2008
10010002 Posts |
Quote:
|
|
|
|
|
|
|
#11 | |
|
(loop (#_fork))
Feb 2006
Cambridge, England
11001000100112 Posts |
Quote:
See http://www.mersenneforum.org/showthread.php?t=10495 for the last 180-digit GNFS I had a hand in. |
|
|
|
|