mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > CADO-NFS

Reply
 
Thread Tools
Old 2020-08-28, 16:14   #485
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

EE916 Posts
Default

Quote:
Originally Posted by EdH View Post
. . . More later.
The more:

There is an incompatibility barrier that was crossed sometime in March 2020 that causes WUs to fail across the barrier. This means that the server and clients need to be on the same side of March 2020.

Currently, there is an open issue that causes an Error -6 interruption on rare occasions. When this error occurs, CADO-NFS cannot complete filtering and won't move on. However, I have been able to run Msieve on the relations found, and even rerun CADO-NFS to add relations by using "tasks.sieve.rels_wanted" in the snapshot file and using the snapshot to restart the CADO-NFS run. It still failed at filtering in my tests.
EdH is offline   Reply With Quote
Old 2020-08-28, 17:35   #486
RichD
 
RichD's Avatar
 
Sep 2008
Kansas

24×211 Posts
Default

So upgrading everything to the above "commit" should be fine, whether server, client or standalone.
RichD is offline   Reply With Quote
Old 2020-08-28, 17:47   #487
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

12FD16 Posts
Default

Quote:
Originally Posted by RichD View Post
So upgrading everything to the above "commit" should be fine, whether server, client or standalone.
Except for the sometimes-broken filtering..
VBCurtis is online now   Reply With Quote
Old 2020-08-28, 17:52   #488
RichD
 
RichD's Avatar
 
Sep 2008
Kansas

D3016 Posts
Default

Quote:
Originally Posted by EdH View Post
Currently, ...
Quote:
Originally Posted by VBCurtis View Post
Except for the sometimes-broken filtering..
Ah, yes.
RichD is offline   Reply With Quote
Old 2020-09-07, 19:12   #489
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

73518 Posts
Default

Just thought I'd post some more about the latest commit I'm using, since I mentioned the error (exit -6) trouble.

I'm still running the same commit across my entire "farm." I have only experienced one or two more of the troublesome error, but the bulk of my factorizations have been for composites <160 dd.

I have experienced a couple other issues, but they may have been noticed with earlier commits, too. The most annoying is if I run the server without the "--server" switch, quite often, one or more of the localhost polyselect processes gets lost. Then the server stops everything (except the constant checking for it) until the timeout occurs. It then goes ahead and reissues the WU and all is then well. But waiting an hour (or even the 15 minutes I now have it set at) for a lost WU to be replaced is (as mentioned) "annoying."

Because of this, I now use the "--server" switch and invoke the clients on the server machine externally.
EdH is offline   Reply With Quote
Old 2021-04-22, 08:22   #490
bur
 
bur's Avatar
 
Aug 2020

28510 Posts
Default

I read the README and don't quite comprehend why it makes sense to have servers and clients running on the local machine. This example
Code:
Run a full factorization on the local machine, using 8 threads for the
server (this includes the linear algebra), and 4 jobs of 2 threads
each for the polynomial selection and the sieving
makes it seem to me that the server does the LA and the clients the poly-selection and sieving?

But if it's a single machine why not use all cores for poly and sieving and afterwards for the LA?

Or put simple: if I have no intention on running CADO distributed, do I need to concern myself with server & clients? Otherwise I'd just use this to factor a C130 with a custom parameters file:

Code:
./cado-nfs.py --parameters ./parameters/factor/params.c130 C130

Last fiddled with by bur on 2021-04-22 at 08:25
bur is offline   Reply With Quote
Old 2021-04-22, 14:32   #491
charybdis
 
charybdis's Avatar
 
Apr 2020

11·31 Posts
Default

Quote:
Originally Posted by bur View Post
I read the README and don't quite comprehend why it makes sense to have servers and clients running on the local machine. This example
Code:
Run a full factorization on the local machine, using 8 threads for the
server (this includes the linear algebra), and 4 jobs of 2 threads
each for the polynomial selection and the sieving
makes it seem to me that the server does the LA and the clients the poly-selection and sieving?

But if it's a single machine why not use all cores for poly and sieving and afterwards for the LA?
This would be useful if, say, your machine has 16 threads and you only want to use 8 of them for CADO. But then you could use -t 8 and I'm not sure what the difference is.

Quote:
Or put simple: if I have no intention on running CADO distributed, do I need to concern myself with server & clients? Otherwise I'd just use this to factor a C130 with a custom parameters file:

Code:
./cado-nfs.py --parameters ./parameters/factor/params.c130 C130
If you want to use all threads on your machine, then this is fine. I'd give you one extra piece of advice, which is to create a directory to store the data and logs for each factorization you run and add workdir=/my/c130/directory to the command line. Otherwise the relations and logs end up in a temporary directory that gets wiped when the job finishes, and if your machine happens to reboot then you've lost the factors.

Last fiddled with by charybdis on 2021-04-22 at 14:36 Reason: correct misleading info
charybdis is offline   Reply With Quote
Old 2021-04-23, 07:20   #492
bur
 
bur's Avatar
 
Aug 2020

28510 Posts
Default

Thanks, that's helpful advice.
bur is offline   Reply With Quote
Old 2021-04-23, 17:39   #493
bur
 
bur's Avatar
 
Aug 2020

3×5×19 Posts
Default

I started CADO on a C112 and after this output nothing happens anymore. top shows no activity:

Code:
$ ./cado-nfs.py --parameters ./parameters/factor/vbc/params.c110.txt --workdir=./factorization/C112 2493852636725492741019529213832529872212666052422896535749837232438979207048925012754355124942131700804532974163
Info:root: No database exists yet
Info:Database: Opened connection to database factorization/C112/c110.db
Info:root: Set tasks.linalg.bwc.threads=10 based on detected physical cores
Info:root: Set tasks.threads=20 based on detected logical cpus
Info:root: tasks.threads = 20 [via tasks.threads]
Info:root: tasks.polyselect.threads = 2 [via tasks.polyselect.threads]
Info:root: tasks.sieve.las.threads = 2 [via tasks.sieve.las.threads]
Info:root: tasks.linalg.bwc.threads = 10 [via tasks.linalg.bwc.threads]
Info:root: tasks.sqrt.threads = 8 [via tasks.sqrt.threads]
Info:root: Command line parameters: ./cado-nfs.py --parameters ./parameters/factor/vbc/params.c110.txt --workdir=./factorization/C112 2493852636725492741019529213832529872212666052422896535749837232438979207048925012754355124942131700804532974163
Info:root: If this computation gets interrupted, it can be resumed with ./cado-nfs.py ./factorization/C112/c110.parameters_snapshot.0
Info:HTTP server: Using non-threaded HTTPS server
Info:Lattice Sieving: param rels_wanted is 9200000
Warning:Complete Factorization / Discrete logarithm: Parameter tasks.filter.maxlevel = 21 was not used anywhere
Info:Complete Factorization / Discrete logarithm: Factoring 2493852636725492741019529213832529872212666052422896535749837232438979207048925012754355124942131700804532974163
Info:HTTP server: serving at https://florian-Precision-3640-Tower:45909 (0.0.0.0)
Info:HTTP server: For debugging purposes, the URL above can be accessed if the server.only_registered=False parameter is added
Info:HTTP server: You can start additional cado-nfs-client.py scripts with parameters: --server=https://florian-Precision-3640-Tower:45909 --certsha1=4db43b1c5fb238c8f6fe14fa5bdbb123511ffce9
Info:HTTP server: If you want to start additional clients, remember to add their hosts to server.whitelist
Info:Polynomial Selection (size optimized): Starting
Info:Polynomial Selection (size optimized): 0 polynomials in queue from previous run
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_0-840 to database
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_840-1680 to database
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_1680-2520 to database
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_2520-3360 to database
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_3360-4200 to database
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_4200-5040 to database
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_5040-5880 to database
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_5880-6720 to database
Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_6720-7560 to database
 Info:Polynomial Selection (size optimized): Adding workunit c110_polyselect1_7560-8400 to database

And is it possible to make cado more verbose to screen output? -v doesn't seem to be recognized. It'd be nice to have some idea of the progress so far such as yafu/msieve does with the number of relations needed and found.

Last fiddled with by bur on 2021-04-23 at 17:53
bur is offline   Reply With Quote
Old 2021-04-23, 19:05   #494
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

4,861 Posts
Default

Looks like no clients ran. I don't have experience with using params files in non-default locations, but you can try adding --tasks.slaves.nrclients=10 to the command line and see if that triggers your setup to fire 10 (two-threaded by default) client processes for poly select and sieving.

Both phases will show progress in much detail on-screen, when they're actually running.

You might try invoking cado with no flags at all, to see if that works- that would help us figure out why the custom params file isn't triggering client startup.
VBCurtis is online now   Reply With Quote
Old 2021-04-23, 19:31   #495
charybdis
 
charybdis's Avatar
 
Apr 2020

1010101012 Posts
Default

I think this could be a manifestation of this bug that was introduced by revisions 0cb1e01c and f15ee502 in November. It caused clients to die immediately and silently with certain versions of Python, and I don't think it was ever fully fixed, because I kept getting the bug with Python 3.5.2 even after the supposed fix. Never got round to talking to the developers about it.

Try building an old revision - perhaps 6605b6e4, the last one before the bug was introduced - and see if you still have problems.
charybdis is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
CADO-NFS on windows jux CADO-NFS 25 2021-07-13 23:53
CADO help henryzz CADO-NFS 4 2017-11-20 15:14
CADO and WinBlows akruppa Programming 22 2015-12-31 08:37
CADO-NFS skan Information & Answers 1 2013-10-22 07:00
CADO R.D. Silverman Factoring 4 2008-11-06 12:35

All times are UTC. The time now is 20:21.


Fri Jul 16 20:21:43 UTC 2021 up 49 days, 18:08, 1 user, load averages: 2.41, 2.14, 2.14

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.