 2019-06-11, 09:55 #221 SethTro     "Seth" Apr 2019 A416 Posts 10% Also we've 10% done with sieving! ETA has finally stabilized at late Aug which is ~76 days out (assuming vebis stays on, otherwise ~150 days)
 2019-06-11, 11:20 #222 lukerichards     "Luke Richards" Jan 2018 Birmingham, UK 25×32 Posts Can someone with access to the raw data check what's happening with lukerichards-pre1 please? Been stuck on 9.1% for a while with no updates since 2019-06-10 14:32:13 despite the fact the instance is running and 'top' reads ~800% CPU usage, with ~hourly network traffic spikes. As CADO is running as a cron job, I can't check the shell output, but will check the log files later. EDIT: Having said that, I can't see anything in the cado folder which is obviously a local log file... does such a thing exist? Last fiddled with by lukerichards on 2019-06-11 at 11:26
2019-06-11, 12:18   #223
Thomas11

Feb 2003

3×5×127 Posts

Quote:
 Originally Posted by lukerichards EDIT: Having said that, I can't see anything in the cado folder which is obviously a local log file... does such a thing exist?
I'm not aware of any local log file.
But you may have a look into your work directory: lukerichards-pre1.xxxxxxx.work.
It your client is still running properly, then the output file 2330L.c207.xxxxx000-xxxxx000.gz should be updated every few minutes...

2019-06-11, 15:02   #224
lukerichards

"Luke Richards"
Jan 2018
Birmingham, UK

25×32 Posts

Quote:
 Originally Posted by Thomas11 I'm not aware of any local log file. But you may have a look into your work directory: lukerichards-pre1.xxxxxxx.work. It your client is still running properly, then the output file 2330L.c207.xxxxx000-xxxxx000.gz should be updated every few minutes...
Code:
{username}@lukerichards-pre1:~/cado\$ ls -lt -d *.work
drwxr-xr-x 2 {username} {username} 4096 Jun  6 15:34 lukerichards-pre1.23245b81.work
Interesting that the instance seems to be started dumping the workfiles into a localhost folder, not a 'lukerichards-pre1' folder.

I literally have not changed a thing since it was working - this change did not started when I began running it from boot in crontab, it started after this. So I've no idea what's gone wrong! Is there any way to check if my work units are being successfully reported, albeit under localhost? I note there are some 'localhost' entries on the list, with ~7 CPU days of work, which is about right considering it has been about a day of 8 CPUs...

2019-06-11, 15:20   #225
Thomas11

Feb 2003

35618 Posts

Quote:
 Originally Posted by lukerichards Interesting that the instance seems to be started dumping the workfiles into a localhost folder, not a 'lukerichards-pre1' folder. I literally have not changed a thing since it was working - this change did not started when I began running it from boot in crontab, it started after this. So I've no idea what's gone wrong!
Don't know what's going wrong there.
But you could force it to use your intended ID by adding the following to the command line:
Code:
--clientid=lukerichards-pre1
or
Code:
--clientid=lukerichards-pre1.1
--clientid=lukerichards-pre1.2
--clientid=lukerichards-pre1.3
--clientid=lukerichards-pre1.4
...
if you're running multiple instances on the same machine. The trailing number just replaces the "random" number generated by cado-nfs.

 2019-06-11, 17:12 #226 VBCurtis     "Curtis" Feb 2005 Riverside, CA 22·23·47 Posts Localhost has been submitting workunits for a few days now; I can get the IP address PMed to you to confirm it's you.
2019-06-11, 18:47   #227
lukerichards

"Luke Richards"
Jan 2018
Birmingham, UK

1001000002 Posts

Quote:
 Originally Posted by VBCurtis Localhost has been submitting workunits for a few days now; I can get the IP address PMed to you to confirm it's you.

 2019-06-12, 00:31 #228 SethTro     "Seth" Apr 2019 22·41 Posts I'm hungry to add more badges, can anyone explain (or point me at resources) to understand what Average J and special-q represent? Code: Info:Lattice Sieving: Total number of relations: 19266332 Info:Lattice Sieving: Average J: 7782.82 for 77739 special-q, max bucket fill: 0.628863
 2019-06-12, 04:25 #229 VBCurtis     "Curtis" Feb 2005 Riverside, CA 22×23×47 Posts special-q are the actual prime values we sieve over. When you run a workunit of Q from, say, 30200000 to 30202000, you're running the sieve program las over each prime (I think? Are some primes skipped for algorithmic reasons?) within that range. Quite a lot of the variation in relations found is related to some ranges having more primes than others. So, while we speak in terms of yield of relations per Q-range, it's actually more accurate to speak in terms of relations per special q sieved. Both kinds of yield decrease as Q rises: there are fewer primes in each range, and fewer relations found per prime. If you divide total relations found by total Q searched, our yield is around 9.0 so far. My test-sieving indicates the yield at Q=100M will be in the low 7's, a 20% drop. By 200M, it's 6.0 and sec/rel is around double its current value. I don't know what J means in CADO context.
2019-06-12, 05:16   #230
SethTro

"Seth"
Apr 2019

22·41 Posts

Quote:
 Originally Posted by VBCurtis special-q are the actual prime values we sieve over. When you run a workunit of Q from, say, 30200000 to 30202000, you're running the sieve program las over each prime (I think? Are some primes skipped for algorithmic reasons?) within that range. Quite a lot of the variation in relations found is related to some ranges having more primes than others. So, while we speak in terms of yield of relations per Q-range, it's actually more accurate to speak in terms of relations per special q sieved. Both kinds of yield decrease as Q rises: there are fewer primes in each range, and fewer relations found per prime. If you divide total relations found by total Q searched, our yield is around 9.0 so far. My test-sieving indicates the yield at Q=100M will be in the low 7's, a 20% drop. By 200M, it's 6.0 and sec/rel is around double its current value. I don't know what J means in CADO context.
Thanks for the detailed description. It's informative and solidifies my understanding.

Is there anything else you'd like for monitoring? maybe time since last workunit?

 2019-06-13, 15:49 #231 lukerichards     "Luke Richards" Jan 2018 Birmingham, UK 25×32 Posts Code: host # of workunits Relations CPU-days Last workunit submitted instance-1 970 9236551 (2.8% total) 45.6 2019-06-05 23:27:08,078 lukerichards-pre1 175 3180220 (0.9% total) 21.9 2019-06-13 08:18:30,440 Both machines are my VM instances running on Google Cloud. I'm fairly sure both have the same architecture (although they could be different). One has managed 48% of the CPU-days of the other one but found only 34% of the relations. Is this within expected bounds of variance? I'm assuming the huge range in # of work units is because of the change in work unit size?

