mersenneforum.org  

Go Back   mersenneforum.org > Other Stuff > Archived Projects > NFSNET Discussion

 
 
Thread Tools
Old 2004-01-26, 17:11   #1
Wacky
 
Wacky's Avatar
 
Jun 2003
The Texas Hill Country

44116 Posts
Default Trouble at t' mill

At around 1500 UT today, Monday 26th January, the NFSNET server here at
MSR Cambridge UK lost its disk which holds all the NFSNET data. Neither
the operating system nor the BIOS can see that disk any longer. This is
not good news 8-(

Richard Wackerbarth and I have worked to restore service. The data
disk here is still unavailable so we put a skeleton server into
operation on the system disk. There is just enough information in place
to allow us to redirect all the clients which were taking tasks from the
Cantab and the Fenland pools to their counterpart pools which are hosted
on Richard's server in Austin, Texas. At the time of typing it's about
20 minutes after setting up the redirections and several dozen clients
have already transferred.


Now that the emergency work has been completed, I will concentrate on
getting my server back into operation. I will try very hard to regain
access to the disk and its data because there is approximately 15 hours
worth of sieving from about 125 clients on it which is not stored
anywhere else. All data which arrived before 0000UT today has already
been copied to Austin for safekeeping.

Apologies for the drop in service. We were relatively fortunate in that
it occurred during the working day in both Austin and Cambridge, that we
noticed in good time and that we had an emergency response procedure
prepared and ready to be activated.


More news when we have it.


Paul & Richard
Wacky is offline  
Old 2004-01-26, 17:30   #2
junky
 
junky's Avatar
 
Jan 2004

7·19 Posts
Default

that's bad :(
I've a question about pool:
i've remark sometime is Fenland, sometime is Texas. what that mean ?
it is just the place where its getting the info from ?
thanks.
junky is offline  
Old 2004-01-27, 03:39   #3
Jeff Gilchrist
 
Jeff Gilchrist's Avatar
 
Jun 2003
Ottawa, Canada

49516 Posts
Default

The pool is the server you are connecting to in order to get assignments. There are several pools setup (in the UK and Texas). If there is a problem with one server (like above) people will be redirected to another so they can continue to process work. It is mainly used to spread out the load from all the users.
Jeff Gilchrist is offline  
Old 2004-01-27, 04:28   #4
dsouza123
 
dsouza123's Avatar
 
Sep 2002

2×331 Posts
Default

Maybe a safety feature of storing the last 24 hours/last dozen results.txt files on the PC so if the work is lost on the server it could be resent.

The client could tell the server the date/time of it's stored files if they are old ( hadn't done work in a while) or they are securely on a server ( if I understand correctly, the work gets backed up to another server roughly daily) they are deleted.

On my PC Athlon 1200 1.5 hours = 24kb results file so 24 hours worth would be about 384kb.
dsouza123 is offline  
Old 2004-01-27, 05:11   #5
junky
 
junky's Avatar
 
Jan 2004

2058 Posts
Default

RAID5 isn't there for that purpose ?
junky is offline  
Old 2004-01-27, 17:31   #6
xilman
Bamboozled!
 
xilman's Avatar
 
"π’‰Ίπ’ŒŒπ’‡·π’†·π’€­"
May 2003
Down not across

10,753 Posts
Smile MSR Cambridge server back up again

The server at MSR Cambridge is now serving again. Almost all
NFSNET clients are being moved back to the Cantab and Fenland
pools. Several dozen have already transferred.

I replaced the failed disk this morning and restored data from
a backup.

The old disk did spin up again when I plugged it into another
machine and it is now approximately 80% of the way through
copying all its data on to another machine. So, apart from a
couple of hours yesterday between the disk dying and everyone
being moved over to a Texas server, we haven't lost any data.

It will take a little time before the data on the old disk makes
its way into the warehouse and the stats appear on the web pages.
With luck, they may appear tomorrow. Otherwise, they should be
there on Thursday.

Again, apologies for the interruption in our service.


Paul
xilman is offline  
Old 2004-01-27, 17:56   #7
junky
 
junky's Avatar
 
Jan 2004

7×19 Posts
Default

hi paul,
im now getting that output:
Code:
17:54:40 NFSNET Client - V 1.0
17:54:40 Initializing...
17:54:40 Initialized.
17:54:40 Requesting assignment...
17:54:43 Received assignment (  IDLE  0-0)...
why its idle? still getting the info from the backup??

thanks.
junky is offline  
 



Similar Threads
Thread Thread Starter Forum Replies Last Post
The Mill CPU architecture ewmayer Hardware 5 2014-05-03 19:58
asm trouble science_man_88 Programming 52 2010-10-06 21:52
Is Entropia in trouble? ekugimps PrimeNet 1 2005-09-09 16:18
39 - 39.25 Mill to 2^61 moo Lone Mersenne Hunters 7 2005-04-30 02:08
Trouble with display ThomRuley Hardware 3 2005-01-17 23:11

All times are UTC. The time now is 23:56.


Fri Jul 16 23:56:52 UTC 2021 up 49 days, 21:44, 1 user, load averages: 1.94, 1.74, 1.52

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.