![]() |
|
|
#4192 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2·3·1,693 Posts |
|
|
|
|
|
|
#4193 |
|
If I May
"Chris Halsall"
Sep 2002
Barbados
33×192 Posts |
Yeah... Thanks...
Another fscking Seagate drive!!! And this time it didn't even throw any SMART warnings! I was editing a file on the server, and went to save it. The console hung. Went to another console and asked for a SMART report (smartctl -a /dev/sda) and that hung. Went to yet another console, and killed the smartclt command; kernel panic. Logged into the serial console and asked for a reboot: Code:
Sep 5 13:06:51 gpu72 kernel: ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Sep 5 13:06:51 gpu72 kernel: ata1.00: failed command: FLUSH CACHE EXT
Sep 5 13:06:51 gpu72 kernel: ata1.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 2#012 res 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Sep 5 13:06:51 gpu72 kernel: ata1.00: status: { DRDY }
Sep 5 13:06:51 gpu72 kernel: ata1: hard resetting link
Sep 5 13:06:51 gpu72 kernel: ata1: link is slow to respond, please be patient (ready=0)
Sep 5 13:06:51 gpu72 kernel: ata1: COMRESET failed (errno=-16)
Sep 5 13:06:51 gpu72 kernel: ata1: hard resetting link
Sep 5 13:06:51 gpu72 kernel: ata1: link is slow to respond, please be patient (ready=0)
Sep 5 13:06:51 gpu72 kernel: ata1: COMRESET failed (errno=-16)
Sep 5 13:06:51 gpu72 kernel: ata1: hard resetting link
Sep 5 13:06:51 gpu72 kernel: ata1: link is slow to respond, please be patient (ready=0)
Sep 5 13:06:51 gpu72 kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Sep 5 13:06:51 gpu72 kernel: ata1.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out
Sep 5 13:06:51 gpu72 kernel: ata1.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out
Sep 5 13:06:51 gpu72 kernel: ata1.00: qc timeout (cmd 0xec)
Sep 5 13:06:51 gpu72 kernel: ata1.00: failed to IDENTIFY (I/O error, err_mask=0x4)
Sep 5 13:06:51 gpu72 kernel: ata1.00: failed to IDENTIFY after ACPI commands
Sep 5 13:06:51 gpu72 kernel: ata1.00: revalidation failed (errno=-5)
Sep 5 13:06:51 gpu72 kernel: ata1: hard resetting link
Sep 5 13:06:51 gpu72 kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Sep 5 13:06:51 gpu72 kernel: ata1.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out
Sep 5 13:06:51 gpu72 kernel: ata1.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out
Sep 5 13:06:51 gpu72 kernel: ata1.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out
Sep 5 13:06:51 gpu72 kernel: ata1.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out
Sep 5 13:06:51 gpu72 kernel: ata1.00: configured for UDMA/133
Sep 5 13:06:51 gpu72 kernel: ata1.00: retrying FLUSH 0xea Emask 0x4
Sep 5 13:06:51 gpu72 kernel: ata1: EH complete
1&1 have scheduled a drive replaced (AGAIN!!!!). Third time for this machine. They also promised to check all the cabling, and if this ever happens again they'll replace the machine. Have I ever mentioned I hate computers? (And don't even get me started on SeaCrap!) |
|
|
|
|
|
#4194 |
|
If I May
"Chris Halsall"
Sep 2002
Barbados
33·192 Posts |
OK. The girl is back... No data-loss (I don't think).
And although 1&1 insisted on installing yet another SeaCRAP drive, at least it's a different model this time: "Seagate Constellation CS" instead of a "Seagate Barracuda 7200.14 (AF)". They also changed the drive carriages to a newer model with better airflow. RAID1 rebuild is currently under way, so there may be a bit of sluggishness for an hour or so. Please let me know if anyone sees anything weird. You never know with this kind of thing.... Edit: OK. Except for the file I was editing at the time of the crash, everything looks good. DB sanity checks passed, etc. Last fiddled with by chalsall on 2018-09-05 at 22:05 |
|
|
|
|
|
#4195 | ||
|
If I May
"Chris Halsall"
Sep 2002
Barbados
33×192 Posts |
Hey All.
Just to document the exchange I had with 1&1 after the outage, trying to give constructive criticism. Quote:
Quote:
And, so, you manage the situation... Know the lay of the land, and always be prepared to deal with everything which could possibly happen.... |
||
|
|
|
|
|
#4196 |
|
Romulan Interpreter
Jun 2011
Thailand
23×419 Posts |
Hey Chris, I remember I read this few days ago, but I skipped at the time, not being interested, but it seems it repeats: I also got assigned 39M TF from 71 to 74 (?!?), 18 (eighteen) of them. Was that supposed to happen? After changing the bitlevel to higher/lower, I started getting correct assignments. For now, I will let them finish, but for the future, if you offer assignments out of the expected ranges, please add them to the tables. I always look to the tables when I request assignments, and I don't like surprises. Mind that I do not question your reasons of assigning this or that, I only question the fact that you gave me some work which I didn't expect, because it was not exposed as available. If you assign 39M range for whatever reason, add it to the table, so I can see it.
|
|
|
|
|
|
#4197 | ||
|
If I May
"Chris Halsall"
Sep 2002
Barbados
33·192 Posts |
Quote:
I somehow managed to forget to remove the last few candidates (24#) from the DB, and you chose an unusual criteria in your query which resulted in those being given to you. Quote:
For the last several months when my GPUs are working at lower levels I've been working "off the books", to avoid this kind of thing. And I've made sure that the GPU72 DB is clean of such potential erroneous assignments. |
||
|
|
|
|
|
#4198 |
|
If I May
"Chris Halsall"
Sep 2002
Barbados
33·192 Posts |
Hey All. Just a heads up...
Barbados is about to be hit by Tropical Storm Kirk. We're prepared, but there's a chance I might lose power and/or connectivity for a while. GPU72 is pretty steady-state at the moment, so there shouldn't be any issues. But if you don't see me around for a bit, you'll know why. |
|
|
|
|
|
#4199 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
236568 Posts |
Quote:
|
|
|
|
|
|
|
#4200 |
|
If I May
"Chris Halsall"
Sep 2002
Barbados
33·192 Posts |
Thanks. And yeah, our home is about 100 m above sea level. Although Linda's office is only about 5 m above...
Kirk turned out to be a bit anti-climatic. It passed north of us, so we only saw about 50 km/h winds, and about 15 cm of rain over eight hours. There's a saying around Bimshire: "God is a Bajan." This annoys the heck out of me, because people use it as a rational for why they don't need to be properly prepared for storms. But, in Kirk's case, the empirical supports the argument: after passing us it then dropped back down south to our latitude and continued on east.... Last fiddled with by chalsall on 2018-09-28 at 14:54 Reason: s/shouldn't be/don't need to be/; |
|
|
|
|
|
#4201 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2·3·1,693 Posts |
That's good to hear. Five meters could be a close thing on the bad side of a strong storm. I am happy for the miss.
Last fiddled with by kladner on 2018-09-29 at 06:43 |
|
|
|
|
|
#4202 |
|
1976 Toyota Corona years forever!
"Wayne"
Nov 2006
Saskatchewan, Canada
111128 Posts |
AWOL?
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Status | Primeinator | Operation Billion Digits | 5 | 2011-12-06 02:35 |
| 62 bit status | 1997rj7 | Lone Mersenne Hunters | 27 | 2008-09-29 13:52 |
| OBD Status | Uncwilly | Operation Billion Digits | 22 | 2005-10-25 14:05 |
| 1-2M LLR status | paulunderwood | 3*2^n-1 Search | 2 | 2005-03-13 17:03 |
| Status of 26.0M - 26.5M | 1997rj7 | Lone Mersenne Hunters | 25 | 2004-06-18 16:46 |