teh bigbro blog(tm)
Bigbro's foray into the scary world of blogging

Tue, 07 Aug 2007

Outage for atlas...

It appears that Hetzner had a problem with one of their server rooms last week, resulting in atlas (one of my servers) losing connectivity to the outside world. Once again, the Hetzner team were quick to respond once I requested them to take a look at the server, and a reboot restored it from whatever state it had gotten itself into.
My munin stats show about 24 hours of missing time, which correlates well with their description of the outage times and when I realised the server was off the air. More annoying though is that I appear to be missing a large chunk of smokeping graphs. It looks like a couple of weeks worth of data in the month of July was dumped. I can only surmise that there was a power outage just as the RRD files were being written, resulting in a large amount of data being unceremoniously lost :-(
I guess the 24 hour outage highlights that I should have something monitoring my monitoring server. I've added it to my 'TODO' list...
posted at: 11:15 | path: /technical | permanent link to this entry


copyright © 2005-2008, Gareth Eason