Jun 26th, 2011

Oh well. Short power failure killed nodes n0001-n0004. n0002 upon reboot 'lost' two cores (bloody BIOS update to 3.1 and bloody 'green' UPS's). Decided to try my luck remotely via flashbios hoping to go back to BIOS v.1.4. Unfortunately, but not unexpectedly, this didn't work as expected, especially when considering that I used a BIOS dump of a different node. After a short trip to access the cluster physically, and a few problems with MAC addresses, n0002 is up again, but still on BIOS v.3.1. Damn …

2011/06/26 22:43

Jun 25th, 2011

TS over station. Five nodes didn't respond to wake-on-LAN, they will have to wait till tomorrow (assuming the power stays on o/n, which appears to be doubtful).

Second close-spaced black-out killed another two nodes. The Déjà vu is strong :

”…

Three little Soldier boys walking in the zoo; A big bear hugged one and then there were two.

Two Little Soldier boys sitting in the sun; One got frizzled up and then there was one.

One little Soldier boy left all alone; He went out and hanged himself and then there were none.”

2011/06/26 00:05

<< Newer entries | Older entries >>

The full maintenance archive is kept here

…and finally, The infamous MBG's Power Failure Log

about/maintenance.txt · Last modified: 2011/01/31 17:56 (external edit)