> Hi folks,
>> I've run into a slight problem here, namely a dead Linux cluster!
What type of cluster?
> Basically, on start-up I am being informed that one of the disks in the
> RAID is dead or dying.
> That's fair enough.. however, the system then goes on to boot and hits a
> kernel panic and stops booting.
Whats the panic say then?
> I've a sneaking feeling that we've got some corruption in the root file
> system.
> Obviously this isn't exactly great news. However... at the moment I have
> one quick and simple question.
> Should I try and get in as a single user or using the rescue environment
> to see what I can do now (FSCK?)
> or would it be more prudent to wait until I replace the dodgy disk in the
> RAID?
Think of a RAID array as simply a LUN which has lost its redundancy.
Doing an fsck will simply add more I/O's (during a reconstruction), so the
question should be:
Can you afford to wait for the new disk? Does this have to go immediately
back into production? Do you have any (verified) backups in case it goes
tits-up?
I recently dealt with such an issue - but the entire LUN was gone due to
silliness, and when they went to restore from backup, they discovered the
backups were giving I/O errors. Luckily they had a secondary backup which
worked grand.
> Cheers
> Austin
--
Conor Wynne
http://www.conorwynne.com/
YZF-R6 http://yzf-r6.kicks-ass.net
Maintained by the ILUG website team. The aim of Linux.ie is to
support and help commercial and private users of Linux in Ireland. You can
display ILUG news in your own webpages, read backend
information to find out how. Networking services kindly provided by HEAnet, server kindly donated by
Dell. Linux is a trademark of Linus Torvalds,
used with permission. No penguins were harmed in the production or maintenance
of this highly praised website. Looking for the
Indian Linux Users' Group? Try here. If you've read all this and aren't a lawyer: you should be!