[LUNI] oh oh, my raid it having problems
Jay Strauss
me at heyjay.com
Thu Aug 2 16:09:24 CDT 2007
On 8/2/07, Craig Van Tassle <craig at codestorm.org> wrote:
> I would check /proc/mdstat and if it shows one drive failed then I would replace the drive.
> You should be able to tell what drive is bad when you look at mdstat. I'm assuming you are only using the full drive for
> raid. Get a drive of the same size, remove the failed drive. Install the new drive. Partision it and then hotadd it to
> the array. Depending on size of the partision you should be able to sync it from any time between 10 minutes to 3 hours.
> watch -n 1 'cat /proc/mdstat' will show you the drive as it resyncs. It should not take that long.
>
> Then I would check RMA status and also run a badblocks on the failed drive. That should tell you what went wrong.
> Another thing to do is check the smart status of the drive. That should show if the drive failed and why.
>
> HTH
>
> Craig
Hi Craig, thanks for advice. Here is the output:
iron:~# cat /proc/mdstat
Personalities : [raid1]
md0 : active raid1 hda1[0]
1951744 blocks [2/1] [U_]
md1 : active raid1 hda2[0]
115266304 blocks [2/1] [U_]
unused devices: <none>
I'm guessing it's hda1 that is the problem. I don't know how to print
the partition table. I have 2 disks which I broke into 2 partitions.
On top of those I built two raid 1 devices. On one of the devices
(the big one) I built an LVM.
So how do I print the partition table?
Do I power down the box, remove the drive, replace with new, power
back on and try to sync OR do I do it all while the box is running?
What is RMA how do I check it? Do I stick the failed drive in a
different computer?
Thanks
Jay
More information about the luni
mailing list