[LUNI] oh oh, my raid it having problems
Craig Van Tassle
craig at codestorm.org
Thu Aug 2 14:46:24 CDT 2007
I would check /proc/mdstat and if it shows one drive failed then I would replace the drive.
You should be able to tell what drive is bad when you look at mdstat. I'm assuming you are only using the full drive for
raid. Get a drive of the same size, remove the failed drive. Install the new drive. Partision it and then hotadd it to
the array. Depending on size of the partision you should be able to sync it from any time between 10 minutes to 3 hours.
watch -n 1 'cat /proc/mdstat' will show you the drive as it resyncs. It should not take that long.
Then I would check RMA status and also run a badblocks on the failed drive. That should tell you what went wrong.
Another thing to do is check the smart status of the drive. That should show if the drive failed and why.
HTH
Craig
Jay Strauss wrote:
> Hi,
>
> I have a raid1 fileserver at home, where I store all my home
> directories. I just noticed in my mail the messages below.
>
> I'm a bit frightened.
>
> All my files seem to be accessable, but not sure what my next steps should be.
>
> Any help, suggestions, guidence would be appreciated.
>
> Thank you
> Jay
>
>> U 1 root at localhost.lo Sat Jul 02 18:50 21/718 Fail event on /dev/md1:iron
> U 2 root at localhost.lo Sat Oct 28 20:54 21/736 DegradedArray
> event on /dev/md0:iron
> U 3 root at localhost.lo Sat Oct 28 20:54 21/736 DegradedArray
> event on /dev/md1:iron
> U 4 root at localhost.lo Sat Oct 28 21:18 21/736 DegradedArray
> event on /dev/md0:iron
> U 5 root at localhost.lo Sat Oct 28 21:18 21/736 DegradedArray
> event on /dev/md1:iron
> U 6 root at localhost.lo Sat Oct 28 21:51 21/736 DegradedArray
> event on /dev/md0:iron
> U 7 root at localhost.lo Sat Oct 28 21:51 21/736 DegradedArray
> event on /dev/md1:iron
> U 8 root at localhost.lo Sat Oct 28 22:19 21/736 DegradedArray
> event on /dev/md0:iron
> U 9 root at localhost.lo Sat Oct 28 22:19 21/736 DegradedArray
> event on /dev/md1:iron
> U 10 root at localhost.lo Sat Oct 28 22:22 21/736 DegradedArray
> event on /dev/md0:iron
> U 11 root at localhost.lo Sat Oct 28 22:22 21/736 DegradedArray
> event on /dev/md1:iron
> U 12 root at localhost.lo Sat Oct 28 22:25 21/736 DegradedArray
> event on /dev/md0:iron
> U 13 root at localhost.lo Sat Oct 28 22:25 21/736 DegradedArray
> event on /dev/md1:iron
> U 14 root at localhost.lo Sun Feb 04 23:01 21/736 DegradedArray
> event on /dev/md0:iron
> U 15 root at localhost.lo Sun Feb 04 23:01 21/736 DegradedArray
> event on /dev/md1:iron
> U 16 root at localhost.lo Sun Apr 29 20:53 21/736 DegradedArray
> event on /dev/md0:iron
> U 17 root at localhost.lo Sun Apr 29 20:53 21/736 DegradedArray
> event on /dev/md1:iron
> U 18 root at localhost.lo Sun May 13 10:18 21/736 DegradedArray
> event on /dev/md0:iron
> U 19 root at localhost.lo Sun May 13 10:18 21/736 DegradedArray
> event on /dev/md1:iron
> U 20 root at localhost.lo Thu May 17 21:06 21/736 DegradedArray
> event on /dev/md0:iron
> U 21 root at localhost.lo Thu May 17 21:06 21/736 DegradedArray
> event on /dev/md1:iron
> U 22 root at localhost.lo Thu May 17 21:15 21/736 DegradedArray
> event on /dev/md0:iron
> U 23 root at localhost.lo Thu May 17 21:15 21/736 DegradedArray
> event on /dev/md1:iron
> U 24 root at localhost.lo Fri May 18 08:17 21/736 DegradedArray
> event on /dev/md1:iron
> U 25 root at localhost.lo Fri May 18 08:17 21/736 DegradedArray
> event on /dev/md0:iron
> U 26 root at localhost.lo Mon Jul 30 21:48 21/736 DegradedArray
> event on /dev/md0:iron
> U 27 root at localhost.lo Mon Jul 30 21:48 21/736 DegradedArray
> event on /dev/md1:iron
> U 28 Mailer-Daemon at loc Thu Aug 02 13:25 38/1394 Mail delivery
> failed: returning message to sender
> U 29 Mailer-Daemon at loc Thu Aug 02 13:26 38/1394 Mail delivery
> failed: returning message to sender
> & 1
> Message 1:
>>From root at localhost.localdomain Sat Jul 02 18:50:43 2005
> Envelope-to: root at localhost.localdomain
> Delivery-date: Sat, 02 Jul 2005 18:50:43 -0500
> From: mdadm monitoring <root at localhost.localdomain>
> To: root at localhost.localdomain
> Subject: Fail event on /dev/md1:iron
> Date: Sat, 02 Jul 2005 18:50:36 -0500
>
> This is an automatically generated mail message from mdadm
> running on iron
>
> A Fail event had been detected on md device /dev/md1.
>
> Faithfully yours, etc.
>
> & q
> Saved 1 message in /home/jstrauss/mbox
> Held 28 messages in /var/mail/jstrauss
More information about the luni
mailing list