How to gracefully swap a failing SAS in a RAID5 array on a Poweredge PERC controller?

Posted by Snot-p@reddit | sysadmin | View on Reddit | 50 comments

Hi all,

In a bit of a situation where I can use some guidance on hardware I inherited. I have 5 1.2TB SAS drives in a RAID5 array on an older Poweredge R540 on a PERC H740P hardware RAID controller.

One of the five drives in the RAID5 is throwing SMART errors and is in a predictive failure state but is still online for now. I have an identical 1.2TB SAS listed ready as a global hot spare on this PERC controller. It's not dedicated to that RAID5 array.

I am heavily imagining it's incredibly bad practice to yank the failing drive and simulate an array failover onto that global hot spare as then I'm risking the array to puncture during rebuild. From reading, I see you're supposed to do a replace member on the PERC. The issue - iDRAC exposes none of that from what I can see to mark a drive for replace member and kick off the safe preemptive build on the hot spare.

I see that you can use PERCCLI to kick off a Replace Member - is this just a Dell utility that runs on the Hypervisor? Is this the right way of going about this? Or are people just yanking a drive and letting the array do the work after immediately slapping in a new healthy drive?

Thanks