Hi everyone, we are utilizing vSAN 6.x for our production server environment. We have 5 host. We just had a major event where 2 disk needed to be replaced. We replaced those disk and then a resync of data kicked off. A few days later the resync was rebalancing 70TB worth of data. It brought the vSAN to a crawl while our host would become unresponsive as their disks became full. Got on the phone with support and they noticed we had our rebalance configuration set at 95%. They believe that configuration plus the replacing of two disk caused the problem. We have a plan of action to change the rebalance to their recommended 80% threshold when this is over with. We lost services to our mission critical servers for two days.
Now I need to plan for redundancy. What if this happened again? How redundant is vSAN? what if vSAN itself becomes corrupt? Thinking outside the box I realized we have no other storage array to move data to and that would have saved us in this event.
What do most people do for vSAN redundancy? what is recommended? What are the best options here? I am now planning for the future as I consider this a disaster situation. Any ideas and thoughts are appreciated!