vSphere manual Disaster Recovery failback when using VMware SRM

While many of you wonder why I am explaining manual Disaster Recovery failback when using VMware Site Recovery Manager where its an automated feature that is included in VMware vCenter Site Recovery Manager 5. Two main scenarios where this apply, the first one if you still using any vCenter Site Recovery Manager prior to VMware SRM 5. The other one if you need to break the replication then recover back without syncing back the changes that happened at the Disaster Recovery Site. In this scenario you won’t be able to use the automated failback feature of VMware vCenter Site Recovery Manager 5 as that sync the changes back to production in the process which you might not desire. For both of these scenarios the below steps become handy.

  1. Confirm Protected VMs recovered to DR Site are no longer being used and can be powered down.
  2. Confirm that you have an inventory of your VMs, DataStores, IPs, Network interfaces
  3. Power down the protected VMs in the DR Site.
  4. Perform a cleanup of the directory in the DR site that contained the placeholder VMs created during protection group creation in Protected site if placeholders exist.
  5. Delete earlier created Protection Groups and Recovery Plans.
  6. Connect to the vCenter Server in the protected site and remove all the VMs in the protected site that was recovered to the DR site from inventory.
  7. Connect to the vCenter Server in the DR site and remove all the VMs in the DR site that was recovered to it when the protected site failed over from inventory. (Only you need to remove it from inventory not from disk, in case you need to come back to it)
  8. Present the original LUNs back to the protected site as full read and write
  9. Right click the cluster where the the protected VMs reside in vCenter on the production side => Rescan for Datastores
  10. Go to each host configuration tab=> Storage. Then verify that all the datastores required are mounted.
  11. Add the VMs back to the inventory of the main site. Use the list created during the failover stage.
  12. If you see a small yellow warning on the VM  then right click it and open the console on that VM. Then from the dialog choose “I moved it” and hit OK
  13. Reconfigure the VMs Network interfaces (Port groups) in case they are connected to a Distributed Switch. Note: Virtual Machines vDS Network Interfaces configuration in VMX file is lost upon removing it from inventory
  14. Establish replication from the protected site to the DR site.
  15. Recreate all the protection groups and recovery groups again.
  16. Run a failover test & clean up to ensure all is back in place.

I hope this help one execute or document the procedure as required.





Leave a Reply