17 Sep 2009

VMware FT...Can you afford a SAN failure?

In today’s world where mission critical applications need to be available 24x7 with 99.99% availability, companies are throwing millions of dollars or pounds at implementing redundant and fault tolerant infrastructures. We all know that the money we spend today will save us much more in the future. Some companies make two to three million profit each and every day. In order to be competitive in the current climate, they need business applications such as messaging and collaboration to be available at all times. Imagine if a business with hundreds of employees one day suddenly lost the ability to send and receive email.

This may sound unheard of, but just this very week I’ve dealt with such a case where a company employing almost 10,000 people had no email, collaboration, database systems and even a corporate website for more than 24 hours, just because a critical component failed on their main SAN. Now let’s just say they make on average 2 million pounds profit each day. They had no business trading on the day of the failure, costing them a huge part of that 2 million pounds profit as the 10,000 people they employ had to be paid for that day an no one could do any meaningful work. If they had only spend half of that on a replication solution for their SAN, they could have avoided such a lengthy outage.

Companies now see massive operating cost savings in their Data Centres with virtualization. They have less physical hardware, use less power, and can provision new servers and services within minutes. This is all good and well, but how do we ensure that what we deploy in our virtualized infrastructures is replicated in real time and then in the event of a failure is failed over without incurring a second of down time? VMware’s answer to that question is Fault Tolerance. VMware Fault Tolerance, is the only feature available today that can guarantee the uptime of virtual machines in your environment. It provides zero downtime, zero data loss and continues availability for your applications even in the event of an ESX host hardware failure, and that without the cost and complexity of traditional hardware and software clustering solutions. VMware is the only vendor that provides capabilities and features such as Fault Tolerance, and is therefore the only virtualization vendor that can actually guarantee the ongoing availability of your critical applications.

Unlike VMware HA (High Availability) which restarts virtual machines on other ESX hosts with available capacity, Fault Tolerance eliminates downtime altogether. Fault Tolerance is based on leading edge technology called vLockstep and eliminates even the smallest of data loss, corruption or disruption to service. Fault tolerance can protect any virtual machine, enabling a single and simple technology to protect all applications. This enables applications that cannot be protected by traditional clustering methods to be protected. As Fault Tolerance can easily be “turned on” or “turned off” for individual virtual machines in your existing HA clusters, Fault Tolerance provides a simple solution to the complex problems that surrounds redundant systems and clustering.

How it works:

Fault Tolerance uses the well known encapsulation properties of virtualization by building high availability directly into the x86 hypervisor in order to deliver hardware style fault tolerance to virtual machines. It requires no custom hardware or software (no need to purchase expensive software editions such Enterprise Edition for applications such as SQL, Exchange and even operating systems such as Windows Server  Enterprise Edition). VMware Fault Tolerance relies on VMware vLockstep technology to establish and maintain an active secondary virtual machine that runs in virtual lockstep with the primary virtual machine. The secondary virtual machine resides on a different host and executes exactly the same sequence of guest instructions as the primary virtual machine. The secondary VM observes the same input as the primary VM and is ready to take over at any time without any data loss or interruption of service should the primary VM fail. Both VMs are managed as a single unit, but run on different physical ESX hosts.

Enabling Fault Tolerance does not require any additional configuration changes to the virtual machine’s guest OS or its applications. It's enabled on an existing virtual machine with a few clicks. This eliminates the need to purchase enterprise edition licenses for your corporate operating systems and applications. As it’s enabled on individual Virtual Machines, you can choose to protect only the applications that are mission critical. You can even run the primary and secondary virtual machines in different buildings, providing an instant DR solution.

Image
VMware FT is enabled by a few simple clicks

In the event that the primary virtual machine becomes unavailable, the secondary virtual machine takes over with no interruption to service. After the initial failover, VMware HA automatically selects a new host that is operating properly and has available resource and starts a new secondary virtual machine instance on that host. The hypervisor where new primary virtual machine is running establishes vLockstep with the new secondary hypervisor, thus re-enabling redundancy. From this point onward, the virtual machine is protected one more against future failures.

Have a look at this demonstration video for VMware Fault Tolerance.

Written by  0 comment
Last modified on Friday, 12 March 2010 12:39
Rate this item
(0 votes)

Comments (0)

There are no comments posted here yet

Leave your comments

Posting comment as a guest. Sign up or login to your account.
0 Characters
Attachments (0 / 3)
Share Your Location

@unicorn_1485 What's your number? I can arrange a call 😉
Follow Rynardt Spies on Twitter