Você está na página 1de 33

INF-BCO2655

VMware vSphere Fault Tolerance for Multiprocessor Virtual Machines Technical Preview
Jim Chow, VMware, Inc. Shrinand Javadekar, VMware, Inc.

#vmworldinf

Disclaimer

This session may contain product features that are


currently under development.

This session/overview of the new technology represents


no commitment from VMware to deliver these features in any generally available product.

Features are subject to change, and must not be included in


contracts, purchase orders, or sales agreements of any kind.

Technical feasibility and market demand will affect final delivery. Pricing and packaging for any new technologies or features
discussed or presented have not been determined.

Agenda

vSphere Availability Portfolio Why Fault Tolerance Multiprocessor Fault Tolerance details Live Demo Performance Numbers Questions

Disasters Happen. Do You Need Protection?

43% of companies experiencing disasters never re-open, and 29% close within two years. (McGladrey and Pullen)

93% of business that lost their data center for 10 days went bankrupt within one year.
(National Archives & Records Administration)

Top executives say 10 hours to recovery; IT managers say up to 30 hours.


(Harris Interactive)

Do you need protection?

Server failures happen


Google released some data about their server failures
2% to 4% servers fail, 1% to 5% of disk drives crash. 20 rack failures: 40-80 machines instantly disappeared 1-6 hours to get back

Sources
http://content.dell.com/us/en/gen/d/large-business/google-data-center

vSphere Offers Protection at Every Level

Protection against hardware failures Planned maintenance with zero downtime Protection against unplanned downtime and disasters
Site Recovery Manager Storage vMotion

High Availability, Fault Tolerance, vMotion, DRS NIC Teaming, Storage Multipathing

Backup Solutions

Component

Server

Storage

Data

Site

vSphere Availability Portfolio

Coverage Application

App Monitoring APIs

Guest OS VM
Fault Tolerance

Guest Monitoring

Infrastructure HA

Hardware
none minutes

Downtime

vSphere Availability Portfolio

Coverage Application

App Monitoring APIs

Guest OS VM
Fault Tolerance

Guest Monitoring

Infrastructure HA

Hardware
none minutes

Downtime

Why Fault Tolerance?

Continuous Availability
Zero downtime Zero data loss No loss of TCP connections Completely transparent to guest software
No dependency on Guest OS, applications No application specific management and learning

Background

2009: vSphere Fault Tolerance in vSphere 4.0 2010: Updates to vSphere Fault Tolerance in vSphere 4.1 2011: Updates to vSphere Fault Tolerance in vSphere 5.0 Details: http://www.vmware.com/products/fault-tolerance/ Problem: FT only for uni-processor VMs Is FT for multi-processor VMs possible?
An impressively hard problem Concerted effort to find an approach

Reached a key milestone


Wed like to share it

10

A Starting Point: vSphere FT

vLockstep

FT LOGGING
vSphere ESX (Primary) vSphere ESX (Secondary)

Shared VMDKs

11

A Clean Slate

FT LOGGING
vSphere ESX (Primary) vSphere ESX (Secondary)

Shared VMDKs

12

A Clean Slate

FT LOGGING
vSphere ESX (Primary) vSphere ESX (Secondary)

Next: FT in practice

13

Turning on Multiprocessor FT

Creating two VMs


Primary VM Secondary VM

Config Disk 1 Disk 2

Config Disk 1 Disk 2

A new VM, but identical configuration


vRAM, # vCPUs, vNICs, etc.

Each VM owns a complete set of VM files


Separate vmdks completely owned by each VM
14

Datastores

Primary VM

Secondary VM

Config Disk 1 Disk 2

Config Disk 1 Disk 2

15

Datastores

Primary VM

Secondary VM

Config Disk 1 Disk 2

Config Disk 1 Disk 2

16

Datastores

One datastore must be common


Primary VM Secondary VM

Config Disk 1 Disk 2 Tie Break Datastore

Config Disk 1 Disk 2

Ensures only one running copy of the VM at any time


17

18

Initial placement of secondary

Not tied to the host

19

Intel vs AMD vMotion compatible

20

Co-located on single Datastore by default

21

22

23

All done!

24

Live Demo

Central management server Continuous availability difficult Multiprocessor FT makes it simple


Natural fit

VMware vCenter Server

25

Live Failover

Continuous availability through


server failure

vCenter Server

vSphere Web Client

26

Backing up FT VMs

Support for vStorage APIs for Data Protection (VADP)


API for non-disruptive snapshots

Many VADP solutions on the market

27

Live Demo Summary

FT in action
Principles to keep in mind Doing backups of FT VMs Ensure continuous availability of multiprocessor workloads Presented a good solution Client oblivious to FT operation Zero downtime, zero data loss Taste for performance / bandwidth

But thats not all

28

Performance Numbers
% Throughput (FT/non FT)
(higher is better)
100 80 60 40 20 0 Microsoft SQL Server 2-vCPU Microsoft SQL Server 4-vCPU Oracle Swingbench 2vCPU Oracle Swingbench 4vCPU

Similar configuration to vSphere 4 FT Performance Whitepaper


Models real-world workloads: 60% CPU utilization
29

vSphere FT Summary

Why Fault Tolerance


Continuous availability

Fault Tolerance for multi-processor VMs


Good solution to impressively hard problem A new design Demonstrated similar experience to existing vSphere FT
But more vCPUs

30

Thank you!

Questions?

31

FILL OUT A SURVEY


EVERY COMPLETE SURVEY IS ENTERED INTO DRAWING FOR A $25 VMWARE COMPANY STORE GIFT CERTIFICATE

INF-BCO2655

VMware vSphere Fault Tolerance for Multiprocessor Virtual Machines Technical Preview
Jim Chow, VMware, Inc. Shrinand Javadekar, VMware, Inc.

#vmworldinf

Você também pode gostar