wiki:WikiStart

Welcome to Project Resilire

High availability refers to a system and associated service implementation that is continuously operational for a long period of time. Whole-system replication is a conventional way to increase system availability: once a primary machine fails, the running applications are migrated and resumed on backup machine(s). However, there are several limitations that make this method unattractive for deployment: it needs specialized hardware and software, which are usually expensive. Additionally, such a system may also require complex customized configurations, which are difficult to manage and maintain.

These limitations are efficiently overcome by virtualization: all applications now run on a virtual machine (VM). Thus, whole-system replication can easily and efficiently be implemented: a copy of the whole VM is continuously checkpointed and saved on a backup machine. As VMs are totally hardware-independent, the cost is much lower compared to the hardware expenses in traditional high availability solutions. Besides, virtualization technology can facilitate the management of multiple VMs on a single physical machine. With virtual machine monitors (VMM), the service applications are separated from physical machines, providing increased flexibility and improved performance.

The Resilire project is developing techniques and mechanisms for high availability through VM migration, from solo VMs to multiple VMs running on different physical hosts, interconnected by a virtual network (i.e., virtual distributed environments or VDEs). The effort is based on Xen 3.4 VMM, but is portable to more recent releases of Xen and other VMMs with full virtualization. The project's implementations do not require modifications to applications or guest OSes inside the VMs.

Resilire Team

Resilire in progress

  • VDEchp: Globally Consistent Checkpointing for Virtual Distributed Environments
  • FGBI: Fine-Grained Block Identification
  • LLM: Lightweight Live Migration

Using Resilire

Prerequisites:

Start Using Resilire:

Documentation & Publications

Related Efforts

Last modified 13 years ago Last modified on 10/25/11 16:19:16