Welcome to Project Resilire
High availability refers to a system and associated service implementation that is continuously operational for a long period of time. Whole-system replication is a conventional way to increase system availability: once a primary machine fails, the running applications are migrated and resumed on backup machine(s). However, there are several limitations that make this method unattractive for deployment: it needs specialized hardware and software, which are usually expensive. Additionally, such a system may also require complex customized configurations, which are difficult to manage and maintain.
These limitations are efficiently overcome by virtualization: all applications now run on a virtual machine (VM). Thus, whole-system replication can easily and efficiently be implemented: a copy of the whole VM is continuously checkpointed and saved on a backup machine. As VMs are totally hardware-independent, the cost is much lower compared to the hardware expenses in traditional high availability solutions. Besides, virtualization technology can facilitate the management of multiple VMs on a single physical machine. With virtual machine monitors (VMM), the service applications are separated from physical machines, providing increased flexibility and improved performance.
The Resilire project is developing techniques and mechanisms for high availability through VM migration, from solo VMs to multiple VMs running on different physical hosts, interconnected by a virtual network (i.e., virtual distributed environments or VDEs). The effort is based on Xen 3.4 VMM, but is portable to more recent releases of Xen and other VMMs with full virtualization. The project's implementations do not require modifications to applications or guest OSes inside the VMs.
Resilire in progress
- VDEchp: Globally Consistent Checkpointing for Virtual Distributed Environments
- FGBI: Fine-Grained Block Identification
- LLM: Lightweight Live Migration
Using Resilire
Prerequisites:
- Install the linux kernel (CentOS, but other OSes are also fine)
- Download and install Xen Hypervisor
- Install Remus for sub-projects LLM and FGBI (skip this for only trying VDEchp)
Start Using Resilire:
Documentation & Publications
Related Efforts
- The Xen Hypervisor: Xen: Open Source Industry Standard for Virtualization
- The Remus project: Remus: Transparent High Availability for Xen
- The Kemari project: A VM Synchronization Mechanism for KVM
- The VNsnap project: Snapshots for Virtual Networked Environments