High-Availability vs Fault-Tolerance vs Disaster Recover
High-Availability
- when it fail, its component can replace as soon as possible
- = maximize the online time of a service/uptime = minimize downtime
- measure by available time (e.g. 99.999% = 5.26 minute/ year downtime)
- allow interuption
Fault-Tolerance
- more than high-availability
- it is a property that enables a system to continue operating properly in the event of the failure of some of its components
- actively connected to multiple servers
- minimize interruption occur even with failure
- can cost much more
Disaster Recover
- a set of policies, tools and procedures to enable the recovery or continuation of vital technology infrastructure & systems following a natural or human-induced disaster
- pre-planning + (automated) DR process
- back-up plan when disaster
- multiple site of backups