Info

The reliability of an application/infra/component is determined by it’s ability to recover from crashes.

Examples

  • Availability: The system should be available 99.9% of the time.
  • Fault Tolerance: The system should continue to operate in the event of hardware or software failures.
  • Disaster Recovery: The system should have a backup and recovery plan in case of catastrophic failures.