It is a process and mechanism for detecting failures and resource shortages that occur in the system and notifying the system administrator by periodically checking whether the servers, applications, networks, etc. running in the system are operating normally.