One of our internal server monitoring systems went down and failed to alert us to an issue this morning. Because we did not receive the alert, it took us longer than normal to diagnose and fix the problem.
We've since found and fixed the issue to restore service.
To prevent this in the future, we are looking to improve our monitoring and notifications.