Healthchecks.io Status

Need support?

E-mail us
Welcome to the Healthchecks.io status page. If there are interruptions to service, we will post a report here, and on our Mastodon account. 

Past incidents

Jul, 2024
    
Jun, 2024
     
May, 2024
  
  
  1. Jul, 2024

    1. Started:
      Duration:
      We're investigating intermittent failing requests on healthchecks.io.

      Identified:

      The HTTP request failures on https://healthchecks.io seem to be a knock-on effects of connectivity issues to our object storage provider.

      We have temporarily disabled API calls that retrieve ping bodies from object storage.

      Identified:

      We have verified the failing requests on Healthchecks.io dashboard were indeed related to object storage availability issues. The temporary solution of disabling "get object" API calls has restored the dashboard functionality. Our object storage provider has acknowledged the issue and is working on it.

      Currently on Healthchecks.io dashboard, when you view details of a recent ping you will see  a "The request body data is not yet available, please check back later." message. The request bodies are not lost, and will be available for viewing eventually.

      Identified:

      Our object storage provider have implemented a fix. We are currently working through a backlog of not yet uploaded ping bodies.

      Resolved:

      We have fully caught up with ping body backlog, and the service is now operating normally.
  2. Jun, 2024

    1. Delayed ping body processingNotification SenderDashboard
      Started:
      Duration:
      Our object storage provider is currently performing maintenance (scheduled on June 18th, 2024, from 18:00 UTC to 03:00 UTC). Some percentage of object storage API calls are currently failing, which causes the following problems for Healthchecks.io:

      * There's a delay in ping body uploads to object storage. When you view details of a recent ping you may see  a "The request body data is not yet available, please check back later." message. The request bodies are not lost, and will be available eventually.
      * Our notification sender process can crash (and silently drop the notification it was about to send) when it hits an unexpected error from object storage. We're currently working on handling these errors. 

      Identified:

      We have deployed a fix for the notification sender process crashing when it hits an unexpected error from object storage.

      Resolved:

      Our object storage provider has completed the maintenance. Ping body processing is now back to working normally with no delays.
  3. May, 2024

    1. Database connectivity isssueNotification SenderPing APIDashboard
      Started:
      Duration:
      We're investigating an issue with our database.

      Investigating:

      We have failed over to a standby database instance.

      Verifying:

      The systems are back up and working through a backlog of outgoing notifications (caused by pings missed during the downtime).

      Resolved:

      All systems are operational again. We are currently setting up a new standby database server, to replace the standby that was promoted to primary.

      The root cause of the outage was hardware failure: a broken power supply of one of the neighboring servers triggered a fuse in the database server's rack.