We have failed over to a standby database instance.
Doğrulanıyor:
The systems are back up and working through a backlog of outgoing notifications (caused by pings missed during the downtime).
Çözüldü:
All systems are operational again. We are currently setting up a new standby database server, to replace the standby that was promoted to primary.
The root cause of the outage was hardware failure: a broken power supply of one of the neighboring servers triggered a fuse in the database server's rack.