Intermittent System Errors Affecting Console Access
Incident Report for Wasabi Technologies
Postmortem

Between 10:32 UTC 2024-08-06 and 20:40 UTC 2024-08-07, we experienced three outages affecting both S3 and user services in all regions. 

Starting at 10:32 UTC 2024-08-06, our queueing service reached a full capacity state which impacted our database cache causing it to become unresponsive. The Wasabi Operations team initiated a restart to the primary database in an attempt to clear out all stale connections to the database while simultaneously clearing the queuing service queue. When this action failed to bring the database into a fully operational state, the secondary database instance was promoted to primary. At 11:20 UTC the S3 service was fully operational again. Between 13:17 UTC and 13:23 UTC, the database was restarted once more by Operations in order to fully incorporate our queueing service library. 

Between 02:55 UTC to 03:35 UTC on 2024-08-07, a second event occurred when our Operations team identified a configuration issue within the queueing service and the previously promoted secondary database instance. This configuration issue was causing timeouts to occur on user services such as our Web Console, WAC API, and WACM interface. Our Operations team then promoted the primary database back to production, alleviating these issues. There was no impact to S3 services during this event. 

Between 20:30 UTC to 20:44 UTC on 2024-08-07, a third event occurred when an automation cluster was failing to be seen by our automation service, causing a small decrease in accepted traffic to our S3 vaults. Our Operations team then recreated and redeployed this cluster, fully restoring the S3 service.

Posted Aug 16, 2024 - 16:10 UTC

Resolved
This incident has been resolved.
Posted Aug 07, 2024 - 13:38 UTC
Monitoring
We have identified and resolved the issue.
The access to our Management Console is back to normal and operational again.
We will continue to monitor our services.
Posted Aug 07, 2024 - 04:06 UTC
Investigating
We are currently investigating intermittent issues affecting Wasabi Management Console Access.
Team is working to resolve this issue. There is no impact on S3 API calls.
Posted Aug 07, 2024 - 03:46 UTC
This incident affected: Wasabi Account Control API, Wasabi Account Control Manager Console, and Wasabi Management Console.