Resolved -
All galaxy services affected by AWS outage should be operational at this point.
Oct 20, 22:50 UTC
Monitoring -
AWS services are slowly getting restored. We are actively monitoring our services to ensure they recover as well
Oct 20, 20:51 UTC
Update -
We have put in some temporary mitigations to help alleviate the cluster startup problems in us east 1. At the same time we are still actively monitoring the AWS outage. Along with cluster startup, we think customers using galaxy telemetry and managed data ingestion may also experience data staleness.
We are working with AWS to mitigate this and will provide more updates when there is new information to share.
Oct 20, 19:08 UTC
Update -
We are still affected by residual AWS outages, with many Galaxy functions affected (e.g., creating new clusters, result-set caching, modifying RBAC permissions, etc.). We continue to work with AWS to address the problems and will update the status here when we have new information to share.
Oct 20, 15:18 UTC
Update -
Although some AWS services have been restored, others continue to fail, and we are still impacted. In particular, any Galaxy service tied to the AWS region us-east-1 is currently not functioning properly. This includes creating new clusters and modifying RBAC permissions.
We will continue to work with AWS to restore our full functionality and provide updates here.
Oct 20, 13:10 UTC
Identified -
We are currently experiencing a service outage caused by an ongoing incident affecting our cloud provider, Amazon Web Services (AWS). Our teams are actively monitoring the situation and are in close contact with AWS as they work to resolve the issue. We will continue to provide updates and confirm once all services are fully restored.
Oct 20, 09:42 UTC