Last checked: 30 seconds ago
Get notified about any outages, downtime or incidents for CircleCI and 1800+ other cloud vendors. Monitor 10 companies, for free.
Outage and incident data over the last 30 days for CircleCI.
Join OutLogger to be notified when any of your vendors or the components you use experience an outage. It's completely free and takes less than 2 minutes!
Sign Up NowOutlogger tracks the status of these components for Xero:
Component | Status |
---|---|
Artifacts | Active |
Billing & Account | Active |
CircleCI Insights | Active |
CircleCI Releases | Active |
CircleCI UI | Active |
CircleCI Webhooks | Active |
Docker Jobs | Active |
Machine Jobs | Active |
macOS Jobs | Active |
Notifications & Status Updates | Active |
Pipelines & Workflows | Active |
Runner | Active |
Windows Jobs | Active |
CircleCI Dependencies | Active |
AWS | Active |
Google Cloud Platform Google Cloud DNS | Active |
Google Cloud Platform Google Cloud Networking | Active |
Google Cloud Platform Google Cloud Storage | Active |
Google Cloud Platform Google Compute Engine | Active |
mailgun API | Active |
mailgun Outbound Delivery | Active |
mailgun SMTP | Active |
OpenAI | Active |
Upstream Services | Active |
Atlassian Bitbucket API | Active |
Atlassian Bitbucket Source downloads | Active |
Atlassian Bitbucket SSH | Active |
Atlassian Bitbucket Webhooks | Active |
Docker Authentication | Active |
Docker Hub | Active |
Docker Registry | Active |
GitHub API Requests | Active |
GitHub Git Operations | Active |
GitHub Packages | Active |
GitHub Pull Requests | Active |
GitHub Webhooks | Active |
GitLab | Active |
View the latest incidents for CircleCI and check for official updates:
Description: This incident has been resolved.
Status: Resolved
Impact: Critical | Started At: Aug. 5, 2024, 1:21 p.m.
Description: ## Summary On August 1, 2024, from 00:07 UTC to 05:20 UTC, some customers experienced delays when retrieving workflow status, rerunning workflows, and canceling jobs and workflows. This was due to a code change deployed on July 31, 2024, at 12:03 UTC, which modified how workflow status is calculated. At 00:07 UTC on August 1, 2024, specific conditions triggered an unanticipated computational demand, leading to customer impact. After investigating, we identified the issue at 04:55 UTC and reverted the faulty code. Customer impact was fully mitigated around 05:29 UTC. We appreciate our customers' patience and understanding as we worked to resolve this incident. [The original status page can be found here](https://status.circleci.com/incidents/63xvc5hp49yk). ## What Happened On July 31, 2024, at 12:03 UTC, we deployed a change to the workflow status calculation. This change passed all necessary tests to ensure that the correct statuses were calculated. However, it was unknown at the time that, under specific conditions, it could significantly increase computational demand on our system. Approximately 12 hours later, on August 1, 2024, at 00:07 UTC, these conditions were met, leading to a spike in CPU utilization across our service. As a result, the service scaled up the number of pods, which increased the number of connections used by one of our databases. At 01:13 UTC, our database team was alerted to database connection saturation and began an initial investigation. By 02:30 UTC, after identifying the correct service and the extent of customer impact, the responsible engineering team was paged to assist with the issue. Between 02:42 UTC and 04:30 UTC, we attempted several mitigation steps to relieve database saturation and reduce CPU utilization, including rolling the pods and reducing the number of replicas. However, these steps did not have the desired effect. At that point, we decided to perform CPU profiling on some of the pods. The profiler revealed an issue with the workflow status calculation function, pointing to one of the recently merged commits as the source of the problem. At 04:55 UTC, we reverted that change, and the customer impact ceased around 05:29 UTC. ## Future Prevention and Process Improvement We made strategic adjustments to our Horizontal Pod Autoscaler \(HPA\) settings for the service, ensuring that scaling is better aligned with our database capabilities. This adjustment will further enhance the stability and reliability of the service. We have thoroughly analyzed and addressed the underlying performance issue in the workflow status calculation, as well as identified which conditions that could potentially lead to similar issues. To prevent this we have implemented additional tests specifically designed to catch these scenarios. These tests ensure that any changes to workflow status calculation, are rigorously vetted before deployment, reducing the risk of recurrence. Additionally, we have also fine-tuned our monitoring systems to provide earlier notifications to our team in the event of increased latency in our read APIs. This adjustment will enable us to respond more quickly to potential issues, reducing the likelihood of significant customer impact.
Status: Postmortem
Impact: Minor | Started At: Aug. 1, 2024, 3:09 a.m.
Description: The issue causing intermittent 500 errors with our API has now been resolved. We appreciate your patience and understanding as we worked through this incident. Please reach out to our support team for any further questions or if you experience any further issues.
Status: Resolved
Impact: Minor | Started At: July 31, 2024, 6:32 p.m.
Description: Today from 16:05 UTC to 16:31 UTC we saw an issue where users making a GET call the /api/v2/runner endpoint would see a 301 error. A patch was made to address the issue and we are no longer seeing this behavior.
Status: Resolved
Impact: Minor | Started At: July 31, 2024, 4 p.m.
Description: This issue is now resolved. We restored user accounts for users with email/password logins after an error during a database maintenance operation. As a result, users with email-based logins will need to reset their passwords on next login.
Status: Resolved
Impact: Major | Started At: July 29, 2024, 8:08 p.m.
Join OutLogger to be notified when any of your vendors or the components you use experience an outage or down time. Join for free - no credit card required.