Last checked: 9 minutes ago
Get notified about any outages, downtime or incidents for Opsgenie and 1800+ other cloud vendors. Monitor 10 companies, for free.
Outage and incident data over the last 30 days for Opsgenie.
Join OutLogger to be notified when any of your vendors or the components you use experience an outage. It's completely free and takes less than 2 minutes!
Sign Up NowOutlogger tracks the status of these components for Xero:
Component | Status |
---|---|
Public Website | Active |
EU | Active |
Alert Flow | Active |
Alert REST API | Active |
Configuration REST APIs | Active |
Email Notification Delivery | Active |
Heartbeat Monitoring | Active |
Heartbeat REST API | Active |
Incident Flow | Active |
Incident REST API | Active |
Incoming Call Routing | Active |
Incoming Email Service | Active |
Incoming Integration Flow | Active |
Logs | Active |
Mobile Application | Active |
Mobile Notification Delivery | Active |
Opsgenie Actions | Active |
Outgoing Integration Flow | Active |
Pricing & Billing | Active |
Reporting & Analytics | Active |
Signup, Login & Authorization | Active |
SMS Notification Delivery | Active |
Voice Notification Delivery | Active |
Web Application | Active |
US | Active |
Alert Flow | Active |
Alert REST API | Active |
Configuration REST APIs | Active |
Email Notification Delivery | Active |
Heartbeat Monitoring | Active |
Heartbeat REST API | Active |
Incident Flow | Active |
Incident REST API | Active |
Incoming Call Routing | Active |
Incoming Email Service | Active |
Incoming Integration Flow | Active |
Logs | Active |
Mobile Application | Active |
Mobile Notification Delivery | Active |
Opsgenie Actions | Active |
Outgoing Integration Flow | Active |
Pricing & Billing | Active |
Reporting & Analytics | Active |
Signup, Login & Authorization | Active |
SMS Notification Delivery | Active |
Voice Notification Delivery | Active |
Web Application | Active |
View the latest incidents for Opsgenie and check for official updates:
Description: Between 2024-06-20 22:04 UTC to 2024-06-20 22:28 UTC, we experienced intermittent issue for users to access the services for some Atlassian Cloud customers. The issue has been resolved and the service is operating normally.
Status: Resolved
Impact: None | Started At: June 21, 2024, 12:17 a.m.
Description: ### Summary On June 3rd, between 09:43pm and 10:58 pm UTC, Atlassian customers using multiple product\(s\) were unable to access their services. The event was triggered by a change to the infrastructure API Gateway, which is responsible for routing the traffic to the correct application backends. The incident was detected by the automated monitoring system within five minutes and mitigated by correcting a faulty release feature flag, which put Atlassian systems into a known good state. The first communications were published on the Statuspage at 11:11pm UTC. The total time to resolution was about 75 minutes. ### **IMPACT** The overall impact was between 09:43pm and 10:17pm UTC, with the system initially in a degraded state, followed by a total outage between 10:17pm and 10:58pm UTC. _The Incident caused service disruption to customers in all regions and affected the following products:_ * Jira Software * Jira Service Management * Jira Work Management * Jira Product Discovery * Jira Align * Confluence * Trello * Bitbucket * Opsgenie * Compass ### **ROOT CAUSE** A policy used in the infrastructure API gateway was being updated in production via a feature flag. The combination of an erroneous value entered in a feature flag, and a bug in the code resulted in the API Gateway not processing any traffic. This created a total outage, where all users started receiving 5XX errors for most Atlassian products. Once the problem was identified and the feature flag updated to the correct values, all services started seeing recovery immediately. ### **REMEDIAL ACTIONS PLAN & NEXT STEPS** We know that outages impact your productivity. While we have several testing and preventative processes in place, this specific issue wasn’t identified because the change did not go through our regular release process and instead was incorrectly applied through a feature flag. We are prioritizing the following improvement actions to avoid repeating this type of incident: * Prevent high-risk feature flags from being used in production * Improve the policy changes testing * Enforcing longer soak time for policy changes * Any feature flags should go through progressive rollouts to minimize broad impact * Review the infrastructure feature flags to ensure they all have appropriate defaults * Improve our processes and internal tooling to provide faster communications to our customers We apologize to customers whose services were affected by this incident and are taking immediate steps to address the above gaps. Thanks, Atlassian Customer Support
Status: Postmortem
Impact: Major | Started At: June 3, 2024, 11:11 p.m.
Description: ### Summary On June 3rd, between 09:43pm and 10:58 pm UTC, Atlassian customers using multiple product\(s\) were unable to access their services. The event was triggered by a change to the infrastructure API Gateway, which is responsible for routing the traffic to the correct application backends. The incident was detected by the automated monitoring system within five minutes and mitigated by correcting a faulty release feature flag, which put Atlassian systems into a known good state. The first communications were published on the Statuspage at 11:11pm UTC. The total time to resolution was about 75 minutes. ### **IMPACT** The overall impact was between 09:43pm and 10:17pm UTC, with the system initially in a degraded state, followed by a total outage between 10:17pm and 10:58pm UTC. _The Incident caused service disruption to customers in all regions and affected the following products:_ * Jira Software * Jira Service Management * Jira Work Management * Jira Product Discovery * Jira Align * Confluence * Trello * Bitbucket * Opsgenie * Compass ### **ROOT CAUSE** A policy used in the infrastructure API gateway was being updated in production via a feature flag. The combination of an erroneous value entered in a feature flag, and a bug in the code resulted in the API Gateway not processing any traffic. This created a total outage, where all users started receiving 5XX errors for most Atlassian products. Once the problem was identified and the feature flag updated to the correct values, all services started seeing recovery immediately. ### **REMEDIAL ACTIONS PLAN & NEXT STEPS** We know that outages impact your productivity. While we have several testing and preventative processes in place, this specific issue wasn’t identified because the change did not go through our regular release process and instead was incorrectly applied through a feature flag. We are prioritizing the following improvement actions to avoid repeating this type of incident: * Prevent high-risk feature flags from being used in production * Improve the policy changes testing * Enforcing longer soak time for policy changes * Any feature flags should go through progressive rollouts to minimize broad impact * Review the infrastructure feature flags to ensure they all have appropriate defaults * Improve our processes and internal tooling to provide faster communications to our customers We apologize to customers whose services were affected by this incident and are taking immediate steps to address the above gaps. Thanks, Atlassian Customer Support
Status: Postmortem
Impact: Major | Started At: June 3, 2024, 11:11 p.m.
Description: This incident has been resolved.
Status: Resolved
Impact: Minor | Started At: April 30, 2024, 12:52 p.m.
Description: This incident has been resolved.
Status: Resolved
Impact: Minor | Started At: April 30, 2024, 12:52 p.m.
Join OutLogger to be notified when any of your vendors or the components you use experience an outage or down time. Join for free - no credit card required.