Company Logo

Is there an Rollbar outage?

Rollbar status: Systems Active

Last checked: 33 seconds ago

Get notified about any outages, downtime or incidents for Rollbar and 1800+ other cloud vendors. Monitor 10 companies, for free.

Subscribe for updates

Rollbar outages and incidents

Outage and incident data over the last 30 days for Rollbar.

There have been 2 outages or incidents for Rollbar in the last 30 days.

Severity Breakdown:

Tired of searching for status updates?

Join OutLogger to be notified when any of your vendors or the components you use experience an outage. It's completely free and takes less than 2 minutes!

Sign Up Now

Components and Services Monitored for Rollbar

Outlogger tracks the status of these components for Xero:

API Tier (api.rollbar.com) Active
Rollbar Docs Active
rollbar.min.js Active
SCIM and SSO Active
Web App (rollbar.com) Active
Mailgun Outbound Delivery Active
Mailgun SMTP Active
Core Processing Pipeline Active
iOS Symbolication pipeline Active
Proguard processing pipeline Active
Source map symbolication pipeline Active
Component Status
API Tier (api.rollbar.com) Active
Rollbar Docs Active
rollbar.min.js Active
SCIM and SSO Active
Web App (rollbar.com) Active
Active
Mailgun Outbound Delivery Active
Mailgun SMTP Active
Active
Core Processing Pipeline Active
iOS Symbolication pipeline Active
Proguard processing pipeline Active
Source map symbolication pipeline Active

Latest Rollbar outages and incidents.

View the latest incidents for Rollbar and check for official updates:

Updates:

  • Time: Feb. 27, 2024, 12:38 a.m.
    Status: Resolved
    Update: This incident has been resolved.
  • Time: Feb. 27, 2024, 12:38 a.m.
    Status: Resolved
    Update: This incident has been resolved.
  • Time: Feb. 26, 2024, 11:31 p.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: Feb. 26, 2024, 11:31 p.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: Feb. 26, 2024, 11:10 p.m.
    Status: Identified
    Update: We are continuing to work on a fix for this issue.
  • Time: Feb. 26, 2024, 11:10 p.m.
    Status: Identified
    Update: We are continuing to work on a fix for this issue.
  • Time: Feb. 26, 2024, 11 p.m.
    Status: Identified
    Update: We are continuing to work on a fix for this issue.
  • Time: Feb. 26, 2024, 11 p.m.
    Status: Identified
    Update: We are continuing to work on a fix for this issue.
  • Time: Feb. 26, 2024, 10:14 p.m.
    Status: Identified
    Update: We are experiencing a processing delay in our occurrence pipeline. We are investigating actively and will post when we have identified the issue.
  • Time: Feb. 26, 2024, 10:14 p.m.
    Status: Identified
    Update: We are experiencing a processing delay in our occurrence pipeline. We are investigating actively and will post when we have identified the issue.

Updates:

  • Time: March 28, 2024, 3:17 a.m.
    Status: Postmortem
    Update: # **Summary of the Incident and Impact** On February 3rd, 2024, between 03:37 and 06:35 PST Rollbar experienced a platform outage affecting the Web Application \([rollbar.com](http://rollbar.com)\) and Platform API \([api.rollbar.com](http://api.rollbar.com)\) servers.  The cause of these outages can be traced to an automated update by our Google Cloud Platform to Rollbar’s GKE \(Google Kubernetes Engine\) Clusters. Following this incident, the trailing-12-month uptime of the API tier as measured by our external monitoring service is 99.92%. The upgrade removed firewall rules necessary for health checks originating from Google Cloud Application Load Balancers \(ALBs\) required to be able to send traffic to application servers. Our default network firewall security posture is very strict and removal of rules has significant consequences as we disallow all IP traffic on the relevant ports. The removal of these firewall rules resulted in the inability of workloads on the GKE clusters to communicate with the ALBs thus causing the load balancers to register all workloads as unhealthy. Initially, it was unclear what had happened as no code changes had been deployed by Rollbar nor were changes made directly to any infrastructure. Not knowing that the firewall rules had been eliminated, we attempted to restart applications and create new load balancers from roughly 03:37 to 05:08am. At 05:08 a support ticket was created with Rollbar’s cloud services provider, and Google to help resolve the issue.  At 05:11 engineers from the cloud services provider, Google, and Rollbar teleconferenced to try to discuss the issue. After 75 minutes on the support call, the cloud services provider and Google were able to determine that the firewall rules had been removed due to the GKE upgrade. Starting at 06:28, Rollbar created new firewall rules and resolved the issues with load balancer health thus restoring service for the Platform API & Web Application. By 06:35, all services were fully restored. **Timeline:** * Feb 3 03:37 PST - Both the Platform API and Web Application stop responding * 03:37-05:08 PST - Attempts to remedy through restarts and creating new load balancers fails * 05:08 PST - Critical support ticket created with our cloud support provider * 05:11 PST - Teleconference call initiated with cloud services provider, Google, & Rollbar engineers * 06:28 PST - New firewall rules recommended and added for the Web Application’s ALB * 06:30 PST - Web Application became available * 06:32 PST - New firewall rules recommended and added for the Platform API’s ALB * 06:35 PST - Platform API became available # **Follow-up Actions** To mitigate future risks and avoid similar incidents, we have undertaken the following actions: * In order to avoid the deletion of necessary firewall rules, we have created our own firewall rules rather than relying on automatically-created rules. * We have incorporated notifications on GKE updates into our internal application performance graphs to note when these occur to help in the future when diagnosing issues.
  • Time: Feb. 3, 2024, 2:48 p.m.
    Status: Resolved
    Update: We are continuing to monitor for any further issues.
  • Time: Feb. 3, 2024, 2:37 p.m.
    Status: Monitoring
    Update: We are continuing to monitor for any further issues.
  • Time: Feb. 3, 2024, 2:36 p.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: Feb. 3, 2024, 2:35 p.m.
    Status: Identified
    Update: We are continuing to work on a fix for this issue.
  • Time: Feb. 3, 2024, 2:33 p.m.
    Status: Identified
    Update: We are continuing to work on a fix for this issue.
  • Time: Feb. 3, 2024, 2:27 p.m.
    Status: Identified
    Update: The issue has been identified and a fix is being implemented.
  • Time: Feb. 3, 2024, 11:51 a.m.
    Status: Investigating
    Update: We are experiencing a processing delay in our occurrence pipeline. We are investigating actively and will post when we have identified the issue.

Updates:

  • Time: March 28, 2024, 3:17 a.m.
    Status: Postmortem
    Update: # **Summary of the Incident and Impact** On February 3rd, 2024, between 03:37 and 06:35 PST Rollbar experienced a platform outage affecting the Web Application \([rollbar.com](http://rollbar.com)\) and Platform API \([api.rollbar.com](http://api.rollbar.com)\) servers.  The cause of these outages can be traced to an automated update by our Google Cloud Platform to Rollbar’s GKE \(Google Kubernetes Engine\) Clusters. Following this incident, the trailing-12-month uptime of the API tier as measured by our external monitoring service is 99.92%. The upgrade removed firewall rules necessary for health checks originating from Google Cloud Application Load Balancers \(ALBs\) required to be able to send traffic to application servers. Our default network firewall security posture is very strict and removal of rules has significant consequences as we disallow all IP traffic on the relevant ports. The removal of these firewall rules resulted in the inability of workloads on the GKE clusters to communicate with the ALBs thus causing the load balancers to register all workloads as unhealthy. Initially, it was unclear what had happened as no code changes had been deployed by Rollbar nor were changes made directly to any infrastructure. Not knowing that the firewall rules had been eliminated, we attempted to restart applications and create new load balancers from roughly 03:37 to 05:08am. At 05:08 a support ticket was created with Rollbar’s cloud services provider, and Google to help resolve the issue.  At 05:11 engineers from the cloud services provider, Google, and Rollbar teleconferenced to try to discuss the issue. After 75 minutes on the support call, the cloud services provider and Google were able to determine that the firewall rules had been removed due to the GKE upgrade. Starting at 06:28, Rollbar created new firewall rules and resolved the issues with load balancer health thus restoring service for the Platform API & Web Application. By 06:35, all services were fully restored. **Timeline:** * Feb 3 03:37 PST - Both the Platform API and Web Application stop responding * 03:37-05:08 PST - Attempts to remedy through restarts and creating new load balancers fails * 05:08 PST - Critical support ticket created with our cloud support provider * 05:11 PST - Teleconference call initiated with cloud services provider, Google, & Rollbar engineers * 06:28 PST - New firewall rules recommended and added for the Web Application’s ALB * 06:30 PST - Web Application became available * 06:32 PST - New firewall rules recommended and added for the Platform API’s ALB * 06:35 PST - Platform API became available # **Follow-up Actions** To mitigate future risks and avoid similar incidents, we have undertaken the following actions: * In order to avoid the deletion of necessary firewall rules, we have created our own firewall rules rather than relying on automatically-created rules. * We have incorporated notifications on GKE updates into our internal application performance graphs to note when these occur to help in the future when diagnosing issues.
  • Time: Feb. 3, 2024, 2:48 p.m.
    Status: Resolved
    Update: We are continuing to monitor for any further issues.
  • Time: Feb. 3, 2024, 2:37 p.m.
    Status: Monitoring
    Update: We are continuing to monitor for any further issues.
  • Time: Feb. 3, 2024, 2:36 p.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: Feb. 3, 2024, 2:35 p.m.
    Status: Identified
    Update: We are continuing to work on a fix for this issue.
  • Time: Feb. 3, 2024, 2:33 p.m.
    Status: Identified
    Update: We are continuing to work on a fix for this issue.
  • Time: Feb. 3, 2024, 2:27 p.m.
    Status: Identified
    Update: The issue has been identified and a fix is being implemented.
  • Time: Feb. 3, 2024, 11:51 a.m.
    Status: Investigating
    Update: We are experiencing a processing delay in our occurrence pipeline. We are investigating actively and will post when we have identified the issue.

Updates:

  • Time: Jan. 25, 2024, 3:22 p.m.
    Status: Resolved
    Update: This incident has been resolved.
  • Time: Jan. 25, 2024, 3:22 p.m.
    Status: Resolved
    Update: This incident has been resolved.
  • Time: Jan. 25, 2024, 3:22 p.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: Jan. 25, 2024, 3:22 p.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: Jan. 25, 2024, 3:22 p.m.
    Status: Identified
    Update: The issue has been identified and a fix is being implemented.
  • Time: Jan. 25, 2024, 3:22 p.m.
    Status: Identified
    Update: The issue has been identified and a fix is being implemented.
  • Time: Jan. 25, 2024, 3:15 p.m.
    Status: Investigating
    Update: We are continuing to investigate this issue.
  • Time: Jan. 25, 2024, 3:15 p.m.
    Status: Investigating
    Update: We are continuing to investigate this issue.
  • Time: Jan. 25, 2024, 3:14 p.m.
    Status: Investigating
    Update: We are experiencing a processing delay in our occurrence pipeline. We are investigating actively and will post when we have identified the issue.
  • Time: Jan. 25, 2024, 3:14 p.m.
    Status: Investigating
    Update: We are experiencing a processing delay in our occurrence pipeline. We are investigating actively and will post when we have identified the issue.

Updates:

  • Time: Jan. 8, 2024, 10:36 p.m.
    Status: Resolved
    Update: This incident has been resolved.
  • Time: Jan. 8, 2024, 10:35 p.m.
    Status: Monitoring
    Update: We are continuing to monitor for any further issues.
  • Time: Jan. 8, 2024, 10:09 p.m.
    Status: Monitoring
    Update: We are continuing to monitor for any further issues.
  • Time: Jan. 8, 2024, 9:42 p.m.
    Status: Monitoring
    Update: We are continuing to monitor for any further issues.
  • Time: Jan. 8, 2024, 9:24 p.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: Jan. 8, 2024, 8:56 p.m.
    Status: Investigating
    Update: We are currently investigating this issue.

Check the status of similar companies and alternatives to Rollbar

Smartsheet
Smartsheet

Systems Active

ESS (Public)
ESS (Public)

Systems Active

ESS (Public)
ESS (Public)

Systems Active

Cloudera
Cloudera

Systems Active

New Relic
New Relic

Systems Active

Boomi
Boomi

Systems Active

AppsFlyer
AppsFlyer

Systems Active

Imperva
Imperva

Systems Active

Bazaarvoice
Bazaarvoice

Issues Detected

Optimizely
Optimizely

Systems Active

Electric
Electric

Systems Active

ABBYY
ABBYY

Systems Active

Frequently Asked Questions - Rollbar

Is there a Rollbar outage?
The current status of Rollbar is: Systems Active
Where can I find the official status page of Rollbar?
The official status page for Rollbar is here
How can I get notified if Rollbar is down or experiencing an outage?
To get notified of any status changes to Rollbar, simply sign up to OutLogger's free monitoring service. OutLogger checks the official status of Rollbar every few minutes and will notify you of any changes. You can veiw the status of all your cloud vendors in one dashboard. Sign up here