Company Logo

Is there an Rollbar outage?

Rollbar status: Systems Active

Last checked: 7 minutes ago

Get notified about any outages, downtime or incidents for Rollbar and 1800+ other cloud vendors. Monitor 10 companies, for free.

Subscribe for updates

Rollbar outages and incidents

Outage and incident data over the last 30 days for Rollbar.

There have been 1 outages or incidents for Rollbar in the last 30 days.

Severity Breakdown:

Tired of searching for status updates?

Join OutLogger to be notified when any of your vendors or the components you use experience an outage. It's completely free and takes less than 2 minutes!

Sign Up Now

Components and Services Monitored for Rollbar

Outlogger tracks the status of these components for Xero:

API Tier (api.rollbar.com) Active
Rollbar Docs Active
rollbar.min.js Active
SCIM and SSO Active
Web App (rollbar.com) Active
Mailgun Outbound Delivery Active
Mailgun SMTP Active
Core Processing Pipeline Active
iOS Symbolication pipeline Active
Proguard processing pipeline Active
Source map symbolication pipeline Active
Component Status
API Tier (api.rollbar.com) Active
Rollbar Docs Active
rollbar.min.js Active
SCIM and SSO Active
Web App (rollbar.com) Active
Active
Mailgun Outbound Delivery Active
Mailgun SMTP Active
Active
Core Processing Pipeline Active
iOS Symbolication pipeline Active
Proguard processing pipeline Active
Source map symbolication pipeline Active

Latest Rollbar outages and incidents.

View the latest incidents for Rollbar and check for official updates:

Updates:

  • Time: May 15, 2023, 10:11 a.m.
    Status: Resolved
    Update: This incident has been resolved.
  • Time: May 15, 2023, 9 a.m.
    Status: Identified
    Update: We are experiencing a processing delay in our occurrence pipeline. We have identified the issue and are working on a solution.

Updates:

  • Time: May 15, 2023, 8:13 a.m.
    Status: Resolved
    Update: This incident has been resolved.
  • Time: May 15, 2023, 7:40 a.m.
    Status: Monitoring
    Update: Our pipeline is still draining its queues after the fix. We are continuing to monitor for any further issues.
  • Time: May 15, 2023, 6:59 a.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: May 15, 2023, 6:34 a.m.
    Status: Identified
    Update: We are experiencing a processing delay in our occurrence pipeline. We are identified the issue and are working on a fix.

Updates:

  • Time: May 15, 2023, 8:13 a.m.
    Status: Resolved
    Update: This incident has been resolved.
  • Time: May 15, 2023, 7:40 a.m.
    Status: Monitoring
    Update: Our pipeline is still draining its queues after the fix. We are continuing to monitor for any further issues.
  • Time: May 15, 2023, 6:59 a.m.
    Status: Monitoring
    Update: A fix has been implemented and we are monitoring the results.
  • Time: May 15, 2023, 6:34 a.m.
    Status: Identified
    Update: We are experiencing a processing delay in our occurrence pipeline. We are identified the issue and are working on a fix.

Updates:

  • Time: April 28, 2023, 8:40 p.m.
    Status: Postmortem
    Update: # Incident Report: Web Outage at Rollbar on April 28th, 2023 ## Summary of Incident and Impact Between 3:45am and 4:19am Pacific Time, Rollbar experienced an outage in its web application due to forced upgrades by our Cloud Provider on a node-pool used exclusively by the web application. The configuration of the web application was overly-specific, which, when combined with the simultaneous updating of the node-pool by the Cloud Provider, resulted in Kubernetes being unable to schedule the pods. Consequently, the web application experienced slow performance from 3:45am until 4:00am, and it was rendered completely unavailable from 4:06am to 4:19am. The issue was resolved by updating the tolerances and taints for the workload to allow it to use a more diverse set of pools within our cluster. ## Detailed Account of the Incident At 3:45am PT, the web application began experiencing slow performance due to the Cloud Provider initiating forced upgrades on a node-pool dedicated to the web application. As a result of the overly-specific configuration of the web application and the simultaneous updating of the entire node-pool by the Cloud Provider, Kubernetes was unable to schedule the necessary pods. ‌ The slow performance persisted until 4:00am, when the web application's availability began to degrade further. By 4:06am, the web application was completely unavailable. In order to resolve the issue, the team updated the tolerances and taints for the workload, allowing it to utilize a broader range of pools in our cluster. This action successfully resolved the problem, and the web application was restored to full functionality by 4:19am PT. ## Follow-Up Actions To mitigate the risk of future outages and ensure the continued stability of the platform, the following actions are being implemented: ‌ 1. Removal of the dedicated node-pool for the web application: This action has already been completed, allowing the web application to utilize a more diverse range of node-pools and preventing a single point of failure. 2. Improvements to monitoring and alerting: Updates to our monitoring and alerting systems will be made to better detect and manage scheduling issues in Kubernetes, ultimately improving our response time to potential issues. 3. Enhancements to the web application's auto-scaling and alerting: Work is underway to improve the auto-scaling capabilities of the web application, with a focus on directly tying these improvements into alerting systems for better responsiveness and reliability.
  • Time: April 28, 2023, 3:08 p.m.
    Status: Resolved
    Update: The web tier inaccessible from approximately 3:45am PT until 4:25am PT today. The outage was a result of maintenance performed by our cloud provider. The outage was resolved when we adjusted our configuration to account for the maintenance.

Updates:

  • Time: April 28, 2023, 8:40 p.m.
    Status: Postmortem
    Update: # Incident Report: Web Outage at Rollbar on April 28th, 2023 ## Summary of Incident and Impact Between 3:45am and 4:19am Pacific Time, Rollbar experienced an outage in its web application due to forced upgrades by our Cloud Provider on a node-pool used exclusively by the web application. The configuration of the web application was overly-specific, which, when combined with the simultaneous updating of the node-pool by the Cloud Provider, resulted in Kubernetes being unable to schedule the pods. Consequently, the web application experienced slow performance from 3:45am until 4:00am, and it was rendered completely unavailable from 4:06am to 4:19am. The issue was resolved by updating the tolerances and taints for the workload to allow it to use a more diverse set of pools within our cluster. ## Detailed Account of the Incident At 3:45am PT, the web application began experiencing slow performance due to the Cloud Provider initiating forced upgrades on a node-pool dedicated to the web application. As a result of the overly-specific configuration of the web application and the simultaneous updating of the entire node-pool by the Cloud Provider, Kubernetes was unable to schedule the necessary pods. ‌ The slow performance persisted until 4:00am, when the web application's availability began to degrade further. By 4:06am, the web application was completely unavailable. In order to resolve the issue, the team updated the tolerances and taints for the workload, allowing it to utilize a broader range of pools in our cluster. This action successfully resolved the problem, and the web application was restored to full functionality by 4:19am PT. ## Follow-Up Actions To mitigate the risk of future outages and ensure the continued stability of the platform, the following actions are being implemented: ‌ 1. Removal of the dedicated node-pool for the web application: This action has already been completed, allowing the web application to utilize a more diverse range of node-pools and preventing a single point of failure. 2. Improvements to monitoring and alerting: Updates to our monitoring and alerting systems will be made to better detect and manage scheduling issues in Kubernetes, ultimately improving our response time to potential issues. 3. Enhancements to the web application's auto-scaling and alerting: Work is underway to improve the auto-scaling capabilities of the web application, with a focus on directly tying these improvements into alerting systems for better responsiveness and reliability.
  • Time: April 28, 2023, 3:08 p.m.
    Status: Resolved
    Update: The web tier inaccessible from approximately 3:45am PT until 4:25am PT today. The outage was a result of maintenance performed by our cloud provider. The outage was resolved when we adjusted our configuration to account for the maintenance.

Check the status of similar companies and alternatives to Rollbar

Smartsheet
Smartsheet

Systems Active

ESS (Public)
ESS (Public)

Systems Active

ESS (Public)
ESS (Public)

Systems Active

Cloudera
Cloudera

Systems Active

New Relic
New Relic

Systems Active

Boomi
Boomi

Systems Active

AppsFlyer
AppsFlyer

Systems Active

Imperva
Imperva

Systems Active

Bazaarvoice
Bazaarvoice

Issues Detected

Optimizely
Optimizely

Systems Active

Electric
Electric

Systems Active

ABBYY
ABBYY

Systems Active

Frequently Asked Questions - Rollbar

Is there a Rollbar outage?
The current status of Rollbar is: Systems Active
Where can I find the official status page of Rollbar?
The official status page for Rollbar is here
How can I get notified if Rollbar is down or experiencing an outage?
To get notified of any status changes to Rollbar, simply sign up to OutLogger's free monitoring service. OutLogger checks the official status of Rollbar every few minutes and will notify you of any changes. You can veiw the status of all your cloud vendors in one dashboard. Sign up here