Rollbar Status: Check if Rollbar down or having an outage.

Rollbar outages and incidents

Outage and incident data over the last 30 days for Rollbar.

There have been 1 outages or incidents for Rollbar in the last 30 days.

Severity Breakdown:

None: 0

Minor: 1

Major: 0

Critical: 0

Tired of searching for status updates?

Join OutLogger to be notified when any of your vendors or the components you use experience an outage. It's completely free and takes less than 2 minutes!

Components and Services Monitored for Rollbar

Outlogger tracks the status of these components for Xero:

API Tier (api.rollbar.com) Active

Rollbar Docs Active

rollbar.min.js Active

SCIM and SSO Active

Web App (rollbar.com) Active

External notification services

Mailgun Outbound Delivery Active

Mailgun SMTP Active

Processing pipeline

Core Processing Pipeline Active

iOS Symbolication pipeline Active

Proguard processing pipeline Active

Source map symbolication pipeline Active

Component	Status
API Tier (api.rollbar.com)	Active
Rollbar Docs	Active
rollbar.min.js	Active
SCIM and SSO	Active
Web App (rollbar.com)	Active
External notification services	Active
Mailgun Outbound Delivery	Active
Mailgun SMTP	Active
Processing pipeline	Active
Core Processing Pipeline	Active
iOS Symbolication pipeline	Active
Proguard processing pipeline	Active
Source map symbolication pipeline	Active

Latest Rollbar outages and incidents.

View the latest incidents for Rollbar and check for official updates:

Increased occurrence processing pipeline latency

Description: This incident has been resolved.

Status: Resolved

Impact: None | Started At: May 15, 2023, 9 a.m.

Updates:

Time: May 15, 2023, 10:11 a.m.

Status: Resolved

Update: This incident has been resolved.
Time: May 15, 2023, 9 a.m.

Status: Identified

Update: We are experiencing a processing delay in our occurrence pipeline. We have identified the issue and are working on a solution.

Increased occurrence processing pipeline latency

Description: This incident has been resolved.

Status: Resolved

Impact: None | Started At: May 15, 2023, 6:34 a.m.

Updates:

Time: May 15, 2023, 8:13 a.m.

Status: Resolved

Update: This incident has been resolved.
Time: May 15, 2023, 7:40 a.m.

Status: Monitoring

Update: Our pipeline is still draining its queues after the fix. We are continuing to monitor for any further issues.
Time: May 15, 2023, 6:59 a.m.

Status: Monitoring

Update: A fix has been implemented and we are monitoring the results.
Time: May 15, 2023, 6:34 a.m.

Status: Identified

Update: We are experiencing a processing delay in our occurrence pipeline. We are identified the issue and are working on a fix.

Increased occurrence processing pipeline latency

Description: This incident has been resolved.

Status: Resolved

Impact: None | Started At: May 15, 2023, 6:34 a.m.

Updates:

Time: May 15, 2023, 8:13 a.m.

Status: Resolved

Update: This incident has been resolved.
Time: May 15, 2023, 7:40 a.m.

Status: Monitoring

Update: Our pipeline is still draining its queues after the fix. We are continuing to monitor for any further issues.
Time: May 15, 2023, 6:59 a.m.

Status: Monitoring

Update: A fix has been implemented and we are monitoring the results.
Time: May 15, 2023, 6:34 a.m.

Status: Identified

Update: We are experiencing a processing delay in our occurrence pipeline. We are identified the issue and are working on a fix.

Web tier outage

Description: # Incident Report: Web Outage at Rollbar on April 28th, 2023 ## Summary of Incident and Impact Between 3:45am and 4:19am Pacific Time, Rollbar experienced an outage in its web application due to forced upgrades by our Cloud Provider on a node-pool used exclusively by the web application. The configuration of the web application was overly-specific, which, when combined with the simultaneous updating of the node-pool by the Cloud Provider, resulted in Kubernetes being unable to schedule the pods. Consequently, the web application experienced slow performance from 3:45am until 4:00am, and it was rendered completely unavailable from 4:06am to 4:19am. The issue was resolved by updating the tolerances and taints for the workload to allow it to use a more diverse set of pools within our cluster. ## Detailed Account of the Incident At 3:45am PT, the web application began experiencing slow performance due to the Cloud Provider initiating forced upgrades on a node-pool dedicated to the web application. As a result of the overly-specific configuration of the web application and the simultaneous updating of the entire node-pool by the Cloud Provider, Kubernetes was unable to schedule the necessary pods. ‌ The slow performance persisted until 4:00am, when the web application's availability began to degrade further. By 4:06am, the web application was completely unavailable. In order to resolve the issue, the team updated the tolerances and taints for the workload, allowing it to utilize a broader range of pools in our cluster. This action successfully resolved the problem, and the web application was restored to full functionality by 4:19am PT. ## Follow-Up Actions To mitigate the risk of future outages and ensure the continued stability of the platform, the following actions are being implemented: ‌ 1. Removal of the dedicated node-pool for the web application: This action has already been completed, allowing the web application to utilize a more diverse range of node-pools and preventing a single point of failure. 2. Improvements to monitoring and alerting: Updates to our monitoring and alerting systems will be made to better detect and manage scheduling issues in Kubernetes, ultimately improving our response time to potential issues. 3. Enhancements to the web application's auto-scaling and alerting: Work is underway to improve the auto-scaling capabilities of the web application, with a focus on directly tying these improvements into alerting systems for better responsiveness and reliability.

Status: Postmortem

Impact: None | Started At: April 28, 2023, 11:30 a.m.

Updates:

Time: April 28, 2023, 8:40 p.m.

Status: Postmortem

Update: # Incident Report: Web Outage at Rollbar on April 28th, 2023 ## Summary of Incident and Impact Between 3:45am and 4:19am Pacific Time, Rollbar experienced an outage in its web application due to forced upgrades by our Cloud Provider on a node-pool used exclusively by the web application. The configuration of the web application was overly-specific, which, when combined with the simultaneous updating of the node-pool by the Cloud Provider, resulted in Kubernetes being unable to schedule the pods. Consequently, the web application experienced slow performance from 3:45am until 4:00am, and it was rendered completely unavailable from 4:06am to 4:19am. The issue was resolved by updating the tolerances and taints for the workload to allow it to use a more diverse set of pools within our cluster. ## Detailed Account of the Incident At 3:45am PT, the web application began experiencing slow performance due to the Cloud Provider initiating forced upgrades on a node-pool dedicated to the web application. As a result of the overly-specific configuration of the web application and the simultaneous updating of the entire node-pool by the Cloud Provider, Kubernetes was unable to schedule the necessary pods. ‌ The slow performance persisted until 4:00am, when the web application's availability began to degrade further. By 4:06am, the web application was completely unavailable. In order to resolve the issue, the team updated the tolerances and taints for the workload, allowing it to utilize a broader range of pools in our cluster. This action successfully resolved the problem, and the web application was restored to full functionality by 4:19am PT. ## Follow-Up Actions To mitigate the risk of future outages and ensure the continued stability of the platform, the following actions are being implemented: ‌ 1. Removal of the dedicated node-pool for the web application: This action has already been completed, allowing the web application to utilize a more diverse range of node-pools and preventing a single point of failure. 2. Improvements to monitoring and alerting: Updates to our monitoring and alerting systems will be made to better detect and manage scheduling issues in Kubernetes, ultimately improving our response time to potential issues. 3. Enhancements to the web application's auto-scaling and alerting: Work is underway to improve the auto-scaling capabilities of the web application, with a focus on directly tying these improvements into alerting systems for better responsiveness and reliability.
Time: April 28, 2023, 3:08 p.m.

Status: Resolved

Update: The web tier inaccessible from approximately 3:45am PT until 4:25am PT today. The outage was a result of maintenance performed by our cloud provider. The outage was resolved when we adjusted our configuration to account for the maintenance.

Web tier outage

Status: Postmortem

Impact: None | Started At: April 28, 2023, 11:30 a.m.

Updates:

Time: April 28, 2023, 8:40 p.m.

Status: Postmortem

Update: # Incident Report: Web Outage at Rollbar on April 28th, 2023 ## Summary of Incident and Impact Between 3:45am and 4:19am Pacific Time, Rollbar experienced an outage in its web application due to forced upgrades by our Cloud Provider on a node-pool used exclusively by the web application. The configuration of the web application was overly-specific, which, when combined with the simultaneous updating of the node-pool by the Cloud Provider, resulted in Kubernetes being unable to schedule the pods. Consequently, the web application experienced slow performance from 3:45am until 4:00am, and it was rendered completely unavailable from 4:06am to 4:19am. The issue was resolved by updating the tolerances and taints for the workload to allow it to use a more diverse set of pools within our cluster. ## Detailed Account of the Incident At 3:45am PT, the web application began experiencing slow performance due to the Cloud Provider initiating forced upgrades on a node-pool dedicated to the web application. As a result of the overly-specific configuration of the web application and the simultaneous updating of the entire node-pool by the Cloud Provider, Kubernetes was unable to schedule the necessary pods. ‌ The slow performance persisted until 4:00am, when the web application's availability began to degrade further. By 4:06am, the web application was completely unavailable. In order to resolve the issue, the team updated the tolerances and taints for the workload, allowing it to utilize a broader range of pools in our cluster. This action successfully resolved the problem, and the web application was restored to full functionality by 4:19am PT. ## Follow-Up Actions To mitigate the risk of future outages and ensure the continued stability of the platform, the following actions are being implemented: ‌ 1. Removal of the dedicated node-pool for the web application: This action has already been completed, allowing the web application to utilize a more diverse range of node-pools and preventing a single point of failure. 2. Improvements to monitoring and alerting: Updates to our monitoring and alerting systems will be made to better detect and manage scheduling issues in Kubernetes, ultimately improving our response time to potential issues. 3. Enhancements to the web application's auto-scaling and alerting: Work is underway to improve the auto-scaling capabilities of the web application, with a focus on directly tying these improvements into alerting systems for better responsiveness and reliability.
Time: April 28, 2023, 3:08 p.m.

Status: Resolved

Update: The web tier inaccessible from approximately 3:45am PT until 4:25am PT today. The outage was a result of maintenance performed by our cloud provider. The outage was resolved when we adjusted our configuration to account for the maintenance.

Check the status of similar companies and alternatives to Rollbar

Smartsheet

Systems Active

ESS (Public)

Systems Active

ESS (Public)

Systems Active

Cloudera

Systems Active

New Relic

Systems Active

Boomi

Systems Active

AppsFlyer

Systems Active

Imperva

Systems Active

Bazaarvoice

Issues Detected

Optimizely

Systems Active

Electric

Systems Active

ABBYY

Systems Active

Frequently Asked Questions - Rollbar

Is there a Rollbar outage?

The current status of Rollbar is: Systems Active

Where can I find the official status page of Rollbar?

The official status page for Rollbar is here

How can I get notified if Rollbar is down or experiencing an outage?

To get notified of any status changes to Rollbar, simply sign up to OutLogger's free monitoring service. OutLogger checks the official status of Rollbar every few minutes and will notify you of any changes. You can veiw the status of all your cloud vendors in one dashboard. Sign up here

Is there an Rollbar outage?

Rollbar status: Systems Active

Rollbar outages and incidents

There have been 1 outages or incidents for Rollbar in the last 30 days.

Severity Breakdown:

Tired of searching for status updates?

Components and Services Monitored for Rollbar

External notification services

Processing pipeline

Latest Rollbar outages and incidents.

Increased occurrence processing pipeline latency

Updates:

Increased occurrence processing pipeline latency

Updates:

Increased occurrence processing pipeline latency

Updates:

Web tier outage

Updates:

Web tier outage

Updates:

Check the status of similar companies and alternatives to Rollbar

Smartsheet

ESS (Public)

ESS (Public)

Cloudera

New Relic

Boomi

AppsFlyer

Imperva

Bazaarvoice

Optimizely

Electric

ABBYY

Frequently Asked Questions - Rollbar

Is there a Rollbar outage?

Where can I find the official status page of Rollbar?

How can I get notified if Rollbar is down or experiencing an outage?

Start monitoring now!