AskNicely Status: Check if AskNicely down or having an outage.

AskNicely outages and incidents

Outage and incident data over the last 30 days for AskNicely.

There have been 1 outages or incidents for AskNicely in the last 30 days.

Severity Breakdown:

None: 0

Minor: 1

Major: 0

Critical: 0

Tired of searching for status updates?

Join OutLogger to be notified when any of your vendors or the components you use experience an outage. It's completely free and takes less than 2 minutes!

Components and Services Monitored for AskNicely

Outlogger tracks the status of these components for Xero:

AskNicely Application Active

Component	Status
AskNicely Application	Active

Latest AskNicely outages and incidents.

View the latest incidents for AskNicely and check for official updates:

Application Unavailability

Description: We experienced a brief period of application unavailability due to malicious requests causing high server load. This issue has been resolved.

Status: Resolved

Impact: None | Started At: Jan. 25, 2021, 12:30 p.m.

Updates:

Time: Jan. 26, 2021, 1:13 a.m.

Status: Resolved

Update: We experienced a brief period of application unavailability due to malicious requests causing high server load. This issue has been resolved.

Application Unavailability

Description: We experienced an application outage starting 11:20 AM due to a database issue. This incident was resolved within an hour.

Status: Resolved

Impact: Major | Started At: Aug. 24, 2020, midnight

Updates:

Time: Aug. 27, 2020, 12:53 a.m.

Status: Resolved

Update: We experienced an application outage starting 11:20 AM due to a database issue. This incident was resolved within an hour.

Database Issues

Description: All AskNicely services are back to normal.

Status: Resolved

Impact: Major | Started At: Feb. 21, 2019, 2:51 a.m.

Updates:

Time: Feb. 21, 2019, 3:57 a.m.

Status: Resolved

Update: All AskNicely services are back to normal.
Time: Feb. 21, 2019, 3:04 a.m.

Status: Monitoring

Update: AskNicely is back to being fully operational. We are monitoring for any continued irregular activity.
Time: Feb. 21, 2019, 2:51 a.m.

Status: Investigating

Update: We have noticed irregular database activity and are performing emergency maintenance to resolve the issues. Services will be partially offline.

502 Error USA Data Center

Description: ## The 502 error Today a number of customer may have experienced a 502 error and were not able to access the AskNicely platform. We are super proud of the platform we have built, and when we let our customers down, we know we need to do a better job, it really hurts. We are sorry you were not able to access our platform. Very sorry. We have a fantastic engineering team and over the next week, we will be focusing on our infrastructure to help minimise outages that you may have seen today. ## What went wrong AskNicely is built on AWS \(Amazon\), it is an amazing platform which allows to scale our solution very easily. Today we hit an issue with extremely heavy load on our USA database server \(RDS\). The symptoms we saw. * 502 Error rates * Load Balancer errors, 'unhealthy web server in load balancer pool * Database load in RDS going from under 5% to 100% in matter of seconds. Very abnormal. * Our 502 error page did not tell our customers what was happening, nor link to our status page. Bad. ## What went right We have extensive monitoring on AskNicely we have some fantastic services that we love which kicked in as soon as it detected something abnormal. The services we use today: * [PagerDuty.com](http://PagerDuty.com) We love PagerDuty, both the mobile app, email, SMS and automated phone calls for alerting. Auto escalation policies to other team members. * [Datadog.com](http://Datadog.com) provides us with detailed metrics around our application performance and servers, we send a massive amount of data back to Datadog and its a valuable asset that we use for real time monitoring and debugging. * [Loggly.com](http://Loggly.com) all our log files and error logs are managed in Loggly. We can easily visualise and quantify requests from customers in seconds using their powerful log query tool. * [NewRelic.com](http://NewRelic.com) can provide incredibly detailed analysis of what parts of our application are being used the most, how well that code is performing and what part of the code is the slowest. It also monitors how long our application is taking to load for our customers. We really absolutely love NewRelic and it is our Litmus test to see if our code changes have resolved our issues or not. * [Slack.com](http://Slack.com) it makes it so easy for our team to stay on the same page and communicate instantly no matter where we are in the world. * [Statuspage.io](http://Statuspage.io) You can find a link to our statuspage from the [www.asknicely.com](http://www.asknicely.com) homepage and our 404 pages. ## What we discovered During this time, we came under a very heavy API load from one customer. Normally our API rate limiter would kick in and prevent any one single customer from causing an outage. But due to the size of this customers dataset, our API was too slow to respond to all their requests causing massive congestion. Our rate limiting API is tuned for number of requests, not time to process a request. ## What we did We have a number of strategies that we use to scale our platform. One strategy allows us to move a single customer from one database host \(RDS Instance\) to another. Once we isolated the issue, this customer was moved to their own database instance. The AskNicely application instantly become responsive and all our server metrics returned to what we would consider normal parameters. We have also worked on several bottle necks including: * Autoscaling our primary USA database server, we have tripled the capacity of this server, in size and dedicated IOPS. * We have 6x our Redis instance that provides us with a powerful and fast caching service for parts of the application. * We have changed several variables on our RDS instance that would allow higher loads * We have added another application server to the server pool. ## What we are planning todo * Add detailed API monitoring - time, frequency, tenant and database * Improve our API rate limiter. * Refactor our API code that caused us issues and most likely refactor a particular query that caused the heavy load on our database. * Provide a way to gracefully degrade AskNicely so that core/key services are not affected. * Improve our 502 error page to link to our StatusPage so we can get our customers more timely updates. Again we are sorry, and we are working hard to rectify these issues. John // CTO and co-founder AskNicely

Status: Postmortem

Impact: None | Started At: Oct. 8, 2018, 3:13 p.m.

Updates:

Time: Oct. 9, 2018, 6:46 a.m.

Status: Postmortem

Update: ## The 502 error Today a number of customer may have experienced a 502 error and were not able to access the AskNicely platform. We are super proud of the platform we have built, and when we let our customers down, we know we need to do a better job, it really hurts. We are sorry you were not able to access our platform. Very sorry. We have a fantastic engineering team and over the next week, we will be focusing on our infrastructure to help minimise outages that you may have seen today. ## What went wrong AskNicely is built on AWS \(Amazon\), it is an amazing platform which allows to scale our solution very easily. Today we hit an issue with extremely heavy load on our USA database server \(RDS\). The symptoms we saw. * 502 Error rates * Load Balancer errors, 'unhealthy web server in load balancer pool * Database load in RDS going from under 5% to 100% in matter of seconds. Very abnormal. * Our 502 error page did not tell our customers what was happening, nor link to our status page. Bad. ## What went right We have extensive monitoring on AskNicely we have some fantastic services that we love which kicked in as soon as it detected something abnormal. The services we use today: * [PagerDuty.com](http://PagerDuty.com) We love PagerDuty, both the mobile app, email, SMS and automated phone calls for alerting. Auto escalation policies to other team members. * [Datadog.com](http://Datadog.com) provides us with detailed metrics around our application performance and servers, we send a massive amount of data back to Datadog and its a valuable asset that we use for real time monitoring and debugging. * [Loggly.com](http://Loggly.com) all our log files and error logs are managed in Loggly. We can easily visualise and quantify requests from customers in seconds using their powerful log query tool. * [NewRelic.com](http://NewRelic.com) can provide incredibly detailed analysis of what parts of our application are being used the most, how well that code is performing and what part of the code is the slowest. It also monitors how long our application is taking to load for our customers. We really absolutely love NewRelic and it is our Litmus test to see if our code changes have resolved our issues or not. * [Slack.com](http://Slack.com) it makes it so easy for our team to stay on the same page and communicate instantly no matter where we are in the world. * [Statuspage.io](http://Statuspage.io) You can find a link to our statuspage from the [www.asknicely.com](http://www.asknicely.com) homepage and our 404 pages. ## What we discovered During this time, we came under a very heavy API load from one customer. Normally our API rate limiter would kick in and prevent any one single customer from causing an outage. But due to the size of this customers dataset, our API was too slow to respond to all their requests causing massive congestion. Our rate limiting API is tuned for number of requests, not time to process a request. ## What we did We have a number of strategies that we use to scale our platform. One strategy allows us to move a single customer from one database host \(RDS Instance\) to another. Once we isolated the issue, this customer was moved to their own database instance. The AskNicely application instantly become responsive and all our server metrics returned to what we would consider normal parameters. We have also worked on several bottle necks including: * Autoscaling our primary USA database server, we have tripled the capacity of this server, in size and dedicated IOPS. * We have 6x our Redis instance that provides us with a powerful and fast caching service for parts of the application. * We have changed several variables on our RDS instance that would allow higher loads * We have added another application server to the server pool. ## What we are planning todo * Add detailed API monitoring - time, frequency, tenant and database * Improve our API rate limiter. * Refactor our API code that caused us issues and most likely refactor a particular query that caused the heavy load on our database. * Provide a way to gracefully degrade AskNicely so that core/key services are not affected. * Improve our 502 error page to link to our StatusPage so we can get our customers more timely updates. Again we are sorry, and we are working hard to rectify these issues. John // CTO and co-founder AskNicely
Time: Oct. 8, 2018, 11:18 p.m.

Status: Resolved

Update: This is issue is now resolved. We have made several changes that have identified the root cause and rectified these issues. We will continue to monitor over the next several days.
Time: Oct. 8, 2018, 8:44 p.m.

Status: Monitoring

Update: We are continuing to monitor, we have made a significant change that appears to rectify the issue. Again, we are monitoring this and we will do a debrief today.
Time: Oct. 8, 2018, 3:25 p.m.

Status: Monitoring

Update: We have identified an issue and are now monitoring.
Time: Oct. 8, 2018, 3:13 p.m.

Status: Investigating

Update: We are investigating a 502 Error on the US datacenter, we have several engineers looking into the issue.

502 Error US Datacenter

Description: We’ve identified endpoints that were not properly rate limited and when receiving a high volume of traffic were causing infrastructure issues. We’re working on better rate limiting coverage rolled out to prevent further outages.

Status: Postmortem

Impact: Major | Started At: Oct. 5, 2018, 3:33 p.m.

Updates:

Time: Oct. 6, 2018, 7:54 a.m.

Status: Postmortem

Update: We’ve identified endpoints that were not properly rate limited and when receiving a high volume of traffic were causing infrastructure issues. We’re working on better rate limiting coverage rolled out to prevent further outages.
Time: Oct. 5, 2018, 11:22 p.m.

Status: Resolved

Update: We have now resolved this incident and identified the cause. The engineering team are now doing a postmortem of the event to prevent this happening in the future.
Time: Oct. 5, 2018, 11:19 p.m.

Status: Monitoring

Update: We are continuing to monitor for any further issues.
Time: Oct. 5, 2018, 7:36 p.m.

Status: Monitoring

Update: We are now monitoring the situation the situation and all our monitoring tools are reporting the system is operating within expected parameters.
Time: Oct. 5, 2018, 7:34 p.m.

Status: Identified

Update: We have identified the source of the problem that has been causing an exceptional high load.
Time: Oct. 5, 2018, 7:19 p.m.

Status: Identified

Update: We have seen some performance issues that are causing some 502 and 504 errors. We are working hard to see where these are occurring, we will update this as we continue to find the root cause. All alert systems are operating as expected and now we are going through platform monitoring tool
Time: Oct. 5, 2018, 4:13 p.m.

Status: Monitoring

Update: We've rolled out changes to try resolve issues accessing AskNicely, and are monitoring current status.
Time: Oct. 5, 2018, 3:33 p.m.

Status: Investigating

Update: We are currently investigating a 502 error.

Check the status of similar companies and alternatives to AskNicely

Avalara

Systems Active

Crisis Text Line

Systems Active

Jamf

Systems Active

Mulesoft

Systems Active

Meltwater

Systems Active

HashiCorp

Systems Active

Datto

Issues Detected

Vox Media

Systems Active

Cradlepoint

Systems Active

Liferay

Systems Active

Zapier

Systems Active

Workato US

Systems Active

Frequently Asked Questions - AskNicely

Is there a AskNicely outage?

The current status of AskNicely is: Systems Active

Where can I find the official status page of AskNicely?

The official status page for AskNicely is here

How can I get notified if AskNicely is down or experiencing an outage?

To get notified of any status changes to AskNicely, simply sign up to OutLogger's free monitoring service. OutLogger checks the official status of AskNicely every few minutes and will notify you of any changes. You can veiw the status of all your cloud vendors in one dashboard. Sign up here

What does AskNicely do?

AskNicely is a tool that enables service businesses to gauge customer experience, incentivize frontline teams, and track financial outcomes.

Is there an AskNicely outage?

AskNicely status: Systems Active

AskNicely outages and incidents

There have been 1 outages or incidents for AskNicely in the last 30 days.

Severity Breakdown:

Tired of searching for status updates?

Components and Services Monitored for AskNicely

Latest AskNicely outages and incidents.

Application Unavailability

Updates:

Application Unavailability

Updates:

Database Issues

Updates:

502 Error USA Data Center

Updates:

502 Error US Datacenter

Updates:

Check the status of similar companies and alternatives to AskNicely

Avalara

Crisis Text Line

Jamf

Mulesoft

Meltwater

HashiCorp

Datto

Vox Media

Cradlepoint

Liferay

Zapier

Workato US

Frequently Asked Questions - AskNicely

Is there a AskNicely outage?

Where can I find the official status page of AskNicely?

How can I get notified if AskNicely is down or experiencing an outage?

What does AskNicely do?

Start monitoring now!