Last checked: 2 minutes ago
Get notified about any outages, downtime or incidents for Inspectorio and 1800+ other cloud vendors. Monitor 10 companies, for free.
Outage and incident data over the last 30 days for Inspectorio.
Join OutLogger to be notified when any of your vendors or the components you use experience an outage. It's completely free and takes less than 2 minutes!
Sign Up NowOutlogger tracks the status of these components for Xero:
Component | Status |
---|---|
Coverage Website | Active |
Production Management (PM) | Active |
Integration API | Active |
Web | Active |
Quality Risk Management (QRM) | Active |
Integration API | Active |
Mobile API | Active |
Web | Active |
Responsible Sourcing & Compliance (RSC) / Digital Assets (DA) | Active |
Integration API | Active |
Mobile API | Active |
Web | Active |
View the latest incidents for Inspectorio and check for official updates:
Description: On Thursday December 17th from 01:00 to 02:40 \(UTC\) the Inspectorio Platform experienced a partial service disruption affecting the authentication and authorisation services availability. During that period, a limited number of users experienced issues with ability to login to Rise & Sight products \(Web and Mobile applications\). We met Service Level Agreement requirements for incident resolution and Uptime SLA. Functionalities Impacted During that period the following functionalities were impacted: * Login functionality to SIGHT * Login functionality to RISE All remaining functionalities of the Inspectorio platform were not impacted by this incident including the live chat support channel. There were no data loss or corruption. **Incident Monitoring** During the incident period our logging system registered a spike of 100% CPU utilization of our Redis Memory Store from 01:00 UTC which last till 02:40 UTC and caused Service Unavailability errors. **Incident Cause Overview** The incident was caused by the issue in the latest optimization functionality released for Rise product \(to prevent duplicate update request of an organization integration request\). As soon as we started to receive a batch update to the Rise via Direct API it led to the 100% CPU utilization of our Redis Memory store which triggered deterioration of Redis availability for the Platform services in part of authorization and authentication. **Incident Recovery** When our team identified the outage of the Authorization & Authentication components through our internal monitoring system, we pinpointed the Redis service component to be the origin of the issue. We first attempt to scale up the Redis and trigger the vertical scaling of this service. We recognized that action had no impact on CPU usage improvements. As per our protocol, we performed a rollback of the latest release of Rise functionality to the previous version. This resolved the issue and Redis CPU and other parameters were all back to normal. The QA team performed a full check of the Sight & Rise functionality and confirmed full recovery of the Platform. **Improvement Action Plan** We aim to implement following actions to prevent such incidents and minimize & mitigate possible risks as consequences: 1. Set up a topology of Redis isolated instances to improve stability and high availability of the system as an outage resilience configuration 2. Deprecate Direct API usage as it has significantly less advanced performance control functionality and might impact our internal services directly. Should ask our clients to plan migration to Integration API solution \(complete for Sight but still used for Rise\)
Status: Postmortem
Impact: Critical | Started At: Dec. 17, 2020, 1 a.m.
Description: [https://status.inspectorio.com/incidents/c679hc2b32sc](https://status.inspectorio.com/incidents/c679hc2b32sc)
Status: Postmortem
Impact: Critical | Started At: Dec. 17, 2020, 1 a.m.
Description: This incident has been resolved.
Status: Resolved
Impact: Maintenance | Started At: June 19, 2020, 7 p.m.
Description: Operations on the PO and Booking components have been restored. All operations are now normal. Thank you for your patience as we worked to resolve this issue. While the issue has been resolved, we will continue to monitor to ensure there are no further issues. If you have any further questions or concerns, please contact our support team at [email protected].
Status: Resolved
Impact: Major | Started At: June 12, 2019, 3:38 a.m.
Description: Our team has successfully resolved all issues and normal operations have been restored. Please feel free to proceed with starting and continuing inspections in addition to uploading inspection reports. Please contact our support team at [email protected] with any further questions or concerns.
Status: Resolved
Impact: Critical | Started At: May 30, 2019, 9:02 a.m.
Join OutLogger to be notified when any of your vendors or the components you use experience an outage or down time. Join for free - no credit card required.