Last checked: 7 minutes ago
Get notified about any outages, downtime or incidents for Intuiface and 1800+ other cloud vendors. Monitor 10 companies, for free.
Outage and incident data over the last 30 days for Intuiface.
Join OutLogger to be notified when any of your vendors or the components you use experience an outage. It's completely free and takes less than 2 minutes!
Sign Up NowOutlogger tracks the status of these components for Xero:
Component | Status |
---|---|
API Explorer | Active |
Management Console | Active |
Web Triggers | Active |
Analytics | Active |
Charts & Dashboards | Active |
Data Processing | Active |
Headless CMS | Active |
Headless CMS Base Servers | Active |
Headless CMS Media Repository | Active |
Headless CMS Web UI | Active |
Interface Asset services | Active |
System Information - GeoIP | Active |
Weather | Active |
View the latest incidents for Intuiface and check for official updates:
Description: This incident has been resolved. Post-mortem analysis will be published later this week.
Status: Resolved
Impact: None | Started At: Feb. 17, 2020, 8 a.m.
Description: ## What Happened On 13-December \(a Friday\), after midnight \(UTC\), an Intuiface employee using API Explorer reported an outage. The Intuiface Server Team also received automatic alerts about the lack of server response. These outages occurred several times until the morning of Monday, 16-December. On 16-December, the Server Team performed root cause analysis, during which it was discovered that the outage issue mainly affected Intuiface employees. Further analysis identified the root cause of performance degradation as an unused but resource-costly feature only available to Intuiface employees. Non-employees encountered API Explorer outages as a side effect. ## Corrective Actions On the morning of 17-December, a plan was executed to remove the unused feature and rebuild the server. The rebuilt server was tested and ready at 2PM UTC. In addition, to proactively compensate for possible performance issues in the future, the host supporting the server was accelerated by a factor of two. ## Remediation Plans Moving forward, five optimization options have been identified, with implementation occurring over the coming weeks. These optimizations will tune request storage to accommodate an ever-growing database. These requests are used by the machine learning recommendation algorithm in API Explorer.
Status: Postmortem
Impact: Minor | Started At: Dec. 16, 2019, 7:11 p.m.
Description: ## Post mortem analysis of the incident: * Immediately after the incident of 5-November between 8AM-11AM UTC, we needed to clean up the database and its storage on the disk. * We deleted old records contained within the largest table \(removing records older than 16 months instead of 18 months\). This deletion consumed a significant amount of journal storage \(approx. 15GB per million of records\), exhausting all available disk storage. * At around 1:30 PM UTC the database locked and entered automatic recovery mode. * We detected this issue three minutes later and immediately disabled the web server to limit the risk of paid Players and Composers stopping or refusing to launch. NB: Free Player and Composer remained unusable following remediation of the first incident * The database was repaired and placed back online after 30 minutes. * The web server was restarted and everything became fully operational. **NOTE**: At no time was any customer data – experiences or analytics – ever at risk. ## Remediation plans: * In the following days, the offending table will be carefully and slowly cleaned in small chunks to avoid the disk storage issue. * In the following weeks, we will migrate that table to a different database to avoid similar issues from occurring again.
Status: Postmortem
Impact: Critical | Started At: Nov. 5, 2019, 1:36 p.m.
Description: ## Analysis of incident: * On 22-October we upgraded our database to enhance Intuiface user security. The upgrade went well but after the server was back in production, we detected some database timeouts. Investigation revealed they were due to some severely altered – thus fragmented – indexes, causing performance degradation. * We reindexed almost all but two tables on 24-October without any issue. Reindexing of the two largest tables was deferred to an already scheduled maintenance activity on 5-November. * On 5-November at around 6:15 AM UTC, the first table was reindexed without any issue. The second table was then reindexed and this operation generated a very large amount of data. At around 8:00 AM UTC the disk went full and the database locked, preventing any further updates to the database tables. At this point, we initiated a rollback of the index. * Until 8:50 AM UTC the web services for license check and activation responded with an error every time a Player or Composer attempted to retrieve or activate a license. This affected all free and licensed Players and Composers during startup. It also affected some running, licensed Players that happened to run a license check \(which normally occurs once a day\). It was also no longer possible to connect to the My Intuiface website \([my.intuiface.com](http://my.intuiface.com)\) or to the Intuiface Support site. * At 8:50 AM UTC, we neutralized the web services for license check and activation so they would no longer reply with an error. This enabled licensed Players and Composers to start correctly. Free Players and Composers could still not launch as they require a successful license check. * At 10:57 AM UTC, the table index was fully rolled-back, making the database accessible and enabling us to begin cleanup in order to free disk storage. * At 11:00 AM UTC the server was back online again. Player and Composer could check their licenses and users could again connect on [my.intuiface.com](http://my.intuiface.com) or support site. * After 10 minutes of monitoring, we considered the issue as fixed at 11:13 AM UTC **NOTE**: At no time was any customer data – experiences or analytics – ever at risk. ## Remediation plans: * We will ensure that database failure or inaccessibility – a very rare event that occurred only 5 hours in total over the last 3 years, an availability rate of better than 99.98% - will no longer impact running, paid Players nor the launch of paid Players and Composers. * We will enhance our ability to inform users and customers about issues and remediation steps in real-time, both via our website \([www.intuiface.com](http://www.intuiface.com)\) and through our [status.intuiface.com](http://status.intuiface.com) server monitoring website. \(NOTE: It is already possible to use our server monitoring site to sign up for notifications about ongoing issues.\) * We will also take action to better test maintenance operations, even if in this case it was hard to predict that it could lead to a major event. \(The faulty reindexing consumed more than 140GB of disk space for a single table!\) * An internal review of the incident is currently underway to identify and take the appropriate actions.
Status: Postmortem
Impact: Major | Started At: Nov. 5, 2019, 8:20 a.m.
Description: The issue was resolved.
Status: Resolved
Impact: Critical | Started At: Sept. 24, 2019, 9:41 p.m.
Join OutLogger to be notified when any of your vendors or the components you use experience an outage or down time. Join for free - no credit card required.