We are investigating reports of elevated projection error rates.
Incident Report for Serialized
Postmortem

Background

We’re sorry for yesterday’s incident that affected the Serialized Dashboard and the Projections API. We noticed higher error rates in the projection API in the morning and later during the day dashboard also was affected.

While some of our systems were affected, your aggregates and events were completely unaffected by this incident.

Problem identification

After some digging in logs, we discovered that a retry-mechanism of the projection processing did not work as expected. A couple of weeks ago, a new batch processing mechanism was introduced to increase the processing speed of projections. Unfortunately, this retry was not compatible with the new batch handling, which caused partial updates to projection data which made the Dashboard unavailable (since it is also dependent on projections).

We discovered that several projects were affected by the problem and we decided to rebuild the affected projections to avoid any data inconsistencies.

Technical measures

  • Error handling in the retry mechanism was improved
  • Affected projections were rebuilt and projection data was verified

Conclusion

We know how much you rely on Serialized for your projects and businesses to succeed. We’re passionate about the availability of our services and the correctness of your data. We will continue to analyze this event and continuously improve to serve you better and earn the trust you place in us.

Posted Sep 20, 2022 - 09:55 CEST

Resolved
This incident has been resolved.
Posted Sep 19, 2022 - 21:03 CEST
Monitoring
A fix has been implemented and we are monitoring the results.
Posted Sep 19, 2022 - 20:58 CEST
Update
We have identified the problem and a fix is being rolled out. The dashboard is now operational again.
Posted Sep 19, 2022 - 20:14 CEST
Identified
The issue has been identified and a fix is being implemented.
Posted Sep 19, 2022 - 18:50 CEST
Update
We are continuing to investigate this issue.
Posted Sep 19, 2022 - 12:45 CEST
Update
We are continuing to investigate this issue.
Posted Sep 19, 2022 - 12:45 CEST
Update
We're seeing high error rates in the Projections API. We're investigating the issue and working hard on finding the root cause.
Posted Sep 19, 2022 - 12:44 CEST
Update
We are continuing to investigate this issue.
Posted Sep 19, 2022 - 12:43 CEST
Investigating
We are currently investigating this issue.
Posted Sep 19, 2022 - 12:43 CEST
This incident affected: Serialized Dashboard and Projections.