CyberGrants Performance

Postmortem

Issue Summary:

On Sunday 5th October 2025, from approximately 04:28 to 05:48 ET, some CyberGrants users were unable to use the full functionality of the product. The same issue recurred later that day from 21:55 to 08:47 ET the following day, Monday 6th October. The situation was resolved when upon investigation, a series of massive reports, concurrently running, were manually stopped allowing affected servers to return to their normal operational state.

Root Cause:

Some Application servers became overloaded after a massive report, that would take well over an hour to complete, was requested multiple times in rapid succession over a prolonged period. For comparison, the vast majority of other reports are less than a thousandth of the size of this one. The report in question is an anomaly and consumes a significant amount of memory and CPU power. Requesting the report over and over, each request coming after a fraction of the time it takes to complete it, caused server after server to become completely utilized and the request, as far as the requestor was able to see, time out, although each request was still running in the background.

Prevention:

  • The ability to request this report in the form it was, has been removed
  • Any report that would consume over a certain amount of compute power and therefore an unusual length of time to complete, is being prevented from running concurrently for the same user
Posted Oct 10, 2025 - 09:17 EDT

Resolved

The issue has been resolved and an RCA will be posted later this week. Thank You.
Posted Oct 06, 2025 - 16:53 EDT

Update

We will continuing to monitor performance for the remainder of the day to ensure there is no regression. Please continue to monitor this page for further updates,
Posted Oct 06, 2025 - 13:04 EDT

Monitoring

Service performance has returned to normal. We’re continuing to investigate the root cause and are closely monitoring for any recurrence. Further updates will be posted on this page.
Posted Oct 06, 2025 - 08:32 EDT

Update

We are continuing to investigate a partial outage impacting CyberGrants. You may experience failed requests, timeouts, or intermittent errors. We’re actively mitigating and expect improvement shortly. Further updates will be posted on this page.
Posted Oct 06, 2025 - 08:15 EDT

Investigating

Degraded performance detected. You may notice slower processing times or intermittent errors. We’re actively mitigating and expect improvement shortly. Further updates will be posted on this page
Posted Oct 06, 2025 - 06:55 EDT
This incident affected: CyberGrants.