Performance Event: 2023-02-28
Issue: Issues reported within Aware Online Testing with submitting assessments including prolonged loading times.
Event start time: 10:00 AM CST
Event end time: 11:41 AM CST
Status: Resolved; continued monitoring.
Summary of most recent events
An influx of traffic to online testing spiked the CPU and memory on containers in the pool before it could automatically scale. This added traffic caused increased database connectivity as well and spiked all Aurora clusters connections to over 1000. Increasing the number of containers from 90 to 150 stabilized the system. Performance was monitored throughout the day.
Immediate next steps for Eduphoria
- Server capacity has been permanently increased to better accommodate demand spikes
- Engineering teams created additional alerts to enhance server monitoring
- Continued optimization of Aware Online Testing scalability; additional details pending