You are an AI security engineer responsible for a customer-facing LLM that summarizes financial documents. An internal red team is running a 60-minute timed exercise to find jailbreaks, data exfiltration paths, and abuse vectors. You have access to the system prompt, the output filter, the rate limiter, and the model's tool configuration.
Each round, the red team tries a new attack class. You have minutes to respond. Over-blocking damages user trust. Under-blocking ships a vulnerability. Your job is the right defense at the right layer.
This scenario tests defensive design under time pressure. The skills carry to real AI red-team engagements where the team has hours, not days, to harden a production system before launch.
Time-pressured. A live threat actor panel updates every few seconds with new actions you must address.
Step timers count down. Color shifts and pulse cues warn at 25%, 10%, and 5% time remaining. Score decays over time.