Visual / Systems Simulation

Power Grid Monitor

The task is a dashboard for a regional power grid, with power plants, substations, live readings, alarms, and outage checks. The numbers and map should change together, like a real monitoring screen rather than a static diagram.

Prompt

Build an interactive 2D SCADA-style monitoring dashboard for a regional power grid with 12 substations, 3 generation plants (1 nuclear, 1 gas, 1 wind farm with 40 turbines), and transmission lines. Simulate real-time telemetry via a producer-consumer pattern: a backend simulator produces load/voltage/frequency data at 1Hz,...

Max tokens
100K
temperature
0
top_p
1
seed
42
presence_penalty
0
frequency_penalty
0
Reasoning effort
High
Execution
Single-shot via API

Fortytwo Prime

Fortytwo

PASS5 / 5

Fortytwo Prime passes all five Power Grid criteria: visible power-grid topology with three generation sources, interactable telemetry, node/line failure projections, MW-weighted line thickness, real-time alarms, and generation-trip response tied to output, reserves, and frequency.

vs
Qwen 3.7 PlusAlibaba
MIXED0 / 5

Qwen 3.7 Plus renders a topology with 3 generator nodes, 12 substations, live telemetry, alarms, and an N-1 report after selecting an asset and running analysis. But the grid starts under-generated at low frequency, many assets show 0 MW on one side of their labels, line thickness is not a convincing MW encoding, the load-spike button throws a runtime error, and there are no generation-trip controls.

Model verdicts

Have a complex task to evaluate?

Request a custom evaluation for your use case.

Request a demo →