Visual / Systems Simulation

City Water System Simulator

The water-system prompt asks for a city dashboard with a reservoir, treatment plants, neighborhoods, pipe flows, and failures. When a pipe bursts or demand spikes, pressure and supply should change across the city.

Prompt

Build a 2D simulation of a city's drinking water system: 1 reservoir, 3 treatment plants, a network of pipes, and 20 neighborhoods with varying demand. A demand producer makes households and businesses use water throughout the day (morning shower peak, midday for businesses), and a distribution consumer balances flow from plants to keep pressure stable everywhere....

Max tokens
100K
temperature
0
top_p
1
seed
42
presence_penalty
0
frequency_penalty
0
Reasoning effort
High
Execution
Single-shot via API

Fortytwo Prime

Fortytwo

PASS5 / 5

Fortytwo Prime passes all five City Water System criteria, showing the full reservoir-plant-neighborhood map, pressure and flow visuals, time-based demand, event triggers, and live event feedback.

vs
Qwen 3.7 PlusAlibaba
MIXED1 / 5

Qwen 3.7 Plus renders a pipe network with three treatment plants, neighborhood gauges, flow animation, and the required heatwave, pipe-burst, and plant-offline buttons. However, the reservoir is not clearly represented as a map element or level indicator, the system starts around 2 PSI despite full reservoir and online plants, and event effects are shallow: demand and average pressure change, but the map/telemetry do not clearly show a coherent plant-offline or reservoir response.

Model verdicts

Have a complex task to evaluate?

Request a custom evaluation for your use case.

Request a demo →