Visual / Systems Simulation

City Water System Simulator

The water-system prompt asks for a city dashboard with a reservoir, treatment plants, neighborhoods, pipe flows, and failures. When a pipe bursts or demand spikes, pressure and supply should change across the city.

Prompt

Build a 2D simulation of a city's drinking water system: 1 reservoir, 3 treatment plants, a network of pipes, and 20 neighborhoods with varying demand. A demand producer makes households and businesses use water throughout the day (morning shower peak, midday for businesses), and a distribution consumer balances flow from plants to keep pressure stable everywhere....

Max tokens
100K
temperature
0
top_p
1
seed
42
presence_penalty
0
frequency_penalty
0
Reasoning effort
High
Execution
Single-shot via API

Fortytwo Prime

Fortytwo

PASS5 / 5

Fortytwo Prime passes all five City Water System criteria, showing the full reservoir-plant-neighborhood map, pressure and flow visuals, time-based demand, event triggers, and live event feedback.

vs
DeepSeek V4 FlashDeepSeek
MIXED2 / 5

DeepSeek V4 Flash renders a live water-system dashboard with reservoir level, plant status, neighborhood pressures, and the required event buttons. After triggering heatwave, pipe burst, and plant-offline events, the reservoir, pressure readings, alerts, and some map colors update. However, the main map is badly clipped and overcrowded, so the 20 neighborhoods and all three plants are not cleanly visible on the map, and pipe thickness, direction, and per-neighborhood gauges are only partially legible.

Model verdicts

Have a complex task to evaluate?

Request a custom evaluation for your use case.

Request a demo →