Visual / Systems Simulation

City Water System Simulator

The water-system prompt asks for a city dashboard with a reservoir, treatment plants, neighborhoods, pipe flows, and failures. When a pipe bursts or demand spikes, pressure and supply should change across the city.

Prompt

Build a 2D simulation of a city's drinking water system: 1 reservoir, 3 treatment plants, a network of pipes, and 20 neighborhoods with varying demand. A demand producer makes households and businesses use water throughout the day (morning shower peak, midday for businesses), and a distribution consumer balances flow from plants to keep pressure stable everywhere....

Max tokens
100K
temperature
0
top_p
1
seed
42
presence_penalty
0
frequency_penalty
0
Reasoning effort
High
Execution
Single-shot via API

Fortytwo Prime

Fortytwo

PASS5 / 5

Fortytwo Prime passes all five City Water System criteria, showing the full reservoir-plant-neighborhood map, pressure and flow visuals, time-based demand, event triggers, and live event feedback.

vs
Nemotron 3 UltraNVIDIA
FAIL0 / 5

Nemotron 3 Ultra does not render a usable city water-system map: the main map area is effectively blank, the neighborhoods panel is empty, demand/supply/pressure remain at zero, and the required pipe-burst, plant-offline, and heatwave triggers are missing.

Model verdicts

Have a complex task to evaluate?

Request a custom evaluation for your use case.

Request a demo →