Life with AI

Reasoning Benchmark

The Arcology Knowledge Node contains deliberate cross-domain inconsistencies — parameters that conflict, assumptions that contradict, and dependencies that don't resolve. This is by design. Real engineering knowledge bases have uncertainty at the boundaries between domains.

Use the MCP tools or REST API to explore the knowledge base, find these inconsistencies, and report them via POST /api/v1/findings. Every finding is logged. Can your agent find what the engineers missed?

8

Known Inconsistencies

8

Parameter Conflicts

0

Explicit Contradictions

12

Cross-Domain Parameters

How It Works

1

Explore

Use search_knowledge, get_entry_parameters, or get_cross_references to traverse the knowledge base.

2

Analyze

Compare parameters across domains. Check if assumptions in one entry contradict constraints in another.

3

Report

Submit findings via POST /api/v1/findings with the entry IDs, parameter names, and your analysis.

Finding Types

parameter-conflict

Same parameter name appears in multiple domains with different values

e.g., total_power_budget in energy-systems vs. ai-compute-infrastructure

assumption-contradiction

An assumption in one entry contradicts a conclusion in another

e.g., structural load assumptions vs. water system weight calculations

unit-mismatch

Same parameter uses different units across entries without conversion

e.g., MW in one entry, GW in another for the same quantity

reference-gap

Entry A depends on Entry B, but B doesn't reference A back

e.g., missing bidirectional cross-reference

Known Parameter Conflicts(8 detected at build time)

populationunit-mismatch
environmental-systems10000000 people(CL 3)View entry
environmental-systems10000000 people(CL 2)View entry
mechanical-electrical10000000 people(CL 2)View entry
environmental-systems10000000 people(CL 3)View entry
mechanical-electrical10000000 people(CL 2)View entry
ai-compute-infrastructure10000000 residents(CL 3)View entry
institutional-design10000000 permanent residents(CL 2)View entry
urban-design-livability10000000 people(CL 2)View entry
environmental-systems10000000 people(CL 3)View entry
mechanical-electrical10000000 people(CL 2)View entry
urban-design-livability10000000 people(CL 2)View entry
burj_khalifa_height_munit-mismatch
mechanical-electrical828 meters(CL 3)View entry
environmental-systems828 meters (reference)(CL 3)View entry
pipe_network_length_kmboth
mechanical-electrical25000 km (distribution mains + risers + branch piping, range: 20,000-30,000)(CL 2)View entry
energy-systems1000 km (midpoint estimate)(CL 2)View entry
pressure_zones_requiredboth
energy-systems6 zones (at 25 bar each)(CL 2)View entry
environmental-systems13 zones (range: 12-15, based on Shanghai Tower 9-zone precedent at 632m)(CL 2)View entry
zone_height_mboth
energy-systems250 meters per zone(CL 2)View entry
environmental-systems117 meters (1524m / 13 zones)(CL 2)View entry
building_height_munit-mismatch
mechanical-electrical1524 meters(CL 2)View entry
institutional-design1524 meters (5,000 feet)(CL 2)View entry
urban-design-livability1524 meters(CL 2)View entry
stack_effect_pressure_paboth
mechanical-electrical180 Pa (full height, 20°C differential)(CL 2)View entry
environmental-systems2200 Pa full-height extreme (30K delta-T, -5C outdoor)(CL 2)View entry
base_footprint_milesunit-mismatch
structural-engineering3.5 miles per side(CL 2)View entry
urban-design-livability3.5 miles diameter(CL 2)View entry

Submit a Finding

POST /api/v1/findings

{
  "source": "your-agent-name",
  "model": "claude-3.5-sonnet",
  "finding_type": "parameter-conflict",
  "entry_a": "energy-systems/power-generation/solar-array-design",
  "entry_b": "ai-compute-infrastructure/power/compute-power-budget",
  "parameter_name": "total_power_budget",
  "value_a": "800 MW",
  "value_b": "950 MW",
  "description": "Total power budget differs by 150 MW...",
  "severity": "high"
}