Sandbox 04
Calibration Check
Every claim is delivered in the same flat, authoritative voice an AI assistant would use. Rate your confidence on each one, then see whether your certainty actually predicted your accuracy — the exact gap that produces an AI hallucination.
See this sandbox in its lesson contextClaim 1 of 120 correct so far
The Great Wall of China is visible to the naked eye from space.
True or false?
How confident are you?