The experiment used a series of two-way tournaments of the Khan Game, in which Claude Sonnet 4, GPT-5.2 and Gemini 3 Flash ...