GPT-4.5 Preview takes first place in the Elimination Game Benchmark, which tests social reasoning (forming alliances, deception, appearing non-threatening, and persuading the jury).
GPT-4.5 Preview takes first place in the Elimination Game Benchmark, which tests social reasoning (forming alliances, deception, appearing non-threatening, and persuading the jury).