artificial Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure /u/zero0_one1 January 22, 2025 January 22, 2025 submitted by /u/zero0_one1 [link] [comments]