Hacker News

Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure

by zone411on 1/22/2025, 4:17:47 PM with 2 comments

by celerrimuson 1/22/2025, 6:41:22 PM
interesting results, thank you!