VideoGameBench from Princeton: Can vision-language models play 90s video games?

  • Wow so without scaffolding the LLMs can't solve any of these games... Super cool work!