balrog-ai/BALROG
Benchmark for agentic LLM and VLM reasoning on games

View on index · View in 3D Map
// SURVEILLANCE FEED
Discovered repositories from the open source frontier
Benchmark for agentic LLM and VLM reasoning on games

View on index · View in 3D Map