All the AI labs are talking about achieving "Artificial General Intelligence" but while I've seen a huge amount of progress, I am not able to point any of the AIs at a game on Steam and have the AI win the game.
In order to help motivate people to push the boundaries of what's possible, I'm putting $10k on the line to have an AI that autonomously plays Gambonanza, without any prior knowledge and beat it.
Finishing a video game seems like a good next step for AIs. It tests the ability to understand what's on screen, figuring out game mechanics, developing short term strategies and long term planning, working with incomplete information, deciding where to spend time...
Gambonanza seems like a good fit for the challenge. I beat it in around two hours the first time playing casually. This is turn by turn with no time limit. There is a very predictable set of screens with elements at fixed positions and animations don't matter. The progression is very easily measured. Winning requires building an actual strategy that is run dependent as there is an element of chance, but a good human can easily beat it every time.
Rules
- The game for the challenge is Gambonanza. A turn by turn chess-inspired roguelite.
- Note: if the game rules and strategies gets ingested in the model training datasets, I reserve the right to switch to another similar game.
- Once the game starts, the AI has to be 100% autonomous without any human input at all.
- The challenge is complete when the game shows the win screen (at the standard difficulty).
- You need to run the game once. If the game exits, the challenge has failed.
- The AI can only interact with the game through reading pixels and executing mouse inputs. It cannot read / write the game memory.
- You can implement any program, agentic loop, prompts... before hand as long as it doesn't encode game-specific logic.
- The AI can search the internet as long as it doesn't see any game-specific logic.
- Searching for how to build a chess optimizer is fine.
- Searching for strategies on how to beat the specific game mechanics is not allowed.
- The AI can write code and execute it.
How to submit
You need to provide:
- A screenshot of the win screen.
- The entire conversation, including subagents.
- Some kind of trace of what happened during the winning run. Can be screenshots at each step, a list of game states, a series of actions...
- Any code / assets that were generated that were used in the winning run.
- The initial set of prompts, agents, programs...
Send an email to [email protected] with the above and I'll review the findings. If I believe that it follows the rules, I will award the $10k bounty.
There is no expiry date on the challenge (with the caveat that I don't know what my financial situation will be many years in the future and may not be in a position to pay).
Disclaimer: this challenge is not affiliated with my employer nor the game developer (I just happen to like the game!).