All the AI labs are talking about achieving "Artificial General Intelligence" but while I've seen a huge amount of progress, I am not able to point any of the AIs at a game on Steam and have the AI win the game.

In order to help motivate people to push the boundaries of what's possible, I'm putting $10k on the line to have an AI that autonomously plays Gambonanza, without any prior knowledge and beat it.

Finishing a video game seems like a good next step for AIs. It tests the ability to understand what's on screen, figuring out game mechanics, developing short term strategies and long term planning, working with incomplete information, deciding where to spend time...

Gambonanza seems like a good fit for the challenge. I beat it in around two hours the first time playing casually. This is turn by turn with no time limit. There is a very predictable set of screens with elements at fixed positions and animations don't matter. The progression is very easily measured. Winning requires building an actual strategy that is run dependent as there is an element of chance, but a good human can easily beat it every time.

Rules

  • The game for the challenge is Gambonanza. A turn by turn chess-inspired roguelite.
    • Note: if the game rules and strategies gets ingested in the model training datasets, I reserve the right to switch to another similar game.
  • Once the game starts, the AI has to be 100% autonomous without any human input at all.
  • The challenge is complete when the game shows the win screen (at the standard difficulty).
  • You need to run the game once. If the game exits, the challenge has failed.
  • The AI can only interact with the game through reading pixels and executing mouse inputs. It cannot read / write the game memory.
  • You can implement any program, agentic loop, prompts... before hand as long as it doesn't encode game-specific logic.
  • The AI can search the internet as long as it doesn't see any game-specific logic.
    • Searching for how to build a chess optimizer is fine.
    • Searching for strategies on how to beat the specific game mechanics is not allowed.
  • The AI can write code and execute it.

How to submit

You need to provide:

  • A screenshot of the win screen.
  • The entire conversation, including subagents.
  • Some kind of trace of what happened during the winning run. Can be screenshots at each step, a list of game states, a series of actions...
  • Any code / assets that were generated that were used in the winning run.
  • The initial set of prompts, agents, programs...

Send an email to [email protected] with the above and I'll review the findings. If I believe that it follows the rules, I will award the $10k bounty.

There is no expiry date on the challenge (with the caveat that I don't know what my financial situation will be many years in the future and may not be in a position to pay).

Disclaimer: this challenge is not affiliated with my employer nor the game developer (I just happen to like the game!).

If you liked this article, you might be interested in my Twitter feed as well.
 
 

Related Posts

  • October 28, 2011 JSPP – Morph C++ Into Javascript (Paper) (0)
    6 months ago, I wrote the blog post "JSPP - Morph C++ into Javascript". My supervisor at the LRDE (R&D Lab of EPITA), Didier Verna, found it interesting and told me that it could be worth a publication. With his great help, I've written my first article. We have submitted it to two […]
  • March 6, 2012 Github Oauth Login – Browser-Side (0)
    I'm working on an application in the browser that lets you take notes. I don't want to have the burden to save them on my own server therefore I want to use Github Gists as storage. The challenge is to be able to communicate with the Github API 100% inside the browser. Since it is a […]
  • September 12, 2015 React Rally: Animated — React Performance Toolbox (0)
    I talk about various strategies to improve the performance of your React app: shouldComponentUpdate, Static Container, Element Caching, Raw DOM Mutations and finally, Data Binding, which is powering the Animated API. I go over their respective trade-offs as sadly there is no perfect […]
  • August 23, 2011 Javascript – Hook Technique (0)
    Let's go back 5 years ago during the World of Warcraft beta. I was working on Cosmos UI, a projects that aimed to improve the World of Warcraft interface. As interface modification was not officially supported by Blizzard, we went ahead and directly modify the game files written in […]
  • September 11, 2011 World of Warcraft HTML Tooltip Diff (0)
    MMO-Champion is a World of Warcraft news website. When a new patch is released, we want to show what has changed in the game (Post Example). An english summary of each spell change is hand written, but we want to show the exact tooltip changes. jsHTMLDiff is available on […]