[P] uttt.ai: AlphaZero-like solution for playing Ultimate Tic-Tac-Toe in the browser

felixjin · 2021-12-10T20:31:32+00:00

Really great job! I also played around with UTTT reinforcement learning a few years ago (https://github.com/fqjin/ultimate-tictactoe). UTTT is quite an interesting game with deep concepts and strategy despite its simplicity and low branching factor.

I pitted MCTS and AB versions of my AI against your app with about 1 sec/move. Both drew as X and lost as O. Congrats!

Inori · 2021-12-10T23:35:29+00:00

Nice project!
You should reach out to folk at codingame if you want a strong baseline bot to benchmark against.

yazriel0 · 2021-12-11T15:19:22+00:00

I love these one man projects! Looks nice.

Some questions
1. How long did this project take up to now ?
2. What were your previous coding languages? Was the js/browser part a major challenge?
3. Would you have made significantly more progress with some cloud credits ?

CatalyzeX_code_bot · 2021-12-10T15:31:38+00:00

Code for https://arxiv.org/abs/1712.01815 found: https://github.com/davidokao/beta-zero

Paper link | List of all code implementations

To opt out from receiving code links, DM me

Zondartul · 2021-12-10T16:35:27+00:00

Says "could not load AI, your browser is not supported". I'm on Chrome on Windows 7.

Aceofsquares_orig · 2021-12-10T17:47:42+00:00

Dang, nicely done. I, too, want to do something like this. I only get caught up in the Monte Carlo Tree Search. I understand the gist of it but am not sure about the details. For instance, not sure what the random rollouts should be of a given state. Is it just random moves and a returned calculation of the board state or is it more general than that? Any suggestions of any reading about MCTS?

kevinwangg · 2021-12-10T19:50:28+00:00

awesome!

bOmrani · 2021-12-21T23:43:02+00:00

Great project! It would be an interesting addition (and additional anxiety for the player ^^) to have an option to see the output of the value network for the current board configuration. That way I could see how quickly the algorithm thinks it can crush me...

whats-a-monad · 2022-01-24T11:52:39+00:00

Can you make this compatible with openspiel so that we can have AIs for all its games? I particularly like Pentango.

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS