Hacker News

Watch an AI bot play Go [video]

by r-u-seriouson 11/3/2015, 1:16:03 PM with 13 comments

by mark_l_watsonon 11/3/2015, 1:38:22 PM
Very cool. I wrote a Go playing game for the Apple II which I sold.
I have recently become interested, again, in Go programming. I have started setting up a framework for using GnuGo to generate training data for a deep learning network. I am starting with a tiny 7x7 board just to see if the general approach works.
If it works (at all) then I will post the entire project to github.
by vesseneson 11/3/2015, 5:49:27 PM
What's interesting to me about this is the visual pattern recognition -- rather than just scaling up a giant Monte-Carlo search, playing like (presumably) humans do, searching for patterns, is very cool.
I was just this weekend going through my go library, and most of the books I have focus on precisely this sort of pattern training. In fact, there's a whole branch of study in go called in Japanese "Tesuji", essentially brilliant moves that come from bad-looking patterns.
The idea is that you lull your opponent into complacency with your bad-looking pattern, then "BAM", a surprise comes out. My guess is that an RNN could be discovering new tesuji, which would be super cool. If the RNN can look ahead deep enough, then I would think that's a path forward against strong players.
There's also the reverse, Anti-suji, essentially things that look brilliant, but are really terrible.
by fitzwatermellowon 11/3/2015, 1:58:58 PM
There is something captivating about watching two master-AI's battling it out on the Go board, it's almost Art in a way ;)
Facebook's AI lab will probably reveal more technical details at NIPS 2015. Here's the workshop on Reasoning, Attention, Memory (RAM) they are organizing that will seek to address some of the challenges around advancing from classification and prediction tasks to simple cognition:
http://www.thespermwhale.com/jaseweston/ram/
by samvsuelon 11/3/2015, 1:42:00 PM
Related article: https://code.facebook.com/posts/1478523512478471/teaching-ma...
by RBerenguelon 11/3/2015, 1:31:20 PM
From the short description, it looks like a project from Google about training a deep network to play the game. The resulting bot was quite strong (or at least, quite strong for not using Monte Carlo tree search, at least). Without any more details though, it's hard to tell.
by weatherlighton 11/3/2015, 4:40:09 PM
"strong human player." Like 1 Dan strong or like 6 kyu strong? It's a meaningless statement.
by donatjon 11/3/2015, 2:43:42 PM
Go should just be labeled Gogame in contrast to Golang.
by jgalt212on 11/3/2015, 3:12:47 PM
He used Monte Carlo methods, not DNN, but a nice talk on Computer Go by Zach Tellman
https://www.youtube.com/watch?v=v5dYE0CMmHQ
by jordighon 11/3/2015, 1:46:17 PM
Btw, why is go-the-game capitalised? We don't capitalise Chess, do we?
by AndrewKemendoon 11/3/2015, 3:09:13 PM
After a few months of playing ... it's as good as a very strong human player.
This is potentially a really big breakthrough if true, as Go is much harder than Chess computationally. Nobody knows about Go in the West though so don't expect any headlines.
by hellbanneron 11/3/2015, 6:21:51 PM
Does "To help teach systems how to plan, Facebook's AI research team has created an AI bot to play the board game Go. " actually teach AI how to "plan" at anything beyond board games?
by deepneton 11/3/2015, 7:41:07 PM
This Facebook Video suggests it has had months of either training or development, which sounds might be it learning by playing, reinforcement learning in this space could be very interesting - more details would be good.
The two breakthrough papers using Deep Conv Nets to play Go - trained on Expert Games to predict the Experts Next Move.
This is made possible by the huge archives of expert play from online Go servers that have reached Big Data sizes in the last few years.
Teaching Deep Convolutional Neural Networks to Play Go - Clark & Storkey Edinburgh Informatics 2015 http://arxiv.org/abs/1412.3409
Move Evaluation in Go Using Deep Convolutional Neural Networks Maddison, Huang, Silver, Sutskever Deepmind 2015 http://arxiv.org/abs/1412.6564
David Silver has work on Reinforcement Learning and Go: Reinforcement Learning of Local Shape in the Game of Go Silver et al 2007. https://scholar.google.co.uk/citations?view_op=view_citation...
Go is scored by occupying and surrounding territory and the opponents pieces taken.
The current State of the Art programs use Monte Carlo Tree Search with each move being evaluated using random playouts - this is well suited to Go as it reasnoble estimate for comparison of the territory that will result from a move.
As the Go search tree branches a lot (~200 branches per play) tree search takes some time and cannot be exhaustive.
The Expert Move Predicting Convnets provide very good moves very quickly, and can provide and provide probabilities the expert would move for each square in a single forward pass.
Most of the current development now centers around using the Convnets to prune the search trees for the random playout MCTS engines.
Go is useful from an AI perspective because it is a huge state space and the patterns are subtle with long term implications. No computer can yet beat the best humans - so it is an area where natural intelligence beats artificial.
Yet the convnets are not doing planning, only predicting what an expert human would do next, the planning is implicit in the dataset but not a part of the convnet itself (though the internals of deepnets are mysterious so who knows).
The Computer Go mailing list is very lively and full of the best program creators with regular computer tournaments against each other. http://computer-go.org/mailman/listinfo/computer-go
The Facebook video shows GnuGo is the opposing player, this is not the best computer player and suggests Facebooks player is not as strong as the Convnets as yet - it also does not detail how deep the playouts of the tree search are, i.e. what level GnuGo is playing at, so this cannot really be compared to recent work.
Hopefully Facebook will publish more details soon - if they are learning through play then their results may well be interesting.
Remi Coulom's excellent slides detail the Monte Carlo revolution in Go programs. http://www.remi-coulom.fr/JFFoS/JFFoS.pdf
AFAIK MCTS with Random playout and hand coded heuristics are still SOTA.
When I visited in the summer Edinburgh Uni Informatics Department convnet Go was in active development.