Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Bram Cohen🌱

@bramcohen

Jul 9, 2020 • 15 tweets • 3 min read • Read on X

Scrolly

And now some thoughts on a possible approach to making a deep learning Chess engine (thread)

What I'm about to say is completely speculative and may completely fail, have already been thought of before, or both, but it's an interesting thought I wan to get out

Going over how engines work there are a few misconceptions and a lot of possible mixing and matching which could be done. PUCT looks an awful lot like alpha-beta, and in fact the two could work together by, for example, having a project to generate a Chess opening book by

Having a single machine which is doing PUCT for the book from the starting position and handing out leaf positions to volunteer machines on the internet to evaluate with an A/B engine (or NN, lots of mixing and matching is possible)

Feel free to go ahead and build that idea as a project if you like it. Also notably PUCT isn't what most people think of as MCTS, in that it doesn't necessarily include many deep runouts and can even be implemented completely deterministically

General commentary aside I'd like to draw attention to the Stoofvlees approach, which is that it trains a neural network to make the same moves as humans have in grandmaster games. It's fairly competitive even though it's had much less work put into it than

Leela, which is currently the best neural network engine, so the approach shows promise. The obvious drawback to the approach is that it's tied to reference games, so it requires getting those games, lacks the elegance and adaptability of being able to work things out from

scratch, and is in some sense limited to the quality of play in those games. Here's an idea for how to fix that, which pushes the edges of what it means for an engine to be 'zero' but still qualifies: Start with a database of games, for Chess using a bunch of

human grandmaster games would be a good start (hence the 'not exactly zero' comment). Then run a completely untrained engine some number of moves deep to get board evaluations for every position in every one of those games. These will be very bad evaluations but

at least differentiate won and lost positions. Then train the neural network to match those evaluations directly. Then run the full engine with the new evaluation network on all the positions again to get new evaluations. Then train for a new network, run a few moves deep, etc.

Intuitively what hopefully happens is that the first pass is only a few moves deep, then the next one is a few more moves deep, then a few more than that, etc. This seems to have much more direct training than working off human games, because an eval is a much more specific

piece of information than a move preference, and the step of training the neural network doesn't have to work through a bunch of conditionals, it just has to train on given inputs and outputs.

There is the hazard that the engine might memorize its own self-reinforced evals, but that can be fixed by dividing the set of positions in two and alternating which set the games are played on with each training run.

This approach would also work very well with volunteer machines on the internet, because they can calculate evals and relatively high depths resulting in very little bandwidth needed for the amount of computation accessed.

If anyone is interested in actually building this, or any related idea inspired by this, please do so. Reports of people already having tried it and either succeeding or failing are welcome as well

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

Read 10 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Bram Cohen🌱

Try unrolling a thread yourself!

More from @bramcohen

Bram Cohen🌱

Bram Cohen🌱

Bram Cohen🌱

Bram Cohen🌱

Bram Cohen🌱

Bram Cohen🌱

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!