Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

labml.ai

@labmlai

Jun 22, 2021 • 8 tweets • 5 min read • Read on X

Scrolly

Learning to play Kuhn Poker with Monte Carlo Counterfactual Regret Minimization (MC-CFR) in #python

📝 Code/Tutorial: nn.labml.ai/cfr/index.html

This isn't deep learning. But it'll be interesting if you do machine learning, like incomplete information games or play #poker.

🧵👇

2/8) Kuhn Poker is a simple 2-player betting game with three cards (A, K, Q). A single card is dealt to each player. Players take turns betting chips and the player with the higher card wins the chips. If a player folds the other player wins the chips.

👇

3/8) CFR finds the Nash equilibrium with self-play. In each iteration, it calculates the regret of following the current strategy instead of playing each action. Then it updates the strategy with regret matching:

strategy = regret of action/total regret of all actions

👇

4/8) The average of the strategies throughout the iterations gets close to the Nash equilibrium as we iterate.

Nash equilibrium is a state where no player can increase their expected payoff by changing their strategy.

👇

5/8) The strategy is a function of "information set" and gives a probability distribution across actions. An "information set" is the state of the game that’s visible to the player.

👇

6/8) Our implementation is accompanied by a lengthy introduction to CFR and MCCFR. The MCCFR implementation is abstracted from the game Kuhn Poker and we will add Leduc Poker implementation soon.

👇

@weights_biases

7/8) Here’s the Kuhn Poker experiment: nn.labml.ai/cfr/kuhn/index…

Colab Notebook with some visualizations:
colab.research.google.com/github/lab-ml/…

Results in @weights_biases:
wandb.ai/vpj/kuhn_poker…

👇

@vpj

8/8) This implementation is based on a tutorial @vpj wrote about a year ago:
github.com/vpj/poker/blob…

We will add implementations for Leduc poker and more efficient variants of CFR such as public chance sampling (PCS) if you find it useful.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @labmlai

labml.ai

@labmlai

Mar 5, 2022

@DeepMind

🎥 Improving language models by retrieving from trillions of tokens by @DeepMind

Paper explanation by @janithcwanni

The paper introduced Retrieval Enhanced Transformer (RETRO) - 25X smaller than GPT-3 with comparable performance.

Link to paper and other related resources such as code, discussions and tweets:

📎 papers.labml.ai/paper/324a7d2e…

https://twitter.com/DeepMind/status/1468613620280004614

https://twitter.com/DeepMind/status/1468613620280004614

Read 6 tweets

labml.ai

@labmlai

Jan 23, 2022

🎥 Lists of papers covered by popular YouTube channels

papers.labml.ai/lists

Here are some lists/channels we picked (1/9)

🧵👇

@ykilcher

2/9

Yannic Kilcher @ykilcher

📑 papers.labml.ai/lists/yannic_k…

https://twitter.com/labmlai/status/1406203374358261760

👇

@gordic_aleksa

3/9

The AI Epiphany @gordic_aleksa

📑 papers.labml.ai/lists/the_ai_e…

https://twitter.com/labmlai/status/1406203376522522627

👇

Read 9 tweets

labml.ai

@labmlai

Jan 22, 2022

🥳 Excited to share our Chrome browser extension for papers.labml.ai 🎉

chrome.google.com/webstore/detai…

It identifies research papers mentioned in websites you visit and shows a 2-line summary, availability code/videos/discussions, popularity on Twitter, and conferences.

🧵👇

2/4 We released the source code of the extension

github.com/labmlai/chrome…

👇

3/4 🙏 A big thank you to all of you who helped us test the extension and suggested making an app/browser-extension.

If you find any bugs or have suggestions please DM us on Twitter or open an issue on the Github repo. We love to hear your feedback.

👇

Read 5 tweets

labml.ai

@labmlai

Oct 18, 2021

If you like our highlighted papers you will love this!

We found a few awesome Github repos with highlighted/annotated research paper PDFs. We started linking them from papers.labml.ai

Here's the list of of repos and 😍 interesting papers they had:

🧵👇

@A_K_Nain

2/13 Github: github.com/AakashKumarNai…
by @A_K_Nain
📎 20 ✨ 1,986 🏃 active

He has covered a wide area of papers with a lot of self-supervised papers.

👇

@HugoTouvron

3/13 A pick

Emerging Properties in Self-Supervised Vision Transformers by @HugoTouvron @mcaron31 @alaaelnouby @imisra_ @hjegou @julienmairal @PierreStock @quobbe @alexsablay @armandjoulin @p_bojanowski @syhw Ben Graham and Matthijs
Douze

📎 papers.labml.ai/paper/e2e56d9c…

👇

Read 11 tweets

labml.ai

@labmlai

Oct 14, 2021

@PyTorch

Patches Are All You Need? by ❓

@PyTorch Paper implementation with side-by-side notes.

📝 Annotated code nn.labml.ai/conv_mixer/ind…
📎 Paper papers.labml.ai/paper/dd638a44…

The paper introduces ConvMixer which mix patch embeddings with depth-wise and point-wise convolutions.

🧵👇

@PyTorch

2/ The implementation is very simple and the paper presents a 280 character version of the @PyTorch model code - fits a tweet 💪

Our implementation is a bit lengthy (hopefully easier to understand 😁)

👇

3/ ConvMixer is similar to MLP-mixer but uses linear transforms (convolutions) instead of multiple layers for each mixing. And It only mixes the neighboring patches within the convolution kernel.

📎 MLP Mixer papers.labml.ai/paper/2105.016…
📝 Our impl nn.labml.ai/transformers/m…

👇

Read 8 tweets

labml.ai

@labmlai

Oct 9, 2021

@PyTorch

Annotated @PyTorch implementation of "Denoising Diffusion Probabilistic Models" by @hojonathanho @ajayj_ @pabbeel @berkeley_ai

📝 Annotated code nn.labml.ai/diffusion/ddpm…
🖥 Github github.com/labmlai/annota…
📎 Paper papers.labml.ai/paper/2006.112…

🧵👇

2/ This removes noise (denoise) step-by-step to generate images. It adds noise to an image from the dataset iteratively and a model is trained to predict the noise at each step.

👇

@oronneberger

3/ Model is based on U-Net

📝 nn.labml.ai/diffusion/ddpm…
📎 papers.labml.ai/paper/1505.045… by @oronneberger @phffischer @thomasbrox

The first half progressively decreases the feature map resolution and the second half increases the res, with skip connections from the first half.

👇

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

labml.ai

Try unrolling a thread yourself!

More from @labmlai

labml.ai

labml.ai

labml.ai

labml.ai

labml.ai

labml.ai

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!