Here are my experiences learning about RL over the past 3 months! ♥️ Hopefully, this will be the most beginner-friendly guide out there.

blog: gordicaleksa.medium.com/how-to-get-sta…

@DeepMind @OpenAI Image
I've tried to give you all of the tips and tricks I could think of both for more productive learning in general and stuff specific to the RL field.
The structure of the blog:
1) Intro
2) RL 101 (getting you exposed to the terminology)
3) Cool things about RL (awesome RL apps!)
4) RL is not just roses
5) Getting started with RL
6) Going deeper - reading papers
7) Implementing an RL project from scratch
8) Related subfields
I've also structured the resources in a way that's as linear as possible.

This is the longest blog I wrote so far. Hope you find it useful!

#deeplearning #reinforcementlearning

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Aleksa Gordić

Aleksa Gordić Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @gordic_aleksa

9 May
I just open-sourced my implementation of the original @DeepMind's DQN paper! But this time it's a bit different!

There are 2 reasons for this, see the thread.

GitHub: github.com/gordicaleksa/p…

#rl #deeplearning
1) This time the project is still not completely ready**. I'm yet to achieve the published results - so I encourage you to contribute!

Many of you have been asking me whether you can work on a project with me and I'll finally start doing it that way - from now onwards. ❤
2) This repo has the ambition to grow and become the go-to resource for learning RL. So collaborators are definitely welcome as I won't always have the time myself.

** main reasons are:
a) I was very busy over the last 2 weeks
b) It currently takes ~5 days to fully train DQN
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(