Saurav Jain (Open Source + Communities) Profile picture
Jul 19, 2021 โ€ข 9 tweets โ€ข 3 min read โ€ข Read on X
๐—ฅ๐—ฒ๐—ถ๐—ป๐—ณ๐—ผ๐—ฟ๐—ฐ๐—ฒ๐—บ๐—ฒ๐—ป๐˜ ๐—น๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด ๐Ÿค–

-A type of Machine Learning (What is it?)
-It's working
-Real-life Applications
-A book for learning RL
-Limitations

Let's learn about all these things in this thread!!

๐Ÿงต๐Ÿ‘‡
๐—ช๐—ต๐—ฎ๐˜ ๐—ถ๐˜€ ๐—ฅ๐—ฒ๐—ถ๐—ป๐—ณ๐—ผ๐—ฟ๐—ฐ๐—ฒ๐—บ๐—ฒ๐—ป๐˜ ๐—น๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด?

- A machine learning area.

-In RL an ML model is trained to take a sequence of decisions.

-If the model takes a correct decision it will get awarded otherwise negative points will be given.

Perfect Explanation๐Ÿ‘‡
๐—ช๐—ผ๐—ฟ๐—ธ๐—ถ๐—ป๐—ด

Environment: Physical world in which the agent operates

State: Current situation of the agent

Reward: Feedback from the env.

Policy: Method to map agentโ€™s state to actions

Value: Future reward that an agent would receive by taking an action in particular state
Letโ€™s take the game of PacMan where the goal of the agent (PacMan) is to eat the food in the grid while avoiding the ghosts on its way.

The grid world is the interactive environment for the agent.
PacMan receives a reward for eating food and punishment if it gets killed by the ghost (loses the game).

The states are the location of PacMan in the grid world and the total cumulative reward is PacMan winning the game.
๐—ฅ๐—ฒ๐—ฎ๐—น-๐—น๐—ถ๐—ณ๐—ฒ ๐—ฎ๐—ฝ๐—ฝ๐—น๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€

-Building AI for playing games.
-Robot Navigation
-Industrial Automation
-Dialog Agents(Text, Speech)
-Online Stock Trading
๐—•๐—ผ๐—ผ๐—ธ ๐—ณ๐—ผ๐—ฟ ๐—น๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด ๐—ฅ๐—Ÿ

A good book to learn about Reinforcement Learning is "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto

You can download/buy the book from this link! ๐Ÿ‘‡

๐Ÿ”— incompleteideas.net/book/the-book-โ€ฆ
๐—Ÿ๐—ถ๐—บ๐—ถ๐˜๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€

-Training process can be time-consuming

-Can do many mistakes while learning

-Too much reinforcement learning can lead to an
overload of states, which can diminish the results.

-Reinforcement learning needs a lot of data and a lot of computation.
Hey All,

Hope I am able to make things easy to understand for you :)

If you like my content please retweet the first tweet of the thread!!

Sorry for not putting a thread yesterday, was busy with some personal stuff ๐Ÿ˜…

โ€ข โ€ข โ€ข

Missing some Tweet in this thread? You can try to force a refresh
ใ€€

Keep Current with Saurav Jain (Open Source + Communities)

Saurav Jain (Open Source + Communities) Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Sauain

Feb 25, 2023
How to write beautiful Python code? ๐Ÿ’ฏ

Learn about PEP 8 Convention ๐Ÿงต๐Ÿ‘‡
๐—ฃ๐—˜๐—ฃ-๐Ÿด

-Want to write an "Eye-pleasing" Python code ๐Ÿ ๐Ÿ‘€?

-PEP is a thing you MUST know about while learning or writing a python code

-Python Enhancement Proposal

-Basically a guide to "๐—›๐—ผ๐˜„ ๐˜๐—ผ ๐˜„๐—ฟ๐—ถ๐˜๐—ฒ ๐—ฎ ๐—ฏ๐—ฒ๐—ฎ๐˜‚๐˜๐—ถ๐—ณ๐˜‚๐—น ๐—ฝ๐˜†๐˜๐—ต๐—ผ๐—ป ๐—ฐ๐—ผ๐—ฑ๐—ฒ?"
We all know that besides having logic and coding in any programming language, there is an important factor that is "coding style."

-How are we writing comments?
-How are we giving spaces?
-How to import libraries?
-etc

These things are very important to writing beautiful code!
Read 12 tweets
Feb 24, 2023
Six websites that will keep you updated about the latest trends in Python ๐Ÿ

Thread ๐Ÿงต๐Ÿ‘‡
1. @realpython

The best site for Python. It helped me a lot personally in the learning process.

๐Ÿ”— realpython.com
2. Python Blogs

A dedicated platform for learners and professionals for Python

๐Ÿ”— pythonblogs.com
Read 8 tweets
Feb 23, 2023
Five FREE Certifications which you can learn and add to your Resume ๐Ÿ˜‰๐Ÿ’ฏ

- Python
- Chatbot Development
- Machine Learning
- NLP

Thread ๐Ÿงต๐Ÿ‘‡
1. Data Flair

20+ Hours Course to learn Python Step by Step with a Certificate on the completion

๐Ÿ”— data-flair.training/python-course/
2. Programming Essentials in Python ( CISCO )

This course is a great place to learn Python โ€“ no prior programming knowledge is required. Completing the course earns you a Statement of Achievement.

๐Ÿ”— netacad.com/courses/prograโ€ฆ
Read 7 tweets
Nov 21, 2022
Programming is interesting if you are learning it from good resources.

5 courses that helped me learn programming concepts. ๐Ÿงต
1. CS50 by Harvard

Build strong basics of Computer Science and fundamentals.

2. @freeCodeCamp

Basics of programming by freecodecamp.

Read 7 tweets
Nov 15, 2022
Learning Classification in Supervised Machine Learning from Scratch ๐Ÿงต๐Ÿ‘‡
Classification, by name, means to classify things into sub-categories.

For example, our computers can differentiate between red and blue, cat and dog, pencil and pen, and so on.
Classification is the problem of identifying to which of a set of categories (subpopulations), a new observation belongs, based on a training set of data containing observations and whose categories membership is known.
Read 8 tweets
Nov 2, 2022
Five FREE courses for Machine Learning, Artificial Intelligence, NLP, Web Development, and Computer Programming from world-class universities like Stanford, MIT, Harvard, etc. ๐Ÿงต
1. Machine Learning by Stanford University

This is a playlist of all the lectures taken by Prof. Andrew Ng in Stanford's ML course.

๐Ÿ”—
2. Harvard CS 50

Learn the basics of computer science from Harvard University. This is CS50, an introduction to the intellectual enterprises of computer science and the art of programming.

๐Ÿ”—
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(