Santiago Profile picture
20 Feb, 20 tweets, 3 min read
25 popular libraries and frameworks for building machine and deep learning applications.

Covering:

▫️ Data analysis and processing
▫️ Visualizations
▫️ Computer Vision
▫️ Natural Language Processing
▫️ Reinforcement Learning
▫️ Optimization

A mega-thread.

🐍 🧵👇
(1 / 25) TensorFlow

TensorFlow is an end-to-end platform for machine learning. It has a comprehensive, flexible ecosystem of tools and libraries to build and deploy machine learning-powered applications.
(2 / 25) Keras

Keras is a highly-productive deep learning interface running on top of TensorFlow. It provides essential abstractions and building blocks for developing and shipping machine learning solutions with high iteration velocity.
(3 / 25) PyTorch

PyTorch is a machine and deep learning framework used primarily for natural language processing and computer vision applications. In the community, PyTorch has grown as a research-first library.
(4 / 25) FastAI

FastAI is a deep learning library providing high and low-level components to achieve state-of-the-art results in standard deep learning domains. FastAI sits on top of the PyTorch framework.
(5 / 25) Apache MXNet

MXNet is a lean, flexible and scalable deep learning framework suited for flexible research, prototyping, and production of deep learning applications.
(6 / 25) NumPy

NumPy is a tool for scientific computing and performing high-performance general-purpose array operations.

(7 / 25) Scikit-Learn

Scikit-Learn is a robust machine learning library featuring a wide range of supervised and unsupervised learning algorithms.
(8 / 25) Pandas

Pandas is a high-performance, easy-to-use library for processing and analyzing relational and structured data.

(9 / 25) Pandarallel

Padarallel is a library that allows the parallelization and speed up of computations when using pandas.
(10 / 25) SciPy

A library containing efficient mathematical routines as linear algebra, interpolation, optimization, integration, and statistics.

(11 / 25) OpenCV

A computer vision and machine learning library providing a common infrastructure for computer vision applications.
(12 / 25) Statsmodels

Statsmodels is a library providing easy computations for descriptive statistics and estimation and inference for statistical models. It also allows for conducting statistical tests and statistical data exploration.
(13 / 25) Matplotlib

A library designed for the exploration and visualization of data. It's the standard plotting library in Python.

(14 / 25) Seaborn

A data visualization library based on Matplotlib, providing a high-level interface for drawing graphics.
(15 / 25) Streamlit

Streamlit is a library that makes it easy to create and share beautiful, custom web applications for machine learning and data science.

(16 / 25) Pyevolve

Pyevolve is a complete genetic algorithm framework to solve optimization problems.
(17 / 25) PyBrain

Modular machine learning library containing algorithms for neural networks, reinforcement learning, unsupervised learning, and evolution.

(18 / 25) Open AI Gym

Gym is a toolkit for developing and comparing reinforcement learning algorithms.
(19 / 25) NLTK

NLTK is a suite of libraries for the symbolic and statistical processing of natural language. NLTK is used mainly to support the research and teaching of NLP.
(20 / 25) Stanza

Stanza is a natural language analysis package containing tools to process and analyze human languages. It brings state-of-the-art NLP pre-trained models supporting 66 languages.
(21 / 25) spaCy

spaCy specializes in processing and analyzing data in NLP, and it's built specifically for production use.

(22 / 25) Pattern

Pattern is a web mining module with NLP capabilities and machine learning model implementations.
(23 / 25) TextBlob

TextBlob provides an easy-to-use interface to the NLTK library, covering everyday NLP tasks ranging from parts-of-speech tagging to sentiment analysis and language translation to text classification.
(24 / 25) Gensim

Gensim is a topic modeling NLP library designed to process raw, unstructured digital texts using unsupervised machine learning algorithms.
(25 / 25) Transformers

Transformers is a high-performance library providing thousands of pre-trained models to perform natural language processing tasks in over 100+ languages.

👇
Building machine learning alone in your basement sucks.

Follow me, and every week I'll help you navigate this thing from doing it, failing at it, learning, and fixing it.

We are going places together!

🦕

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Santiago

Santiago Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @svpino

21 Feb
Go to college. Send your kids. Celebrate those that make it happen.

College is a good thing. If you can afford it, do it.

Most people telling you that college sucks went to college. Most people that didn't go whish their kids would.

🧵👇
You won't replace college with YouTube videos, or reading books, or following tutorials.

Some people may. Most people won't.

Yes, the knowledge is all out there, but college is just not about learning new things.

👇
College doesn't guarantee you a job but look at the statistics of median income and unemployment among those that went and those that didn't.

The numbers should tell us something.

👇 Image
Read 7 tweets
19 Feb
I'm sad to watch many developers working 80-hour weeks to get one inch ahead of everyone else.

And yet, they are missing the biggest opportunity of their lives right under their noses.

🧵👇
Today, you don't leap ahead by learning another framework, watching another tutorial, or building another web page.

That's incremental improvement. Important, but not extraordinary.

👇
Hours don't mean anything, and everything you add to your portfolio will be obsolete in the next couple of years.

What's really going to move the needle is the impact of your work. It's how you change and influence those around you.

👇
Read 10 tweets
18 Feb
You gotta think about this one carefully!

Imagine you go to the doctor and get tested for a rare disease (only 1 in 10,000 people get it.)

The test is 99% effective in detecting both sick and healthy people.

Your test comes back positive.

Are you really sick? Explain below 👇
The most complete answer from every reply so far is from Dr. Lena. Thanks for taking the time and going through it!

You can get the answer using Bayes' theorem, but let's try to come up with it in a different —maybe more intuitive— way.

👇
Read 9 tweets
17 Feb
Imagine a model to predict customers who will unsubscribe from your service.

You want to incentivize them with $10 because they will cost you $100 if they churn.

Look at the attached confusion matrix showing that the model is only 77% accurate.

Is this model good enough?
I love this question because it puts a couple of things in perspective:

1. A model that doesn't look too good by the numbers.

2. A business case that can use a less-than-ideal solution to solve the problem.
There's only one question about this problem: how many people will not churn if they get the incentive?

We don't know, but we can play out different scenarios and see what happens.
Read 9 tweets
16 Feb
Imagine that you ask a yes/no question to 1,000 people, and each person answers correctly 51% of the time.

You count the different answers and pick the most common one.

How likely are you to end up with the correct answer?

🧵👇
Don't feel bad if you think it's 51% — We all did!

If every person answers independently from the rest, you'll end up with the correct answer ~75% of the time.

And if you ask 10,000 people, the chance of getting it right goes up to ~97%!

Mind-blowing, right?

(2 / 6)
If you care about the math behind this, take a look at the attached expression. (But it doesn't matter if you don't.)

This is what's important:

The law of large numbers ensures that we get more correct answers as we ask more people.

(3 / 6)
Read 7 tweets
15 Feb
I have not seen any proof that Twitter "kills your content" if you include links to your tweets.

Here is the result of a very unscientific experiment: comparing my top 10 tweets with and without links.

If you have something concrete, please let me know.
This is anecdotal evidence at best.

It doesn't prove that Twitter doesn't mess with your links, but it does suggest that —if anything is going on— it is much more subtle than what some believe.

I haven't found any documentation either.
This is what I do know:

Breaking the links that you add to your tweets is self-serving: it makes it worse for the people who follow you. They can't just click to get the content.

I can't see how this will make your content better in any way.
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!