A list of the most useful #Python libraries you can use for #SEO right now. 🐍

This updated thread will tell you the main libraries for #DataScience and #NLP that you should consider. Use them in your workflow! 🧡
Numpy & Pandas: the foundations for data analysis, just learn them.

Without these 2 libraries, you cannot do Data Science at all. Good knowledge of Pandas can get you quite far.
Advertools: the best SEM library out there and for SEO too. It’s very useful for crawling, log file analysis, analyzing SERPs and querying the Knowledge Graph.

The ideal Swiss-knife you need in your arsenal.
Ecommercetools: The ideal package for analyzing eCommerce data and getting access to some useful NLP functions.

It’s a rare jewel in your collection that is very handy for technical SEO and e-commerce as well.
Requests: Make HTTPS requests via Python, essential for web scraping.

Sure, there are alternatives but you should learn them. It's very important and a lot of your initial work will require this library.
urllibb: for working with URLs. It should be part of your arsenal.

Take some time to study all the options and possible use cases.
BeautifulSoup: a library to extract data from HTML/XML files, used in combination with scraping libraries to convert data into Python objects.

One of the first ones you’ll probably learn in your Python journey. You can also try Scrapy as an alternative.
Matplotlib/Seaborn/Plotly: you need some sort of visualization and these libs are here to help you.

You can start with Seaborn which is easier to use. DataViz is an important topic and you should value it.
NLTK/spaCy: work with human language toanalyzee text data and get insightsintot the nuances of our language.

This is necessary to get your hands dirty with text data.
Querycat: an awesome library with few functions but with extreme use thanks to association rule mining and BERT/umap.

It's one of my favorite libraries in general, I love it.

It's useful for visualizing losses in impressions over time.
Sklearn: Another staple for Machine Learning, some models are pretty useful for SEO so understanding the basics is highly beneficial.

I don't think you really need it, it can be extremely beneficial though.
Transformers: Pretrained models to handle a wide range of tasks. Essential for NLP!

This library is essential for the most advanced tasks and quite reliable too. I highly suggest you check my other thread:

sentence_transformers: Python framework for state-of-the-art sentence, text and image embeddings.

Use it for keyword clustering and other text-related tasks. It's one of my most used libraries right now.
Trafilatura: download, parse and scrape web page, definitely useful and I have used it quite a lot now!

This is a good library when you don't want to spend too much time on cleaning the HTML elements of a page.

It can be extremely solid for some projects.
Streamlit/Dash: interactive web applications are very useful, needless to say they can boost your communication by a lot.

Learning something like Streamlit is useful in the SEO community because we consider it the standard for web apps.
PyCaret: if you already are on the ML side you already know this, but for those who don't, it's an awesome library that goes from data manipulation to model deployment.

I don't use it quite often but it's extremely powerful and definitely an important library.
searchconsole: Use this library to import your data from the GSC API. It's easy to set up and once you do it there should be no problems.

This is part of an ideal workflow where you query data and then perform some data manipulation. Extremely recommended.
BERTopic: one of my most used NLP libraries and for good reasons. I dedicated an entire thread on the topic:

scattertext: library for finding distinguishing terms in corpora and displaying them in an interactive HTML scatter plot.

Short example from the official docs (github.com/JasonKessler/s…). Image
openpyxl: if you have to work with Excel data and create spreadsheets.

There are other libraries but I prefer to use this one. It's quite nice and it works well for most of the tasks.
In my opinion, you should start with scraping and data analysis. Then, you can move to NLP libraries and check some popular use cases.

Clustering would be one of the most common because it's super common and useful.
Sticking to the mainstream libraries is necessary to get access to "better" documentation. My suggestion is to try alternatives and always look for new opportunities across the web.

Be sure to always do your research, you could find that perfect library for your needs.
Shorter thread today because I was sooooo busy. Like and retweet if you like it, more to come in the next few days!
openai: Technically another super versatile library for NLP tasks.

I am not using it quite often, although it's extremely powerful for content outlines and much more!

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with Marco Giordano πŸ‡ΊπŸ‡¦

Marco Giordano πŸ‡ΊπŸ‡¦ Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @GiordMarco96

Apr 29
Some #SEO concepts that are not so common across mainstream resources.

They will probably become more known with time (I hope).

This is an updated thread 🧡
"Proper" Content Marketing. Most of us work with content and have a deep interest in content, to some extent.

Content Marketing is a separate subject but it overlaps with some SEO areas.

On-Page optimization is usually presented as a checklist of instructions.
Great content doesn't follow a boring checklist.

Even the process of optimization is not as simple as before.

This is quite hard to explain!
Read 32 tweets
Apr 27
There are many similarities between #SEO and what many Data professionals do.

We can say that in both cases we are talking about relatively new fields.

This thread talks about striking commonalities between SEO and #DataScience. 🧡
Both are new jobs, there are no clear boundaries and they are often confused with other professions.

SEOs overlap with content marketers, webmasters and marketing managers.

Data Scientists are often confused with Analysts, ML Engineers and even Data Engineers.
What was said in the tweet above is of course wrong.

Still, an SEO can indeed be a content marketer or a manager too!

A Data Analyst can do other tasks that are oriented toward Machine Learning.

So yes, it depends on the company and the scope of the projects.
Read 35 tweets
Apr 26
How can you apply #Python for #SEO to get outstanding results?

I will show you some examples of what you can do without spending money.

This thread is all about Python and quick wins and insights 🧡
Disclaimer: I am not listing source codes here, I will make separate threads later.

The goal of this thread is to give you the footprint and the direction.

Scraping - It's no secret that Python can be used for this purpose.

There are many types of projects, in this case, the most important revolving around sitemaps.

You can get a lot of information from sitemaps alone because they store time data.

And this means...
Read 29 tweets
Apr 14
Today I want to try something different. I've just skimmed through a wonderful book that can help a lot of #SEO professionals.

The book is neither about SEO nor Digital Marketing.

A thread to review a book and comment on its insights. 🧡
You can buy it from this super convenient bundle.

The title is "Analytical Skills for AI & Data Science" and it's one of the best reads I've ever had.

humblebundle.com/books/ai-machi…
Data are more and more important as time goes by...

They are everywhere and their use is almost infinite.

Data quality is not the only problem, today we are going to talk about decision-making and convincing stakeholders.
Read 30 tweets
Apr 13
One of my favorite topics is learning. This is giving me several benefits during my #SEO career.

How can we learn faster and better?

This thread will clarify some concepts and teach you that steady progress > perfection. 🧡
Let's start by saying that you don't need to learn everything.

Learning things by heart is overrated in the digital era, you have a PC for that stuff.

Don't waste your time with the useless details.
Some of you may be shocked, but you don't really need to learn all the human knowledge.

This is what schools unsuccessfully teach, learn by heart and forget logic.

This is the most toxic argument you can ever make.
Read 33 tweets
Apr 12
I've written an article (tweet below) about Semantic #SEO for Level343. This is more for business owners and not for professionals.

Anyway, here is another thread about Semantic SEO 🧡
Before I start, the article is right here:

level343.com/how-to-use-sem…
Semantic SEO is my favorite part when it's possible to implement it.

It's very suitable for large brands and if you are starting to document your knowledge on the web.

If you already have a lot of documents, this is a good chance to start your journey.
Read 22 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(