What is Natural Language Processing (#NLP) and why it is so important for #SEO?

There are some clear benefits but first we need some clear explanations.

This thread shows you why this subject can improve your results and the role it plays in modern search engines. 🧵
NLP is a branch of Machine Learning that enables machines to understand and process natural language, i.e. what we humans say.

Our language is complex, ambiguous and sometimes misleading, it's a tough job for a machine!
More specifically, it's a subset of Computer Science, Linguistics and Artificial Intelligence.

Computational Linguistics is another subject closer to Linguistics than it is to Engineering. You can think of NLP as the other way around.

Source: direct.mit.edu/coli/article/4…
Modern search engines like Google are powered by such technologies and understanding some key NLP concepts can increase your chance of ranking.

In the last few years we have seen the rise of new methodologies and perspectives that totally differ from the past.
If you read some Google patents, you can quickly notice how some topics are frequent and why people emphasize the role of topical clusters.

On a higher level you would need to treat your website as a big Knowledge Graph. Cover topics extensively and achieve authority.
Google is able to process what you write and can understand its sentiment.

While the sentiment of a text is not a ranking factor, some may even argue that it can play a role in what you see on SERPs.

This is NLU, not NLP but still useful to know.
Keyword density is useless. Instead of counting keywords focus on syntax and entities in your text. Google can detect named entities in a text and understand their role in a sentences too.

That's why you should be specific and accurate while you write.
My article is longer, why is it not first?

There are many answers, one could be that your sentence structure is not so clear or you are diluting your content.

If you don't go straight to the point, don't expect people to read your blog posts. Google knows that very well too.
The list of NLP tasks is long, you should mainly care about:

- POS Tagging
- Stemming/Lemmatization
- Named Entity Recognition (NER)
- Word Sense Disambiguation
- Entity Linking
- Semantic role labelling
- Question answering

I will cover them in future threads. >>>
>>> Just recall for now that there are different tasks that you can execute with some NLP knowledge.

You can evaluate your own text and look at competitors with a more critical eye.

If you are just starting you'll also encounter stop-word removal and tokenization.
Machines don't actually understand what you say. They eat numbers and perform some operations to process human language.

State-of-the-art models are pretty good at it and the best is yet to come.
Google applies BERT models to snippets to understand if they're good enough for the user. To win them, you should understand what HTML/structured data components your competitors are using.

Bring more facts and have a crystal clear language. Zero fluff.
Be careful tho, we are not optimizing for BERT, it doesn't make any sense to say such a thing.

We are just improving our syntax to provide better and clearer answers to the user's needs.

It's not a game to trick search engines, it's adding more value.
One of the most powerful concepts you may have heard is "semantic triple".

A triple can be described as the minimum unity to encode data into the semantic web.

Triples are made up by a "subject–predicate–object" structure.

👉 John was born in 1945.

de.marklogic.com/blog/making-ne…
Entities are then connected via relationships. You can express them via semantic triples.

You are starting to connect the dots. Information in the semantic web is ruled by such triples.

Knowing the main entities you have to cover in a page is a method of page optimization.
The entire goal of learning more #NLP is to create pages that are easily understandable by machines.

Build a "content network" in order to be recognized by Google as an authority on a given topic.

This doesn't substitute your value proposition or push you to fake content.
Several triples connected via relationships can form a Knowledge Graph.

It is a structure to represent knowledge that understands machine language in the form of standards.

Therefore, order is not enough, there should be some ratio behind a graph.
Speaking of which, Knowledge Graphs are organized according to ontologies.

They define what exists in your domain and their properties. This will be the topic for a future thread.
Google introduced the Knowledge Graph in 2012. Nowadays it's one of the hottest #SEO topics and for good reasons.

Having a website with proper connections between entities is the key.

Read more: holisticseo.digital/theoretical-se…
LSI keywords don't work at all. It's the usual #SEO myth created by people with no clear understanding of search engines.

More here: seobythesea.com/2021/12/what-a…
Did you know that Google doesn't scrape questions? They generate them!

You can do the same to add value to your content strategy.

Source: patents.google.com/patent/US92137…
RankBrain is an algorithm used to understand search intent. It is one of the most important factors affecting ranking.

That said, you should have understood that focusing on single words or keywords is super useless.

Intent, entities and order are your new friends.
Why did I add "order"? Because having clear sentences is mandatory and you need some degree of methodology to make it work.

Being methodical is a huge help for such technical topics.
All of this fits into the Semantic Web.

You can think of it as the proposal of structuring and tagging web pages for machine understanding. This is done via a set of standards.

And that's why we talked about Knowledge Graphs before.
And that's one more reason to start creating topical maps with tools like Xmind/MindMeister/Miro.

You can think of your content strategy as a way to build an imaginary Knowledge Graph and fit it into your website.

Clustering keywords is an enormous help for this.
You can be more proactive and combine what you know with #Python libraries.

Some examples include:
- NER on competitors page
- SERP analysis
- N-grams analysis
- Semantic clustering
- Smarter internal linking
- Knowledge Graphs

...and much more
Some indirect potential benefits of NLP are:

- More understanding of Google patents
- Booming market
- New SEO perspectives
- Better topical maps
I will try to cover more and more topics in my threads.

If you want to support me, be sure to follow me and like/retweet to spread awareness about these topics.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Marco Giordano 🇺🇦

Marco Giordano 🇺🇦 Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @GiordMarco96

Mar 4
I updated my Google Colab file about Google Search Console data analysis in #Python. Now you have more plotting options for query count.

Future versions will include improved clustering and other types of in-depth analyses. 🧵

colab.research.google.com/drive/1hXuKuf8…
If you want a preview you can read what's inside this article:

oncrawl.com/technical-seo/…
Querycat is a good way to cluster but I am starting to prefer other solutions.

Like this one from @LeeFootSEO

searchenginejournal.com/semantic-keywo…
Read 7 tweets
Mar 3
Some personal considerations about the new trends in #SEO and the influence of coding and data in my journey.

This is a personal thread focused on explaining how different subjects can influence you 🧵
I started with #Python relatively early, I was into R before. The concept doesn't change either, they are just tools.

I decided to get into coding because I felt it was my route. I am improving everyday but I am still far from the biggest names in the industry or elsewhere.
I've always noticed that data are still misused by companies and there is a lot of misinformation.

Think about all the people using Excel as a database or SEO case studies with super weak proofs.

This is just the tip of the iceberg.
Read 25 tweets
Mar 2
Some unusual ways for having content ideas in the #SEO world.

A thread to uncover creative and unusual ways of content creation with some examples and proofs 🧵
As usual, listen to your customers and have a dedicated notebook or journal.

All the information will be used to produce FAQs or discover pain points.

Super effective when you have no data or you have trouble understanding a niche.
Forums and social media offer great insights for trends and for people's opinions on a topic.

Leverage Google Discover and Google News for fast-paced content and try to create evergreen content for recurring problems.

Check this tool for Reddit: highervisibility.com/free-seo-tools…
Read 18 tweets
Feb 27
Some important #Python concepts that you need to understand, especially if you are an #SEO Specialist and have no clue about this sort of stuff.

You have to do some research, this is just an introduction.

A thread containing the essential that you need for Python 🧵
Lists. Used to contain multiple elements in a single variable, including different types, i.e. numbers with letters etc.

You will often work with lists as they are everywhere.

E.g.

example_list = ["dog", "cat"]
Sets. You probably studied them at school, I am quite sure.

It's a collection of items without duplicates. Therefore, if you want to remove duplicates from a list you can convert to a set first and then retransform into a list.
Read 23 tweets
Feb 25
Some of the best tools that can help you as an #SEO Specialist.

They are mostly related to digital marketing and/or copywriting, so don't expect the usual stuff.

A thread about improving productivity and your organization with tools, including personal advices 🧵
Notion. A workplace solution to have almost anything in one place.

Great for having order in your life and combine different templates, such as lists, tables and so on.

I am using it to define future plans or create new templates.
Google Docs/Sheets. I prefer them over the MS counterparts because they are more focused on collaboration.

Moreover, you have a better integration with Google APIs, which is great for SEO.

Google for the win, I guess.
Read 20 tweets
Feb 24
Some unusual soft skills or underrated stuff that can be beneficial for #SEO Specialists (and others).

This is a thread that will cover not-so discussed topics...

mostly because we are too busy thinking that leadership and communication are the only good ones 🧵
Coldness. Super underrated imho, the ability of staying focused on something is essential.

This is what I learned after many times I couldn't solve certain problems.
Persuasion. No, this is not just communication. It's the art of convincing someone.

It can be used for good and bad things, it's up to you.
Read 25 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(