One of the most useful #NLP libraries for #SEO in #Python is certainly BERTopic.
I will show you its benefits, why it's so powerful and simple to use in this thread 🧵
BERTopic is the easy and comfortable way of using advanced linguistics models without writing too much code.
That's why it's so powerful and reliable.
Although this library wasn't built with SEO in mind, it's clearly super versatile for us.
It's a way to flatten the steep learning curve that such topics possess.
We're focusing on the implementation itself rather than the theory. >>>
>>> This doesn't imply that you don't have to study the models! You should mature an understanding of the high level overview and the parameters.
It's very unlikely to have good results without tuning your models.
If you are like me and want to focus on NLP and Data Science this is the right way to go. Transformers and recent models are way better than older ones and are able to capture the semantic nature of words.
This is not possible with a traditional clustering technique.
Some terms you need to know are:
Embeddings - think of them as representing words in math language, i.e. vectors
Topic modelling - identifying topics in a set of documents
Transformers - Deep Learning models based on attention
>>>
>>> These are very broad definitions to get you started, do your research.
The idea here is to have the minimum level to get started with BERTopic.
You can find all you need in this link, just follow the instructions.
Yes, you can apply this idea to GSC data as well! I am working on it as well, it just takes time to properly clean data, as it is very hard in some niches.
Visualizing topics is a great way to spot similarity among clusters. This is crucial for large websites or when you have no clue what a new domain is about.
Use this info as a hint on what to topically improve and to see your topical authority. >>>
>>> However, do recall that it's computationally expensive to process all those GSC data for medium websites, imagine for big ones!
There are plenty of topic modelling techniques and you have to get a basic understanding of transformers.
You have way too many options at first, just go through the docs and apply what you can. It will take time but it's totally worth it.
In alternative, you can check this Medium article by the author.
Here you get a "manual" implementation of some feature.