Scraping is super important for a lot of tasks.

#SEO Specialists usually crawl their websites but scraping is not always one of the tools in their arsenal.

A short thread on what you can do with scraping and what's the difference with crawling. 🧵
In plain English, crawling is the act of discovering URLs on a website, so following the links on a page is extremely important.

Scraping is the extraction of data from a website, not all of them of course, just those of interest to achieve a goal.
As we know, crawling is aimed at creating indices of pages.

We care about scraping because we may be interested in extracting specific information from a page.
Now, let's move on to some SEO use cases:

- Check price updates
- Get sitemaps/RSS feeds for content strategy
- Competitor monitoring
- Data analysis and getting data in general
I check sitemaps or feeds to understand the publishing rate of my competitors.

You can also get the latest news and much more, actually.

holisticseo.digital/python-seo/con…
No need to guess content velocity, just check it.

If they don't have a sitemap, things get way more complex.

And here you can just scrape by folder or find a way to create your custom scraper.
You may also want to scrape pages to apply NLP after.

Example: Scrape a page and then analyze the entities.

Very simple and effective, I will post an example in the next threads.
I know what you're thinking, scraping can be used for less ethical purposes.

Many still view it with negative meaning, and I don't blame them either.

Anyway, it's an important task that can open you many opportunities!
And yes, this can be used to get data that you'll use for Programmatic SEO.

I am not an expert on that SEO branch but ofc it works!

Gathering data is one of the use cases I mentioned before.
Follow me for threads, tips and case studies (coming soon) about SEO, content and #Python/data.

If you liked this thread, consider liking and retweeting it!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Marco Giordano 🇺🇦

Marco Giordano 🇺🇦 Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @GiordMarco96

May 17
There are some factors to take into account when creating content.

Content outlines, briefs and templates give you a good way to minimize misunderstandings.

A short thread about processes in #SEO content creation.
Let's explain what all these terms are plus some examples.

Content briefs are essentially an overview, you define the purpose, audience and general guidelines for an article.

A good template is this one (by @danielkcheung):

@danielkcheung So briefs are your war strategy, and what you need to succeed.

The example I provided before is a very good one.

One problem is when you have to work with 100+ articles and of course, no way I am going to do it 100 times.
Read 14 tweets
May 13
A quick recap for those #SEO professionals still lacking some motivation to study Semantic SEO.

The future of search engines lies in innovation, we're talking about corporations, after all.

This thread will show you some concepts and valid reasons. 🧵
Google is already using such technologies and we know it for sure.

Even if we were given the benefit of doubt, there wouldn't be any discussion either.

Innovation drives profit and we know that Google wants good-enough search results.

You cannot optimize your content for BERT/MUM/any other NLP algorithm.

The reason is quite simple, these models/algos are trying to understand and replicate how we humans interact.

Nonetheless, you can use a better syntax and put entities in a good position.
Read 32 tweets
May 11
Some other tips that I think are good for aspirant #SEO Specialists.

This will be a short thread for beginners or anyone that wants to start exploring the SEO realm. 🧵
Learn some popular tools but acknowledge that they don't give you a competitive advantage as a professional.

Everyone can buy them, it's how you use them that makes a difference.

I suggest you start with Semrush/Ahrefs and Screaming Frog/Sitebulb.
Experience different flavors of SEO and pick your favorite.

Content, PR outreach or Technical are just some of the choices you have.

Pick whatever suits you the best, don't over-optimize your career.
Read 18 tweets
May 9
I've spent years studying content and understanding how to improve it.

This short thread is about common #SEO (and non) misconceptions that block your traffic growth. 🧵
The first mistake I see is being picky. Great idea, don't write listicles, comparisons and reviews so that your competitors can take it all!

A business blog/website is for what people want, not for your ego.
Lack of internal links. The capital sin of content maintenance, forgetting to link to your old content.

Viceversa is also true, you should always link to your newly-published articles.
Read 15 tweets
May 7
How to differentiate yourself as an #SEO professional?

What I've learned so far can be your fortune, who knows.

A short thread about YOUR value proposition. 🧵
Focus on one area of SEO rather than doing it all.

It's easier to pick something you like and collaborate with others to solve problems.

I picked content, data and strategy because I feel comfortable with them.
Ignore trends if they are not suitable for what you want to do.

Of course, you have to be updated but you don't need to learn new skills if they're totally unrelated to what you want to achieve.
Read 10 tweets
May 6
A quick recap on why coding (#Python) may help some #SEO professionals or some people pursuing their goals.

A short thread for those folks looking for motivation 🧵
🐍 Scrape competitors to get their headings and optimize accordingly.

Check their sitemaps/RSS feeds to find articles and understand their content frequency.
🐍 Analyze SERPs and find keywords with the same pages.

Analyze titles, get the most common words and visualize them.
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(