@si1very Yes/No/Maybe?

The distinction is important, because a penalty is an applied negative.
Getting hit with a penalty for things like mass-cookie-cutter or stolen content is vastly different to G filtering out X of Y pages.

>>>
@si1very >>>

Then there's the issues of vocabulary/ & specificity.
* Internal duplication
* External duplication
* Copied/Stolen content
* Highly similar content
* Cookie-cutter-content
* Boilerplate content
* Low "original" content
* Mashups
* Enhanced mashups

Each is different.

>>>
@si1very >>>

If you've got 50+ pages, and have some canonical issues (such as protocol, domain/TLD, paths/queries etc.) - it's problematic.

1) Diluted and/or Dispersed values typically lead to lower rankings.
2) Auto-canonical/Filter may lead to fluctuating rankings

>>>
@si1very >>>

But typically G will resolve to one version and push that (in most cases).
These can be fixed with Hard/Soft canonical methods (301s, CLE/CLR etc.)

But that's not quite the same as external dupes - esp. with things like syndicated content,
which you seldom control!

>>>
@si1very >>>

For external dupes - you may well be stuffed (better off to alter your version (and learn Not to syndicate full/exact versions! (Not even if they provide a link and use a CLE/CLR!))

Then there's nigh-identical and cookie-cutter,
again ... different!

>>>
@si1very >>>

Nigh identical and cookie-cutter are often both intentional - and serve similar aims...
... to get multiple "versions" ranking and earning traffic.

The main differences are the quality, the quantity, and the type.

>>>
@si1very >>>

There's seldom a good reason to have multiple versions of an informational piece.
Typically highly-similar/nigh-identical applies to "products" and "services".

Legitimately - you may have numerous versions
(such as different colour iPads, with different RAM etc.)

>>>
@si1very >>>

If done properly - these are canonicalised.
G will then attempt to show the most relevant out of the set to match a query.
(if someone searches for "car sales in X", G may show your X page, even if the designated canonical is the Y page!).

>>>
@si1very >>>

Cookie-cutter pages though are often in greater quantity, with only the "keyword" different.
Worse, the quality of the content is often lower ... G doesn't want it, and may penalise it, or the "quality" is marked down (lower crawl/index rates).

>>>
@si1very >>>

Then there's the confusion over things like "boilerplate".
Having a % of content that is identical across pages is seldom a cause for concern.

Really, the only time it's an issue is if you have little other content (and/or it's not original/unique).

>>>
@si1very >>>

Then we have "mashups".
Be it your own content, or from external sites ... multiple bits of content that can be found elsewhere.

Sometimes this can be good for users (curated, relevant, useful) ... but often it's subpar, and if G see's poor reactions, it tends to die

>>>
@si1very >>>

Google's reaction to "duplicates",
and the impact it can/does have on:
* indexing
* crawling
* ranking
can and does vary - depending on the nature, quantity and quality.

>>>
@si1very >>>

Having less than optimal value dispersal
is not the same as
G refusing to show a % of your site,
or
G removing a % of your site from it's index.

Different results, caused by (slightly) different problems, with different solutions.

>>>
@si1very >>>

So … yes - in some cases, it's moot.
In others, it's an important distinction.

What matters is understanding the problem,
knowing the cause
and applying the correct solution.

It may require canonicalization, or altering content,
or simply deleting spammy garbage.

>>>
@si1very >>>

And the only way that's going to happen is if people use correct descriptions.
Things like "boilerplate" aren't duplication.
Highly similar/expanded mashups aren't duplication.
Even cookie-cutter-content isn't technically "duplication" (as they aren't identical).

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Lyndon NA (Darth Autocrat)

Lyndon NA (Darth Autocrat) Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @darth_na

19 Nov 21
1/🧵
🚨::: Online Reputation Management :::🚨

A topic that is often ignored,
despite the huge influence that reputation plays in marketing,
the impact it has on sales,
and what a PITA it is for SEO.

🧵>>>

Please Retweet

#ORM #OnlineReputationManagement #SEO #DarthAutocrat Image : Header text reads &...
2/*

ORM is technically part of Digital Public Relations (Digital PR (DPR)).

I say “technically”, because, in most cases, those in that field seem more interested in “link acquisition” than handling public relations/perception?

>>>
3/*

Online Reputation is obviously the digital/internet aspect of a company’s (or individuals/groups) reputation.

It not only has the same issues that the Real World presents,
but has some unique difficulties due to the nature of digital, and search.
Read 45 tweets
14 Nov 21
If you've done the research,
you should have a set of
Who, What, Where, When & Why

Those should make a large % of the existing consumers.

But few people go to the effort of:
1) Figuring an earlier "when", and the "where" that goes with it
2) Looking at sibling/cousin paths
Proper (and continuous!) research should yield insights into motivation/cause, and locations.
You should be able to utilise "personas" (or demo-/firmo- and psycho-graphics) to locate additional locations, probable channels/sources.

Failing that, use Search for questions!
Search for the same things your consumers do,
and you'll likely find where they go for info ... and where you should be!
(providing answers, running ads, providing sponsorships etc.)

Do searches for Product/Service -brand, and see what comes up. Or +Comp. brand!
Read 6 tweets
12 Nov 21
1/🧵
🚨::: Keywords, and Beyond! :::🚨

It’s a standard industry term.
Almost everyone that has heard of SEO knows of “keywords”.

The problem is, the general perception of keywords is out of date!

Worse - there’s no alternative, and few additional terms!

#SEO #DarthAutocrat Image : Heading : “Keywords, and Beyond!” Darth Vader in
2/*

Originally, Keywords were THE thing.
Meta Keywords and string matching.

Other SEs came along, things evolved, Meta-Keywords basically died.

Yet the term remained.
Though how they are used has evolved,
the way they are used for research hasn’t really.
3/*

As competition for “keywords” got harder,
new terms came to the fore:
* Head term
* Longtail
* And then Mid-tail joined in
As more businesses went online, and more sites, pages and content appeared - it became harder to rank for the shorter “keywords”.
Read 24 tweets
10 Nov 21
.
8 things to think about - SEO for Startups.

Solid thinking and reasoning on SEO for new businesses.

More than worth a read!

#SEO #StartUps
>>>
>>>

I'd also throw in:

+ Get TSL sorted and have the HTTPS on launch!

+ Get your Google Business listing sorted ASAP

+ When looking at TLDs for Domain Names, check for confusion points (same name, different TLD etc.)

>>>
>>>

+ Sort the HTTP > HTTPS out, and pick either www or non-www - then get the 301s sorted out from day one.

+ Own your Name! Make sure you own a domain with your Brand, and you have social profiles for it (same for unique product names etc.).
Same for Directories.

>>>
Read 5 tweets
5 Nov 21
1/🧵
::: Google do NOT use CTR as a Ranking Factor! :::
:::But does that mean they don't use Clicks? :::

This is a long running subject,
and one that get's muddied by poor phrasing,
almost every time :(

🧵>>>

#SEO #CTR #DarthAutocrat
2/*

Let's be clear here:

* GA BR (Google Analytics Bounce Rate)
* GA ToP (Google Analytics Time on Page)
* GA ToS (Google Analytics Time on Site)
* SERP CTR (Search Engine Result Page Click Through Rate)

are NOT (direct/indirect) ranking factors!

>>>
3/*

For starters, only a % of sites use Google Analytics,
so there'd be a Huge data hole.

And each of them are ambiguous/noisy,
with various reasons for whatever value,
(a page with the weather - quick visit, leave - does not mean the page sucks or is irrelevant!)

>>>
Read 30 tweets
22 Oct 21
17/🧵
Part 2
🚨::: Internal Links - What, Why, When, Where and How ::: 🚨

(Part 1 = )

The second Thread covers the 2 bits you are likely more interested in:
* SEO
* Optimisation
of Internal links.

Please Retweet

>>>

#SEO #InternalLinks
18/*
:: SEO ::
Finally - the bits you’ve been waiting for!
SEO aspects of Internal Links:
* Crawling
* Indexing
* Ranking
* Hybrid

>>>
19/*
Crawling :
SE’s (can/do/may) use links for URL discovery - they note the destination of the URL and add it to a crawl queue.

(They may also note “patterns”, and attempt to “guess” URLs, sometimes looking at JS etc. to do so.)

>>>
Read 35 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(