Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Antoine Eripret

@antoineripret

Sep 9, 2021 • 16 tweets • 7 min read • Read on X

Scrolly

🪦 Resucitar un dominio en menos de una hora 🪦

En el hilo de hoy, te voy a explicar cómo puedes resucitar un dominio con miles de contenidos en poco tiempo.

Te puede ser útil para migraciones catastróficas pero también para dominios expirados.

https://twitter.com/antoineripret/status/1409462180462817283?s=09

Ya expliqué un poco el proceso en un otro hilo (

https://twitter.com/antoineripret/status/1409462180462817283?s=09

) pero hoy daré más detalles y usaré un ejemplo real.

El ejemplo: un subdominio de la empresa Michelin, fabricante francés de neumáticos. Decidieron eliminar este subdominio hace un par de meses.

Vamos a suponer que queremos reactivar este dominio.

Etapa 1: Consultar la API de archive.org

No volveré a explicar lo que ya está en el otro hilo. Puedes también leer aeripret.com/es/extraer-url… dónde explico todo.

Acabarás con una tabla con la URL original y la URL des último snapshot en https://t.co/ORlt4dS4F8.

Etapa 2: Extraer los contenidos

En este caso, dos partes nos interesan:

1. (rojo) la introducción del contenido
2. (naranja) el contenido

Ambos se puede identificar fácilmente con una clase:

Este parte se hará en Python, para guardar los contenidos en Markdown.

Se puede hacer de otra manera pero:

1. Es más complejo / lento (en mi opinión)
2. No es reproducible

Te dejo el código comentado para que se entienda la lógica.

No se pueden extraer todos los contenidos porque en algunos casos, el único snapshot de archive.org es una página que indica que el dominio ya no existe.

Obviamente, no me interesan estos casos y por eso mi lógica toma en cuenta este "problema".

Puedes abrir algunos archivos para comprobar que el contenido se haya guardado correctamente.

En mi caso, vemos que tanto el contenido como las imágenes aparecen bien.

Etapa 3: Convertir el markdown en HTML simple

Convertir nuestro Markdown en HTML simple. ¿Por qué hemos usado este formato antes entonces? Ya teníamos html 🤷‍♂️

Así nos aseguramos de tener HTML limpio, es decir sin clase, <div> etc...

En Python, realizar esta operación es bastante fácil (ver devdungeon.com/content/conver…).

A eso me refería por ejemplo cuando te decía que es más fácil hacerlo así que todo a mano.

Etapa 4: descargar todas las imágenes

Por defecto, las imágenes de nuestros contenidos están ahora en archive.org. Podemos descargarlas para subirlas a nuestro servidor.

Primero tenemos que obtener todas las URLs de todas las imágenes de nuestros contenidos

Y después intentamos descargarlas (tendremos que subirlas después a nuestro servidor por FTP de forma manual, pero son 5mn con FileZilla).

Etapa 5: modificación del HTML

Con las imágenes descargadas:

1. Actualizar los atributos src en el código descargado anteriormente para usar la nueva URL
2. Eliminar las imágenes que no se han podido descargar.

Entre no tener una imagen o que no cargue, ¿qué prefieres?

Aplicaremos un cambio muy similar al enlazado interno:

1. Si un contenido no se ha podido recuperar de archive.org, eliminamos los enlaces internos hacia este contenido

2. Reemplazamos los enlaces internos https://t.co/ORlt4dS4F8 por la URL real que usaremos

Siguiendo esta lógica y en poco tiempo tienes:

1. Todos los contenidos disponibles descargados (en HTML limpio)
2. Las imágenes disponibles en tu FTP o quitadas de los contenidos
3. El enlazado interno sin 404

Usando wpallimport.com, podrás importar todo en 2mn.

Este proceso puede parecer complejo y asustar (especialmente si no dominas Python) pero:

1. Funciona perfectamente, lo he usado ya varias veces sin ningún tipo de problema

2. Ahorra mucho tiempo porque es repetible y escalable.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @antoineripret

Antoine Eripret

@antoineripret

Apr 13, 2023

🤓 Google Sheets formulas every SEO should know 🤓

Let's go through the most common formulas you need to master to work quicker.

Most of them can also be used in Excel, but not all of them.

1. VLOOKUP

THE formula you have to master because it allows you to merge data from different tables. Very useful to combine Search Console and Analytics data, for instance.

You have to master it.

https://twitter.com/antoineripret/status/1633422333384990720

2. FILTER

I've explained everything about this formula in a separate thread:

https://twitter.com/antoineripret/status/1633422333384990720

Read 15 tweets

Antoine Eripret

@antoineripret

Apr 12, 2023

🛑 SEO tip: Never assume search intent 🛑

These three queries have the same search intent but, for some reason, Google thinks that "python playground" is different. Bing as well.

Should have I done it manually, I'd have created one URL to target these queries.

Funny to see how a website is ranking on both (supposed) intents, with two versions of the same functionality.

While others are just present on a part of the demand.

Be smart and look at SERP data before taking decisions!

@keywordinsights

PS: Screenshots from @keywordinsights

Read 4 tweets

Antoine Eripret

@antoineripret

Apr 11, 2023

🛣️ How to create an efficient SEO roadmap 🛣️

SEO theory is relatively easy, but pushing changes into production is harder. How many ideas have you got? How many will be live?

Let me explain how I manage to define & plan an SEO roadmap with other teams!

Foremost, it's important to keep track of all the ideas you have. A simple to-do list is enough.

A sort of brain dump to ensure that even if you can't implement them now, you never forget an idea that occurred to you.

When comes the planning phase (frequency depends on how the organization operates), take these ideas and:

* Create a brief summary (2-3 sentences)
* Assign an SEO priority
* Ask IT to assign a complexity

Based on these two criteria, you can define a prioritization.

Read 13 tweets

Antoine Eripret

@antoineripret

Apr 4, 2023

🚦 Find cannibalization at scale using GSC 🚦

Keyword cannibalization means that you have more than one content ranking. It's often a situation you want to avoid.

Easy to spot when you check a couple of URLs, but how to handle thousands of URLs?

Let me explain!

@semrush

When you intend to spot a cannibalization, you can use an external tool such as @semrush.

Head to the Keyword Gap tool and introduce the two URLs you wish to compare.

Great if you don't have access to GSC data, for instance.

If you have access to GSC, you can achieve the same using first-party data.

* Filter on a specific query (you can also use a REGEX)
* Go to the "Pages" tab

You can see quickly see which URLs are ranking for this query. Easy, right?

Read 11 tweets

Antoine Eripret

@antoineripret

Mar 9, 2023

🕵️ How can you spy on a competitor's content strategy? 🕵️

Your strategy must never be a simple pale copy of what others are doing, but it's always a good idea to know what they are up to.

Let me show you, with a real example, how you can generate insights quickly.

Let's assume we're working in the travel industry and one of our competitors is Skyscanner.

We want to understand what they are doing on their blog and generate some insights based on the data we have at our disposal.

First step: get an exhaustive list of their URLs

This could be done through a crawl, but I'd rather get the list from a sitemap. Not always doable, but in this case, it was easy to find what I was looking for.

Read 11 tweets

Antoine Eripret

@antoineripret

Feb 7, 2023

🚨 New article: content rehydration and SEO 🚨

JavaScript SEO is not going away, and is often challenging.

If you want to know what is content hydration and how it can cause huge traffic drop, check my article or read the thread below

aeripret.com/content-rehydr…

Content rehydration is a process that occurs when a website, built with a JavaScript framework, such as Angular or React, dynamically updates the content on a page without requiring a full-page refresh.

Why using rehydration instead of relying only on SSR? It is faster!

What is the issue with content rehydration?

It will add a script to the raw response sent by your server with all the required code to make the application dynamic. Out-of-the-box, this script can easily represent more than 90% of the total HTML size.

Read 7 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Antoine Eripret

Try unrolling a thread yourself!

More from @antoineripret

Antoine Eripret

Antoine Eripret

Antoine Eripret

Antoine Eripret

Antoine Eripret

Antoine Eripret

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!