Yohan Profile picture
May 3, 2024 13 tweets 4 min read Read on X
Meta is using Facebook and mobile phone data to produce super-granular household wealth estimates.

Here’s what you need to know about the Relative Wealth Index: Image
The Relative Wealth Index (RWI) is a geospatial measure, which shows wealth disparities within countries.

Main features:

• It's open source
• It measures 'asset-based' wealth
• It covers 135 countries in the world
• Data is provided at 2.4km resolution
1. Methodolgy—Data Inputs:

It uses 'ground-truth' survey data from @DHSprogram (covering 1.4m households across 67,000 villages in 56 countries).

It then uses a range of alternative data:

• mobile phone data
• topographic maps
• aggregated Facebook connectivity data Image
@DHSprogram It also uses daytime satellite imagery, that's processed in a way I outline here:
@DHSprogram 2. Methodology—ML Models:

Machine learning (gradient boosting) is used to train a model on the relationship between the alternative data and survey data.

This is then used to predict wealth (out of sample) for 2.4km x 2.4km grids across 135 countries. Image
@DHSprogram Based on this, the RWI:

• is a relative index within each country at the time of the survey.
• has a mean value of zero and a standard deviation of one.

The scores cannot be compared across countries or over time.
@DHSprogram Model accuracy:

The ML model explains 72% of the variation in wealth, as measured with independent census data from 15 countries.

However, when compared to coordinate-level data collected by the Nigerian government, the model explained 50% of variation (at the grid-level). Image
@DHSprogram However there are a number of things to be aware of:

• larger errors in regions far from survey areas
• the model's accuracy is higher when data is aggregated to the local-government level
@DHSprogram Further questions on accuracy:

A study in 2023 looked at the RWI in Indonesia.

It found that using the RWI to pinpoint the poorest 14% of the population. showed mixed results.

The error rate was high—50.65%.

I.e. half of Indonesia's poorest regions were incorrectly identified Image
@DHSprogram And surprisingly, some areas RWI labeled as poorest were among the wealthiest. Image
@DHSprogram Takeaway:

Meta's RWI is a novel way of estimating asset-wealth at a household level.

But it still has limitations.

And this is the key message when seeking granular insights into wealth & GDP:

Nothing's perfect—it's just important to know what limitations each dataset has.
@DHSprogram If you're interested in other ways of measuring local GDP, check out this post:

And give us a follow @yohaniddawela for more breakdowns on geospatial and economics topics.
@DHSprogram Interested in going deeper?

I provide more in-depth tutorials and analyses in my newsletter.

You can subscribe here: yohan.so
Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Yohan

Yohan Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @yohaniddawela

Jul 28
Most countries don't publish official sub-national population data.

Luckily, there are several geospatial population datasets we can use instead.

Here's a list of them (I wish I knew about 5 years ago): Image
1. WorldPop (@WorldPopProject) provides data on:

• population counts
• population density
• population by age and sex

Data is from 2000-2020 and available at 100m or 1km resolution.

Link: worldpop.orgImage
@WorldPopProject WorldPop have recently been extending this out to 2030.

This is still in beta, but you can find the data here:
Read 12 tweets
Jun 19
Meta is known for Facebook, WhatsApp and Instagram.

But did you know they provide a range of free geospatial datasets for researchers?

These include granular measures of household wealth, population, and network access.

Here's what you need to know about it: Image
1. Meta provides granular estimates of household wealth for low and middle income countries.

Read more about it here:
2. They identified 'at-risk' populations during the pandemic:
Read 12 tweets
Jun 17
It can be a nightmare to find official sub-national shapefiles.

Luckily, there are a number of sources that make the job easier: Image
1. GADM

The most widely used source of global shapefiles is the Database of Global Administrative Areas (GADM).

For countries like the UK, it provides boundaries down to the admin 4 level.

Link: gadm.orgImage
2. Geoboundaries

Geoboundaries from @aiddata provide another global source of admin boundaries.

They've got shapefiles for all countries in the world as well.

I typically use this to double-check GADM boundaries.

Link: geoboundaries.orgImage
Read 9 tweets
Jun 16
One of the biggest traps in geospatial analysis?

Ecological Fallacy.

It can turn a map into a misleading story.

Here's what you need to know about it: Image
In simple terms, ecological fallacy is drawing conclusions about individuals from data that were aggregated over areas (e.g. counties, districts, grids). Source: ScienceUpFirst
Why does this matter?

Aggregate data mix many influences.

When you average values, opposing patterns can cancel out or intensify.

This hides what happens at the person-level. Image
Read 13 tweets
Jun 6
What happens when a scientist pays $300 for 50 citations?

A study reveals a growing black market of citation mills, and why it threatens the credibility of science. Image
To test the Google Scholar system, researchers created a fake academic and paid $300 to buy 50 citations.

Within 40 days, citations started appearing in Google Scholar, published in journals with impact factors as high as 4.79. Image
They found that these citations often came from “special issues” in journals.

These are issues where peer review was minimal or absent.

These special issues published everything from fake news to Islamic psychology, all in journals supposedly about chemistry.
Read 13 tweets
Jun 3
We track poverty too slowly to respond effectively.

But a new open-source tool now lets us map wealth in near real-time using satellite imagery.

Here’s what you need to know: Image
Back in 2016, Stanford's AI Lab pioneered a new method of measuring economic activity using daytime satellite images. Image
However, a paper in @Res_Elements's SoftwareX extends this methodology to provide time-series wealth estimates at a granular level: Image
Read 12 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(