Yohan Profile picture
May 3, 2024 13 tweets 4 min read Read on X
Meta is using Facebook and mobile phone data to produce super-granular household wealth estimates.

Here’s what you need to know about the Relative Wealth Index: Image
The Relative Wealth Index (RWI) is a geospatial measure, which shows wealth disparities within countries.

Main features:

• It's open source
• It measures 'asset-based' wealth
• It covers 135 countries in the world
• Data is provided at 2.4km resolution
1. Methodolgy—Data Inputs:

It uses 'ground-truth' survey data from @DHSprogram (covering 1.4m households across 67,000 villages in 56 countries).

It then uses a range of alternative data:

• mobile phone data
• topographic maps
• aggregated Facebook connectivity data Image
@DHSprogram It also uses daytime satellite imagery, that's processed in a way I outline here:
@DHSprogram 2. Methodology—ML Models:

Machine learning (gradient boosting) is used to train a model on the relationship between the alternative data and survey data.

This is then used to predict wealth (out of sample) for 2.4km x 2.4km grids across 135 countries. Image
@DHSprogram Based on this, the RWI:

• is a relative index within each country at the time of the survey.
• has a mean value of zero and a standard deviation of one.

The scores cannot be compared across countries or over time.
@DHSprogram Model accuracy:

The ML model explains 72% of the variation in wealth, as measured with independent census data from 15 countries.

However, when compared to coordinate-level data collected by the Nigerian government, the model explained 50% of variation (at the grid-level). Image
@DHSprogram However there are a number of things to be aware of:

• larger errors in regions far from survey areas
• the model's accuracy is higher when data is aggregated to the local-government level
@DHSprogram Further questions on accuracy:

A study in 2023 looked at the RWI in Indonesia.

It found that using the RWI to pinpoint the poorest 14% of the population. showed mixed results.

The error rate was high—50.65%.

I.e. half of Indonesia's poorest regions were incorrectly identified Image
@DHSprogram And surprisingly, some areas RWI labeled as poorest were among the wealthiest. Image
@DHSprogram Takeaway:

Meta's RWI is a novel way of estimating asset-wealth at a household level.

But it still has limitations.

And this is the key message when seeking granular insights into wealth & GDP:

Nothing's perfect—it's just important to know what limitations each dataset has.
@DHSprogram If you're interested in other ways of measuring local GDP, check out this post:

And give us a follow @yohaniddawela for more breakdowns on geospatial and economics topics.
@DHSprogram Interested in going deeper?

I provide more in-depth tutorials and analyses in my newsletter.

You can subscribe here: yohan.so
Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Yohan

Yohan Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @yohaniddawela

Nov 28, 2025
Mapping an entire country’s agriculture typically requires massive computational resources.

However, researchers have just mapped one country's cropland in 16 hours for only $313.40.

Here's the breakdown: Image
A new preprint from researchers evaluates the utility of geospatial embeddings for cropland mapping in Togo: Image
So, what are these embeddings?

They are derived from Geospatial Foundation Models (GeoFMs).

The researchers focused on two specifically:

• Presto (low compute requirements)
• AlphaEarth (already generated globally)

These models compress complex sensor data into actionable features.
Read 11 tweets
Oct 23, 2025
5 South American countries just discovered they're wasting $15-23 billion on duplicate renewable energy infrastructure.

A new paper examines the solution: Image
A new study in Nature Communications examines Argentina, Brazil, Chile, Paraguay, and Uruguay's electricity systems through 2050: Image
The researchers ran 80 different scenarios testing:

• Full vs limited regional coordination
• 90% emissions cuts vs no climate policy
• Different wind turbine types
• Solar tracking technologies

They used an open-source model called GridPath to optimise both generation and transmission.
Read 12 tweets
Oct 14, 2025
Asia will bear 65% of global mangrove losses by 2100, whilst OECD countries face just 3%.

A new study reveals the massive inequality in climate impacts on coastal ecosystems.

Here's what you need to know: Image
A new study in Environmental Research: Climate presents the first global analysis of how warming seas threaten mangrove restoration efforts: Image
The researchers analysed mangrove cover data across 1,533 locations worldwide from 1996 to 2020.

They examined how climate variables and economic development influence mangrove area.

Their approach uses panel data analysis to isolate the causal effects of temperature and GDP. Image
Read 12 tweets
Sep 9, 2025
Everyone is talking about Zarr.

ESA is adopting it and others are testing it.

Does this mean the end of Cloud Optimized GeoTIFFs?

Here is what you need to know: Image
ESA recently announced Zarr as the new format for Sentinel-1, 2 and 3.

USGS has benchmarked it for Landsat’s archive.

But many in the community are asking: does this mean the end of COG? Image
What are the basics?

• Zarr: best for large, n-dimensional data cubes (e.g. climate models, satellite time series, weather).

• COG: best for 2D rasters like imagery or elevation, especially when you need wide compatibility with existing tools.
Read 13 tweets
Sep 3, 2025
Turns out there are some pretty big issues with DHS data.

A new study finds massive subnational differences in data quality across 35 African countries.

Here's the breakdown: Image
A new study in Nature Communications, analyses geocoded DHS data at a 5km resolution.

It highlights serious concerns for health and development policymaking: Image
The researchers focus on three types of data errors:

• Incomplete age (missing birth month or year)
• Age heaping (ages ending in 0 or 5)
• Flagged HAZ (missing or implausible child height data)

These are widely used indicators of data quality.
Read 13 tweets
Aug 29, 2025
Air pollution is usually blamed for lung and heart disease.

But new clinical data shows it may also drive diabetes.

Here’s what you need to know: Image
The researchers combined:

• Outpatient clinical records from the Italian Association of Diabetologists (AMD)
• Municipality-level pollution exposure data from ISPRA, Italy’s environmental protection agency

This gave them a unique dataset of pollution and diabetes at the local level.Image
The AMD dataset is pretty powerful:

• Covers ~300 diabetes centres across all 20 Italian regions
• Half of all diabetes outpatients in Italy
• Based on clinical records, not self-reported cases

This makes it far more reliable than survey-based data.
Read 13 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(