Paul Novosad Profile picture
Jun 15, 2023 29 tweets 11 min read Read on X
📣 New working paper on residential segregation in India. We’ve been working for 5 years on this.

8 facts about residential segregation in India, from new administrative data. The situation is not great 🧵 1/N Image
We built a national neighborhood-level dataset covering all India, 2011–13.

It’s super local. A neighborhood = ~700 people, 1.5m in the country.

Data are from ~2012, this is about historical patterns, not the current govt.

3/N
Fact 1: India is very segregated.

Urban places are about as segregated as rural places, for Scheduled Castes. For Muslims, segregation is worse in cities. The graph shows the Dissimilarity Index for cities and subdistricts. ImageImage
Fact 2: Scheduled Castes and Muslims are about as segregated as Black people in U.S. cities. Note this dissimilarity graph is slightly different from the prior, b/c we limit to cities >100k to match U.S. Census definitions. 5/N Image
Fact 3: Muslims are more likely to live in highly segregated neighborhoods.

26% of urban Muslims live in neighborhoods that are >80% Muslim.

17% of urban SCs live in neighborhoods that are >80% SC.

Numbers in rural areas are similar. 6/N Image
Fact 4: Cities replicate the social environments of their hinterlands. Districts with segregated villages have segregated cities. 7/N Image
The existence of segregation is not surprising to people who study and spend time in Indian cities.

Some of these descriptive facts have been noted by @RaphaelSusewind, @nav_bharathi, @deepak_malghan, among others. 8/N
But maybe groups choose to live together — does it matter?

Let’s look at service delivery in these neighborhoods, starting with secondary schools 👇 9/N
Fact 5: Public services in cities are less likely to be found in neighborhoods with many SCs and Muslims.

A 100% Muslim neighborhood is only half as likely to have a secondary school as a neighborhood with no Muslims. 10/N Image
When we look at SCs, moderate SC neighborhoods are doing ok, but the most segregated neighborhoods again are less likely to have secondary schools. 11/N Image
We are comparing SC, Muslim, and integrated neighborhoods, *within the same city*.

This kind of granular data has not been available before. If you look at geographic aggregates, like districts, you will find a different (misleading) story. 12/N
This graph shows school access vs. SC share at aggregate levels.

The story is positive for SCs: states, districts, and towns with more SCs all have more secondary schools, maybe because policies have targeted schools to high-SC regions.

But … Image
Once you look at school allocation *across* blocks/neighborhoods *within* towns, most of that advantage disappears.

For whatever reason, within cities and towns, the most segregated SC neighborhoods have the fewest secondary schools. 14/N Image
Let’s look at the same analysis for Muslims.

States, districts, and towns with high Muslim shares do not have any particular advantage or disadvantage when it comes to school access. 15/N Image
But across neighborhoods, the results are stark. Muslim neighborhoods are *much less likely* to have public secondary schools. 16/N Image
We ran the same analysis for a wide range of public services: primary schools, health clinics, water and electricity infrastructure, closed drainage. 18/N
The result is systematic: within cities, neighborhoods with high SC and Muslim shares have much worse public services—look at the rightmost red bar. The other services are in Figs 5–7 of the paper. paulnovosad.com/pdf/india-segr… ImageImageImageImage
Fact 5: Kids are worse off in segregated neighborhoods.

Young people have over a full year less education in fully segregated SC and Muslim neighborhoods. The graph shows outcomes for 17–18 year olds. 20/N Image
Fact 6: Kids from *all social groups* are worse off in segregated neighborhoods.

The neighborhood effect explains about half of the group disadvantage. In predicting your education, where you live is just as important as your social group. Image
Fact 7: The broad regional patterns do not stand out. Segregated and integrated cities appear throughout the country. 22/N ImageImage
Fact 8: Things might be getting better over time (but we’re not sure).

Younger cities are less segregated, even taking into account their size. 23/N ImageImage
This could mean (a) modern settlement patterns are less segregated (good news, we conjecture); or (b) cities get more segregated over time.

It’s hard to tell, without time series data. 24/N
Ellie Baker and @tobylunt made a fantastic NYT-style visualization of the key results of the paper: devdatalab.org/segregation

We wrote a fact sheet for the media: paulnovosad.com/pdf/segregatio…

More details in the full paper: paulnovosad.com/pdf/india-segr…
25/N
We’re working on a big data release from this paper, which will be posted soon.
26/N
Some additional details:

Core data are from SECC (2012) and Economic Census (2013), linked at enumeration block level.

The EC records public facilities, the SECC social group, infrastructure, wealth, and education.
27/N
This is a descriptive paper and we do not take a stand on the causes of segregation and unequal service access.

Sorting across neighborhoods and unequal service allocation choices are both likely to play roles.

28/N
We don't study Scheduled Tribes, because the paper is focused on cities, and only 4% of members of Scheduled Tribes live in cities.

Given the data available to us, it was not possible to identify neighborhoods that are predominantly OBC. 29/N
While it would be nice to think that Kerala has zero segregation, in fact we don't have neighborhood SECC data for Kerala, so we can't measure segregation.

We made a bad color choice for missing! 30/N

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Paul Novosad

Paul Novosad Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @paulnovosad

Dec 21, 2025
I'm enjoying this @alexolegimas suggestion so much.

In addition to being fascinating (who knew C. Elegans had so much in common with a Roomba), the author @maxsbennett has such a fine way with words.

Some examples in thread 1/ Image
Fungal spores are all around us, patiently waiting for something to die.

I love this image, it's so dark, and inevitable. Take your time, we can wait.

2/ Image
Radially symmetrical animals have only one opening—a mouth-butt if you will.

I'll never look at coral the same way.

3/ Image
Read 4 tweets
Dec 15, 2025
Here's an underrated theory paper on college admissions.

When the stakes get high, we end up selecting too many people who are good at gaming the system, and too few who are actually good. Image
Once you realize that gaming the system is a dimension of talent, you notice that many students at elite schools have this in spades.

They are the ones who protest every grade, invoke every appeal and accommodation, send lengthy emails about why you should be lenient with them.
Read 4 tweets
Nov 14, 2025
👀👀

Very very interesting — data on disability accommodations in college, at last!

Many of us have suspected that high-income students are benefiting the most from disability accommodations. Some answers!

🧵 as I read 1/
Data is from a single (unnamed I think) state school. Big enough to be representative.

Though I suspect behavior at elite schools might be different — I am afraid elite admissions selects (among other things!) for the people who are good at scheming these things.

2/
The income result is not totally clear, and not huge.

Omitted group is HH income < $50k.

Most likely groups to get accommodations are the very poor (<$50k) and the rich (>$200k).

Kind of parallels elite college admissions — upper middle class gets the least help.

3/ Image
Read 20 tweets
Nov 4, 2025
What happens when online job applicants start using LLMs? It ain't good.

1. Pre-LLM, cover letter quality predicts your work quality, and a good cover gets you a job
2. LLMs wipe out the signal, and employer demand falls
3. Model suggests high ability workers lose the most

1/n Image
In April 2023, made it possible for workers to use AI in their cover letters.

Employers can't see if they used the tool.

Time spent to submit an application goes down, with a big increase in apps that took <30 seconds 2/ freelancer.comImage
A measure of cover letter quality — how much the freelancer's email is customized to the specific job post — goes way up. 3/ Image
Read 15 tweets
Oct 14, 2025
In the 1980s, Nestlé was moving into infant formula markets in low-income countries.

With each new market entry, moms switched from breastfeeding to unclean water, and infant mortality increased substantially; 200,000 excess deaths per year. 1/n Image
Amazing — Nestlé saleswomen dressed in nurses uniforms to pitch their product to women in hospitals right after delivery.

Marketing memos were very direct on this — "medical staff are more likely to influence mothers with regard to the [best food] for their babies" 2/n Image
Infant formula is classified as food—not medication—so companies can make whatever claims they want about the effects of their product — improves sleep, increases intelligence, etc. wow 3/n Image
Read 4 tweets
Sep 3, 2025
David Roodman writes: consumers of economic research are more truth-seeking than the producers.

Here is Roodman's recap of his re-analysis of a set of papers on temperature and judge decision-making.

The comment process is not working that well!

Gory details in 🧵 1/ Image
Here is the Gelman blog post:
statmodeling.stat.columbia.edu/2025/09/02/who…

Here is @davidroodman's reanalysis:
econstor.eu/handle/10419/3… 2/
Compare the authors' published comment in AEJ:Applied ("results qualitatively unchanged"), with @davidroodman's summary of what they did.

I'm sorry, what?? They had to drop China (25% of the data) to save the result, and don't state it in the comment abstract?? 3/ Image
Image
Read 9 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(