Susan Athey Profile picture
Sep 14, 2020 13 tweets 6 min read Read on X
@otis_reid Matrix factorization: Panel data can be thought of as a matrix. A necessary condition for being able to do prediction is that there is some structure --something about the row and column of an entry is informative about the outcome. 1/n
@otis_reid One way to describe the amount of structure is the quality of approximation you can get with a low rank matrix. An NxT matrix of rank k can be written as product of two latent factor matrices with k factors: [Nxk] X [kxT]. 2/n
@otis_reid Fixed effect models impose low rank structure with very strong functional form (outcome is sum of unit, time effects). Not usually the best way to approx a given matrix with a limited number of parameters. Matrix factorization finds good approximation in data-driven way. 3/n
@otis_reid If outcome=smoking, matrix is states X yrs, latent unit characteristics =share of pop in each demographic (smoking highly correlated w/ age, ethnicity). Smoking in state/year is dot product of “share of state pop in each demographic” and “smoking rate for demo in this year.” 4/n
@otis_reid If outcome=purchase, matrix is people X products, latent unit characteristics=preferences for product attributes, latent product characteristics=product attributes. Latent product attribute could represent “organic.” 5/n
@otis_reid In smoking ex., we figure out how state outcomes move together over time. Without directly observing demographics, we can infer the factors that lead to co-movements, and if I see some states at a point in time, can infer outcomes for others at that time. 6/n
@otis_reid In shopping ex., I learn from correlation structure in purchase behavior. One person purchases organic tomatoes and lettuce, another purchases organic lettuce and cucumber. I predict the first person more likely to buy organic cucumber. 7/n
@otis_reid Even if matrix sparse (mostly 0’s), so fixed effects hard to estimate, can still find good low-rank approximation if structure is present in the data. Chains of people buying overlapping products informative. 8/n
@otis_reid I have some lecture notes for a master’s class (not as polished as I’d like, and stealing liberally from others) here that may help build intuition for different ways to look at a matrix: drive.google.com/drive/u/0/fold… 9/n
@otis_reid You can see Guido teach this at the AEA website here aeaweb.org/conference/con… , and slides are here: drive.google.com/drive/u/0/fold… 10/n
@otis_reid Slides build intuition about regression in panels. Do you regress final pd outcomes on prior pd outcomes; observation is unit? Regress target unit outcomes on other units in the same period; obs. is time pd (synth control)? Matrix compl. works if N>T or T>N, good in middle. 11/n
@otis_reid I also have applications to shopping and discrete choice, in these papers: arxiv.org/abs/1906.02635 arxiv.org/abs/1711.03560 and also see slides here: drive.google.com/drive/u/0/fold… 12/n
@otis_reid The shopping papers show how modern matrix factorization can be combined with structural models, and indeed there is a long history in marketing/IO and also in time series econ of using latent factor models, just typically fewer factors. 13/n

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Susan Athey

Susan Athey Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Susan_Athey

May 17, 2022
In most econ applications, clustering std errors is a problem of design, not sampling. What does that mean? Clustering is needed if treatment assignment is correlated within clusters, not because some clusters were sampled and others were not. 1/8 @jmwooldridge
After 5 yrs of iterating on math, simulations, writing, we updated our working paper on clustering. Lots of time working on single formulation that captures intermediate cases, simple after-the-fact, math matches intuition. arxiv.org/abs/1710.02926
Key to state clearly the thought experiment–what type of uncertainty are std errors supposed to capture? If we have observations on individuals from all 50 states, the problem is NOT that there are states that weren’t sampled & we have uncertainty about unseen states!
3/8
Read 8 tweets
Mar 3, 2021
Just published in @ScienceMag

We estimate the value of vaccine capacity, and suggest policies and investments that governments could make right now to accelerate vaccine availability and end the COVID-19 pandemic sooner.

science.sciencemag.org/lookup/doi/10.….

Thread/
1/ Each month, COVID-19 costs hundreds of thousands of lives and hundreds of billions of dollars in short-run GDP. That’s before accounting for missed schooling and other healthcare.
This means there are huge benefits to accelerating vaccination.
2/ Our latest analysis estimates $17.4 trillion of benefit for first 3bn courses, and $1trn in global benefit from expanding annual capacity by another 1bn courses now.
Read 34 tweets
Nov 9, 2020
We (@ArkhangelskyD, David Hirshberg, Guido Imbens, Stefan Wager and I) posted a major revision of our paper, Synthetic Difference-in-Differences, to arxiv. The paper estimates treatment effects when units are followed over time, and some receive a treatment in later periods. 1/n
Our SDID method combines weighting and outcome modeling for a form of double-robustness that improves on synthetic controls and standard DID in many settings. arxiv.org/abs/1812.09970 2/n
We include a worked out application as well as data-driven simulations designed to highlight strengths and weaknesses of alternative methods. 3/n
Read 16 tweets
Mar 7, 2020
Starting an open-ended conversation for faculty who are taking their courses online in a hurry. What are tips and tricks? What challenges are you facing? #econtwitter
I recently had to give some lectures remotely. I used powerpoint, taking screen shots of pdfs and pasting onto ppt where needed. Ppt has very, very easy to use feature where you record a video w/ webcam, separate videos for each slide, and they autoplay in present mode.
The file size was huge but ppt has option to compress them. I still had to break up ppt's into smaller chunks. I liked recording separately slide by slide and being able to rearrange and edit modularly, could change slide after finishing video, etc.
Read 15 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(