@otis_reid Matrix factorization: Panel data can be thought of as a matrix. A necessary condition for being able to do prediction is that there is some structure --something about the row and column of an entry is informative about the outcome. 1/n
@otis_reid One way to describe the amount of structure is the quality of approximation you can get with a low rank matrix. An NxT matrix of rank k can be written as product of two latent factor matrices with k factors: [Nxk] X [kxT]. 2/n
@otis_reid Fixed effect models impose low rank structure with very strong functional form (outcome is sum of unit, time effects). Not usually the best way to approx a given matrix with a limited number of parameters. Matrix factorization finds good approximation in data-driven way. 3/n
@otis_reid If outcome=smoking, matrix is states X yrs, latent unit characteristics =share of pop in each demographic (smoking highly correlated w/ age, ethnicity). Smoking in state/year is dot product of “share of state pop in each demographic” and “smoking rate for demo in this year.” 4/n
@otis_reid If outcome=purchase, matrix is people X products, latent unit characteristics=preferences for product attributes, latent product characteristics=product attributes. Latent product attribute could represent “organic.” 5/n
@otis_reid In smoking ex., we figure out how state outcomes move together over time. Without directly observing demographics, we can infer the factors that lead to co-movements, and if I see some states at a point in time, can infer outcomes for others at that time. 6/n
@otis_reid In shopping ex., I learn from correlation structure in purchase behavior. One person purchases organic tomatoes and lettuce, another purchases organic lettuce and cucumber. I predict the first person more likely to buy organic cucumber. 7/n
@otis_reid Even if matrix sparse (mostly 0’s), so fixed effects hard to estimate, can still find good low-rank approximation if structure is present in the data. Chains of people buying overlapping products informative. 8/n
@otis_reid I have some lecture notes for a master’s class (not as polished as I’d like, and stealing liberally from others) here that may help build intuition for different ways to look at a matrix: drive.google.com/drive/u/0/fold… 9/n
@otis_reid Slides build intuition about regression in panels. Do you regress final pd outcomes on prior pd outcomes; observation is unit? Regress target unit outcomes on other units in the same period; obs. is time pd (synth control)? Matrix compl. works if N>T or T>N, good in middle. 11/n
@otis_reid The shopping papers show how modern matrix factorization can be combined with structural models, and indeed there is a long history in marketing/IO and also in time series econ of using latent factor models, just typically fewer factors. 13/n
• • •
Missing some Tweet in this thread? You can try to
force a refresh
In most econ applications, clustering std errors is a problem of design, not sampling. What does that mean? Clustering is needed if treatment assignment is correlated within clusters, not because some clusters were sampled and others were not. 1/8 @jmwooldridge
After 5 yrs of iterating on math, simulations, writing, we updated our working paper on clustering. Lots of time working on single formulation that captures intermediate cases, simple after-the-fact, math matches intuition. arxiv.org/abs/1710.02926
Key to state clearly the thought experiment–what type of uncertainty are std errors supposed to capture? If we have observations on individuals from all 50 states, the problem is NOT that there are states that weren’t sampled & we have uncertainty about unseen states!
3/8
We estimate the value of vaccine capacity, and suggest policies and investments that governments could make right now to accelerate vaccine availability and end the COVID-19 pandemic sooner.
1/ Each month, COVID-19 costs hundreds of thousands of lives and hundreds of billions of dollars in short-run GDP. That’s before accounting for missed schooling and other healthcare.
This means there are huge benefits to accelerating vaccination.
2/ Our latest analysis estimates $17.4 trillion of benefit for first 3bn courses, and $1trn in global benefit from expanding annual capacity by another 1bn courses now.
We (@ArkhangelskyD, David Hirshberg, Guido Imbens, Stefan Wager and I) posted a major revision of our paper, Synthetic Difference-in-Differences, to arxiv. The paper estimates treatment effects when units are followed over time, and some receive a treatment in later periods. 1/n
Our SDID method combines weighting and outcome modeling for a form of double-robustness that improves on synthetic controls and standard DID in many settings. arxiv.org/abs/1812.09970 2/n
We include a worked out application as well as data-driven simulations designed to highlight strengths and weaknesses of alternative methods. 3/n
Polishing my slides for our panel discussion tomorrow morning about synthetic control in the online Chamberlain econometrics seminar: chamberlainseminar.org
Have a great set of coauthors!
Method is weighted version of difference-in-differences, w/ more weight to units AND time periods similar to the treated units. This yields double robustness: even if outcome model is not linear in unit/time effects, still consistent; if linear model right, weights can be wrong.
Starting an open-ended conversation for faculty who are taking their courses online in a hurry. What are tips and tricks? What challenges are you facing? #econtwitter
I recently had to give some lectures remotely. I used powerpoint, taking screen shots of pdfs and pasting onto ppt where needed. Ppt has very, very easy to use feature where you record a video w/ webcam, separate videos for each slide, and they autoplay in present mode.
The file size was huge but ppt has option to compress them. I still had to break up ppt's into smaller chunks. I liked recording separately slide by slide and being able to rearrange and edit modularly, could change slide after finishing video, etc.