...

Generalized Synthetic Control Method for Causal Inference in Time-Series Cross-Sectional... Yiqing Xu Department of Political Science, Massachusetts Institute of Technology The Problem

by user

on
Category: Documents
33

views

Report

Comments

Transcript

Generalized Synthetic Control Method for Causal Inference in Time-Series Cross-Sectional... Yiqing Xu Department of Political Science, Massachusetts Institute of Technology The Problem
Generalized Synthetic Control Method for Causal Inference in Time-Series Cross-Sectional Data
Yiqing Xu
Department of Political Science, Massachusetts Institute of Technology
Empirical Applications
The Problem
Objective: to identify causal effects with
time-series cross-sectional data
Problems of existing methods:
(1) DID: “parallel trends" may not be true
(2) Synthetic control: one treated unit
treatment
effect
control
T0
−2
Unifying the synthetic control method and
fixed-effect models in a simple framework
X Multiple treated units
X Variable treatment timing
X Conventional standard errors
X Robust DID estimates
X More efficient than the synthetic control method
X Fast and easy to implement
Quantities of Interest
• Individual
treatment effect: δit, ∀i ∈ T
1 P
• SATTt = N
δ
D
i∈T
it
it
tr
• PATTt = E[δit|Dit = 1]
Applying ideas of statistical learning to a
problem of causal inference
Identifying Assumptions
5
10
15
−3
Finding: On average, a high-level official’s visit
almost tripled a firm’s total outstanding loans in a
three-year period.
80
120
160
Case 2. One treated unit: Abadie, Diamond,
Hainmueller (2010): Proposition 99 on tobacco
consumption
40
Treated
Controls
Generalized synthetic control
1970
•I
use a resampling scheme to produce standard
errors for individual treatment effect and SATT .
→ The key is to correctly calculate the out-ofsample prediction error of the counterfaucal for
each treated unit
• When
the treatment group is relatively large and
Assumption 4 holds, inference of PATT can be
made using nonparametric bootstrap
show that in both cases I obtain the
correct coverage rates
1975
1980
1985
1990
1995
2000
1985
1990
1995
2000
Estimated treatment effect (GSynth)
Estimated treatment effect (Synth)
95% CI
1970
1975
1980
Finding: The estimated effect is similar in size to
that in Abadie et. al. though estimation uncertainty
increases.
Extensions
1
When It Doesn’t Work
Conditional ignorability
{Yi(1), Yi(0)}⊥
⊥Di|X, Λ, F
• Predict
the counterfactual for each treated unit in
the post-treatment period
• Take advantage of cross-sectional correlations
• Use pre-treatment periods to select models
Inference
• Simulations
Intuition
0
40
Main Contributions
4 Obtain treatment effect for each treated unit
−5
20
15
3 Predict counterfactuals for the treated units
−10
0
−0.5
−1.0
Important Special Cases
Ft,1 = 1: individual fixed effect
λi,1 = 1, Ft,1 = ηt: time fixed effect
Ft,1 = t: individual-specific linear time trend
Ft,1 = t, Ft,2 = t2: quadratic time trend
1−αt
t
Ft,1 = α , Ft,2 = 1−α : lagged dependent (AR1)
λi,1Ft,1 = Ziγt: effect of pre-treatment covariates
1 Estimate a factor model using the control
group units (Bai 2009)
2 Estimate factor loadings for the treated
units using least squares
Repeat 1-2 to minimize MSPE
−15
−40
+ it
+ it + δitDit
Loop starts: given the number of factors,
Estimates
95% CIs
−80
2.0
1.5
Coefficient
0.0
0.5
1.0
0
λiFt
0
λiFt
Algorithm
unobserved time-varying factors
• λi: individual-specific factor loadings
10
T0
Setup
Yit(0) = βXit +
Yit(1) = βXit +
5
control
T0
• Ft :
0
Coefficient
−1
0
1
control
Diff−in−diffs
−5
2
treated
λ0iFt = λi,1Ft,1 + λi,2Ft,2 + · · · λi,r Ft,r
−10
Generalized Synthetic Control
treated
I investigate the the effect of government officials’
visits to firms in China on firms’ access to loans
during the 2008 global crisis. DID fails because the
“parallel trends" assumption appears implausible.
−15
Case 1. Multiple treated units, variable
treatment timing: officials’ visits on firms’ access
to bank loans
3
treated
A Motivating Case
Estimates
95% CIs
treatment
effect
treatment
effect
∀i
Weak serial dependence of the error terms
3 Regularity conditions
4 Random sampling (if PATT is of interest)
2
• When
the number of pre-treatment periods or the
number of control group units is small
• When time-varying counfounders cannot be
decomposed (e.g., Wit 6= λi × Ft)
• When the SUTVA is violated
Yit(0) = Ziθt + αi + ηt + βXit +
0
λiFt
+ it
- αi and ηt: unit and time fixed effects
- Zi: pre-treatment time-invariant covariates
• The choice of identifying assumptions is the key
• The algorithm can also help choose the model
Fly UP