First, we will load data from Graddy (1995) who observes daily fish prices and quantities from the Fulton Fish Market in NYC.
The goal of this paper is to estimate the demand elasticity of fish prices
$$
Q = \alpha + \delta P + \gamma X + \varepsilon
$$
Since price and quantity are simultaneous equations, we need an instrument that shocks
Load the Fulton Fish Market data from https://github.com/Mixtape-Sessions/Causal-Inference-1/raw/main/Lab/IV/Fulton.dta
-
Run OLS of
qonpcontrolling for indicators for day of the week (Mon,Tue,Wed,Thu). Interpret the elasticity. -
Estimate the Wald estimator using reduced form and first stage regressions with day of week fixed effects using
Stormyas the instrument. Interpret the elasticity. -
Instrument for
pusing the variableStormywhich is an indicator for it being stormy in the past two days. Interpret the elasticity. Compare your answer to the Wald estimator. -
Calculate the Olea and Pfleuger effective F-statistic and interpret the strength of the instrument.
-
Calculate the Anderson-Rubin confidence intervals.
-
What might be a great figure of the reduced form and the first stage separately?
Next, we will turn to the classical IV example of Card (1995). Card aims to estimate the returns to college education, but is worried about omitted variable bias when regressing wages on years of education. In this case, the coefficient on education is likely biased upwards by unobserved ability since it is plausibly positively correlated with education and with wages.
Load the Card data from https://raw.github.com/scunning1975/mixtape/master/card.dta
-
Regress the log wage
lwageon years of educationeduccontrolling for experience (exper), an indicator for being Black (black), an indicator for being in the South (south), an indicator for being married (married), and an indicator for living in an urban area (smsa). Interpret the coefficient on schooling. -
Using the same set of controls, run an instrumental variables regression instrumenting
educbynearc4which is an indicator for being near a 4-year college/university. Interpret the coefficient on schooling -
Calculate the strength of the first stage using the Olea and Pfleuger effective F statistic. Interpret the result.
-
Calculate the Anderson-Rubin confidence intervals and interpret your result.
-
Compare the two results, does the IV estimate move in the direction we predicted above? Use the concept of LATE to describe why the coefficient moved in the direction it did.
-
If your schooling result is a LATE, then what do the five assumptions mean in the context of this study?
-
What might be a great figure of the reduced form and the first stage separately?
Note: for Stata, you can use -weakivtest- for the Olea and Pfleuger effective F and -twostepweakiv- for the Anderson-Rubin confidence intervals. For R, you can use the {} and {} packages
