-
Notifications
You must be signed in to change notification settings - Fork 1
/
19-iv.Rmd
162 lines (89 loc) · 6.29 KB
/
19-iv.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
```{r 19_setup, include=FALSE}
knitr::opts_chunk$set(echo=TRUE, eval=FALSE, fig.align="center")
```
# Instrumental Variables Analysis
## Learning Goals {-}
- Explain the big picture ideas behind how IV studies work in terms of mimicking a randomized experiment
- Evaluate the plausibility of IV assumptions within a data context
<br>
Slides from today are available [here](https://docs.google.com/presentation/d/1Y75rZZE0eYwwiuSUYn5mGZtDvIxilWEhBzeZhemS22g/edit?usp=sharing).
<br><br><br>
## Exercises {-}
### Exercise 1 {-}
In health services research, the distance that one lives from a specialty care provider is often considered as an instrument. In particular, studies have looked at the causal effect of high-level vs low-level neonatal intensive care units (high vs. low level NICUs) on birth outcomes. (High vs. low is differentiated by the amount of technology available and number of babies delivered per year.)
The instrument used in this setting was a *differential distance (excess travel time)*: travel time to the nearest high-level NICU *minus* travel time to the nearest low-level NICU. This instrument was binarized to be $IV = 1$ if this excess travel time was no more than 10 minutes, and $IV = 0$ otherwise. Distance to these health services is known to be a reflection of where mothers live which, in turn, is often a reflection of socioeconomic status indicators.
#### Part a {-}
Evaluate the validity of differential distance as a (conditional) instrument by constructing a causal graph.
- Draw the causal graph.
- Identify a conditioning set such that the instrument would be (conditionally) valid. It's fine if this validity is only an approximate validity.
- Evaluate the degree to which each of the 3 core instrumental assumptions (not the homogeneity or monotonicity assumptions) is satisfied. Make connections between d-separation and the graphical versions of these assumptions.
<br>
#### Part b {-}
Under the monotonicity assumption, to whom does the causal effect estimated by IV generalize? Do you think this effect is of genuine interest in this setting?
<br><br>
### Exercise 2 {-}
Some analysts have believed that it is possible to perform a hypothesis test to test the exclusion restriction assumption by regressing `Y ~ A + IV`. If they obtain a high p-value for the IV coefficient, that must mean that the exclusion restriction is met. However, even if the exclusion restriction is met, this test will likely not turn result in the correct conclusion.
Explain two things: (1) the intuition behind the `Y ~ A + IV` test and (2) why this test will not give the right result (using graph concepts).
<br><br>
### Exercise 3 {-}
While randomized experiments are the gold standard for causal inference when everything runs perfectly, **noncompliance** can be an issue. Noncompliance occurs when an individual does not comply with the treatment that was randomly assigned.
Explain how instrumental variables estimation could be useful in a setting with noncompliance, and state clearly what causal effect is being estimated with the IV approach.
<br><br>
### Exercise 4 {-}
The **strength** of an instrument refers to its degree of association with the treatment of interest. Strong instruments are highly associated with the treatment, and weak instruments have small associations with the treatment.
A characteristic of instrumental variables estimation is that the causal effect estimate has much higher variance for weak instruments than for strong instruments. By looking at the ratio form of the IV estimator, argue why this would be the case. (This can also be argued via thinking about the two-stage least squares approach if you wish.)
<br><br>
### Extra! {-}
If you have time, work on this exercise.
**Before looking at the code below,** plan an answer to the following research question with high-level pseudocode (a high-level description of code that you would write).
> **Research question:** How could we write a simulation study to understand the impact of modeling choices on estimated results in IV settings where there are nonlinear relationships between variables in our causal system?
<br><br><br><br>
Now examine the code below which implements an analysis to investigate the above question. Write a paragraph describing what this simulation study is doing and (preliminary) results from the study.
Code notes:
- The `ivpack` package is used to fit the two-stage least squares model. Specifically, you'll want to look up the documentation for the `ivreg()` function.
- `replicate(n, {code})` repeats `{code}` `n` times.
- `I(x^2)` in a model formula instructs R to add a squared version of `x` to the dataset as another predictor.
```{r}
library(ivpack)
library(dplyr)
library(tidyr)
library(ggplot2)
sim_binary <- function(n, log_odds) {
odds <- exp(log_odds)
p <- odds/(1+odds)
rbinom(n, size = 1, prob = p)
}
simulate_and_estimate <- function(n, iv_strength) {
results <- replicate(100, {
Z <- rnorm(n, 10, 2)
log_odds_IV <- -3 + 0.05*Z + 0.005*(Z^2)
IV <- sim_binary(n, log_odds_IV)
log_odds_A <- -1 + 0.04*Z + 0.004*(Z^2) + iv_strength*IV
A <- sim_binary(n, log_odds_A)
M <- rnorm(n, 4 + 3*IV, 2)
Y <- rnorm(n, 4 + Z + 5*A + 2*(M^2), 2)
sim_data <- data.frame(Z, IV, A, M, Y)
iv_mod <- ivreg(Y ~ A + Z + M|IV + Z + M, data = sim_data)
iv_mod2 <- ivreg(Y ~ A + Z + I(Z^2) + M + I(M^2)|IV + Z + I(Z^2) + M + I(M^2), data = sim_data)
c(iv_strength = iv_strength, estim1 = unname(coefficients(iv_mod)["A"]), estim2 = unname(coefficients(iv_mod2)["A"]))
})
results %>% t() %>% as_tibble()
}
set.seed(451)
n <- 10000
result1 <- simulate_and_estimate(n, iv_strength = 0.1)
result2 <- simulate_and_estimate(n, iv_strength = 0.4)
result3 <- simulate_and_estimate(n, iv_strength = 0.8)
all_results <- bind_rows(result1, result2, result3)
all_results <- all_results %>%
pivot_longer(-iv_strength, names_to = "model_type", values_to = "estimate")
ggplot(all_results, aes(x = model_type, y = estimate)) +
geom_boxplot() +
facet_grid(~iv_strength) +
geom_hline(yintercept = 5, col = "red")
ggplot(all_results, aes(x = model_type, y = estimate)) +
geom_boxplot() +
facet_grid(~iv_strength) +
geom_hline(yintercept = 5, col = "red") +
coord_cartesian(ylim = c(-200,200))
```