-
Notifications
You must be signed in to change notification settings - Fork 7
/
Copy path15-inference_for_one_proportion-web.Rmd
679 lines (391 loc) · 28.6 KB
/
15-inference_for_one_proportion-web.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
# Inference for one proportion {#one-prop}
<!-- Please don't mess with the next few lines! -->
<style>h5{font-size:2em;color:#0000FF}h6{font-size:1.5em;color:#0000FF}div.answer{margin-left:5%;border:1px solid #0000FF;border-left-width:10px;padding:25px} div.summary{background-color:rgba(30,144,255,0.1);border:3px double #0000FF;padding:25px}</style>`r options(scipen=999)`<p style="color:#ffffff">`r intToUtf8(c(50,46,48))`</p>
<!-- Please don't mess with the previous few lines! -->
::: {.summary}
### Functions introduced in this chapter {-}
No new R functions are introduced here.
:::
## Introduction {#one-prop-intro}
Our earlier work with simulations showed us that when the number of successes and failures is large enough, we can use a normal model as our sampling distribution model.
We revisit hypothesis tests for a single proportion, but now, instead of running a simulation to compute the P-value, we take the shortcut of computing the P-value directly from a normal model.
There are no new concepts here. All we are doing is revisiting the rubric for inference and making the necessary changes.
### Install new packages {#one-prop-install}
There are no new packages used in this chapter.
### Download the R notebook file {#one-prop-download}
Check the upper-right corner in RStudio to make sure you're in your `intro_stats` project. Then click on the following link to download this chapter as an R notebook file (`.Rmd`).
<a href = "https://vectorposse.github.io/intro_stats/chapter_downloads/15-inference_for_one_proportion.Rmd" download>https://vectorposse.github.io/intro_stats/chapter_downloads/15-inference_for_one_proportion.Rmd</a>
Once the file is downloaded, move it to your project folder in RStudio and open it there.
### Restart R and run all chunks {#one-prop-restart}
In RStudio, select "Restart R and Run All Chunks" from the "Run" menu.
## Load packages {#one-prop-load}
We load the standard `tidyverse`, `janitor` and `infer` packages as well as the `openintro` package to access data on heart transplant candidates. We'll include `mosaic` for one spot below when we compare the results of `infer` to the results of graphing a normal distribution using `qdist`.
```{r}
library(tidyverse)
library(janitor)
library(infer)
library(openintro)
library(mosaic)
```
## Revisiting the rubric for inference {#one-prop-rubric}
Instead of running a simulation, we are going to assume that the sampling distribution can be modeled with a normal model as long as the conditions for using a normal model are met.
Although the rubric has not changed, the use of a normal model changes quite a bit about the way we go through the other steps. For example, we won't have simulated values to give us a histogram of the null model. Instead, we'll go straight to graphing a normal model. We won't compute the percent of our simulated samples that are at least as extreme as our test statistic to get the P-value. The P-value from a normal model is found directly from shading the model.
What follows is a fully-worked example of inference for one proportion. After the hypothesis test (sometimes called a one-proportion z-test for reasons that will become clear), we also follow up by computing a confidence interval. **From now on, we will consider inference to consist of a hypothesis test and a confidence interval.** Whenever you're asked a question that requires statistical inference, you should follow both the rubric steps for a hypothesis test and for a confidence interval.
The example below will pause frequently for commentary on the steps, especially where their execution will be different from what you've seen before when you used simulation. When it's your turn to work through another example on your own, you should follow the outline of the rubric, but you should **not** copy and paste the commentary that accompanies it.
## Research question {#one-prop-question}
Data from the Stanford University Heart Transplant Study is located in the `openintro` package in a data frame called `heart_transplant`. From the help file we learn, "Each patient entering the program was designated officially a heart transplant candidate, meaning that he was gravely ill and would most likely benefit from a new heart." Survival rates are not good for this population, although they are better for those who receive a heart transplant. Do heart transplant recipients still have less than a 50% chance of survival?
## Exploratory data analysis {#one-prop-ex-eda}
### Use data documentation (help files, code books, Google, etc.) to determine as much as possible about the data provenance and structure. {#one-prop-ex-documentation}
Start by typing `?heart_transplant` at the Console or searching for `heart_translplant` in the Help tab to read the help file.
##### Exercise 1 {-}
Click on the link under "Source" in the help file. Why is this not helpful for determining the provenance of the data?
Now try to do an internet search to find the original research article from 1974. Why is this search process also not likely to help you determine the provenance of the data?
::: {.answer}
Please write up your answer here.
:::
*****
Now that we have learned everything we can reasonably learn about the data, we print it out and look at the variables.
```{r}
heart_transplant
```
```{r}
glimpse(heart_transplant)
```
Commentary: The variable of interest is `survived`, which is coded as a factor variable with two categories, "alive" and "dead". Keep in mind that because we are interested in survival rates, the "alive" condition will be considered the "success" condition.
There are 103 patients, but we are not considering all these patients. Our sample should consist of only those patients who actually received the transplant. The following table shows that only 69 patients were in the "treatment" group (meaning that they received a heart transplant).
```{r}
tabyl(heart_transplant, transplant) %>%
adorn_totals()
```
### Prepare the data for analysis. {#one-prop-ex-prepare}
**CAUTION: If you are copying and pasting from this example to use for another research question, the following code chunk is specific to this research question and not applicable in other contexts.**
We need to use `filter` so we get only the patients who actually received the heart transplant.
```{r}
# Do not copy and paste this code for future work
heart_transplant2 <- heart_transplant %>%
filter(transplant == "treatment")
heart_transplant2
```
Commentary: don't forget the double equal sign (`==`) that checks whether the `treatment` variable is equal to the value "treatment". (See the Chapter 5 if you've forgotten how to use `filter`.)
Again, this step isn't something you need to do for other research questions. This question is peculiar because it asks only about patients who received a heart transplant, and that only involves a subset of the data we have in the `heart_transplant` data frame.
### Make tables or plots to explore the data visually. {#one-prop-ex-plots}
Making sure that we refer from now on to the `heart_transplant2` data frame and not the original `heart_transplant` data frame:
```{r}
tabyl(heart_transplant2, survived) %>%
adorn_totals()
```
## Hypotheses {#one-prop-ex-hypotheses}
### Identify the sample (or samples) and a reasonable population (or populations) of interest. {#one-prop-ex-sample-pop}
The sample consists of `r NROW(heart_transplant2)` heart transplant recipients in a study at Stanford University. The population of interest is presumably all heart transplants recipients.
### Express the null and alternative hypotheses as contextually meaningful full sentences. {#one-prop-ex-express-words}
$H_{0}:$ Heart transplant recipients have a 50% chance of survival.
$H_{A}:$ Heart transplant recipients have less than a 50% chance of survival.
Commentary: It is slightly unusual that we are conducting a one-sided test. The standard default is typically a two-sided test. However, it is not for us to choose: the proposed research question is unequivocal in hypothesizing "less than 50%" survival.
### Express the null and alternative hypotheses in symbols (when possible). {#one-prop-ex-express-math}
$H_{0}: p_{alive} = 0.5$
$H_{A}: p_{alive} < 0.5$
## Model {#one-prop-ex-model}
### Identify the sampling distribution model. {#one-prop-ex-sampling-dist-model}
We will use a normal model.
Commentary: In past chapters, we have simulated the sampling distribution or applied some kind of randomization to simulate the effect of the null hypothesis. The point of this chapter is that we can---when the conditions are met---substitute a normal model to replace the unimodal and symmetric histogram that resulted from randomization and simulation.
### Check the relevant conditions to ensure that model assumptions are met. {#one-prop-ex-ht-conditions}
* Random
- Since the `r NROW(heart_transplant2)` patients are from a study at Stanford, we do not have a random sample of all heart transplant recipients. We hope that the patients recruited to this study were physiologically similar to other heart patients so that they are a representative sample. Without more information, we have no real way of knowing.
* 10%
- `r NROW(heart_transplant2)` patients are definitely less than 10% of all heart transplant recipients.
* Success/failure
$$
np_{alive} = 69(0.5) = 34.5 \geq 10
$$
$$
n(1 - p_{alive}) = 69(0.5) = 34.5 \geq 10
$$
Commentary: Notice something interesting here. Why did we not use the 24 patients who survived and the 45 who died as the successes and failures? In other words, why did we use $np_{alive}$ and $n(1 - p_{alive})$ instead of $n \hat{p}_{alive}$ and $n(1 - \hat{p}_{alive})$?
Remember the logic of inference and the philosophy of the null hypothesis. To convince the skeptics, we must assume the null hypothesis throughout the process. It's only after we present sufficient evidence that can we reject the null and fall back on the alternative hypothesis that encapsulates our research question.
Therefore, under the assumption of the null, the sampling distribution is the *null distribution*, meaning that it's centered at 0.5. All work we do with the normal model, including checking conditions, must use the null model with $p_{alive}= 0.5$.
That's also why the numbers don't have to be whole numbers. If the null states that of the 69 patients, 50% are expected to survive, then we expect 50% of 69, or 34.5, to survive. Of course, you can't have half of a survivor. But these are not *actual* survivors. Rather, they are the expected number of survivors in a group of 69 patients *on average* under the assumption of the null.
## Mechanics {#one-prop-ex-mechanics}
### Compute the test statistic. {#one-prop-ex-compute-test-stat}
```{r}
alive_prop <- heart_transplant2 %>%
specify(response = survived, succes = "alive") %>%
calculate(stat = "prop")
alive_prop
```
We'll also compute the corresponding z score.
```{r}
alive_z <- heart_transplant2 %>%
specify(response = survived, success = "alive") %>%
hypothesize(null = "point", p = 0.5) %>%
calculate(stat = "z")
alive_z
```
Commentary: The sample proportion code is straightforward and we've seen it before. To get the z score, we also have to tell `infer` what the null hypothesis is so that it knows where the center of our normal distribution will be. In the `hypothesize` function, we tell `infer` to use a "point" null hypothesis with `p = 0.5`. All this means is that the null is a specific point: 0.5. (Contrast this to hypothesis tests with two variables when we had `null = "independence"`.)
We can confirm the calculation of the z score manually. It's easiest to compute the standard error first. Recall that the standard error is
$$
SE = \sqrt{\frac{p_{alive}(1 - p_{alive})}{n}} = \sqrt{\frac{0.5(1 - 0.5)}{69}}
$$
**Remember that are working under the assumption of the null hypothesis.** This means that we use $p_{alive} = 0.5$ everywhere in the formula for the standard error.
We can do the math in R and store our result as `SE`.
```{r}
SE <- sqrt(0.5*(1 - 0.5)/69)
SE
```
Then our z score is
$$
z = \frac{(\hat{p}_{alive} - p_{alive})}{SE} = \frac{(\hat{p}_{alive} - p_{alive})}{\sqrt{\frac{p_{alive} (1 - p_{alive})}{n}}} = \frac{(0.348 - 0.5)}{\sqrt{\frac{0.5 (1 - 0.5)}{69}}} = -2.53.
$$
Using the values of `alive_prop` and `SE`:
```{r}
z <- (alive_prop - 0.5)/SE
z
```
Both the sample proportion $\hat{p}_{alive}$ (stored above as `alive_prop`) and the corresponding z-score can be considered the "test statistic". If we use $\hat{p}_{alive}$ as the test statistic, then we're considering the null model to be
$$
N\left(0.5, \sqrt{\frac{0.5 (1 - 0.5)}{69}}\right).
$$
If we use z as the test statistic, then we're considering the null model to be the *standard normal model*:
$$
N(0, 1).
$$
The standard normal model is more intuitive and easier to work with, both conceptually and in R. Generally, then, we will consider z as the test statistic so that we can consider our null model to be the standard normal model. For example, knowing that our test statistic is two and a half standard deviations to the left of the null value already tells us a lot. We can anticipate a small P-value leading to rejection of the null. Nevertheless, for this type of hypothesis test, we'll compute both in this section of the rubric.
### Report the test statistic in context (when possible). {#one-prop-ex-report-test-stat}
The test statistic is `r alive_prop %>% pull(1)`. In other words, `r 100 * (alive_prop %>% pull(1))`% of heart transplant recipients were alive at the end of the study.
The z score is `r alive_z %>% pull(1)`. The proportion of survivors is about 2.5 standard errors below the null value.
### Plot the null distribution. {#one-prop-ex-plot_null}
```{r}
alive_test <- heart_transplant2 %>%
specify(response = survived, success = "alive") %>%
hypothesize(null = "point", p = 0.5) %>%
assume(distribution = "z")
alive_test
```
```{r}
alive_test %>%
visualize() +
shade_p_value(obs_stat = alive_z, direction = "less")
```
Commentary: In past chapters, we have used the `generate` verb to get many repetitions (usually 1000) of some kind of random process to simulate the sampling distribution model. In this chapter, we have used the verb `assume` instead to assume that the sampling distribution is a normal model. As long as the conditions hold, this is a reasonable assumption. This also means that we don't have to use `set.seed` as there is no random process to reproduce.
Compare the graph above to what we would see if we simulated the sampling distribution. (Now we do need `set.seed`!)
```{r}
set.seed(6789)
alive_test_draw <- heart_transplant2 %>%
specify(response = survived, success = "alive") %>%
hypothesize(null = "point", p = 0.5) %>%
generate(reps = 1000, type = "draw") %>%
calculate(stat = "prop")
alive_test_draw
```
```{r}
alive_test_draw %>%
visualize() +
shade_p_value(obs_stat = alive_prop, direction = "less")
```
This is essentially the same picture, although the model above is centered on the null value 0.5 instead of the z score of 0. This also means that the `obs_stat` had to be the sample proportion `alive_prop` and not the z score `alive_z`.
### Calculate the P-value. {#one-prop-ex-calculate-p}
```{r}
alive_test_p <- alive_test %>%
get_p_value(obs_stat = alive_z, direction = "less")
alive_test_p
```
Commentary: compare this to the P-value we get from simulating random draws:
```{r}
alive_test_draw %>%
get_p_value(obs_stat = alive_prop, direction = "less")
```
The values are not exactly the same. And a new simulation with a different seed would likely give another slightly different P-value. The takeaway here is that the P-value itself has some uncertainty, so you should never take the value too seriously.
### Interpret the P-value as a probability given the null. {#one-prop-ex-interpret-p}
The P-value is `r alive_test_p %>% pull(1)`. If there were truly a 50% chance of survival among heart transplant patients, there would only be a `r 100 * (alive_test_p %>% pull(1))`% chance of seeing data at least as extreme as we saw.
## Conclusion {#one-prop-ex-ht-conclusion}
### State the statistical conclusion. {#one-prop-ex-stat-conclusion}
We reject the null hypothesis.
### State (but do not overstate) a contextually meaningful conclusion. {#one-prop-ex-context-conclusion}
We have sufficient evidence that heart transplant recipients have less than a 50% chance of survival.
### Express reservations or uncertainty about the generalizability of the conclusion. {#one-prop-ex-reservations}
Because we know nearly nothing about the provenance of the data, it's hard to generalize the conclusion. We know the data is from 1974, so it's also very likely that survival rates for heart transplant patients then are not the same as they are today. The most we could hope for is that the Stanford data was representative for heart transplant patients in 1974. Our sample size (69) is also quite small.
### Identify the possibility of either a Type I or Type II error and state what making such an error means in the context of the hypotheses. {#one-prop-ex-errors}
As we rejected the null, we run the risk of making a Type I error. It is possible that the null is true and that there is a 50% chance of survival for these patients, but we got an unusual sample that appears to have a much smaller chance of survival.
## Confidence interval {#one-prop-ex-ci}
### Check the relevant conditions to ensure that model assumptions are met. {#one-prop-ex-ci-conditions}
- Random
- Same as above.
- 10%
- Same as above.
- Success/failure
- There were 24 patients who survived and 45 who died in our sample. Both are larger than 10.
Commentary: In the "Confidence interval" section of the rubric, there is no need to recheck conditions that have already been checked. The sample has not changed; if it met the "Random" and "10%" conditions before, it will meet them now.
So why recheck the success/failure condition?
Keep in mind that in a hypothesis test, we temporarily assume the null is true. The null states that $p = 0.5$ and the resulting null distribution is, therefore, centered at $p = 0.5$. The success/failure condition is a condition that applies to the normal model we're using, and for a hypothesis test, that's the null model.
By contrast, a confidence interval is making no assumption about the "true" value of $p$. The inferential goal of a confidence interval is to try to capture the true value of $p$, so we certainly cannot make any assumptions about it. Therefore, we go back to the original way we learned about the success/failure condition. That is, we check the actual number of successes and failures.
### Calculate and graph the confidence interval. {#one-prop-ex-ci-calculate}
```{r}
alive_ci <- alive_test %>%
get_confidence_interval(point_estimate = alive_prop, level = 0.95)
alive_ci
```
```{r}
alive_test %>%
visualize() +
shade_confidence_interval(endpoints = alive_ci)
```
Commentary: when we use a theoretical normal distribution, we have to compute the confidence interval a different way.
When we bootstrapped, we had many repetitions of a process that resulted in a sampling distribution. From all those, we could find the 2.5th percentile and the 97.5th percentile. Although we let the computer do it for us, the process is straightforward enough that we could do it by hand if we needed to. Just put all 1000 bootstrapped values in order, then go to the 25th and 975th position in the list.
We don't have a list of 1000 values when we use an abstract curve to represent our sampling distribution. Nevertheless, we can find the 2.5th percentile and the 97.5th percentile using the area under the normal curve as we saw in the last two chapters. We can do this "manually" with the `qdist` command, but we need the standard error first.
Didn't we calculate this earlier?
$$
SE = \sqrt{\frac{p_{alive}(1 - p_{alive})}{n}} = \sqrt{\frac{0.5(1 - 0.5)}{69}}
$$
Well...sort of. The value of $p_{alive}$ here is the value of the null hypothesis from the hypothesis test above. *However*, the hypothesis test is done. For a confidence interval, we have no information about any "null" value. There is no null anymore. It's irrelevant.
So what is the standard error for a confidence interval? Since we don't have $p_{alive}$, the best we can do is replace it with $\hat{p}_{alive}$:
$$
SE = \sqrt{\frac{\hat{p}_{alive} (1 - \hat{p}_{alive})}{n}} = \sqrt{\frac{0.3478261 (1 - 0.3478261 )}{69}}.
$$
We can let R do the heavy lifting here:
```{r}
SE2 <- sqrt(alive_prop * (1 - alive_prop) / 69)
SE2
```
And now this number can go into `qdist` as our standard deviation:
```{r}
qdist("norm", p = c(0.025, 0.975), mean = 0.3478261, sd = 0.05733743, plot = FALSE)
```
The numbers above are identical to the ones computed by the `infer` commands.
### State (but do not overstate) a contextually meaningful interpretation. {#one-prop-ex-ci-interpret}
We are 95% confident that the true percentage of heart transplant recipients who survive is captured in the interval (`r 100 * alive_ci$lower_ci`%, `r 100 * alive_ci$upper_ci`%).
Commentary: Note that when we state our contextually meaningful conclusion, we also convert the decimal proportions to percentages. Humans like percentages a lot better.
### If running a two-sided test, explain how the confidence interval reinforces the conclusion of the hypothesis test. {#one-prop-ex-two-sided}
We are not running a two-sided test, so this step is not applicable.
### When comparing two groups, comment on the effect size and the practical significance of the result. {#one-prop-ex-effect}
This is not applicable here because we are not comparing two groups. We are looking at the survival percentage in only one group of patients, those who had a heart transplant.
## Your turn {#one-prop-your-turn}
Follow the rubric to answer the following research question:
Some heart transplant candidates have already had a prior surgery. Use the variable `prior` in the `heart_transplant` data set to determine if fewer than 50% of patients have had a prior surgery. (To be clear, you are being asked to perform a one-sided test again.) **Be sure to use the full `heart_transplant` data, not the modified `heart_transplant2` from the previous example.**
The rubric outline is reproduced below. You may refer to the worked example above and modify it accordingly. Remember to strip out all the commentary. That is just exposition for your benefit in understanding the steps, but is not meant to form part of the formal inference process.
Another word of warning: the copy/paste process is not a substitute for your brain. You will often need to modify more than just the names of the tibbles and variables to adapt the worked examples to your own work. For example, if you run a two-sided test instead of a one-sided test, there are a few places that have to be adjusted accordingly. Understanding the sampling distribution model and the computation of the P-value goes a long way toward understanding the changes that must be made. Do not blindly copy and paste code without understanding what it does. And you should **never** copy and paste text. All the sentences and paragraphs you write are expressions of your own analysis. They must reflect your own understanding of the inferential process.
**Also, so that your answers here don't mess up the code chunks above, use new variable names everywhere. In particular, you should use `prior_test`(instead of `alive_test`) to store the results of your hypothesis test. Make other corresponding changes as necessary, like `prior_test_p` instead of `alive_test_p`, for example.**
##### Exploratory data analysis {-}
###### Use data documentation (help files, code books, Google, etc.) to determine as much as possible about the data provenance and structure. {-}
::: {.answer}
Please write up your answer here.
```{r}
# Add code here to print the data
```
```{r}
# Add code here to glimpse the variables
```
:::
###### Prepare the data for analysis. [Not always necessary.] {-}
::: {.answer}
```{r}
# Add code here to prepare the data for analysis.
```
:::
###### Make tables or plots to explore the data visually. {-}
::: {.answer}
```{r}
# Add code here to make tables or plots.
```
:::
##### Hypotheses {-}
###### Identify the sample (or samples) and a reasonable population (or populations) of interest. {-}
[Remember that you are using the full `heart_transplant` data, so your sample size should be larger here than in the example above.]
::: {.answer}
Please write up your answer here.
:::
###### Express the null and alternative hypotheses as contextually meaningful full sentences. {-}
::: {.answer}
$H_{0}:$ Null hypothesis goes here.
$H_{A}:$ Alternative hypothesis goes here.
:::
###### Express the null and alternative hypotheses in symbols (when possible). {-}
::: {.answer}
$H_{0}: math$
$H_{A}: math$
:::
##### Model {-}
###### Identify the sampling distribution model. {-}
::: {.answer}
Please write up your answer here.
:::
###### Check the relevant conditions to ensure that model assumptions are met. {-}
[Remember that you are using the full `heart_transplant` data, so the number of successes and failures will be different here than in the example above.]
::: {.answer}
Please write up your answer here. (Some conditions may require R code as well.)
:::
##### Mechanics {-}
[Be sure to use `heart_transplant` everywhere and not `heart_transplant2`!]
###### Compute the test statistic. {-}
::: {.answer}
```{r}
# Add code here to compute the test statistic.
```
:::
###### Report the test statistic in context (when possible). {-}
::: {.answer}
Please write up your answer here.
:::
###### Plot the null distribution. {-}
::: {.answer}
```{r}
# Add code here to plot the null distribution.
```
:::
###### Calculate the P-value. {-}
::: {.answer}
```{r}
# Add code here to calculate the P-value.
```
:::
###### Interpret the P-value as a probability given the null. {-}
::: {.answer}
Please write up your answer here.
:::
##### Conclusion {-}
###### State the statistical conclusion. {-}
::: {.answer}
Please write up your answer here.
:::
###### State (but do not overstate) a contextually meaningful conclusion. {-}
::: {.answer}
Please write up your answer here.
:::
###### Express reservations or uncertainty about the generalizability of the conclusion. {-}
::: {.answer}
Please write up your answer here.
:::
###### Identify the possibility of either a Type I or Type II error and state what making such an error means in the context of the hypotheses. {-}
::: {.answer}
Please write up your answer here.
:::
##### Confidence interval {-}
###### Check the relevant conditions to ensure that model assumptions are met. {-}
::: {.answer}
Please write up your answer here. (Some conditions may require R code as well.)
:::
###### Calculate the confidence interval. {-}
::: {.answer}
```{r}
# Add code here to calculate the confidence interval.
```
:::
###### State (but do not overstate) a contextually meaningful interpretation. {-}
::: {.answer}
Please write up your answer here.
:::
###### If running a two-sided test, explain how the confidence interval reinforces the conclusion of the hypothesis test. [Not always applicable.] {-}
::: {.answer}
Please write up your answer here.
:::
###### When comparing two groups, comment on the effect size and the practical significance of the result. [Not always applicable.] {-}
::: {.answer}
Please write up your answer here.
:::
## Conclusion {#one-prop-conclusion}
When certain conditions are met, we can use a theoretical normal model---a perfectly symmetric bell curve---as a sampling distribution model in hypothesis testing. Because this does not require drawing many samples, it is faster and cleaner than simulation. Of course, on modern computing devices, drawing even thousands of simulated samples is not very time consuming, and the code we write doesn't really change much. Given the additional success/failure condition that has to met, it's worth considering the pros and cons of using a normal model instead of simulating the sampling distribution. Similarly, confidence intervals can be obtained directly from the percentiles of the normal model without the need to obtain bootstrapped samples.
### Preparing and submitting your assignment {#one-prop-prep}
1. From the "Run" menu, select "Restart R and Run All Chunks".
2. Deal with any code errors that crop up. Repeat steps 1–-2 until there are no more code errors.
3. Spell check your document by clicking the icon with "ABC" and a check mark.
4. Hit the "Preview" button one last time to generate the final draft of the `.nb.html` file.
5. Proofread the HTML file carefully. If there are errors, go back and fix them, then repeat steps 1--5 again.
If you have completed this chapter as part of a statistics course, follow the directions you receive from your professor to submit your assignment.