-
Notifications
You must be signed in to change notification settings - Fork 0
/
final_writeup.Rmd
274 lines (188 loc) · 26.2 KB
/
final_writeup.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
---
title: "Household Income and Transportation Usage"
author: "Jasmine Mangat"
date: "May 9 2021"
abstract: "With rising concerns over the impact of climate change on our planet, specifically urban communities, there has been a recent push in designing cities to be more eco-friendly through investing in public transportation and creating neighborhoods more accessible to bicyclists and pedestrians. However, these changes may be feeding into another urban issue that has been making headlines: gentrification. Coined as 'eco-gentrification,' researchers are pointing to the inherent issues with touting the benefits of sustainable urban planning without considering equity (Rice). Through this study, I focus on the current state of transportation options available to those living and working in the Boston–Cambridge–Newton, MA–NH Metropolitan Statistical Area depending on their income level and where they have to travel to work. This study could potentially play a role in studies relating to transportation accessibility and its potential role in promoting eco-gentrification in urban areas."
font-family:
output: pdf_document
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = FALSE)
library(haven)
library(ipumsr)
library(tidyverse)
library(broom)
library(knitr)
library(nnet)
library(ggplot2)
library(reshape2)
library(dplyr, warn.conflicts = FALSE)
```
# Introduction
Transportation is one of the key factors which influences urban planning because of its power to connect different people and industries throughout a given area. A study in Brazil conducted interviews with the poorer population in a given city and found how unaffordable and unexpansive transportation systems can inhibit access to jobs, businesses, and other resources that are too far by foot or bicycles. Although American and Brazilian land models differ, this phenomenon is important to consider when considering how gentrification can often lead to people being kicked out of their neighborhoods into those "less-desirable" and "inaccessible."
Transportation has also been a topic of interest in the recent surge in the sustainable and smart city movement which helps promote more climate-conscience and data-driven urban planning. However, recent studies have found that efforts such as more sustainable and cheaper transportation systems such as bikes may lead to increased housing prices (Rice). Thus, eco-gentrification points out how increased sustainable transportation accessibility efforts may be targeting may lead to less accessibility overall to a city's overall resources for low-income populations.
Through this study, I look at how different household incomes travel from their homes to work while controlling for different living and working locations in a metropolitan area. Through this study, we hope to gain a better understanding of how transportation choices among lower and upper-income groups might differ depending on their travel needs. While I do not consider transportation accessibility directly, I provide suggestions on how this study can be extended to do so. Additionally, I will be discussing the nuances that come from studies involving transportation choices and accessibility due to various hidden variables and flawed assumptions that are often made.
# Data
```{r, include=FALSE}
ddi <- read_ipums_ddi("usa_00003.xml")
acs_data3 <- read_ipums_micro(ddi)
```
```{r, include = FALSE}
acs_data3_boston_metro <- acs_data3 %>% filter(COUNTYICP == 0250 | COUNTYICP == 0230 | COUNTYICP == 0210 | COUNTYICP == 0090 | COUNTYICP == 0170 | COUNTYICP == 0150 | COUNTYICP == 0170)
acs_data3_boston_metro <- acs_data3 %>%
filter(COUNTYICP == 0250 | COUNTYICP == 0230 | COUNTYICP == 0210 | COUNTYICP == 0090 | COUNTYICP == 0170 | COUNTYICP == 0150 | COUNTYICP == 0170, METRO > 1 & METRO <= 4, PWTYPE == 1 | PWTYPE == 4 | PWTYPE == 5)
acs_data3_boston_metro <- rbind(acs_data3_boston_metro %>%
filter(TRANWORK >= 31 & TRANWORK <= 60), acs_data3_boston_metro %>%
filter(TRANWORK >= 10 & TRANWORK < 31 & CARPOOL != 0)) %>%
mutate(TRANWORK_SM = case_when(
TRANWORK < 31 & CARPOOL == 1 ~ "Private vehicle",
TRANWORK < 31 & CARPOOL == 2 ~ "Carpool",
TRANWORK < 50 & TRANWORK >= 31 ~ "Public transport",
TRANWORK == 50 ~ "Bicycle",
TRANWORK == 60 ~ "Walked Only"
)) %>%
mutate(METRO_NAME = case_when(
METRO == 2 ~ "In metropolitan area: in central/principal city",
METRO == 3 ~ "In metropolitan area: Not in central/principal city",
METRO == 4 ~ "In metropolitan area: Central/principal city status indeterminable (mixed)"
), PWTYPE_NAME = case_when(
PWTYPE == 1 ~ "In metropolitan area: in central/principal city",
PWTYPE == 4 ~ "In metropolitan area: Not in central/principal city",
PWTYPE == 5 ~ "In metropolitan area: Central/principal city status indeterminable (mixed)",
PWTYPE == 0 ~ "N/A or abroad",
PWTYPE == 9 ~ "Metropolitan status indeterminable (mixed)"), HOUSE_TO_WORK = case_when(
(METRO == 2 | METRO == 4) & (PWTYPE == 1 | PWTYPE == 5) ~ 1,
(METRO == 2 | METRO == 4) & (PWTYPE == 4) ~ 2,
METRO == 3 & (PWTYPE == 1 | PWTYPE == 5) ~ 3,
METRO == 3 & PWTYPE == 4 ~ 4
),
HOUSE_TO_WORK_NAME = case_when(
(METRO == 2 | METRO == 4) & (PWTYPE == 1 | PWTYPE == 5) ~ "City to city",
(METRO == 2 | METRO == 4) & (PWTYPE == 4) ~ "City to suburbs",
METRO == 3 & (PWTYPE == 1 | PWTYPE == 5) ~ "Suburbs to city",
METRO == 3 & PWTYPE == 4 ~ "Suburbs to suburbs"
),
HHINCOME_BRACKET = case_when(
HHINCOME < 25000 ~ 1,
HHINCOME >= 25000 & HHINCOME < 85000 ~ 2,
HHINCOME >= 85000 ~ 3
), CITY_TO_CITY_VAL = (METRO == 2 | METRO == 4) & (PWTYPE == 1 | PWTYPE == 5), CITY_TO_NOT_CITY_VAL = (METRO == 2 | METRO == 4) & (PWTYPE == 4), NOT_CITY_TO_CITY_VAL = METRO == 3 & (PWTYPE == 1 | PWTYPE == 5), NOT_CITY_TO_NOT_CITY_VAL = METRO == 3 & PWTYPE == 4, CITY_TO_CITY = case_when(
CITY_TO_CITY_VAL == TRUE ~ 1,
CITY_TO_CITY_VAL == FALSE ~ 0), CITY_TO_NOT_CITY = case_when(
CITY_TO_NOT_CITY_VAL == TRUE ~ 1,
CITY_TO_NOT_CITY_VAL == FALSE ~ 0), NOT_CITY_TO_CITY = case_when(
NOT_CITY_TO_CITY_VAL == TRUE ~ 1,
NOT_CITY_TO_CITY_VAL == FALSE ~ 0), NOT_CITY_TO_NOT_CITY = case_when(
NOT_CITY_TO_NOT_CITY_VAL == TRUE ~ 1,
NOT_CITY_TO_NOT_CITY_VAL == FALSE ~ 0
), WHITE = case_when(
RACE == 1 ~ 1,
RACE == 2 ~ 0
))
acs_data3_boston_metro <- acs_data3_boston_metro %>% mutate(HHINCOME_10000 = HHINCOME / 10000, INCTOT_1000 = INCTOT / 1000) %>% filter(HHINCOME_10000 >= 0) %>% drop_na(HOUSE_TO_WORK_NAME)
```
I use a 2019 sample from the American Community Survey consisting of 43,951 observations. The sample consists of people who live and work in the Boston–Cambridge–Newton, MA–NH Metropolitan Statistical Area which encompasses eastern Massachusetts and southern New Hampshire through Suffolk, Norfolk, Plymouth, Middlesex, Essex, Rockingham, and Strafford counties. I chose this area because although it is considered to be one of the wealthiest parts of the two states, studies have shown that it is also the area with the most income inequality (Sommeiller). This area is also dominated by the MBTA in terms of commuting to, from, and within the city of Boston and surrounding towns. However, this area also includes counties such as Rockingham and Strafford which do not have as much access to MBTA services such as the Commuter Rail. However, since Metropolitan Statistical Areas are often defined by commuting patterns, Rockingham and Strafford residents and workers may still travel to and from the city. Thus, this sample should be able to capture a diverse set of transportation methods.
My sample focuses on the following variables: HHINCOME, METRO, PWTYPE, TRANWORK, and CARPOOL. HHINCOME gives the total household income of the individual observed. I do not include other family members of the individual in the final list of observations. My final model uses a new variable called HHINCOME_10000 which divides each number in HHINCOME by 10000 dollars. I filtered out any values below 0 to avoid negative incomes and also any values above 500 due to outliers which can be seen in Table 1 below.
METRO is a categorical variable that indicates whether or not a person lives in a metropolitan area and if so, which part. I only include those living in metropolitan areas which include the following categories: central/principal city, not in central/principal city, and central/principal city status indeterminable (mixed). PWTYPE is another categorical variable similar to METRO for where the observed individual works. Again, I only include individuals working in metropolitan areas within the same categories as METRO. I combine METRO and PWTYPE using a new variable called HOME_TO_WORK_NAME to establish four different types of geographical travel patterns: "City to city," "City to suburbs," "Suburbs to city," and "Suburbs to suburbs." The first location indicates where the person lives while the second indicates where they work. The mixed category in METRO and PWTYPE was included within those who lived in the city and those who worked in the city respectively. A city's transportation system, including Boston's, often extends outside of the city itself into surrounding areas. So, these areas, have access to many of the transportation options available to those living within the city.
Finally, TRANWORK indicates which mode of transportation the observed individual uses to get from home to work while CARPOOL shows whether or not the person carpools to work. I combine a more filtered down version of TRANWORK and CARPOOL into TRANWORK_SM. All modes of transportation involving privately owned vehicles such as cars or vans and those who do not carpool are combined into the "Private vehicle" category in TRANWORK_SM. I create a separate category for "Carpool." All public transportation options are combined into the "Public transport" category. I also include categories for "Bicycle" and "Walk only."
All NA individuals with NA values in any of these categories were removed.
Below, I included a graph to show the distribution of observations across different incomes on the x-axis in different transportation types on the y-axis while differentiating between HOUSE_TO_WORK_NAME.
```{r, echo=FALSE}
table1 <- acs_data3_boston_metro %>% ggplot() + geom_jitter(aes(x = HHINCOME_10000, y = TRANWORK_SM, color = factor(HOUSE_TO_WORK_NAME))) + labs(title = "Figure 1: Observations Before Filtering HHINCOME_10000")
acs_data3_boston_metro <- acs_data3_boston_metro %>% filter(HHINCOME_10000 < 500)
table2 <- acs_data3_boston_metro %>% ggplot() + geom_jitter(aes(x = HHINCOME_10000, y = TRANWORK_SM, color = factor(HOUSE_TO_WORK_NAME))) + labs(title = "Figure 2: Observations After Filtering HHINCOME_10000")
table1
table2
```
Figure 1 shows a large gap between people with an HHINCOME_10000 value of aroudn 170 and those with a value of 1000. These points made up 965 of the observations. Due to the small amount of points compared to the total number of observations that lie around 1000 for HHINCOME_10000, I exluded these points from my final list of observations. Figure 2 then gives a much more understandable depiction of the observations.
```{r, echo=FALSE}
summary_stat <- data.frame(HHINCOME_10000 = acs_data3_boston_metro$HHINCOME_10000)
summary(summary_stat) %>% kable(caption = "Table 1: Summary Statistics for HHINCOME_10000")
```
```{r, include=FALSE}
table2 <- acs_data3_boston_metro %>% group_by(HOUSE_TO_WORK_NAME) %>% summarise(Total = n())
table3 <- acs_data3_boston_metro %>% group_by(TRANWORK_SM) %>% summarise(Total = n())
```
```{r, echo=FALSE}
table2 %>% kable(caption="Table 2: Count for HOUSE_TO_WORK_NAME")
table3 %>% kable(caption="Table 3: Count for TRANWORK_SM")
```
# Model and Hypothesis
I will use multinomial logistic regression where the dependent variable is TRANWORK_SM. The independent variables are HHINCOME_10000 and HOME_TO_WORK_NAME. I also included an interaction term between the two independent variables because I suspect that the impact of household income on transportation choice will differ depending on people's geographic travel patterns.
$$log(\pi_j/\pi_1) = \beta_0 + \beta_1HHINCOME\_10000 + \beta_2HOME\_TO\_WORK\_NAME +$$ $$\beta_3HHINCOME\_10000*HOME\_TO\_WORK\_NAME + \epsilon_i$$
$$\pi_1,\pi_2,\pi_3,\pi_4,\pi_5$$ = probability of Individual, probability of Carpool, probability of Public Transit, probability of Walking, probability of Biking
"Private vehicle" will be my baseline dependant variable for comparison since this is the most common transportation type in this metropolitan area. Since HOME_TO_WORK_NAME is a categorical variable, I will be treating "Suburb to suburb" as my reference category.
I hypothesize that as household income increases, I will see that the probability of using bikes or walking to work will increase when traveling within the city because wealthier people, who tend to work in wealthy areas of a city, can afford to live nearby. I suspect that biking and walking will have very low probabilities when looking at travel within the suburbs or between the suburbs and the city among all income groups. Due to more accessibility to bikes and walking, I suspect that the probability of using public transportation to get to work will decrease as income increases within the city.
It is worth noting that after dividing up HHINCOME_10000 into three income brackets (described in the Results section of this paper), I found that lower, middle and upper-income people tend to have 2.97, 2.49, and 2.81 private vehicles respectively. However, Among all travel patterns, I still suspect that the probability of carpooling will decrease and the probability of using private vehicles will increase are income increases. This is because higher-income people can afford services like car repair and gas more frequently. So, although many lower-income people do seem to have access to cars, they might be less willing to use them daily.
Additionally, I hypothesize that there is a higher probability for higher income brackets to use public transportation to move between the city and the suburbs due to the higher cost of using services such as the commuter rail as compared to the subway within the city.
# Results
```{r, include = FALSE}
acs_data3_boston_metro$TRANWORK_SM2 <- relevel(factor(acs_data3_boston_metro$TRANWORK_SM), ref = "Private vehicle")
acs_data3_boston_metro <- acs_data3_boston_metro %>% mutate(HOUSE_TO_WORK_NAME = HOUSE_TO_WORK_NAME %>% fct_relevel("Suburbs to suburbs"))
```
Below, I will report the log odds the result from the model to analyze the signs and significance.
```{r, include = FALSE}
cont_logit <- multinom(TRANWORK_SM2 ~ HHINCOME_10000 + HOUSE_TO_WORK_NAME + HHINCOME_10000*HOUSE_TO_WORK_NAME, data = acs_data3_boston_metro)
```
```{r, echo=FALSE}
tidy(cont_logit) %>% kable(caption="Table 4: Computed log odds from multinomial logistic regression")
```
It is interesting to note that all HHINCOME_10000 variables besides the one for "Walking Only" are insignificant. However, I still kept this variable in my model since I expected that as income grows, people are less likely to use other forms of transportation compared with private vehicles that are individually driven due to more access to these vehicles. Other insignificant variables include the category "Suburbs to city" using bicycle. However, the negative sign of the coefficient is expected since the relative odds of using a bike as opposed to a vehicle to travel from the suburbs to the city in reference to traveling just within the suburbs should be lower. However, if we look at the interaction term between Suburbs to city and HHINCOME_10000 for the bicycle, we see another insignificant coefficient with a positive sign indicating that the relative odds of using a bike in this situation increases as unexpected income increases.
We also see man insignificant variables stem from the Carpool option. The signs of the coefficients are expected, however. Carpooling may overall be preferred over using private vehicles when traveling from the suburbs to the city in reference to traveling through just the suburbs. For the interaction terms, it makes sense why carpooling will overall have lower relative odds of being picked over private vehicles as income increases due to increased access.
With the choice of public transit, we see that the interaction term between City to suburbs and household income is insignificant. However, the sign of the coefficient is not unexpected. One could reason that the less robust transportation systems in the suburbs may not be enough to get someone home from the city and thus, the relative odds of using a private vehicle is higher in reference to traveling just within the suburbs.
There are also many variables for the walking choice that are insignificant. Again, the signs seem to match my hypotheses and not seem unexpected. Besides City city traveling, it makes sense why the relative odds of walking in all other travel patterns would be lower than using a private vehicle in reference to traveling within the suburbs. The positive sign for the interaction term between income and City to suburbs is unexpected, however.
Since more of these insignificant variables have signs that make sense in the context of my hypothesis and the data, I will keep them in my model. Having insignificant variables is surprising to me overall due to the large sample size and relatively low standard error.
Since odds are often difficult to interpret, I instead calculated predicted probabilities on a new dataset I created. This dataset consists of data points for each whole value between 0 and 170 to represent 10000 dollar increases in household income for each increase in the value. Each of these discrete values is paired with every combination of TRANWORK_SM and HOUSE_TO_WORK_NAME.
Below, there is a chart that shows the averaged predicted probabilities for each value of the continuous variable HHINCOME_10000 within each category of HOUSE_TO_WORK_NAME.
Table 5:
```{r, echo=FALSE}
dwrite <- data.frame(HOUSE_TO_WORK_NAME = rep(c("City to city", "City to suburbs", "Suburbs to city", "Suburbs to suburbs"), each = 171), HHINCOME_10000 = rep(c(0:170), 4))
predicted_prob_hhincome <- cbind(dwrite, predict(cont_logit, newdata = dwrite, "probs"))
by(predicted_prob_hhincome[, 3:7], predicted_prob_hhincome$HOUSE_TO_WORK_NAME, colMeans)
```
Looking at the average household income may not be that useful, however. We can breakdown our predicted probabilities into lower (less than 30,000 dollars), middle (between 30,000 and 90,000 dollars), and upper (above 90,000 dollars) income brackets (Sommeilier).
Table 6: Lower Income Bracket
```{r, echo=FALSE}
dwrite_lower <- data.frame(HOUSE_TO_WORK_NAME = rep(c("City to city", "City to suburbs", "Suburbs to city", "Suburbs to suburbs"), each = 31), HHINCOME_10000 = rep(c(0:30), 4))
predicted_prob_hhincome_lower <- cbind(dwrite_lower, predict(cont_logit, newdata = dwrite_lower, "probs"))
by(predicted_prob_hhincome_lower[, 3:7], predicted_prob_hhincome_lower$HOUSE_TO_WORK_NAME, colMeans)
```
Table 7: Middle Income Bracket
```{r, echo=FALSE}
dwrite_middle <- data.frame(HOUSE_TO_WORK_NAME = rep(c("City to city", "City to suburbs", "Suburbs to city", "Suburbs to suburbs"), each = 61), HHINCOME_10000 = rep(c(30:90), 4))
predicted_prob_hhincome_middle <- cbind(dwrite_middle, predict(cont_logit, newdata = dwrite_middle, "probs"))
by(predicted_prob_hhincome_middle[, 3:7], predicted_prob_hhincome_middle$HOUSE_TO_WORK_NAME, colMeans)
```
Table 8: Upper Income Bracket
```{r}
dwrite_upper <- data.frame(HOUSE_TO_WORK_NAME = rep(c("City to city", "City to suburbs", "Suburbs to city", "Suburbs to suburbs"), each = 81), HHINCOME_10000 = rep(c(90:170), 4))
predicted_prob_hhincome_upper <- cbind(dwrite_upper, predict(cont_logit, newdata = dwrite_upper, "probs"))
by(predicted_prob_hhincome_upper[, 3:7], predicted_prob_hhincome_upper$HOUSE_TO_WORK_NAME, colMeans)
```
To analyze the overall trends among different income brackets, I will analze results from a visual representation of these predicted probabilities.
```{r, echo = FALSE}
lpp2 <- melt(predicted_prob_hhincome, id.vars = c("HOUSE_TO_WORK_NAME", "HHINCOME_10000"), value.name = "probability")
# head(lpp)
# predict(cont_logit, newdata = dwrite, "probs")
ggplot(lpp2, aes(x = HHINCOME_10000, y = probability, colour = HOUSE_TO_WORK_NAME)) + geom_line() + facet_grid(variable ~ ., scales = "free") + labs(title = "Figure 3: Predicted Probabilities Separated by TRANWORK_SM")
filtered_lpp2 <- lpp2 %>% filter(variable != "Private vehicle")
ggplot(lpp2, aes(x = HHINCOME_10000, y = probability, colour = variable)) + geom_line() + facet_grid(vars(HOUSE_TO_WORK_NAME), scales = "free") + labs(title = "Figure 4: Predicted Probabilities Separated by HOUSE_TO_WORK_NAME")
```
# Discussion and Conclusion
As discussed earlier, many of the coefficents in this model are insignificant but have the expected directions. Thus, I will be referring to Figures 3 and 4 to analyze the model across different income brackets rather than looking at specific predicted probabilities provided in Tables 5 through 8.
From Figure 4, we see the probability of using of private vehicles when traveling from the city to the suburbs and within the suburbs as income increases as expected.
However, it was unexpected that lower-income brackets seem to be using private vehicles more when traveling from the suburbs to the city or within the city itself.
What is even more interesting is the upper-income bracket's increased probability in using public transportation when traveling from the suburbs to the city compared to the lower and middle-income brackets in Figure 3. In Figure 4, we see that at a certain income level within the higher income bracket, people have a higher probability of using public transit over private vehicles while private vehicles remain the most likely to be used when traveling from the city to the suburbs. On the other hand, the increased probability of using public transportation within the city is expected among the upper-income bracket as explained in the Model and Hypothesis section. With further study, the low probability of using public transportation in the city or from the suburbs to the city among lower-incomes could potentially indicate a lack of accessibility to public transportation for lower incomes.
Walking seems to increase among higher income brackets within the city, potentially indicating increased access to industries and businesses due to home location. However, walking seems to be less preferred compared to individual vehicles and public transportation among higher income brackets.
Finally, as hypothesized, lower-income brackets have the highest probability of using carpooling and the probability decreases as household income increases. We see a less drastic dip in probability as income brackets increase for carpooling when looking at travel within the suburbs only which makes sense due to the lower accessibility to public transit, walking, and biking.
The expected trends for walking and biking further follow what past studies have suspected about the increased accessibility that the wealthy have to businesses within the city when living in it while poorer communities may have to take farther commutes for which biking or walking may be too strenuous (Rice). It could be interesting to further investigate where city biking services are more prominent to see whether they are mainly targeted toward wealthier neighborhoods to promote the notion of sustainability. One could conduct a similar study to look at walkway renovations to study in which neighborhoods walking has become more accessible overtime.
I would also be interested in investigating public transit accessibility in terms of location and price. The increased probability of using public transportation over private vehicles among the wealthy while private vehicles remain the main source of transit among lower classes could be explained by a lack of public transportation services among poorer suburbs and/or lack of affordability.
It should be noted, however, that the probabilities among the lower income brackets may be deflated since they make up a smaller proportion of the population.
It also seems as if increased income may not have as large of an impact on the use of private vehicles. Although having a larger income indeed means that one can afford to use their vehicles daily, public transportation into and in the city and more sustainable forms of transportation such as walking and biking may be more attractive and perhaps more convenient.
This last point brings up another important limitation in this study. I do not establish a metric by which I can judge how people make decisions when it comes to using one transportation type over the other. It could be assumed that each individual is making the "best" choice available to them due to factors such as finances and convenience. However, this is not always the case. It would not be surprising that public transportation simply is not an option in certain poor areas. Thus, even if for some individuals, public transit is their "best" option, they simply would not be able to use it.
To try to combat this issue, a future study could potentially create an "accessibility index" to evaluate the accessibility to "affordable" and "convenient" transportation types in different communities. This index could be compared with similar graphs and charts shown in this study to infer why might people be making the choices they are and whether or not their choice is truly the "best" one available to them.
# Bibliography
Lima, Jassica, et al. “Income vs. Travel Time: Why Do the Poorest and Richest Travel Fastest in Northeastern Brazil?” ScienceDirect, 10 July 2016.
Dargay, Joyce, and Jos Van Ommeren. “The Effect of Income on Commuting Time - an Analysis Based on Panel Data.” ECONSTOR, 23 Aug. 2005.
Sommeiller, Estelle, and Mark Price. “The New Gilded Age Income Inequality in the U.S. by State, Metropolitan Area, and County.” Economic Policy Institute, 19 July 2018.
Rice, Jennifer L., et al. “CONTRADICTIONS OF THE CLIMATE-FRIENDLY CITY: New Perspectives on Eco-Gentrification and Housing Justice.” INTERNATIONAL JOURNAL OF URBAN AND REGIONAL RESEARCH.