From 2722ca35062d5ab60e89c5eb7156f380ffc7ccab Mon Sep 17 00:00:00 2001 From: Nissim Lebovits <111617674+nlebovits@users.noreply.github.com> Date: Wed, 13 Dec 2023 22:08:05 -0500 Subject: [PATCH] Built site for gh-pages --- .nojekyll | 2 +- final.html | 6479 ++++++++++++++++++++++++++++++++++++++++++++++++++-- index.html | 6479 ++++++++++++++++++++++++++++++++++++++++++++++++++-- 3 files changed, 12587 insertions(+), 373 deletions(-) diff --git a/.nojekyll b/.nojekyll index d5adabd..da9d188 100644 --- a/.nojekyll +++ b/.nojekyll @@ -1 +1 @@ -ef6deae7 \ No newline at end of file +d1a4c137 \ No newline at end of file diff --git a/final.html b/final.html index 535b7b9..adbfc4c 100644 --- a/final.html +++ b/final.html @@ -2979,6 +2979,6152 @@ + + + + + + + + + + + + + + + + + + + + @@ -3053,7 +9199,7 @@

SmartZoning® Documentation

This model and web application prototype were developed for MUSA508, a Master of Urban Spatial Analytics class focused on predictive public policy analytics at the University of Pennsylvania.

1 Background

-

Growth is critical for a city to continue to densify and modernize. The benefits of growth range from increased public transit use to updating the built environment to be more climate resilient. Growth fuels development and vice versa. Philadelphia is 6th largest city in the US, yet ranks [42nd in cost of living] (https://www.axios.com/2023/11/09/lowest-highest-cost-of-living-cities-us-map), and so growth is often met with concern. Many residents and preservationists ask: Will growth deteriorate the city’s best features? Will modernization make the city unaffordable to longtime residents?

+

Growth is critical for a city to continue to densify and modernize. The benefits of growth range from increased public transit use to updating the built environment to be more climate resilient. Growth fuels development and vice versa. Philadelphia is 6th largest city in the US, yet ranks 42nd in cost of living, and so growth is often met with concern. Many residents and preservationists ask: Will growth deteriorate the city’s best features? Will modernization make the city unaffordable to longtime residents?

Balancing growth with affordability is a precarious task for Philadelphia. To date, politicians favor making exceptions for developers parcel-by-parcel rather than championing a citywide smart growth strategy. Zoning advocates need better data-driven tools to broadcast the benefits of a smart growth approach, a planning framework that aims to maximize walkability and transit use to avoid sprawl, that also demonstrates how parcel-by-parcel, or spot zoning, creates unmet development pressure that can drive costs. Designed to support smart growth advocacy, SmartZoning is a prototype web tool that identifies parcels under development pressure with conflicting zoning. Users can strategically leverage the tool to promote proactive upzoning of high-priority parcels, aligning current zoning more closely with anticipated development. This approach aims to foster affordable housing in Philadelphia, addressing one of the city’s most pressing challenges.

Smart Growth meets SmartZoning®

@@ -3113,14 +9259,14 @@

-
- +
+

3.2 Feature Engineering by Time and Space

-

To better understand the relationship between time-space lag and permit count,… Notably…

+

New construction exhibits sizable spatial and temporal autocorrelation. In other words, there is a strong relationship between the number of permits in a given block group and the number of permits in neighboring block groups; as well as between the number of permits issued in a block group in a given year and the number of permits issued in that same block group in the previous year. To account for these relationships, we engineer new features, including both space and time lags. We note that all of these engineered features have strong correlation coefficients with our dependent variable, permits_count, and p-values indicating that these relationships are statistically significant.

@@ -3159,6 +9305,8 @@

4.1.2 VIF

To ensure that our predictive model does not have multicollinearity, or multiple values telling the same story about permit counts, we use the VIF test. The table below lists each variables’s VIF score. Variables that have over a 5 are considered to potentially have some multicollinearity, and those over 10 certainly need to be flagged. Generally the council district and zoning overlays such as historic districts may be conflicting.

+

Based on VIF, we drop:

+

hist_dist_na 33.120644 district8 32.900255 district4 32.164688 district9 30.834357 district5 29.393431 district3 29.092806 district2 28.580678 district1 27.199582 district7 26.741373 hist_dist_historic_street_paving_thematic_district 26.735386 district6 19.912382 overlay_fne 15.601603 overlay_ne 11.179327 overlay_nis 7.717070 overlay_ndo 6.867713 overlay_fdo 6.574022 overlay_edo 5.595256 overlay_vdo 5.400210

@@ -3607,7 +9755,7 @@

-

+

@@ -3620,7 +9768,7 @@

To to identify spatial clusters, or hotspots, in geographic data, we performed a Local Moran’s I test. It assesses the degree of spatial autocorrelation, which is the extent to which the permit counts in a block group tend to be similar to neighboring block group. We used a p-value of 0.1 as our hotspot threshold.

-

+

Emergeging hotspots…? If I can get it to work.

@@ -3635,19 +9783,19 @@

-

+

-

+

Our OLS model exhibits a Mean Absolute Error (MAE) of 2.66, a decent performance for a model of its simplicity. However, its efficacy is notably diminished in critical domains where optimization is imperative. Consequently, we intend to enhance the predictive capacity by incorporating more pertinent variables and employing a more sophisticated modeling approach.

-

+

-

We find that our OLS model has an MAE of only MAE: 2.66–not bad for such a simple model! Still, it struggles most in the areas where we most need it to succeed, so we will try to introduce better variables and apply a more complex model to improve our predictions.

+

We find that our OLS model has an MAE of only MAE: 2.68–not bad for such a simple model! Still, it struggles most in the areas where we most need it to succeed, so we will try to introduce better variables and apply a more complex model to improve our predictions.

4.3.2 Random Forest

@@ -3655,16 +9803,16 @@

Compared to the OLS model, the relationship between predicted vs actual permits…

-

+

-

+

Compared to the OLS Model, the Random Forest Model has a similar error distribution however, it exhibits a MAE of….

-

+

@@ -3679,16 +9827,16 @@

Having settled on our model features and tuning, we now validate on 2023 data.

-

+

-

+

-

+

-

We return an MAE of MAE: 2.2.

+

We return an MAE of MAE: 2.19.

6 Discussion

@@ -3697,7 +9845,7 @@

-

+

@@ -3706,20 +9854,20 @@

The constructed boxplot, categorizing observations based on racial composition, indicates that the random forest model generalizes effectively, showcasing consistent and relatively low absolute errors across majority non-white and majority white categories. The discernible similarity in error distributions suggests that the model’s predictive performance remains robust across diverse racial compositions, affirming its ability to generalize successfully.

-

+

We find that error is not related to affordability and actually trends downward with percent nonwhite. (This is probably because there is less total development happening there in majority-minority neighborhoods to begin with, so the magnitude of error is less, even though proportionally it might be more.) Error increases slightly with total pop. This makes sense–more people –> more development.

Our analysis reveals that the error is not correlated with affordability and demonstrates a downward trend in conjunction with the percentage of the nonwhite population. This observed pattern may be attributed to the likelihood that majority-minority neighborhoods experience a comparatively lower volume of overall development, thereby diminishing the absolute magnitude of error, despite potential proportional increases. Additionally, there is a slight increase in error with the total population, aligning with the intuitive expectation that higher population figures correspond to more extensive development activities.

-

+

-

How does this generalize across council districts? Don’t forget to refactor

+

How does this generalize across council districts?

-

+

@@ -3730,15 +9878,15 @@

-

+

We can extract development predictions at the block level to these parcels and then visualize them by highest need.

-
- +
+

Furthermore, we can identify properties with high potential for assemblage, which suggests the ability to accomodate high-density, multi-unit housing.

@@ -3759,377 +9907,336 @@

868 -27.06613 +27.94353 3 1615 ICMX 1548 -27.06613 +27.94353 3 2736 IRMX 1587 -27.06613 +27.94353 3 2804 IRMX 3420 -27.06613 +27.94353 3 6405 RSA5 4667 -27.06613 +27.94353 3 9661 RSA5 9169 -27.06613 +27.94353 4 20073 ICMX -1768 -22.24860 -3 -3128 -IRMX - - -3640 -22.24860 -3 -6901 -ICMX - - 7517 -21.08143 +22.04183 3 16717 RSA5 -3934 -20.84390 +1768 +21.73393 3 -7646 -ICMX +3128 +IRMX -12326 -20.84390 -4 -25776 -RSA5 +3640 +21.73393 +3 +6901 +ICMX 4957 -20.67827 +21.68943 3 10410 ICMX 4958 -20.67827 +21.68943 3 10411 RSA5 4959 -20.67827 +21.68943 3 10412 ICMX 5245 -20.67827 +21.68943 3 11160 RSA5 -4460 -17.31087 +3934 +17.81977 3 -9093 -RSA5 +7646 +ICMX -7726 -15.16207 -3 -17168 -ICMX +12326 +17.81977 +4 +25776 +RSA5 13578 -15.12210 +16.39950 3 27869 IRMX +4460 +16.21347 +3 +9093 +RSA5 + + +7726 +16.00803 +3 +17168 +ICMX + + 5088 -15.00973 +14.95410 3 10759 IRMX 4512 -14.95163 +14.36920 5 9243 IRMX 6014 -14.95163 +14.36920 6 13057 ICMX 3041 -12.82333 +14.17370 3 5568 ICMX 9842 -12.82333 +14.17370 3 21369 RSA5 9843 -12.82333 +14.17370 3 21370 ICMX 9845 -12.82333 +14.17370 3 21372 RSA5 -7833 -12.25843 -3 -17408 -RSA5 - - -3957 -11.49060 -3 -7704 -IRMX - - 6645 -11.22550 +12.52903 3 14648 ICMX 7280 -11.22550 +12.52903 3 16179 RSA5 9912 -11.22550 +12.52903 3 21527 ICMX +7833 +11.57457 +3 +17408 +RSA5 + + +3957 +11.13063 +3 +7704 +IRMX + + 2138 -10.88253 +11.07510 4 3744 IRMX 8143 -10.71940 +11.04757 3 18031 RSD3 8656 -10.71940 +11.04757 3 19076 RSA3 9409 -10.71940 +11.04757 4 20534 RSA2 10175 -10.71940 +11.04757 3 22002 RSD1 12605 -10.71940 +11.04757 3 26247 RSD1 4146 -10.39053 +10.92453 3 8265 IRMX 5108 -10.39053 +10.92453 4 10795 IRMX + +1536 +10.63863 +4 +2715 +ICMX + + +2422 +10.63863 +5 +4284 +IRMX + + +2941 +10.63863 +4 +5351 +RSA5 + + +10490.1 +10.63863 +3 +22527 +I3 + + +10810 +10.63863 +5 +23106 +ICMX + + +11135.1 +10.63863 +8 +23678 +I3 + -

8 2024 Predictions

-

+

9 Web Application

+

10 Next Steps

11 Appendices

-

- - - -

- - - - - - - -
diff --git a/index.html b/index.html index 535b7b9..adbfc4c 100644 --- a/index.html +++ b/index.html @@ -2979,6 +2979,6152 @@ + + + + + + + + + + + + + + + + + + + + @@ -3053,7 +9199,7 @@

SmartZoning® Documentation

This model and web application prototype were developed for MUSA508, a Master of Urban Spatial Analytics class focused on predictive public policy analytics at the University of Pennsylvania.

1 Background

-

Growth is critical for a city to continue to densify and modernize. The benefits of growth range from increased public transit use to updating the built environment to be more climate resilient. Growth fuels development and vice versa. Philadelphia is 6th largest city in the US, yet ranks [42nd in cost of living] (https://www.axios.com/2023/11/09/lowest-highest-cost-of-living-cities-us-map), and so growth is often met with concern. Many residents and preservationists ask: Will growth deteriorate the city’s best features? Will modernization make the city unaffordable to longtime residents?

+

Growth is critical for a city to continue to densify and modernize. The benefits of growth range from increased public transit use to updating the built environment to be more climate resilient. Growth fuels development and vice versa. Philadelphia is 6th largest city in the US, yet ranks 42nd in cost of living, and so growth is often met with concern. Many residents and preservationists ask: Will growth deteriorate the city’s best features? Will modernization make the city unaffordable to longtime residents?

Balancing growth with affordability is a precarious task for Philadelphia. To date, politicians favor making exceptions for developers parcel-by-parcel rather than championing a citywide smart growth strategy. Zoning advocates need better data-driven tools to broadcast the benefits of a smart growth approach, a planning framework that aims to maximize walkability and transit use to avoid sprawl, that also demonstrates how parcel-by-parcel, or spot zoning, creates unmet development pressure that can drive costs. Designed to support smart growth advocacy, SmartZoning is a prototype web tool that identifies parcels under development pressure with conflicting zoning. Users can strategically leverage the tool to promote proactive upzoning of high-priority parcels, aligning current zoning more closely with anticipated development. This approach aims to foster affordable housing in Philadelphia, addressing one of the city’s most pressing challenges.

Smart Growth meets SmartZoning®

@@ -3113,14 +9259,14 @@

-
- +
+

3.2 Feature Engineering by Time and Space

-

To better understand the relationship between time-space lag and permit count,… Notably…

+

New construction exhibits sizable spatial and temporal autocorrelation. In other words, there is a strong relationship between the number of permits in a given block group and the number of permits in neighboring block groups; as well as between the number of permits issued in a block group in a given year and the number of permits issued in that same block group in the previous year. To account for these relationships, we engineer new features, including both space and time lags. We note that all of these engineered features have strong correlation coefficients with our dependent variable, permits_count, and p-values indicating that these relationships are statistically significant.

@@ -3159,6 +9305,8 @@

4.1.2 VIF

To ensure that our predictive model does not have multicollinearity, or multiple values telling the same story about permit counts, we use the VIF test. The table below lists each variables’s VIF score. Variables that have over a 5 are considered to potentially have some multicollinearity, and those over 10 certainly need to be flagged. Generally the council district and zoning overlays such as historic districts may be conflicting.

+

Based on VIF, we drop:

+

hist_dist_na 33.120644 district8 32.900255 district4 32.164688 district9 30.834357 district5 29.393431 district3 29.092806 district2 28.580678 district1 27.199582 district7 26.741373 hist_dist_historic_street_paving_thematic_district 26.735386 district6 19.912382 overlay_fne 15.601603 overlay_ne 11.179327 overlay_nis 7.717070 overlay_ndo 6.867713 overlay_fdo 6.574022 overlay_edo 5.595256 overlay_vdo 5.400210

@@ -3607,7 +9755,7 @@

-

+

@@ -3620,7 +9768,7 @@

To to identify spatial clusters, or hotspots, in geographic data, we performed a Local Moran’s I test. It assesses the degree of spatial autocorrelation, which is the extent to which the permit counts in a block group tend to be similar to neighboring block group. We used a p-value of 0.1 as our hotspot threshold.

-

+

Emergeging hotspots…? If I can get it to work.

@@ -3635,19 +9783,19 @@

-

+

-

+

Our OLS model exhibits a Mean Absolute Error (MAE) of 2.66, a decent performance for a model of its simplicity. However, its efficacy is notably diminished in critical domains where optimization is imperative. Consequently, we intend to enhance the predictive capacity by incorporating more pertinent variables and employing a more sophisticated modeling approach.

-

+

-

We find that our OLS model has an MAE of only MAE: 2.66–not bad for such a simple model! Still, it struggles most in the areas where we most need it to succeed, so we will try to introduce better variables and apply a more complex model to improve our predictions.

+

We find that our OLS model has an MAE of only MAE: 2.68–not bad for such a simple model! Still, it struggles most in the areas where we most need it to succeed, so we will try to introduce better variables and apply a more complex model to improve our predictions.

4.3.2 Random Forest

@@ -3655,16 +9803,16 @@

Compared to the OLS model, the relationship between predicted vs actual permits…

-

+

-

+

Compared to the OLS Model, the Random Forest Model has a similar error distribution however, it exhibits a MAE of….

-

+

@@ -3679,16 +9827,16 @@

Having settled on our model features and tuning, we now validate on 2023 data.

-

+

-

+

-

+

-

We return an MAE of MAE: 2.2.

+

We return an MAE of MAE: 2.19.

6 Discussion

@@ -3697,7 +9845,7 @@

-

+

@@ -3706,20 +9854,20 @@

The constructed boxplot, categorizing observations based on racial composition, indicates that the random forest model generalizes effectively, showcasing consistent and relatively low absolute errors across majority non-white and majority white categories. The discernible similarity in error distributions suggests that the model’s predictive performance remains robust across diverse racial compositions, affirming its ability to generalize successfully.

-

+

We find that error is not related to affordability and actually trends downward with percent nonwhite. (This is probably because there is less total development happening there in majority-minority neighborhoods to begin with, so the magnitude of error is less, even though proportionally it might be more.) Error increases slightly with total pop. This makes sense–more people –> more development.

Our analysis reveals that the error is not correlated with affordability and demonstrates a downward trend in conjunction with the percentage of the nonwhite population. This observed pattern may be attributed to the likelihood that majority-minority neighborhoods experience a comparatively lower volume of overall development, thereby diminishing the absolute magnitude of error, despite potential proportional increases. Additionally, there is a slight increase in error with the total population, aligning with the intuitive expectation that higher population figures correspond to more extensive development activities.

-

+

-

How does this generalize across council districts? Don’t forget to refactor

+

How does this generalize across council districts?

-

+

@@ -3730,15 +9878,15 @@

-

+

We can extract development predictions at the block level to these parcels and then visualize them by highest need.

-
- +
+

Furthermore, we can identify properties with high potential for assemblage, which suggests the ability to accomodate high-density, multi-unit housing.

@@ -3759,377 +9907,336 @@

868 -27.06613 +27.94353 3 1615 ICMX 1548 -27.06613 +27.94353 3 2736 IRMX 1587 -27.06613 +27.94353 3 2804 IRMX 3420 -27.06613 +27.94353 3 6405 RSA5 4667 -27.06613 +27.94353 3 9661 RSA5 9169 -27.06613 +27.94353 4 20073 ICMX -1768 -22.24860 -3 -3128 -IRMX - - -3640 -22.24860 -3 -6901 -ICMX - - 7517 -21.08143 +22.04183 3 16717 RSA5 -3934 -20.84390 +1768 +21.73393 3 -7646 -ICMX +3128 +IRMX -12326 -20.84390 -4 -25776 -RSA5 +3640 +21.73393 +3 +6901 +ICMX 4957 -20.67827 +21.68943 3 10410 ICMX 4958 -20.67827 +21.68943 3 10411 RSA5 4959 -20.67827 +21.68943 3 10412 ICMX 5245 -20.67827 +21.68943 3 11160 RSA5 -4460 -17.31087 +3934 +17.81977 3 -9093 -RSA5 +7646 +ICMX -7726 -15.16207 -3 -17168 -ICMX +12326 +17.81977 +4 +25776 +RSA5 13578 -15.12210 +16.39950 3 27869 IRMX +4460 +16.21347 +3 +9093 +RSA5 + + +7726 +16.00803 +3 +17168 +ICMX + + 5088 -15.00973 +14.95410 3 10759 IRMX 4512 -14.95163 +14.36920 5 9243 IRMX 6014 -14.95163 +14.36920 6 13057 ICMX 3041 -12.82333 +14.17370 3 5568 ICMX 9842 -12.82333 +14.17370 3 21369 RSA5 9843 -12.82333 +14.17370 3 21370 ICMX 9845 -12.82333 +14.17370 3 21372 RSA5 -7833 -12.25843 -3 -17408 -RSA5 - - -3957 -11.49060 -3 -7704 -IRMX - - 6645 -11.22550 +12.52903 3 14648 ICMX 7280 -11.22550 +12.52903 3 16179 RSA5 9912 -11.22550 +12.52903 3 21527 ICMX +7833 +11.57457 +3 +17408 +RSA5 + + +3957 +11.13063 +3 +7704 +IRMX + + 2138 -10.88253 +11.07510 4 3744 IRMX 8143 -10.71940 +11.04757 3 18031 RSD3 8656 -10.71940 +11.04757 3 19076 RSA3 9409 -10.71940 +11.04757 4 20534 RSA2 10175 -10.71940 +11.04757 3 22002 RSD1 12605 -10.71940 +11.04757 3 26247 RSD1 4146 -10.39053 +10.92453 3 8265 IRMX 5108 -10.39053 +10.92453 4 10795 IRMX + +1536 +10.63863 +4 +2715 +ICMX + + +2422 +10.63863 +5 +4284 +IRMX + + +2941 +10.63863 +4 +5351 +RSA5 + + +10490.1 +10.63863 +3 +22527 +I3 + + +10810 +10.63863 +5 +23106 +ICMX + + +11135.1 +10.63863 +8 +23678 +I3 + -

8 2024 Predictions

-

+

9 Web Application

+

10 Next Steps

11 Appendices

-

- - - -

- - - - - - - -