-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sidewalk Data Quality #1
Comments
Did manual cleaning of widths data. Example errors:
Used the sf package in R to convert SWK_WIDTH column from text to numeric:
|
O instead of 0 is a major facepalm! Great stuff |
@aseemdeodhar can you install and run https://github.com/pandas-profiling/pandas-profiling on the sidewalk datafile and upload the resulting html here? Thanks! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The sidewalks width field is a text field (!) and has to be cleaned up. The vast majority of rows are clean, with only 20 out of ~24000 rows having undecipherable text values. Some entries with widths in the 90s (widths are in feet) seem to be erroneous entries where a decimal point is missing.
The text was updated successfully, but these errors were encountered: