Skip to content

Latest commit

 

History

History
38 lines (21 loc) · 3.14 KB

README.md

File metadata and controls

38 lines (21 loc) · 3.14 KB

Strava Data Tools

Data tools created to investigate how much information is gleanable from Strava. Investigating results from our survey on sentiment and understanding of Privacy in social media spaces related to fitness.

Contains functions to manipulate, visualize, and analyze GPS Data from a Strava Data Dump.

Strava contains Privacy Features to help users hide addresses associated with them (such as home or office addresses), while still being able to participate in sharing routes and interesting activities. While disabling maps on activities prevents any chance of leaked location information, it undermines much of the purpose of fitness applications like Strava, the ability to compete on shared segments, share routes with friends, and post accomplishments.

One notable feature intended to alleviate this concern is the Privacy Zone. This disables location sharing around a certain radius of a marked location. Unfortunately, given enough data, this feature can be easily defeated.

Data

In order to run this code against your own Strava data, request your archive and run the desired programs in the folder that is emailed to you.

Example of Mapped Start Locations

Starting Locations

Example of Start Locations with a Privacy Zone

Starting Locations (Obscured by Privacy Zone)

Determining the Center of a Privacy Zone

Deduced Start Location

Determining Start Locations Inside Offset Privacy Zone

![Starting Locations in Offset Zone](Heat Map.png)

Requires plotly, fitparse, numpy, and pandas.

To extract the compressed activity files, use python 3 ./strava_extract.py -e NOTE that due that some functions involved still use relative paths, so running this from within the working directory containing the data provided by Strava is REQUIRED (I know that's bad, on the TODO list for fixes.)

To extract only the start points from a folder of activity files, use python3 ./strava_extract.py without the -e flag. This will not extract .FIT files, only read from them. The optional -p parameters allows you to load in a .CSV file containing privacy zones, but this requires a google maps token, the file location of which is accessed with the -g flag.

To display the processed points on an interactive map use ./strava_map.py -i <input_file.csv>, which is a CSV file containing coordinates to starting points of activities (optionally considering privacy zones). You can configure certain maps to overlay onto. The default map requires a MapBox token. If no such token is available, use -l open-street-map as your layer.

Note: As MapBox is required for certain visualizations, a mapbox token will be required for these functions.

Note: A Google Maps API token is also required to analyze privacy zones. Strava stores Privacy Zones using text addresses, so we send those to Google Maps to receive coordinates.