Skip to content

QueensleyC/my-udacity-python-project

Repository files navigation

Udacity Programming for DataScience Nanodegree Program

Explore Us Bikeshare Data.

Project Details

This project involves exploring and analyzing a CSV dataset on bike sharing from three different cities in the United States using Python. The analysis is conducted based on user inputs, creating an interactive experience for the audience.

The Datasets

The datasets provided by Motivate, a bike share system provider in the US, contains randomly selected data for the first six months of 2017 for all three cities (Chicago, New York City, Washington). All three of the data files contain the same core six (6) columns:

  • Start Time (e.g., 2017-01-01 00:07:57)
  • End Time (e.g., 2017-01-01 00:20:53)
  • Trip Duration (in seconds - e.g., 776)
  • Start Station (e.g., Broadway & Barry Ave)
  • End Station (e.g., Sedgwick St & North Ave)
  • User Type (Subscriber or Customer)

The Chicago and New York City files also have the following two columns:

  • Gender
  • Birth Year

Data for the first 10 rides in the new_york_city.csv file Close

The original files are much larger and messier, hence they are not used on the project. they can be accessed here (Chicago, New York City, Washington). These files had more columns and they differed in format in many cases. Some data wrangling has been performed to condense these files to the above core six columns.

Computed Statistics Within the Project

  1. Popular times of travel (i.e., occurs most often in the start time)

    • most common month
    • most common day of week
    • most common hour of day
  2. Popular stations and trip

    • most common start station
    • most common end station
    • most common trip from start to end (i.e., most frequent combination of start station and end station)
  3. Trip duration

    • total travel time
    • average travel time
  4. User info

    • counts of each user type
    • counts of each gender (only available for NYC and Chicago)
    • earliest, most recent, most common year of birth (only available for NYC and Chicago)
  5. Graphs

    • Distribution of birth year
    • Count plot of gender
    • Count plot of user type
    • Top 10 most used start stations
    • Top 10 most used end stations

Built With

  • Python
  • Pandas
  • Numpy
  • plotly.express
  • Dash
  • dash_iconify
  • dash_bootstrap_components
  • dash_mantine_components
  • Visual Studio (IDE)

Hosting

Render's Web Service (free tier) is used to host the web version of this project

Used Files

  • chicago.csv
  • new_york_city.csv
  • washington.csv

Good to Know

bikeshare.py creates an interactive experience in the terminal while app.py displays a link in the terminal that creates a web app when clicked (ctrl + click) on

requirements.txt is the file needed for the hosting service render. It contains the libraries used in the making of app.py

App Demo

demo.mp4

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages