This is a joint project created by Heidi, Lotta and Tatu for the course Introduction to Data Science at the University of Helsinki.
Our task was to come up with an inspiring idea to get familiar with different aspects of data science. We decided to create HealthNotebook, which analyses data given by the user and recommends ways to improve sleep. We decided to create an interactive Jupyter notebook since the time was limited. This could be further developed into a web and/or mobile application.
Used datasets:
- Clone this repository to your computer
git clone https://github.com/heksaani/DataScienceProject.git
cd DataScienceProject
- If you wish, you can create a virtual environment (and install
pip
) and then install the required packages using the requirements.txt file
conda create --name ds-project
conda activate ds-project
conda install pip
pip install -r requirements.txt
- Launch the Jupyter notebook and open this project in your browser.
jupyter notebook
When using the notebook you can run the cells by pressing the play button or by pressing shift + enter. We recommend that you run the cells one by one and fill in the questions. Also, remember to push the button 'Submit' after filling in the question boxes.
NOTE: if you are having trouble loading the regression models with pickle
, try running the following files to recreate the models: analysis/linear_regression_covid19_dataset.ipynb
and analysis/linear_regression_uhs_dataset.ipynb
.