Skip to content

jolaya80/Bios259_Project_structure_Practice

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Gapminder Project: Reproducible workflow with Pixi

Task description

This project uses the Gapminder dataset to explore global development indicators like life expectancy, GDP per capita, and population.

The workflow includes:

  • Data preprocessing
  • Analysis by continent
  • Visualization of results

Setup Instructions

1. Create a reproducible research using cookiecutter:

Here is the command to install and start the project using cookiecutter.

mamba install cookiecutter jinja2-time
cookiecutter gh:bios259/cookie-for-reproducible-research

You can also check the GitHub repo to install cookiecutter and create a project - https://github.com/bios259/cookie-for-reproducible-research

2. Copy necessary from this repo:

Copy the necessary files (scripts and data) from this repo to making the project as Pixi and add the scripts and data.

Check the pixi.toml file for more details.

3. Install Pixi packages and run the Pixi tasks:

  • Install pixi dependencies after copying the pixi.toml:
pixi install
  • Preprocess the data:
pixi run preprocess
  • Analyze the data:
pixi run analyze
  • Visualize the results:
pixi run visualize

4. Run with Docker:

Build and run the Docker container:

docker build -t gapminder-project .
docker run -v $(pwd):/app -it gapminder-project

Results

  • Cleaned Data: data/processed/cleaned_gapminder.csv

  • Summary Table: results/tables/continent_summary.csv

  • Figures:

    • Life Expectancy: results/figures/life_expectancy_over_time.png
    • GDP per Capita: results/figures/gdp_per_capita_distribution.png