python_ml_project_template

This is a template for a Python Machine Learning project with the following features:

Weights and Biases support, for experiment tracking and visualization
Hydra support, for configuration management
Pytorch Lightning support, for training and logging

In addition, it contains all the good features from the original version of this repository (and is a proper Python package):

Installable via pip install. Anyone can point directly to this Github repository and install your project, either as a regular dependency or as an editable one.
Uses the new PEP 518, officially-recommended pyproject.toml structure for defining project structure and dependencies (instead of requirements.txt)
Nice, static documentation website support, using mkdocs-material. Structure can be found in docs/
black support by default, which is an opinionated code formatting tool
pytest support, which will automatically run tests found in the tests/ directory
mypy support, for optional typechecking of type hints
pre-commit support, which runs various formatting modifiers on commit to clean up your dirty dirty code automatically.
Github Actions support, which runs the following:
- On a Pull Request: install dependencies, run style checks, run Python tests
- After merge: same a Pull Request, but also deploy the docs site to the projects Github Pages URL!!!!

All that needs doing is replacing all occurances of python_ml_project_template and python-ml-project-template with the name of your package(including the folder src/python_ml_project_template), the rest should work out of the box!

Installation

First, we'll need to install platform-specific dependencies for Pytorch. See here for more details. For example, if we want to use CUDA 11.8 with Pytorch 2.

pip install torch==2.0.1 torchvision==0.15.2 --index-url https://download.pytorch.org/whl/cu118/

Then, we can install the package itself:

pip install -e ".[develop,notebook]"

Then we install pre-commit hooks:

pre-commit install

Docker

To build the docker image, run:

docker build -t <my_dockerhub_username>/python-ml-project-template .

To run the training script locally, run:

WANDB_API_KEY=<API_KEY>
# Optional: mount current directory to run / test new code.
# Mount data directory to access data.
docker run \
    -v $(pwd)/data:/opt/baeisner/data \
    -v $(pwd)/logs:/opt/baeisner/logs \
    --gpus all \
    -e WANDB_API_KEY=$WANDB_API_KEY \
    -e WANDB_DOCKER_IMAGE=python-ml-project-template \
    python-ml-project-template python scripts/train.py \
        dataset.data_dir=/root/data \
        log_dir=/root/logs

To push this:

docker push <my_dockerhub_username>/python-ml-project-template:latest

Using the CI.

Set up pushing to docker:

Put the following secrets in the Github repository:

DOCKERHUB_USERNAME: Your Dockerhub username
DOCKERHUB_TOKEN: Your Dockerhub token

You'll also need to Ctrl-F replace instances of beisner and baeisner with appropriate usernames.

Running on Clusters

Autobot

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
.vscode		.vscode
cluster		cluster
configs		configs
docs		docs
notebooks		notebooks
scripts		scripts
src/python_ml_project_template		src/python_ml_project_template
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
README.md		README.md
autobot.md		autobot.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

python_ml_project_template

Installation

Docker

Using the CI.

Running on Clusters

About

Releases

Packages

Languages

r-pad/python_ml_project_template

Folders and files

Latest commit

History

Repository files navigation

python_ml_project_template

Installation

Docker

Using the CI.

Running on Clusters

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages