Installation (Cross-Platform)

First install Poetry. Open a terminal, cd to the cloned repository directory, and run

poetry install

to setup and install the virtual environment.

Basic Testing

WARNING: Tests will use all your available cores and spew a huge amount of text to your terminal, so only run them if you are prepared.

poetry run pytest

Running the Paper Analyses

Compute Canada

The code to run the analyses here is unfortunately deeply tied up with Compute Canada cluster specifics and SLURM. Normally, once you have activated your virtual environment with e.g. poetry shell, the procedure would be to run:

python analysis/create_jobscripts.py
sbatch analysis/job_scripts/submit_all_downsampling.sh
sbatch analysis/job_scripts/submit_all_feature.sh
sbatch analysis/job_scripts/submit_mlp_downsampling.sh
sbatch analysis/job_scripts/submit_mlp_feature.sh

There are hard-coded switches that modify resource requests and runtimes depending on the cluster, so reproducing this would unfortunately require reading and modifying the code.

Running Locally

Alternately, you can run a specific dataset analysis by specifying command-line arguments and faking the environment. For example (assuming you have run poetry shell):

CC_CLUSTER=niagara \
python analysis/feature_downsampling.py \
  --classifier=lr \
  --dataset=diabetes \
  --kfold-reps=50 \
  --n-percents=200 \
  --results-dir=<your_directory_here> \
  --cpus=8 \
  --pbar

Don't expect this to work on Windows, and it is most likely unfeasible to run all analyses like this on a single machine.

Hypertuning can likely be done locally without too much trouble. For example, to tune the logistic regression classifier, run:

poetry run pytest tests/test_analysis_hypertune.py::test_lr_params

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.vscode		.vscode
data		data
job_scripts		job_scripts
results		results
scripts		scripts
src		src
tests		tests
written/methods		written/methods
.flake8		.flake8
.gitignore		.gitignore
README.md		README.md
install.sh		install.sh
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
python_install.sh		python_install.sh
upload_data.sh		upload_data.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation (Cross-Platform)

Basic Testing

Running the Paper Analyses

Compute Canada

Running Locally

About

Releases

Packages

Languages

stfxecutables/ec_downsampling_analysis

Folders and files

Latest commit

History

Repository files navigation

Installation (Cross-Platform)

Basic Testing

Running the Paper Analyses

Compute Canada

Running Locally

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages