Running things at the potsdam university cluster

This is better explained in my blog: https://xarxax.xyz/training-ai-models-potsdam-university/

To run with gpu an apptainer container you should

Build the image.

apptainer build img.sif recipe.def

Run the image with --nv flag

apptainer run --nv img.sif

This last step is done in slurm.job so if you just sbatch slurm.job while in your cluster you should be fine.

If you want to see how your task is doing, as Uni Potsdam says you can check the Grafana.

Comfy setup aliases for working in your machine but running things on the cluster (you can add this to your .bashrc):

#VARIABLES
export YOUR_CLUSTER_USERNAME="yourusername"
export project="/example_apptainer"#the shortcuts only work if the project is in your home folder
export CLUSTER_LOGIN="[email protected]"
export PATH_IN_CLUSTER="/work/$YOUR_CLUSTER_USERNAME/"

#SCRIPTS
alias ssh_uni="ssh -X $CLUSTER_LOGIN"
alias update_example_apptainer="rsync -av -e ssh --exclude='*.pyc' --exclude='.git' --exclude='*/generated_models/*' $HOME/$project $CLUSTER_LOGIN:$PATH_IN_CLUSTER "
alias reverse_update_example_apptainer="rsync -av -e ssh --exclude='*.pyc' --exclude='.git*' --exclude='*generate_model.py' --exclude='*.sif' --exclude='*.bin' --exclude='*.pt'  $CLUSTER_LOGIN:$PATH_IN_CLUSTER/$project $HOME  "

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
generated_models		generated_models
.gitignore		.gitignore
README.md		README.md
generate_model.py		generate_model.py
recipe.def		recipe.def
requirements.txt		requirements.txt
slurm.job		slurm.job

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Running things at the potsdam university cluster

About

Releases

Packages

Languages

xarxaxdev/example_apptainer

Folders and files

Latest commit

History

Repository files navigation

Running things at the potsdam university cluster

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages