Based upon https://github.com/runpod/serverless-workers/blob/main/workers/huggingface-transformers/README.md
Select one of the following models to build:
falcon-7b-instruct
falcon-40b-instruct
starcoder
wizardcoder
docker build --build-arg MODEL_NAME={model name} -t repo/image_name:tag .
The docker containers are also available in the packages of this repository.
The build seems to flake out every now and then. Unsure why this is happening.
Running a model requires that the model git repository is downloaded on the root a networked volume.
The folder name has to match the model name
e.g. when using the starcoder model the /runpod-volume is the mount location of the networked volume
/runpod-volume/starcoder