Uses the Segment-Anything Model By Meta AI and adds a barebones interface to label images and saves the masks in the COCO format.
Under active development, apologies for rough edges and bugs. Use at your own risk.
- Install Segment Anything on any machine with a GPU. (Need not be the labelling machine.)
- Create a conda environment using
conda env create -f environment.yaml
on the labelling machine (Need not have GPU). - (Optional) Install coco-viewer to scroll through your annotations quickly.
docker run -it -v /home/marcoambrosio/dataset/:/root/dataset --privileged --env=NVIDIA_VISIBLE_DEVICES=all --env=NVIDIA_DRIVER_CAPABILITIES=all --gpus 1 --name salt andreaostuni/salt:salt-cuda-11.8-base /bin/bash
- Setup your dataset in the following format
<dataset_name>/images/*
and create empty folder<dataset_name>/embeddings
.- Annotations will be saved in
<dataset_name>/annotations.json
by default.
- Annotations will be saved in
- Copy the
helpers
scripts to the base folder of yoursegment-anything
folder.- Call
extract_embeddings.py
to extract embeddings for your images. For examplepython3 extract_embeddings.py --dataset-path <path_to_dataset>
- Call
generate_onnx.py
generate*.onnx
files in models. For examplepython3 generate_onnx.py --dataset-path <path_to_dataset> --onnx-models-path <path_to_dataset>/models
- Call
-
Call
segment_anything_annotator.py
with argument<dataset_name>
and categoriescat1,cat2,cat3..
. For examplepython3 segment_anything_annotator.py --dataset-path <path_to_dataset> --categories cat1,cat2,cat3
- There are a few keybindings that make the annotation process fast.
- Click on the object using left clicks and right click (to indicate outside object boundary).
n
adds predicted mask into your annotations. (Add button)r
rejects the predicted mask. (Reject button)a
andd
to cycle through images in your your set. (Next and Prev)l
andk
to increase and decrease the transparency of the other annotations.Ctrl + S
to save progress to the COCO-style annotations file.
-
coco-viewer to view your annotations.
python cocoviewer.py -i <dataset> -a <dataset>/annotations.json
-
Call
coco_to_binary_mask.py
with argument<dataset_name>
. For examplepython3 coco_to_binary_mask.py --dataset-path <path_to_dataset>
. It will create a new foldermasks
in the dataset folder with the binary masks. For now only one binary mask is created with all the segmented regions in the same image. (Multiple categories are not supported yet.)
Follow these guidelines to ensure that your contributions can be reviewed and merged. Need a lot of help in making the UI better.
If you have found a bug or have an idea for an improvement or new feature, please create an issue on GitHub. Before creating a new issue, please search existing issues to see if your issue has already been reported.
When creating an issue, please include as much detail as possible, including steps to reproduce the issue if applicable.
Create a pull request (PR) to the original repository. Please use black
formatter when making code changes.
The code is licensed under the MIT License. By contributing to SALT, you agree to license your contributions under the same license as the project. See LICENSE for more information.