Add multi-points input, foreground/background points input and box input to EfficientSAM model #291

Zhang-Yang-Sustech · 2025-04-23T15:52:05Z

The EfficientSAM model in the model zoo can only support single point input to infer before. This time I change the model input dimension to 6, and add a dynamic supplement method to support multi-points input. Users can input 6 points to select segment object at most. Additionally, I select from the output 3 masks to avoid to pick the mask with background points. I also add foreground/background points and box input in the demo.

new model(image_segmentation_efficientsam_ti_2024may.onnx) with new input dimension:
'batched_images': (1, 3, 1024, 1024), previous is (1, 3, 640, 640);
'batched_point_coords':(1, 1, 6, 2), previous is (1, 1, 1, 2), can only support single point input, now can support 6 points;
'batched_point_labels':(1, 1, 6), previous is (1, 1, 1), only one label, now each point one label.
with new output dimension:
'output_masks': (1, 1, 3, 1024, 1024), previous is (1, 1, 1, 640, 640), can only output single mask, now output three mask, we can select the mask we want to enhance performance;
'iou_predictions': (1, 1, 3), new output which is used to select the masks by scores.
new model file(efficientSAM.py), with dynamical supplement points method to support multi-points input in preprocess, with masks selection method in postprocess.
new demo file(demo.py), with foreground/background points input and box input.

2. add box prompt(drag), add background point(long press) 3. model fix to 1024*1024 4. label padding -1 5. update demo

Zhang-Yang-Sustech added 20 commits April 10, 2024 20:41

a

11fb27e

add efficientsam model and basic demo

6e27053

update license

dc3f586

remove example images

691a559

update readme

a5cc02a

update readme

b0d9d3b

update demo

ffb1bf4

update demo

a48e3f5

update readme

be74b65

update SAM and __init__

7adcf81

update demo and sam

3a0ff63

update label

7d86141

add present gif

52fb290

update readme

d5bc0ce

add efficientSAM gif to readme of opencvzoo

073464f

cv version 4.10.0， remove camera branch

6130312

1. add multipoints infering(max: 6)

4ac7c69

2. add box prompt(drag), add background point(long press) 3. model fix to 1024*1024 4. label padding -1 5. update demo

replace the model by new model support mutil-points input, update demo

4185fce

update readme

993eda2

update readme

e2ae8f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multi-points input, foreground/background points input and box input to EfficientSAM model #291

Add multi-points input, foreground/background points input and box input to EfficientSAM model #291

Zhang-Yang-Sustech commented Apr 23, 2025 •

edited

Loading

Add multi-points input, foreground/background points input and box input to EfficientSAM model #291

Are you sure you want to change the base?

Add multi-points input, foreground/background points input and box input to EfficientSAM model #291

Conversation

Zhang-Yang-Sustech commented Apr 23, 2025 • edited Loading

Zhang-Yang-Sustech commented Apr 23, 2025 •

edited

Loading