Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding a maskrcnn_resnet50_fpn_v2 from TorchVision models #1167

Open
medphisiker opened this issue Jan 19, 2023 · 0 comments
Open

Adding a maskrcnn_resnet50_fpn_v2 from TorchVision models #1167

medphisiker opened this issue Jan 19, 2023 · 0 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@medphisiker
Copy link

medphisiker commented Jan 19, 2023

🚀 Feature

Hello.

there is new intresting version of Masked RCNN model in TorchVision (link).
maskrcnn_resnet50_fpn_v2 - Improved Mask R-CNN model with a ResNet-50-FPN backbone from the Benchmarking Detection Transfer Learning with Vision Transformers paper.

maskrcnn_resnet50_fpn_v2 model gives effective increase(link) for MS COCO metric in comparision with classic maskrcnn_resnet50_fpn.

image

I see some examples of fine tuning. The code for fine tuning maskrcnn_resnet50_fpn_v2 and maskrcnn_resnet50_fpn are identical.
Ice Vision framework has support for classic TorchVision's maskrcnn_resnet50_fpn fine tuning. It will be great if Ice Vision framework also has support for new TorchVision's maskrcnn_resnet50_fpn_v2.

Describe the solution you'd like
It will be great if Ice Vision framework also has support for new TorchVision's maskrcnn_resnet50_fpn_v2. Also there is an updated version of the these detectors, - FasterRCNN_ResNet50_FPN_V2 and RetinaNet_ResNet50_FPN_V2.

image

Describe alternatives you've considered
Currently, we already have many excellent neural networks for detection in the Ice Vision framework. But it is worth noting that Faster and Masked RCN are multi-stage detectors. Most of the more accurate detectors presented in the framework are single-stage.
In one competition, I used YOLOv7, which had a higher metric on MS COCO for detection (53). But the competitors that used the classic multistage Faster R-CNN won that gives only 37. It turned out that on a dataset with crowded objects, Faster RCNN works better than a single-stage YOLOv7, even though there is a big difference in metrics on MS COCO in the YOLOv7 slider.

@medphisiker medphisiker added enhancement New feature or request help wanted Extra attention is needed labels Jan 19, 2023
@medphisiker medphisiker changed the title Add maskrcnn_resnet50_fpn_v2 from TorchVision models Adding a maskrcnn_resnet50_fpn_v2 from TorchVision models Jan 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant