Table 4 and Table 5 using COCO pretraining or not? #39

superaha · 2022-10-26T17:28:09Z

Hi there,

Thank you for sharing the repo. In Table 3, the results of YOUTUBE-VIS 2019 are reported using both models with and without the COCO pretraining.

How about Table 4 and Table 5 for IDOL? I did not find the detailed settings and explanations for these two results.

Thanks

timmeinhardt · 2022-10-26T21:13:36Z

I have asked the same question in a different issue. This line

VNext/projects/IDOL/configs/ovis_r50.yaml

Line 3 in d41c4a1

WEIGHTS: "cocopretrain_R50.pth"

seems to suggest that they used a model pretrained on COCO sequences. But I would appreciate a clarification of the others as well!

superaha · 2022-10-27T15:31:28Z

Thanks for pointing this out. Let us see if the authors can clarify.
@wjf5203

wjf5203 · 2022-10-27T16:06:00Z

Hi,

Thanks for your attention and pointing this out.

Let me clarify this. We have at most three training steps for IDOL:

Step 1: pre-training the instance segmentation pipeline on COCO, following all other VIS methods.
Step 2: pre-training IDOL on pseudo key-reference pair from COCO. (This step forces the model to learn a position-insensitive contrastive embedding that relies on appearance of the object rather than the spatial position.)
Step 3: finetune our VIS method IDOL on VIS dataset (YTVIS19/YTVIS21/OVIS), following all other VIS methods.

So, the main difference is Step 2.
In Table 3,4,5, all the IDOL results marked with $\dagger$ are obtained by Step 1+2+3, others without $\dagger$ are obtained by Step 1+3.

We will add more detailed experimental settings in the next arXiv version ~

HanGuangXin · 2022-10-28T12:50:00Z

@wjf5203 Hi, so there are 2 steps of pre-train. The first step is on single frame with static COCO images. The second step is on pseudo key-reference pairs.

And I have 3 questions about this:

Why do we have to do pre-train on static COCO images first? Why just using the second is not enough?
The provided pre-trained weights is from the second step, not the first step? If so, could you provide the trained weights of the first step?
How can I do the first step pre-training by myself? It seems the code only support the second and the third steps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Table 4 and Table 5 using COCO pretraining or not? #39

Table 4 and Table 5 using COCO pretraining or not? #39

superaha commented Oct 26, 2022

timmeinhardt commented Oct 26, 2022 •

edited

Loading

superaha commented Oct 27, 2022

wjf5203 commented Oct 27, 2022 •

edited

Loading

HanGuangXin commented Oct 28, 2022

Table 4 and Table 5 using COCO pretraining or not? #39

Table 4 and Table 5 using COCO pretraining or not? #39

Comments

superaha commented Oct 26, 2022

timmeinhardt commented Oct 26, 2022 • edited Loading

superaha commented Oct 27, 2022

wjf5203 commented Oct 27, 2022 • edited Loading

HanGuangXin commented Oct 28, 2022

timmeinhardt commented Oct 26, 2022 •

edited

Loading

wjf5203 commented Oct 27, 2022 •

edited

Loading