EVA-02: Semantic Segmentation on ADE20K using UperNet

Table of Contents

EVA-02: Semantic Segmentation on ADE20K using UperNet

EVA-02 Model Card

model name	init. ckpt	batch size	iter	crop size	seg head dim	mIoU (ss)	mIoU (ms)	config	logs	weight
`eva02_B_ade_seg_upernet_sz512`	`eva02_B_pt_in21k_p14to16`	32	60k	`512x512`	768	55.3	-	link	link	🤗 HF link
`eva02_L_ade_seg_upernet_sz512`	`eva02_L_pt_m38m_p14to16`	16	80k	`512x512`	1024	59.8	60.4	link	link	🤗 HF link
`eva02_L_ade_seg_upernet_sz640`	`eva02_L_pt_m38m_p14to16`	16	80k	`640x640`	1536	60.1	-	link	link	🤗 HF link

Setup

Environment

First, setup EVA-02 pre-training & image classification environment.

Then, install mmcv==1.7.1 and mmsegmentation==0.29.1.

Data

Please prepare ADE20K datasets according to the guidelines in MMSegmentation.

EVA-02 pre-trained weight

model name	#params	MIM pt dataset	MIM pt epochs	weight
`eva02_B_pt_in21k_p14to16`	86M	IN-21K	150	🤗 HF link
`eva02_L_pt_m38m_p14to16`	304M	Merged-38M	56	🤗 HF link

eva02_psz14to16 models interpolate the kernel size of patch_embed from 14x14 to 16x16, and interpolate the pos_embed from 16x16 to 14x14. This is useful for object detection, instance segmentation & semantic segmentation tasks.

Evaluation

Evaluate eva02_B_ade_seg_upernet_sz512 on ADE20K using a single node with 4 gpus.

SEG_CONFIG=configs/eva02/upernet/upernet_eva02_base_12_512_slide_60k.py
EVAL_CKPT=/path/to/eva02_B_ade_seg_upernet_sz512.pth

python -m torch.distributed.launch --nproc_per_node=4 --nnodes=${WORLD_SIZE} --node_rank=${RANK} --master_addr=${MASTER_ADDR} --master_port=11235 \
--use_env test.py --launcher pytorch \
    ${SEG_CONFIG} \
    ${EVAL_CKPT} \
    --eval mIoU


# expected results
Summary:
+-------+-------+-------+
|  aAcc |  mIoU |  mAcc |
+-------+-------+-------+
| 85.41 | 55.25 | 66.81 |
+-------+-------+-------+

Evaluate eva02_L_ade_seg_upernet_sz512 on ADE20K using a single node with 4 gpus.

single-scale evaluation

SEG_CONFIG=configs/eva02/upernet/upernet_eva02_large_24_512_slide_80k.py
EVAL_CKPT=/path/to/eva02_L_ade_seg_upernet_sz512.pth

python -m torch.distributed.launch --nproc_per_node=4 --nnodes=${WORLD_SIZE} --node_rank=${RANK} --master_addr=${MASTER_ADDR} --master_port=11235 \
--use_env test.py --launcher pytorch \
    ${SEG_CONFIG} \
    ${EVAL_CKPT} \
    --eval mIoU


# expected results
Summary:
+-------+-------+-------+
|  aAcc |  mIoU |  mAcc |
+-------+-------+-------+
| 86.67 | 59.77 | 72.05 |
+-------+-------+-------+

multi-scale evaluation

SEG_CONFIG=configs/eva02/upernet/upernet_eva02_large_24_512_slide_ms_eval.py
EVAL_CKPT=/path/to/eva02_L_ade_seg_upernet_sz512.pth

python -m torch.distributed.launch --nproc_per_node=4 --nnodes=${WORLD_SIZE} --node_rank=${RANK} --master_addr=${MASTER_ADDR} --master_port=11235 \
--use_env test.py --launcher pytorch \
    ${SEG_CONFIG} \
    ${EVAL_CKPT} \
    --eval mIoU


# expected results
Summary:                                                                                                                                                                             
+-------+-------+-------+                
|  aAcc |  mIoU |  mAcc |                                                                                                                 
+-------+-------+-------+       
| 86.98 | 60.42 | 72.54 |                                                                                                                 
+-------+-------+-------+

Evaluate eva02_L_ade_seg_upernet_sz640 on ADE20K using a single node with 4 gpus.

SEG_CONFIG=configs/eva02/upernet/upernetpro_eva02_large_24_640_slide_80k.py
EVAL_CKPT=/path/to/eva02_L_ade_seg_upernet_sz640.pth

python -m torch.distributed.launch --nproc_per_node=4 --nnodes=${WORLD_SIZE} --node_rank=${RANK} --master_addr=${MASTER_ADDR} --master_port=11235 \
--use_env test.py --launcher pytorch \
    ${SEG_CONFIG} \
    ${EVAL_CKPT} \
    --eval mIoU


# expected results
Summary:
+-------+-------+-------+
|  aAcc |  mIoU |  mAcc |
+-------+-------+-------+
| 86.83 | 60.05 | 72.17 |
+-------+-------+-------+

Training

Train eva02_B_ade_seg_upernet_sz512 on ADE20K using a single node with 4 gpus.

SEG_CONFIG=configs/eva02/upernet/upernet_eva02_base_12_512_slide_60k.py
PRETRAIN_CKPT=/path/to/eva02_B_pt_in21k_p14to16.pt

python -m torch.distributed.launch --nproc_per_node=4 --nnodes=${WORLD_SIZE} --node_rank=${RANK} --master_addr=${MASTER_ADDR} --master_port=11235 \
--use_env train.py --launcher pytorch \
    ${SEG_CONFIG} \
    --seed 0 --deterministic \
    --options model.backbone.pretrained=${PRETRAIN_CKPT}

Train eva02_L_ade_seg_upernet_sz512 on ADE20K using 2 nodes with 8 gpus.

SEG_CONFIG=configs/eva02/upernet/upernet_eva02_large_24_512_slide_80k.py
PRETRAIN_CKPT=/path/to/eva02_L_pt_m38m_p14to16.pt

python -m torch.distributed.launch --nproc_per_node=8 --nnodes=${WORLD_SIZE} --node_rank=${RANK} --master_addr=${MASTER_ADDR} --master_port=11235 \
--use_env train.py --launcher pytorch \
    ${SEG_CONFIG} \
    --seed 0 --deterministic \
    --options model.backbone.pretrained=${PRETRAIN_CKPT}

Train eva02_L_ade_seg_upernet_sz640 on ADE20K using 2 nodes with 8 gpus.

SEG_CONFIG=configs/eva02/upernet/upernetpro_eva02_large_24_640_slide_80k.py
PRETRAIN_CKPT=/path/to/eva02_L_pt_m38m_p14to16.pt

python -m torch.distributed.launch --nproc_per_node8 --nnodes=${WORLD_SIZE} --node_rank=${RANK} --master_addr=${MASTER_ADDR} --master_port=11235 \
--use_env train.py --launcher pytorch \
    ${SEG_CONFIG} \
    --seed 0 --deterministic \
    --options model.backbone.pretrained=${PRETRAIN_CKPT}

Acknowledgement

EVA-02 semantic segmentation is built with MMSegmentation and BEiT.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
configs		configs
misc		misc
mmcv_custom		mmcv_custom
mmseg		mmseg
tools		tools
.gitignore		.gitignore
README.md		README.md
test.py		test.py
test.sh		test.sh
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EVA-02: Semantic Segmentation on ADE20K using UperNet

EVA-02 Model Card

Setup

Environment

Data

EVA-02 pre-trained weight

Evaluation

Training

Acknowledgement

About

Releases

Packages

Languages

Brilliant-B/EVA02_Seg_Hybrid

Folders and files

Latest commit

History

Repository files navigation

EVA-02: Semantic Segmentation on ADE20K using UperNet

EVA-02 Model Card

Setup

Environment

Data

EVA-02 pre-trained weight

Evaluation

Training

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages