Skip to content

Latest commit

 

History

History
46 lines (30 loc) · 3.58 KB

dataset.md

File metadata and controls

46 lines (30 loc) · 3.58 KB

Data

Stage 1: Pre-training Dataset

  • Please download the transformed annotations of each dataset from Stage-1 Training Annotations.

  • Please download the image from the official source.

Data Source Data Source
COCO 2014 Download VOCdevkit Download
COCO 2017 Download DocBank Download
Visual Genome Download DocLayNet Download
Object365 Download PubLayNet Download
OpenImage Download CurvedSynText150k Download
V3Det Download ICDAR2013 Download
ADE20k Download MLT2017 Download
Cityscapes Download MLT2019 Download
cocostuff 10k Download TotalText Download
cocostuff 164k Download AITW Download

Important notice: Visual Genome should contain all the vg images(VG_100K and VG_100K_2). Merge the image data from the VG_100K and VG_100K_2 folders into one.

  • In each annotation JSON file, update the image path to reflect the location of the downloaded image data.

Stage 2: Fine-tuning Dataset

  • Please download the transformed annotations of each dataset from Stage-2 Training Annotations.

  • Please download the image from the official source. The data for stages beyond stage 1 is list below:

Data Source Data Source
OpenPsgGCG Download SeeClick Download
GRIT Download Multi-Panel Download
Flicker30K Download Osprey-724K Download
M6Doc Download LaionGPT4v Download
VCR Download ShareGPT4v Download
  • In each annotation JSON file, update the image path to reflect the location of the downloaded image data.