Skip to content

Latest commit

 

History

History
executable file
·
86 lines (74 loc) · 2.59 KB

File metadata and controls

executable file
·
86 lines (74 loc) · 2.59 KB
image

Solution Formulation

  1. To approach the problem, the first step was to recreate the DANet implementation on RGB images of Cityscapes Dataset
  2. Next, we explored publicly available datasets in the Autonomous Driving Domain which have semantic segmentation ground truth data for RGB images and LIDAR/ Depth data
    • Identified KITTI as a good dataset
  3. Run DANet on the KITTI semantic dataset
  4. Obtain LIDAR data from KITTI-360 along with corresponding segmentation information
  5. Creating our own custom network (a simplified version of DANet) for flexibility of input data
  6. Customizing this network for segmentation on RGB images combined with Depth Data

This repository is adapted from A PyTorch implementation of DANet based on CVPR 2019 paper Dual Attention Network for Scene Segmentation.

Requirements

conda install pytorch torchvision cudatoolkit=10.1 -c pytorch
  • opencv
pip install opencv-python
  • tensorboard
pip install tensorboard
  • pycocotools
pip install git+https://github.com/cocodataset/cocoapi.git#subdirectory=PythonAPI
  • fvcore
pip install git+https://github.com/facebookresearch/fvcore
  • cityscapesScripts
pip install git+https://github.com/mcordts/cityscapesScripts.git
  • detectron2
pip install git+https://github.com/facebookresearch/detectron2.git@master

Datasets

For a few datasets that detectron2 natively supports, the datasets are assumed to exist in a directory called datasets/, under the directory where you launch the program. They need to have the following directory structure:

Expected dataset structure for Cityscapes:

cityscapes/
  gtFine/
    train/
      aachen/
        color.png, instanceIds.png, labelIds.png, polygons.json,
        labelTrainIds.png
      ...
    val/
    test/
  leftImg8bit/
    train/
    val/
    test/

run ./datasets/prepare_cityscapes.py to creat labelTrainIds.png.

Training

To train a model, run

python train_net.py --config-file <config.yaml>

For example, to launch end-to-end DANet training with ResNet-50 backbone on 8 GPUs, one should execute:

python train_net.py --config-file configs/r50.yaml --num-gpus 8

Evaluation

Model evaluation can be done similarly:

python train_net.py --config-file configs/r50.yaml --num-gpus 8 --eval-only MODEL.WEIGHTS checkpoints/model.pth