June 2020
tl;dr: Detached model to perform domain adaptation sim2real.
Two stage model to bridge domain gap. This is very similar to GenLaneNet for 3D LLD prediction. The idea of using semantic segmentation to bridge the sim2real gap is explored in many BEV semantic segmentation tasks such as BEV-Seg, CAM2BEV, VPN.
The first stage model already extracted away domain-dependent features and thus the second stage model can be used as is.
The GT of BEV segmentation is difficult to collect in most domains. The simulated segmentation GT can be obtained in abundance with simulator such as CARLA.
- View transformation: pixel-wise depth prediction
- The first stage generates the pseudo-lidar point cloud, and render it in BEV.
- This is incomplete and may have many void pixels.
- Always choosing the point of lower height.
- The second stage converts the BEV view of pseudo-lidar point cloud to BEV segmentation.
- Fills in the void pixels
- Smooth already predicted segmentation
- During inference, only finetune first stage. Use second stage as is.
- Summary of technical details