4D Net Pytorch Implementation KITTI (RGB+LiDAR)

Weights available at :Link to Weights
- password:4dnetpytorch
This repo is an attempt at implementing 4D Net https://arxiv.org/abs/2109.01066
NO TEMPORAL ELEMENT (RGB+LiDAR only, no Time)
- This repo serves only as a tutorial for myself
- I may have missed out some stuff from the paper
Feel free to download this repo and implement the temporal elements yourself

Visual Results

Evaluation mAP

Evaluation Code from https://github.com/jacoblambert/3d_lidar_detection_evaluation

Model Details

The Model consist of a PointNet Processing model, an RGB Processing Model, PseudoImage Scattering Layer and a Efficient-Det style Single Shot Detector as object detection head
During Training, the Pseudo Images will look like this in Tensorboard and important objects should get more pronounced
For matching the targets to predicted outputs, i used a hungarian matcher used in DETR/Deformable-DETR
Half of the effort here is to let the dataset grab the relavant RGB feature coordinates
- These coordinates are used to grab the CNN features from the RGB Image to create a sepearte Pseudo Image
- This is then concatenated with the LiDAR Point Pillars Pseudo Image later

Anchorbox Calculation

K-Means analysis of ground truth boxes are used
Look at Stats.ipynb

How to Train/Infer

Edit the dataset root location in train_exp_KITTI.py:

    from KITTI_dataset import kitti_dataset,KITTI_collate_fn
    from pillar_models import NET_4D_EffDet
    
    batch_size = 4
    xyz_range = np.array([0,-40.32,-2,80.64,40.32,3])
    xy_voxel_size= np.array([0.16,0.16])
    points_per_pillar = 32
    n_pillars=12000

    dataset = kitti_dataset(root = "/home/conda/RAID_5_14TB/DATASETS/KITTI_dataset/training/" , xyz_range = xyz_range,xy_voxel_size= xy_voxel_size,points_per_pillar = points_per_pillar,n_pillars=n_pillars)
    data_loader_train = DataLoader(dataset, batch_size=batch_size,collate_fn= KITTI_collate_fn, num_workers=8, shuffle=True)

    anchor_dict = np.load("./cluster_kitti_3scales_3anchor.npy",allow_pickle=True).item()
    model = NET_4D_EffDet(anchor_dict,n_classes=4)
    model.cuda()
    for i,(img,(pillars, coord, contains_pillars),(pillar_img_pts,rgb_coors,contains_rgb),targets) in enumerate(data_loader_train):
        pred,_,_= model(img.cuda(),pillars.float().cuda(), coord.cuda(), contains_pillars.cuda(),pillar_img_pts.float().cuda(),rgb_coors.cuda(),contains_rgb.cuda())
     
    print(pred["pred_logits"].shape) #torch.Size([2, 15747, 4])
    print(pred["pred_boxes"].shape) #torch.Size([2, 15747, 7]) #x,y,z,w,l,h,r

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
evaluation_code @ e6b55c0		evaluation_code @ e6b55c0
images		images
.gitmodules		.gitmodules
Benchmark_New.ipynb		Benchmark_New.ipynb
KITTI_dataset.py		KITTI_dataset.py
README.md		README.md
Stats.ipynb		Stats.ipynb
box_ops.py		box_ops.py
calibration_util.py		calibration_util.py
cluster_kitti_3scales_1anchor.npy		cluster_kitti_3scales_1anchor.npy
cluster_kitti_3scales_3anchor.npy		cluster_kitti_3scales_3anchor.npy
deformable_conv.py		deformable_conv.py
detector_models.py		detector_models.py
matcher.py		matcher.py
pillar_models.py		pillar_models.py
point_cloud_ops.py		point_cloud_ops.py
reader.py		reader.py
testset_names.npy		testset_names.npy
train_KITTI.py		train_KITTI.py
train_exp_KITTI.py		train_exp_KITTI.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

4D Net Pytorch Implementation KITTI (RGB+LiDAR)

Visual Results

Evaluation mAP

Model Details

Anchorbox Calculation

How to Train/Infer

About

Releases

Packages

Languages

chanlilong/4D_NET_pytorch

Folders and files

Latest commit

History

Repository files navigation

4D Net Pytorch Implementation KITTI (RGB+LiDAR)

Visual Results

Evaluation mAP

Model Details

Anchorbox Calculation

How to Train/Infer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages