Highlights
- Pro
computer_vision_related
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
A high-fidelity 3D face reconstruction library from monocular RGB image(s)
Optical Flow Estimation using RAFT with PyTorch.
A tool converting Waymo dataset format to Kitti dataset format.
Obtain bird's eye view of a scene from a single input image
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation