- Champaign, IL
- https://hkchengrex.com
-
MMAudio Public
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
-
MS-CLAP Public
Forked from microsoft/CLAPLearning audio concepts from natural language supervision
Python MIT License UpdatedDec 22, 2024 -
CLAP Public
Forked from LAION-AI/CLAPContrastive Language-Audio Pretraining
Python Creative Commons Zero v1.0 Universal UpdatedDec 22, 2024 -
passt_hear21 Public
Forked from kkoutini/passt_hear21Inference code for PaSST, using the HEAR API.
Python UpdatedDec 22, 2024 -
ImageBind Public
Forked from facebookresearch/ImageBindImageBind One Embedding Space to Bind Them All
Python Other UpdatedDec 21, 2024 -
MiVOS Public
[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!
-
XMem Public
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
-
Cutie Public
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
-
nitrous-ema Public
Fast and simple post-hoc EMA (Karras et al., 2023) for PyTorch with minimal `.item()` calls. ~78% lower overhead than ema_pytorch.
-
shared-memory-tensor-dataset Public
This repository provides an example of reading from a single shared memory tensor from multiple processes (e.g., with DDP).
-
ema-pytorch Public
Forked from lucidrains/ema-pytorchA simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model
-
Tracking-Anything-with-DEVA Public
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
-
CascadePSP Public
[CVPR 2020] CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
-
STCN Public
[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
-
Scribble-to-Mask Public
[CVPR 2021] MiVOS - Scribble to Mask module
-
Mask-Propagation Public
[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code 🌟. Semi-supervised video object segmentation evaluation.
-
vos-benchmark Public
Fast and general video object segmentation evaluation.
-
Grounded-Segment-Anything Public
Forked from IDEA-Research/Grounded-Segment-AnythingGrounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
-
-
-
RAFT Public
Forked from princeton-vl/RAFTPython BSD 3-Clause "New" or "Revised" License UpdatedOct 23, 2021 -
kinetics_to_frames Public
Convert kinetics datasets (or other video datasets) to frames. Support resizing and temporal sampling for space efficiency.
-
-
fbrs_interactive_segmentation Public
Forked from SamsungLabs/fbrs_interactive_segmentation[CVPR2020] f-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation https://arxiv.org/abs/2001.10331
Python Mozilla Public License 2.0 UpdatedAug 10, 2020 -
An implementation of Single View Metrology (Criminisi99) with step-by-step guidance in a Jupyter Notebook.
Jupyter Notebook UpdatedAug 5, 2020 -
STM Public
Forked from seoungwugoh/STMVideo Object Segmentation using Space-Time Memory Networks
-
RGMP Public
Forked from seoungwugoh/RGMPFast Video Object Segmentation by Reference-Guided Mask Propagation
Python UpdatedFeb 18, 2020 -
pythia Public
Forked from facebookresearch/mmfA modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Python Other UpdatedNov 29, 2019 -
TCN Public
Forked from locuslab/TCNSequence modeling benchmarks and temporal convolutional networks
Python MIT License UpdatedNov 24, 2019 -
semseg Public
Forked from hszhao/semsegSemantic Segmentation in Pytorch
Python MIT License UpdatedNov 13, 2019