hkchengrex

↗️

ヽ(*ﾟдﾟ)ノ

Rex Cheng hkchengrex

↗️

ヽ(*ﾟдﾟ)ノ

Ph.D. student at the University of Illinois Urbana-Champaign. From Hong Kong.

459 followers · 57 following

Champaign, IL
https://hkchengrex.com

MMAudio Public

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

audio computer-vision deep-learning audio-synthesis video-to-audio

Python 547 41 MIT License Updated Dec 22, 2024
MS-CLAP Public
Forked from microsoft/CLAP

Learning audio concepts from natural language supervision

Python MIT License Updated Dec 22, 2024
CLAP Public
Forked from LAION-AI/CLAP

Contrastive Language-Audio Pretraining

Python Creative Commons Zero v1.0 Universal Updated Dec 22, 2024
passt_hear21 Public
Forked from kkoutini/passt_hear21

Inference code for PaSST, using the HEAR API.

Python Updated Dec 22, 2024
ImageBind Public
Forked from facebookresearch/ImageBind

ImageBind One Embedding Space to Bind Them All

Python Other Updated Dec 21, 2024
MiVOS Public

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!

computer-vision deep-learning pytorch segmentation video-segmentation interactive-segmentation video-object-segmentation

Python 469 64 MIT License Updated Nov 15, 2024
XMem Public

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

computer-vision deep-learning pytorch segmentation video-segmentation video-object-segmentation eccv2022

Python 1,787 194 MIT License Updated Nov 15, 2024
Cutie Public

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

computer-vision deep-learning pytorch segmentation video-editing video-segmentation video-object-segmentation

Python 755 76 MIT License Updated Nov 8, 2024
nitrous-ema Public

Fast and simple post-hoc EMA (Karras et al., 2023) for PyTorch with minimal `.item()` calls. ~78% lower overhead than ema_pytorch.

machine-learning pytorch ema

Python 4 MIT License Updated Nov 2, 2024
shared-memory-tensor-dataset Public

This repository provides an example of reading from a single shared memory tensor from multiple processes (e.g., with DDP).

Python 2 Apache License 2.0 Updated Aug 12, 2024
ema-pytorch Public
Forked from lucidrains/ema-pytorch

A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model

Python 1 MIT License Updated Aug 6, 2024
Tracking-Anything-with-DEVA Public

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

deep-learning object-tracking video-editing video-segmentation video-object-segmentation iccv2023 open-vocabulary-segmentation

Python 1,292 127 Other Updated Aug 1, 2024
CascadePSP Public

[CVPR 2020] CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement

computer-vision deep-learning pytorch segmentation high-resolution refinement-network cvpr2020

Python 835 93 MIT License Updated May 21, 2024
STCN Public

[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

computer-vision deep-learning pytorch segmentation video-segmentation video-object-segmentation neurips-2021

Python 546 69 MIT License Updated Mar 15, 2024
Scribble-to-Mask Public

[CVPR 2021] MiVOS - Scribble to Mask module

computer-vision deep-learning pytorch segmentation interactive-segmentation cvpr2021

Python 87 15 MIT License Updated Feb 16, 2024
Mask-Propagation Public

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code 🌟. Semi-supervised video object segmentation evaluation.

computer-vision deep-learning pytorch segmentation video-segmentation video-object-segmentation cvpr2021

Python 128 22 MIT License Updated Feb 16, 2024
vos-benchmark Public

Fast and general video object segmentation evaluation.

video-segmentation video-object-segmentation

Python 28 4 MIT License Updated Jan 30, 2024
Grounded-Segment-Anything Public
Forked from IDEA-Research/Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 39 10 Apache License 2.0 Updated Sep 15, 2023
so Public

Stackoverflow answers

Python Updated Aug 27, 2023
davis2016-evaluation Public archive

Python 8 Updated Aug 20, 2023
RAFT Public
Forked from princeton-vl/RAFT

Python BSD 3-Clause "New" or "Revised" License Updated Oct 23, 2021
kinetics_to_frames Public

Convert kinetics datasets (or other video datasets) to frames. Support resizing and temporal sampling for space efficiency.

Python 1 Updated Apr 13, 2021
BlenderVOSRenderer Public

Python 2 GNU General Public License v3.0 Updated Mar 14, 2021
fbrs_interactive_segmentation Public
Forked from SamsungLabs/fbrs_interactive_segmentation

[CVPR2020] f-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation https://arxiv.org/abs/2001.10331

Python Mozilla Public License 2.0 Updated Aug 10, 2020
Single-View-Metrology-Step-By-Step Public

An implementation of Single View Metrology (Criminisi99) with step-by-step guidance in a Jupyter Notebook.

Jupyter Notebook Updated Aug 5, 2020
STM Public
Forked from seoungwugoh/STM

Video Object Segmentation using Space-Time Memory Networks

Python 2 Updated Mar 19, 2020
RGMP Public
Forked from seoungwugoh/RGMP

Fast Video Object Segmentation by Reference-Guided Mask Propagation

Python Updated Feb 18, 2020
pythia Public
Forked from facebookresearch/mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python Other Updated Nov 29, 2019
TCN Public
Forked from locuslab/TCN

Sequence modeling benchmarks and temporal convolutional networks

Python MIT License Updated Nov 24, 2019
semseg Public
Forked from hszhao/semseg

Semantic Segmentation in Pytorch

Python MIT License Updated Nov 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rex Cheng hkchengrex

Block or report hkchengrex

MMAudio Public

MS-CLAP Public

CLAP Public

passt_hear21 Public

ImageBind Public

MiVOS Public

XMem Public

Cutie Public

nitrous-ema Public

shared-memory-tensor-dataset Public

ema-pytorch Public

Tracking-Anything-with-DEVA Public

CascadePSP Public

STCN Public

Scribble-to-Mask Public

Mask-Propagation Public

vos-benchmark Public

Grounded-Segment-Anything Public

so Public

davis2016-evaluation Public archive

RAFT Public

kinetics_to_frames Public

BlenderVOSRenderer Public

fbrs_interactive_segmentation Public

Single-View-Metrology-Step-By-Step Public

STM Public

RGMP Public

pythia Public

TCN Public

semseg Public