-
Megvii
Stars
A generative world for general-purpose robotics & embodied AI learning.
real time face swap and one-click video deepfake with only a single image
CraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
An operation trying to do the opposite of F.grid_sample
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
[ToG 2024]: DMHomo: Learning Homography with Diffusion Models
[CVPR 2023] L2G-NeRF: Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
An open-source visual programming environment for battle-testing prompts to LLMs.
Drag & drop UI to build your customized LLM flow
Playing Pokemon Red with Reinforcement Learning
[T-CSVT 2021]: DeepOIS: Gyroscope-Guided Deep Optical Image Stabilizer Compensation
A fast python implementation of Ray Tracing in One Weekend using python and Taichi
🐍 Geometric Computer Vision Library for Spatial AI
The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning
a pytorch impelementation of ssim to reproduce matlab results
RepVGG: Making VGG-style ConvNets Great Again