Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

欢迎分享CVPR 2025 论文和代码 / Welcome to share the paper and code of CVPR 2025 #242

Open
amusi opened this issue Feb 27, 2025 · 16 comments

Comments

@amusi
Copy link
Owner

amusi commented Feb 27, 2025

[The format of the issue]
Paper title:
Paper link:
Code link:

@amusi
Copy link
Owner Author

amusi commented Feb 27, 2025

[Sample]
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Paper: https://arxiv.org/abs/2407.08083
Code: https://github.com/NVlabs/MambaVision

@Xiangxu-0103
Copy link

LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes
Paper: https://arxiv.org/abs/2501.04004
Code: https://github.com/Xiangxu-0103/LiMoE
Project: https://ldkong.com/LiMoE

@Epiphqny
Copy link

PAR: Parallelized Autoregressive Visual Generation
Paper: https://arxiv.org/abs/2412.15119
Code: https://github.com/Epiphqny/PAR
Project: https://epiphqny.github.io/PAR-project/

@MingkunLei
Copy link

StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements
Paper: https://arxiv.org/abs/2412.08503
Code: https://github.com/Westlake-AGI-Lab/StyleStudio
Project: https://stylestudio-official.github.io/

@2toinf
Copy link

2toinf commented Feb 28, 2025

Universal Actions for Enhanced Embodied Foundation Models
Paper: https://arxiv.org/abs/2501.10105
Code: https://github.com/2toinf/UniAct
Project: https://2toinf.github.io/UniAct/

@Fediory
Copy link

Fediory commented Feb 28, 2025

HVI: A New color space for Low-light Image Enhancement
Paper: https://arxiv.org/abs/2502.20272
Code: https://github.com/Fediory/HVI-CIDNet
Demo: https://huggingface.co/spaces/Fediory/HVI-CIDNet_Low-light-Image-Enhancement_

@LiewFeng
Copy link

LiewFeng commented Mar 1, 2025

Paper title: Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Paper link: https://arxiv.org/abs/2411.19108
Code link: https://github.com/ali-vilab/TeaCache
Project: https://liewfeng.github.io/TeaCache/
Topic: Visual Generation Acceleration

@wbhu
Copy link

wbhu commented Mar 1, 2025

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Paper: https://arxiv.org/abs/2409.02095
Code: https://github.com/Tencent/DepthCrafter
Project: https://depthcrafter.github.io
Topic: 深度估计(Depth Estimation)

@hzxie
Copy link

hzxie commented Mar 1, 2025

Generative Gaussian Splatting for Unbounded 3D City Generation

Paper: https://arxiv.org/abs/2406.06526
Code: https://github.com/hzxie/GaussianCity
Project: https://haozhexie.com/project/gaussian-city
Hugging Face: https://huggingface.co/spaces/hzxie/gaussian-city
Topic: 3D生成, 3DGS (Gaussian Splatting)

@callsys
Copy link

callsys commented Mar 1, 2025

Paper title: DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
Paper link: https://arxiv.org/abs/2405.16071
Code link: https://github.com/callsys/DynRefer
Topic: Multimodal learning, MLLM

@Junda24
Copy link

Junda24 commented Mar 3, 2025

Paper title: MonSter: Marry Monodepth to Stereo Unleashes Power
Paper link: https://arxiv.org/abs/2501.08643
Code link: https://github.com/Junda24/MonSter
Topic: stereo matching

@Gaaaavin
Copy link

Gaaaavin commented Mar 3, 2025

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Paper: https://arxiv.org/abs/2411.17820
Project website: https://ai4ce.github.io/CityWalker/
Code: https://github.com/ai4ce/CityWalker
Topic: Embodied AI

@MqLeet
Copy link

MqLeet commented Mar 4, 2025

ReDDiT: Efficient Diffusion as Low Light Enhancer
Paper: https://arxiv.org/abs/2410.12346
Code: https://github.com/lgz-0713/ReDDiT
Topic: Low-level vision, Image enhancement

@yuanc3
Copy link

yuanc3 commented Mar 4, 2025

From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization
Paper: https://arxiv.org/abs/2503.00938
Code: https://github.com/yuanc3/Pose2ID
Topic: ReID

@pandayuanyu
Copy link

Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis

Paper: https://arxiv.org/abs/2412.02168
Code: https://github.com/pandayuanyu/generative-photography
Project Page: https://generative-photography.github.io/project/
Dataset: https://huggingface.co/datasets/pandaphd/camera_settings
Demo: https://huggingface.co/spaces/pandaphd/generative_photography

Topic: Image / Video Generation, Camera Physics

@hyz317
Copy link

hyz317 commented Mar 6, 2025

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images
Paper: https://arxiv.org/abs/2411.05738
Code: https://github.com/hyz317/StdGEN
Project Page: https://stdgen.github.io/
Huggingface: https://huggingface.co/hyz317/StdGEN
Topic: 3D Generation, Avatar

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests