youngkyunJang

Follow

Young Kyun Jang youngkyunJang

Follow

Postdoctoral Researcher at Meta AI

28 followers · 10 following

Achievements

Achievements

Stars

AudioLLMs / Awesome-Audio-LLM

Audio Large Language Models

Python 407 25 Updated Feb 27, 2025

microsoft / GLIP

Grounded Language-Image Pre-training

Python 2,335 200 Updated Jan 24, 2024

open-mmlab / mmdetection

OpenMMLab Detection Toolbox and Benchmark

Python 30,391 9,565 Updated Aug 21, 2024

beichenzbc / Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 761 39 Updated Aug 13, 2024

mu-cai / matryoshka-mm

Matryoshka Multimodal Models

Python 97 5 Updated Jan 22, 2025

AILab-CVC / SEED-X

Multimodal Models in Real World

Jupyter Notebook 440 20 Updated Feb 24, 2025

TencentARC / SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 795 60 Updated Oct 11, 2024

GAIR-NLP / anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 723 41 Updated Aug 5, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 101,873 16,520 Updated Feb 28, 2025

google-deepmind / magiclens

[ICML'24 Oral] "MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions"

Python 164 13 Updated Oct 28, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,587 70 Updated Aug 15, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,414 1,580 Updated Feb 29, 2024

huggingface / amused

Python 84 5 Updated Jan 4, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,937 113 Updated Jul 29, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,747 442 Updated Jan 12, 2025

kyegomez / CM3Leon

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Python 361 18 Updated Dec 15, 2023

dsfsi / textaugment

TextAugment: Text Augmentation Library

Python 415 60 Updated Feb 20, 2024

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,918 482 Updated Aug 6, 2024

li-xirong / coco-cn

Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks

OpenEdge ABL 189 22 Updated Feb 12, 2025

facebookresearch / DCI

Densely Captioned Images (DCI) dataset repository.

Python 169 5 Updated Jul 1, 2024

boheumd / MA-LMM

(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Python 275 27 Updated Jul 19, 2024

castorini / anserini

Anserini is a Lucene toolkit for reproducible information retrieval research

Java 1,046 477 Updated Feb 25, 2025

kyegomez / Vit-RGTS

Open source implementation of "Vision Transformers Need Registers"

Python 165 15 Updated Jan 27, 2025

nerfies / nerfies.github.io

JavaScript 2,956 1,100 Updated Jun 21, 2024

BryanPlummer / flickr30k_entities

Flickr30K Entities Dataset

MATLAB 168 26 Updated Dec 23, 2018

lichengunc / refer

Referring Expression Datasets API

Jupyter Notebook 492 79 Updated Aug 27, 2024

arijitray1993 / COLA

COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!

Python 24 Updated Nov 23, 2024

minyoungg / platonic-rep

Python 502 34 Updated Jul 29, 2024

TIGER-AI-Lab / UniIR

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)

Python 128 14 Updated Oct 1, 2024

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,932 268 Updated Jun 4, 2024