Skip to content
View invoker-LL's full-sized avatar
  • 17:32 (UTC +08:00)

Block or report invoker-LL

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

391 16 Updated Jan 18, 2025

Spatial Sparse Convolution Library

Python 1,976 370 Updated Dec 15, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,008 1,114 Updated Feb 28, 2025

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.

65 Updated Nov 27, 2024

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 282 1 Updated Mar 5, 2025

Multimodal Whole Slide Foundation Model for Pathology

Jupyter Notebook 175 19 Updated Feb 14, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 706 36 Updated Feb 24, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,601 71 Updated Aug 15, 2024

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 239 7 Updated Jan 22, 2025

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

596 20 Updated Dec 23, 2024

[NeurIPS 2024] Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

Python 68 4 Updated Feb 11, 2025
Jupyter Notebook 95 10 Updated Jun 27, 2024

Towards a general-purpose foundation model for computational pathology - Nature Medicine

Jupyter Notebook 408 58 Updated Jan 15, 2025

A vision-language foundation model for computational pathology - Nature Medicine

Python 322 28 Updated Jan 15, 2025

The official implementation of GPFM

Python 49 2 Updated Feb 7, 2025

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.

41 1 Updated Dec 17, 2024

Code associated to the publication: Scaling self-supervised learning for histopathology with masked image modeling, A. Filiot et al., MedRxiv (2023). We publicly release Phikon 🚀

Jupyter Notebook 148 12 Updated Jan 29, 2024

[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model

Jupyter Notebook 34 1 Updated Nov 10, 2024

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,179 215 Updated Mar 9, 2025

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,422 179 Updated Jan 23, 2025

EVE Series: Encoder-Free Vision-Language Models from BAAI

Python 308 7 Updated Mar 1, 2025

✨✨Latest Advances on Multimodal Large Language Models

14,164 912 Updated Mar 5, 2025

Reasoning with Language Model is Planning with World Model

PDDL 159 19 Updated Aug 25, 2023

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python 1,089 71 Updated Jan 11, 2024

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,018 71 Updated Jun 17, 2024

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,558 143 Updated Mar 7, 2025

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,438 124 Updated Feb 6, 2025

Paper collection about Mamba

7 1 Updated Apr 12, 2024

Fast and memory-efficient exact attention

Python 16,165 1,530 Updated Mar 9, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,071 126 Updated Mar 9, 2025
Next