Skip to content

Commit

Permalink
Update mllm_papers.md
Browse files Browse the repository at this point in the history
  • Loading branch information
PromptExpert authored Sep 30, 2024
1 parent 417c022 commit 4c08f82
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions docs/mllm/mllm_papers.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@
- 2024.09 [Molmo and PixMo:Open Weights and Open Data for State-of-the-Art Multimodal Models](https://www.arxiv.org/abs/2409.17146) Allen出品,同时开源模型和数据。
- 2024.09 [MIO: A Foundation Model on Multimodal Tokens](https://arxiv.org/abs/2409.17692)
- 2024.09 [Phantom of Latent for Large Language and Vision Models](https://arxiv.org/abs/2409.14713)
- 2024.09 [Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution](https://arxiv.org/pdf/2409.12191)
- 2024.09 [Llama 3.2: Revolutionizing edge AI and vision with open, customizable models](https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/)
- 2024.09 [NVLM: Open Frontier-Class Multimodal LLMs](https://arxiv.org/pdf/2409.11402)
- 2024.09 [Viper: Open Mamba-based Vision-Language Models](https://github.com/EvanZhuang/viper/tree/main) 首个基于Mamba的VLM系列
Expand Down

0 comments on commit 4c08f82

Please sign in to comment.