diff --git a/docs/mllm/mllm_papers.md b/docs/mllm/mllm_papers.md index 69d3980..3ab969f 100644 --- a/docs/mllm/mllm_papers.md +++ b/docs/mllm/mllm_papers.md @@ -1,6 +1,7 @@ # MLLM论文精选(持续更新) ## 最新动态 +- 2024.08 [Building and better understanding vision-language models: insights and future directions](https://www.arxiv.org/pdf/2408.12637) - 2024.08 [LongVILA: Scaling Long-Context Visual Language Models for Long Videos](https://arxiv.org/abs/2408.10188) - 2024.08 [UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling](https://arxiv.org/pdf/2408.04810) - 2024.08 [xGen-MM (BLIP-3): A Family of Open Large Multimodal Models](https://www.arxiv.org/abs/2408.08872)