Skip to content

Commit

Permalink
Update mllm_papers.md
Browse files Browse the repository at this point in the history
  • Loading branch information
PromptExpert authored Nov 14, 2024
1 parent e27c05c commit 317b7a0
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions docs/mllm/mllm_papers.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@

## 最新动态
- 2024.11 [HourVideo: 1-Hour Video-Language Understanding](https://arxiv.org/abs/2411.04998) 李飞飞团队提出长视频理解评测集
- 2024.11 [Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models](https://arxiv.org/abs/2411.04996)
- 2024.11 [MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS](https://arxiv.org/pdf/2411.02571) 英伟达提出基于MLLM的通用多模态检索。
- 2024.11 [Attacking Vision-Language Computer Agents via Pop-ups](https://arxiv.org/abs/2411.02391)
- 2024.11 [Know Where You're Uncertain When Planning with Multimodal Foundation Models: A Formal Framework](https://arxiv.org/abs/2411.01639) 提高多模态基础模型在处理不确定性时的能力,从而增强机器人在规划任务中的可靠性。
Expand Down

0 comments on commit 317b7a0

Please sign in to comment.