Skip to content

Commit

Permalink
Update mllm_papers.md
Browse files Browse the repository at this point in the history
  • Loading branch information
PromptExpert authored Nov 7, 2024
1 parent c583e37 commit 8dab3e5
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions docs/mllm/mllm_papers.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,7 @@
多模态交流QQ群: 237976286

## 最新动态
- 2024.11 [MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS](https://arxiv.org/pdf/2411.02571) 英伟达提出基于MLLM的通用多模态检索。
- 2024.11 [Attacking Vision-Language Computer Agents via Pop-ups](https://arxiv.org/abs/2411.02391)
- 2024.11 [Know Where You're Uncertain When Planning with Multimodal Foundation Models: A Formal Framework](https://arxiv.org/abs/2411.01639) 提高多模态基础模型在处理不确定性时的能力,从而增强机器人在规划任务中的可靠性。
- 2024.10 [Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities](https://arxiv.org/abs/2410.11190)
Expand Down

0 comments on commit 8dab3e5

Please sign in to comment.