Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 197 Bytes

230629 LLaVAR.md

File metadata and controls

3 lines (2 loc) · 197 Bytes

https://arxiv.org/abs/2306.17107

LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding (Yanzhe Zhang, Ruiyi Zhang, Jiuxiang Gu, Yufan Zhou, Nedim Lipka, Diyi Yang, Tong Sun)