publications

* means equal contribution

2024

  1. lmms-eval.png
    LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models
    Kaichen Zhang*Bo Li*Peiyuan Zhang*Fanyi Pu*, Joshua Adrian Cahyono, Kairui HuShuai LiuYuanhan ZhangJingkang YangChunyuan Li, and Ziwei Liu
    2024
  2. worldqa.png
    WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning
    Yuanhan ZhangKaichen ZhangBo LiFanyi Pu, Christopher Arif Setiadharma, Jingkang Yang, and Ziwei Liu
    arXiv preprint arXiv:2405.03272, 2024

2023

  1. OtterHD.png
    OtterHD: A High-Resolution Multi-modality Model
    2023
  2. MIMICIT.png
    MIMIC-IT: Multi-Modal In-Context Instruction Tuning
    2023