Yunlong (Yolo) Tang

Hi there / 你好 / こんにちは :wave: Welcome to my homepage!

website/prof_pic.jpg

My name is Yunlong Tang (). I’m a second-year Ph.D. student in Computer Science at University of Rochester (UR), advised by Prof. Chenliang Xu. I obtained B.Eng. (2019-2023) in Intelligence Science & Technology from Southern University of Science and Technology (SUSTech), with supervision from Prof. Feng Zheng. I’ve interned at ByteDance and Tencent.

My research focuses on Multimodal Learning, especially Video Understanding & Generation. I also have a keen interest in AI-Agents and Computational Arts.

I am actively looking for any collaboration. Please feel free to contact me if you are interested!

News

Dec 09, 2024 Three papers on Video-LLMs has been accepted by AAAI 2025!
Nov 23, 2024 We have released VidComposition, a benchmark to evaluate MLLMs’ understanding of video compositions. 👉 [ Project Page | Paper | Leaderboard ]
Oct 13, 2024 🚀 MMComposition has been publicly released. Read our Paper, check out the latest 🏆Leaderboard, and access the Code to evaluate your own models.
Aug 23, 2024 Introducing CaRDiff, a framework for video saliency prediction using MLLM CoT reasoning and diffusion model.
Aug 05, 2024 🏅 We won the first place in AIM 2024 Challenge on Video Saliency Prediction @ ECCV Workshop! Thanks to Gen Zhan and Li Yang!
👉 More news

Selected Research

* Equal Contribution | † Corresponding Author

  1. AAAI
    cardiff.png
    Yunlong Tang, Gen Zhan, Li Yang, Yiting Liao, and Chenliang Xu
    AAAI Conference on Artificial Intelligence (AAAI), 2025
  2. vidllm_survey.png
    Yunlong Tang*, Jing Bi*, Siting Xu*, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, and 10 more authors
    arXiv preprint arXiv:2312.17432, 2023
  3. AAAI
    teaser-avicuna.png
    Yunlong Tang, Daiki Shimada, Jing Bi, Mingqian Feng, Hang Hua, and Chenliang Xu
    AAAI Conference on Artificial Intelligence (AAAI), 2025
  4. AAAI
    v2xum-llama.png
    Hang Hua*Yunlong Tang*, Chenliang Xu, and Jiebo Luo
    AAAI Conference on Artificial Intelligence (AAAI), 2025
  5. ACM MM
    teaser-eagle.png
    Jing Bi, Yunlong Tang, Luchuan Song, Ali Vosoughi, Nguyen Nguyen, and Chenliang Xu
    In Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM), 2024

Misc.


Fun Facts
  • My nickname, YOLO, is a soramimi/mondegreen for Yunlong.
  • I'm a Tech-otaku, ACGN enthusiast, J-Pop fan, and 🚀 e/acc proponent.
  • I have a certain artistic foundation (10+ years of experience in drawing/painting 🎨).
Visitor Map
"What I cannot create, I do not understand."
—— Richard Feynman
Social Links