Yunlong (Yolo) Tang

Hi there / 你好 / こんにちは / Ciallo~(∠・ω< )⌒★ Welcome to my homepage!

I’m a second-year Ph.D. student in the Department of Computer Science at the University of Rochester, advised by Prof. Chenliang Xu.

I obtained my B.Eng. from SUSTech in 2023, under the supervision of Prof. Feng Zheng. I’ve interned at ByteDance and Tencent.

My recent research focuses on multimodal learning, particularly LLMs/VLMs for video understanding. I am also exploring video generation, AI agents, and computational arts.

Please read this[note]if you're interested in research collaboration.

News

May 31, 2025 📐 Introducing MMPerspective, a comprehensive benchmark for MLLMs on perspective understanding.
May 27, 2025 🌟 Started my internship as an Applied Scientist Intern at Amazon in Bellevue, WA.
May 03, 2025 🎉 Our Vid-LLM survey has been accepted by the IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)! 👉 IEEE Xplore | GitHub
Apr 18, 2025 🎉 I have passed my area/qualification exam and became a Ph.D. candidate!
Apr 09, 2025 📷 Caption Anything in Video (CAT-V) has been released 👉 arXiv | GitHub
👉 More news

Selected Research Citations

* Equal Contribution | † Corresponding Author

  1. CVPR
    vidcomposition.png
    Yunlong Tang*, Junjia Guo*, Hang Hua, Susan Liang, Mingqian Feng, Xinyang Li, Rui Mao, Chao Huang, Jing Bi, Zeliang Zhang, and 2 more authors
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
  2. TCSVT
    vidllm_survey.png
    Yunlong Tang*, Jing Bi*, Siting Xu*, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, and 10 more authors
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025
  3. AAAI
    cardiff.png
    Yunlong Tang, Gen Zhan, Li Yang, Yiting Liao, and Chenliang Xu
    Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025
  4. AAAI
    teaser-avicuna.png
    Yunlong Tang, Daiki Shimada, Jing Bi, Mingqian Feng, Hang Hua, and Chenliang Xu
    Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025
  5. AAAI
    v2xum-llama.png
    Hang Hua*Yunlong Tang*, Chenliang Xu, and Jiebo Luo
    Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025

Misc.


Fun Facts
  • 🎸 I'm an ACG lover, J-Pop fan, and cosplayer.
  • 🎨 I started formal painting training at 4.
  •  Preferred names (click to unfold).
    • English: Yolo Y. Tang, Yunlong (Yolo) Tang, Yolo Tang.
    • JP/CH: 唐悠たんヨロたきな (Tanyoro Takina), 唐 悠泷奈/悠泷.
    •  Evolution (click to unfold).
      • 云龙 (Yunlong) → Yolo (ヨロ、yoro) → 悠泷
      • 云 (yun) → 悠 (ゆう、yuu) → 悠 (ateji:ヨロ)
      • 龙 → 泷 → 泷奈 → たきな (Takina)
Visitor Map
"What I cannot create, I do not understand."
—— Richard Feynman