Xiangyu Zeng,
Kunchang Li,
Chenting Wang,
Xinhao Li,
Tianxiang Jiang,
Ziang Yan,
Songze Li,
Yansong Shi,
Zhengrong Yue,
Yi Wang,
Yali Wang,
Yu Qiao,
Limin Wang
(2025).
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning.
The Thirteenth International Conference on Learning Representations.