Task preference optimization: improving multimodal large language models with vision task alignment Jan 1, 2025· Ziang Yan , Zhilin Li , Yinan He , Chenting Wang , Kunchang Li , Xinhao Li , Xiangyu Zeng , Zilei Wang , Yali Wang , Yu Qiao · 0 min read Cite URL Type Conference paper Publication Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Last updated on Jan 1, 2025 ← Steady progress beats stagnation: mutual aid of foundation and conventional models in mixed domain semi-supervised medical image segmentation Jan 1, 2025 Taste more, taste better: diverse data and strong model boost semi-supervised crowd counting Jan 1, 2025 →