AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

2026年5月5日·
Lidong Lu
,
Guo Chen
,
Zhu Wei
,
Zhiqi Li
,
Yicheng Liu
Tong Lu
Tong Lu
· 0 分钟阅读时长
类型
出版物
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)