AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

May 5, 2026·
Lidong Lu
,
Guo Chen
,
Zhu Wei
,
Zhiqi Li
,
Yicheng Liu
Tong Lu
Tong Lu
· 0 min read
Type
Publication
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)