InternVideo-Next: Towards World-Understanding Video Models May 5, 2026· Chenting Wang , Yuhan Zhu , Yicheng Xu , Jiange Yang , Ziang Yan , Yali Wang , Yi Wang Limin Wang · 0 min read Cite URL Type Conference paper Publication Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Last updated on May 5, 2026 Authors Limin Wang Nanjing University ← DDT: Decoupled Diffusion Transformer May 5, 2026 Rethinking BCE Loss for Multi-Label Image Recognition with Fine-Tuning May 5, 2026 →