InternVideo-Next: Towards World-Understanding Video Models

2026年5月5日·

Chenting Wang

,

Yuhan Zhu

,

Yicheng Xu

,

Jiange Yang

,

Ziang Yan

,

Yali Wang

,

Yi Wang

Limin Wang

Limin Wang

· 0 分钟阅读时长

引用 URL

类型

出版物

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

最近更新于 2026年5月5日

Limin Wang

Authors

← DDT: Decoupled Diffusion Transformer 2026年5月5日

Rethinking BCE Loss for Multi-Label Image Recognition with Fine-Tuning 2026年5月5日 →