Long-Context Attention Benchmark: From Kernel Efficiency to Distributed Context Parallelism Apr 24, 2026· Tao Bu , Qiangang Wang , Bowen Zeng , Hanwen Sun , Yunpeng Huang , Chun Cao Jingwei Xu · 0 min read Cite URL Type Conference paper Publication The Fourteenth International Conference on Learning Representations Last updated on Apr 24, 2026 Authors Jingwei Xu Nanjing University School of Computer Science ← DuPO: Enabling Reliable Self-Verification via Dual Preference Optimization Apr 24, 2026 PixNerd: Pixel Neural Field Diffusion Apr 24, 2026 →