PEGRL: Improving Machine Translation by Post-Editing Guided Reinforcement Learning May 8, 2026· Yunzhi Shen , Hao Zhou , Xin Huang , Xue Han , Junlan Feng Shujian Huang · 0 min read Cite URL Type Conference paper Publication Findings of the Association for Computational Linguistics: ACL 2026 Last updated on May 8, 2026 ← How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning May 8, 2026 Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers May 8, 2026 →