Achieving Human Parity on Automatic Chinese to English News Translation
Technical Report 2018
LLM Post-Training · Reasoning · Multilingual · Multimodal
Zhirui Zhang
Researcher and builder across large-model systems, translation, multimodal capability, and deployable intelligence.

Most recently: AI Technical Advisor and entrepreneurial partner at IDEA Research.
Profile
I focus on post-training and reasoning models for large language models, while continuing to build on a long research track in multilingual NLP, machine translation, speech translation, and dialogue systems. Across industry labs and product teams, I have worked on frontier-scale pretraining, general post-training, reasoning-oriented optimization, translation and multilingual capability building, and practical deployment in user-facing systems.
I received my Ph.D. from the University of Science and Technology of China through a joint training program with Microsoft Research Asia. My recent work bridges academic research and real-world model development, with a sustained interest in reliable, iterative, and deployable large-model systems.
Current Focus
LLM Post-Training Reasoning Models Multilingual Modeling Machine Translation Multimodal Capability Building Production Model Systems
Selected Work
Technical Report 2018
EMNLP 2023
ACL-IJCNLP 2021
AAAI 2019
NeurIPS 2020
ICLR 2023
arXiv 2026
NeurIPS 2025
ICML 2025
arXiv 2025
ACL 2024 Findings
Experience
2025.12 - 2026.3
AI Technical Advisor and entrepreneurial partner working on model-and-tool systems for MoonBit and more reliable code intelligence.
2024.3 - 2025.9
Technical expert leading general post-training, reasoning-model exploration, multilingual capability building, and practical large-model deployment.
2023.11 - 2024.3
Algorithm expert contributing to trillion-parameter MoE pretraining, FP8 language-model exploration, and long-context capability validation.
2021.9 - 2023.11
Senior researcher building translation training platforms, interactive translation models, personalized MT, and multilingual research systems.
2019.7 - 2021.8
Algorithm expert for multilingual translation, speech translation, automated training pipelines, and commercial translation services.
2015.7 - 2019.6
Research intern across MSRA and Redmond, working on neural machine translation, dialogue systems, and controllable text generation.
Service & Recognition
Long-term reviewer or program committee member for ACL, EMNLP, NAACL, AAAI, IJCAI, NeurIPS, ICML, and ICLR, with prior service as an ACL area chair.
Writing / Notes
I am setting up a lightweight blog for notes on post-training, translation, multimodal systems, and practical lessons from building deployable model stacks.
Models Translation Multimodal Systems
Contact