刘群, 自然语言处理, 机器翻译, 预训练语言模型, 中文信息处理

刘群 (Qun Liu),教授,博士

单位

华为诺亚方舟实验室(Huawei Noah's Ark Lab)

职位

语音语义首席科学家(Chief Scientist of Speech and Language Computing)

工作经历

  • 2018/07-现在:华为诺亚方舟实验室(Huawei Noah's Ark Lab),语音语义首席科学家
  • 2012/07-2018/06:都柏林城市大学(Dublin City University),教授
    • 2015/06/30-2015/07/23:台湾中研院资讯所,访问学者
    • 2015-2018/06:Irish ADAPT Centre,Theme Leader (NLP & MT)
    • 2012/07-2014:Irish CNGL Centre,Theme Leader (NLP & MT)
  • 1992/07-2018/06:中国科学院计算技术研究所(2012/07起为兼职),研究员、教授 (2005年起)
    • 2007/01/01-2007/03/01:New York University (纽约大学),访问学者
    • 2005/04/01-2005/04/30:Hong Kong Polytechnic University (香港理工大学),访问学者
    • 2004/01/26-2004/02/25:NICT (日本情报通信研究机构),访问学者
    • 2003/11-2018/06:中国科学院大学 (兼职),教授 (2005年起)

教育经历

  • 1999/09-2004/05:理学博士,计算机软件,北京大学计算语言学研究所
  • 1989/09-1992/07:工学硕士,计算机应用,中国科学院计算技术研究所
  • 1984/09-1989/07:工学学士,计算机科学技术,中国科学技术大学

研究兴趣

  • Computational Linguistics
  • Natural Language Processing
  • Large Language Models

研究课题

My research work focuses on Natural Language Processing and Machine Translation, its theory, technology and application, including:

  • Morphological Analysis
  • Parsing
  • Semantic Processing
  • Language Modeling
  • Machine Translation
  • Named Entity Recognition and Information Extraction
  • Large Scale Linguistic Resource Construction
  • Evaluation Technologies for NLP & MT
  • Chinese Language Processing
  • Cross-Language/Cross-Standard/Cross-Domain Adaption for NLP
  • Dialog and Question Answering
  • Language Generation
  • Neural Symbolic Computing

论文著作

请访问以下网址获取本人论文发表详情:

程序代码

演讲报告

  • 2025/03/26 从ChatGPT到DeepSeek:人工智能大模型技术现状与展望
    • an invited talk at AI技术培训, 汕尾

    • 2025/03/26 The Interaction between Artificial Intelligence and Linguistics – a Historical Review and Prospect
      • an invited talk at Symposium on Humanities and Culture, online

      • 2024/11/16 大语言模型评价研究进展
        • an invited talk at IMLIP2024 国际多语种智能信息处理大会, 北京

        • 2024/11/02 大模型时代的神经符号计算
          • an invited talk at MLA2024 第22届机器学习及其应用研讨会, 合肥

          • 2024/10/25 从LLM到AGI:我们还差什么?
            • an invited talk at CNCC 2024 论坛:大模型与超级智能的演进路径, 横店

            • 2024/07/28 人工智能:游走在神经和符号之间
              • an invited talk at 中国计算语言学大会 CCL 2024, 太原

              • 2024/07/06 符号化知识表示的形式和性质
                • an invited talk at CCF秀湖会议——新一代知识工程:记忆、推理与可解释, 苏州

                • 2024/06/30 人工智能与语言学关系的流变—人工智能视角
                  • an invited talk at 人文+人工智能交叉学科沙龙, 黄大年茶思屋,复旦大学,上海

                  • 2024/06/20 大语言模型Tokenization技术介绍
                    • an invited talk at 大模型技术洞察交流会, 深圳

                    • 2024/06/06 华为盘古大模型的核心技术与挑战
                      • an invited talk at CCF大模型论坛, 北京

                      • 2024/05/10 Transformer Plus:未来AI大模型架构设想
                        • an invited talk at 人工智能与算力底座研讨会, 东莞

                        • 2024/05/08 Beyond Algorithms: Navigating the Data Deluge in AI
                          • an invited talk at ICLR 2024 EXPO Talk by Huawei, Vienna, Austria

                          • 2024/03/07 Large Language Models - Technology Status, Trends and Impacts
                            • an invited talk at Hong Kong Metropolitan University, online

                            • 2024/01/20 大语言模型技术现状与发展趋势的思考
                              • an invited talk at 2023掘金年度技术演讲, 深圳

                              • 2023/11/14 Self-improvement and Self-evolving of Large Language Models
                                • an invited talk at HKPolyU, Hong Kong

                                • 2023/08/16 融合检索和工具调用的大语言模型
                                  • an invited talk at 中文信息学会信息检索专委会年会, 山西太原

                                  • 2023/05/25 Large Language Models: Research and Practice
                                    • an invited talk at FST Symposium on Science and Technology and Graduation Ceremony, Macau University, Macau

                                    • 2023/03/24 大语言模型研究进展与展望
                                      • an invited talk at 中国中文信息学会2022学术年会, 北京

                                      • 2023/02/16 ChatGPT技术分析 (Video)
                                        • an invited talk at 黄大年茶思屋 Chaspark Forum, online
                                      • 2022/12/11 神经符号计算的再思考
                                        • an invited talk at 全国少数民族自然语言处理青年论坛, online
                                      • 2022/11/27 规则知识与神经网络的融合 Integration of Rules into Neural Networks
                                        • an invited talk at 全球人工智能技术大会自然语言处理论坛 GAITC 2022 NLP Forum, online
                                      • 2022/08/21 对话系统用户意图建模 User Intent Modeling in Dialog Systems
                                        • an invited talk at CCF对话技术启智会 / CCF Inspiring New Ideas on Dialog Technologies, online
                                      • 2022/08/10 神经机器翻译语言迁移和预训练中语言资源的极致利用
                                        • an invited talk at 全国机器翻译大会 Chinese Conference of Machine Tanslation (CCMT2022), online
                                      • 2022/07/20 Large-scale Pre-trained Language Models - Opportunities and Challenges
                                        • an invited talk to University of Edinburgh, Edinburgh
                                      • 2022/06/02 TGEA Datasets and Benchmark Tasks
                                        • an invited talk at 2022智源大会NLP论坛 / 2022 BAAI Conference - NLP Forum, online
                                      • 2021/11/28 神经自然语言处理方法中的子词切分方法综述
                                        • an invited talk at 第18届全国少数民族语言信息处理学术研讨会, online
                                      • 2021/10/13 关于预训练大模型发展前景的一些思考和探讨
                                        • an invited talk at全国人工智能大会自然语言处理论坛, Chendu, China
                                      • 2021/09/18 Solving Math Word Problems with Pre-trained Language Models
                                        • an invited talk at 世界计算大会, Changsha, China
                                      • 2021/07/30 Efficient NLP Modeling and Training_ Advances in Huawei Noah's Ark Lab
                                        • an invited talk at 机器之心ACL论文线下分享会, Beijing, China
                                      • 2021/06/06 多语言预训练语言模型:技术与应用
                                        • an invited talk at 多语种智能信息处理高峰论坛(IMLIP2021), online
                                      • 2021/06/02 SparTerm Learning Term-based Sparse Representation
                                        • an invited talk at 北京智源大会IR论坛(BAAI2021 IR Forum), online
                                      • 2021/05/15 Benchmarking the Ability of Commonsense Understanding and Reasoning for Pretrained Language Models
                                        • an invited talk at Chinese Lexical Semantics Workshop(CLSW2021), online
                                      • 2020/10/31 预训练语言模型研究进展和趋势展望
                                        • an invited talk at 全国计算语言学大会(CCL2020), online
                                      • 2020/06/18 Research and Practice of Simultaneous Speech Translation in Huawei Noah’s Ark Lab
                                        • an invited talk at AutoSimTrans Workshop, attached with ACL 2020, online
                                      • 2019/11/03 Document-level Machine Translation: the Current State and the Challenges
                                        • an invited talk at DiscoMT 2019 Workshop, attached with EMNLP 2019, Hong Kong, China
                                      • 2019/10/31 预训练语言模型的研究与应用
                                        • at 北京智源大会NLP论坛, Beijing, China
                                      • 2019/08/24 基于深度学习的自然语言处理:边界在哪里?
                                        • at 第四届语言与智能高峰论坛, Beijing, China
                                      • 2018/03/21 What has Deep Learning brought to Natural Language Processing?
                                        • at Deep Learning Meetup Accenture The Dock, Dublin, Ireland
                                      • 2018/01/29 Neural Machine Translation
                                        • at DeepHack.Babel, MIPT, Moscow, Rassia (online)
                                      • 2017/10/11 Research and Applications of Machine Translation: Personal Experience
                                        • at University of Chinese Academy of Sciences, Beijing, China
                                      • 2016/10/28 Dependency-Based Statistical Machine Translation
                                        • a tutorial at AMTA 2016, Austin, TX, USA
                                      • 2016/09/25 The Oppotunities of Deep Learning in NLP
                                        • at ADAPT Industry Showcase, Dublin, Ireland
                                      • 2016/05/18 Recent Progress in Syntax-based Machine Translation
                                        • at Nanjing University, Nanjing, China
                                      • 2016/05/18 A Novel Approach to Dropped Pronoun Translation
                                        • at Nanjing Normal University, Nanjing, China
                                      • 2015/07/14 Syntax in Statistical Machine Translation
                                        • at IIS Academia Sinica, Taipei, Taiwan
                                      • 2014/08/28 Adaptation for Natural Language Processing
                                        • at COLING 2014, Dublin, Ireland
                                      • 2013/04/10 Context-Aware Rule-Selection for SMT
                                        • at University of Ulster, Northern Ireland
                                      • 2012/11/5-6 Context-Aware Rule-Selection for SMT
                                        • at City University of New York (CUNY) and IBM Watson Research Center, New York
                                      • 2012/11/1 Maximum Rank Correlation Training for SMT
                                        • at MONOMT Workshop, attached with AMTA2012, San Diego
                                      • 培养学生

                                        学术服务

                                        • 期刊编委:
                                          • Transactions of the Association for Computational Linguistics (TACL) (2020-Present)
                                          • Computational Linguistics (Editorial Board Member, 2017-2019)
                                          • Machine Translation (2011/12-present)
                                          • ACM Transactions on Asian Language and Information Processing (TALIP) (2010/6-2011/12)
                                          • 中文信息学报 (2006-present)
                                        • 期刊审稿:
                                          • Computational Linguistics (CL)
                                          • Transactions of the Association for Computational Linguistics (TACL)
                                          • ACM Transactions on Asian Language and Information Processing (TALIP)
                                          • Machine Translation
                                          • Neural Computing
                                          • Journal of Artificial Intelligence Research (JAIR)
                                          • Transactions on Audio, Speech and Language Processing (T-ASL)
                                          • Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
                                          • Computer Speech & Language (CSL)
                                          • 中文信息学报(Journal of Chinese Information Processing)
                                          • 计算机学报(Journal of Computer Science and Technology)
                                          • 计算机研究与发展(Journal of Computer Research and Development)
                                          • 自动化学报(ACTA Automata Sinica)
                                        • 会议组织:
                                          • 2025 NeurIPS (Area Chair)
                                          • 2025 ICML (Area Chair)
                                          • 2025 ARR/ACL/EMNLP (Area Chair)
                                          • 2024 NeurIPS (Area Chair)
                                          • 2024 ARR/ACL/EMNLP (Area Chair)
                                          • 2024 ICML (Area Chair)
                                          • 2023 IJCNLP-AACL (Area Chair)
                                          • 2023 ICML (Area Chair)
                                          • 2023 ARR/ACL/EMNLP (Area Chair)
                                          • 2022 ICLR (Area Chair)
                                          • 2022 NeurIPS (Area Chair)
                                          • 2022 COLING (Area Chair)
                                          • 2022 AutoSimTrans Workshop (Co-organizer)
                                          • 2022 ICML (Meta-reviewer)
                                          • 2022 KDD (SPC)
                                          • 2022 ARR/ACL/EMNLP (Area Chair)
                                          • 2021 ARR (Area Chair)
                                          • 2021 EMNLP (Area Chair)
                                          • 2021 IJCAI (Area Chair)
                                          • 2021 ICML (Area Chair)
                                          • 2021 ACL-IJCNLP (Senior Area Chair)
                                          • 2021 ICLR (Area Chair)
                                          • 2021 NeurIPS ENLSP Workshop (Co-organizer)
                                          • 2021 第六届语言与智能高峰论坛 Language and Intelligence Summit (Co-chair)
                                          • 2021 AutoSimTrans Workshop (Co-organizer)
                                          • 2020 NeurIPS (Area Chair)
                                          • 2020 AAAI (SPC)
                                          • 2020 ICLR (Area Chair)
                                          • 2020 ACL (MT Area Co-chair)
                                          • 2020 EMNLP (Demonstration Co-chair)
                                          • 2020 AACL-IJCNLP (Publicity co-chair)
                                          • 2020 CNCC Forum on 预训练语言模型:还能走多远? (Chair)
                                          • 2019 CNCC Forum on 自然语言对话:技术挑战和应用前景 (Chair)
                                          • 2019 IWDP 国际篇章处理研讨会 (Co-organizer)
                                          • 2019 NLPCC (Conference Co-chair)
                                          • 2018 EMNLP (MT area co-chair)
                                          • 2018 IWDP 国际篇章处理研讨会 (Co-organizer)
                                          • 2018 MLP-MomenT Workshop (Co-chair)
                                          • 2017 Multi-Lingual Processing (MLP) Workshop (Chair)
                                          • 2015 DL4MT Winter School (Organizer)
                                          • 2014 COLING (Tutorial Co-chair)
                                          • 2013 ACL (Workshop Co-chair)
                                          • 2011 IUCS (General Chair)
                                          • 2011 MT SUMMIT (Organization Co-chair)
                                          • 2010 EMNLP (Area Co-chair)
                                          • 2010 COLING (Area Co-chair)
                                          • 2010 CLP (PC co-chair)
                                          • 2011 CWMT (Evaluation Organizer)
                                          • 2009 CWMT (Evaluation Organizer)
                                          • 2008 CWMT (Evaluation Organizer)
                                          • 2007 CWMT (Evaluation Organizer)
                                          • 2006 CWMT (Organizer)
                                          • 2005 HTDRP Evaluation on Chinese Information Processing and Intelligent Human-machine Interface (Organizer)
                                          • 2004 HTDRP Evaluation on Chinese Information Processing and Intelligent Human-machine Interface (Organizer)
                                          • 2003 HTDRP Evaluation on Chinese Information Processing and Intelligent Human-machine Interface (Organizer)
                                        • 会议审稿:
                                          • 2025 MT Summit
                                          • 2025 COLM
                                          • 2024 ICLR
                                          • 2024 COLM
                                          • 2023 MT Summit
                                          • 2022 WMT
                                          • 2022 ICLR
                                          • 2021 AAAI, NAACL, CCMT, NLPCC, NeurIPS, WMT, CODI
                                          • 2020 EAMT, EMNLP, CODI, COLING, NLPCC, MT Summit, CCMT, AMTA, AACL, WMT
                                          • 2019 AAAI, NAACL, ACL, IJCAI, MTSummit, IWSLT
                                          • 2018 AMTA, NAACL, WILDRE4, IJCAI-ECAI, ACL, COLING, EAMT, NLPCC, WMT, IALP, IWSLT
                                          • 2017 EACL, EAMT, ACL, WMT, EMNLP, MT Summit, AICS, NLPCC, IJCNLP, IALP, IWSLT, LREC
                                          • 2016 LREC, NLP4TM, EAMT, IEEE DSC, NAACL, HLT-IA, ACL, WMT, AICS, SLSP, AMTA, EMNLP, IALP, COLING, WAT
                                          • 2015 ACL-IJCNLP, EAMT, EMNLP, EXPERT, IALP, IJCAI, MT Summit, NAACL-HLT, NLP4TM, NLPCC, RANLP, S2MT, WMT
                                          • 2014 ACL, AMTA, COLING, EACL, EAMT, EMNLP, IALP, IWSLT, LREC, WMT
                                          • 2013 ACL, EMNLP, IALP, IJCNLP, MTSummit, NAACL-HLT, PACLIC, SLSP
                                          • 2012 ACL, AMTA, COLING, NAACL-HLT
                                        • 学术组织职务:
                                          • ACL Nomination Committee (Member,2024,2025)
                                          • 中国计算机学会大模型论坛 CCF-FoLM (副主任,2024-present)
                                          • 中国人工智能学会多语言信息处理专委会 CCAI-IMLIP (副主任,2021-present)
                                          • ACL SIGHAN (Information Officer of China, 2011/12-2013)
                                          • ACL (Member 2007-present)
                                          • 中国中文信息学会 CIPSC (副理事长,2021-present; 理事,2016-2021; 常务理事,2011-2016)
                                          • 中国中文信息学会计算语言学专委会 (委员 xxxx-present)
                                          • 中国中文信息学会机器翻译专委会 (副主任 xxxx-yyyy,委员:yyyy-present)
                                          • 中国计算机学会自然语言处理专委会 CCF-TCNLP (副主任,2019-present)
                                          • 中国计算机学会术语工作委员会 (副主任 xxxx-yyyy)
                                          • 全国科学技术名词审定委员会计算机分委员会 (副主任 xxxx-yyyy)

                                        奖励

                                        • 2024 2024 AI 2000 Most Influential Scholar Award Honorable Mention in NLP
                                        • 2023 2023世界互联网大会领先科技奖
                                        • 2023 IAMT Honor Award
                                        • 2023 高等院校科学研究优秀成果奖(科学技术)自然科学一等奖
                                        • 2022 ACL Outstanding Paper Award
                                        • 2021 ACL Fellow
                                        • 2019 ACL Best Long Paper Award
                                        • 2016 CCL NLP-NABD Best Paper Award
                                        • 2015 国家科学技术进步二等奖
                                        • 2015 中国电子学会科学技术一等奖
                                        • 2014 CCL NLP-NABD Best Paper Award
                                        • 2012 Google Research Award
                                        • 2011 政府特殊津贴
                                        • 2011 朱李月华优秀教师奖
                                        • 2010 中国中文信息学会钱伟长中文信息处理科学技术奖
                                        • 2009 北京市2009年科学技术二等奖
                                        • 2006 COLING-ACL Meritorious Asian NLP Paper Award

                                        电子邮件

                                        qun(dot)liu(at)huawei(dot)com

                                        社交媒体

                                        通讯地址

                                        Huawei Tech. Investment Co.,Ltd
                                        Bio-informatics Center
                                        Hong Kong Science Park
                                        Shatin, NT, Hong Kong SAR

                                        相关链接