Yuzhe Gu 顾宇喆

PhD Student

Large Model Center, Shanghai AI Laboratory
School of Electronic Information and Electrical Engineering,
Shanghai Jiao Tong University

Email: guyuzhe@pjlab.org.cn, guyuzhe1116@sjtu.edu.cn;
Google Scholar: Google Scholar Link; Github: Github Link

Biography

I am a PhD student at Shanghai Jiao Tong University, in the joint program at Shanghai AI Laboratory, advised by Wenwei Zhang and Kai Chen. Before that, I received the bachelor degree at Wuhan University in 2024.

My research interests lie primarily in the area of Large Language Model (LLM). I'm focusing on improving the reasoning and knowledge capabilities of LLMs. I also have experience about the reducing hallucination in LLMs, including the annotation, detection and mitigation of hallucinations.

Discussions and cooperations are welcomed!

News

[2026.07] Our paper Intern-S1-MO is accepted by COLM 2026.
[2026.05] We release Intern-S2-Preview, an efficient 35B scientific multimodal foundation model.
[2026.04] Our paper ThoughtFold is accepted by ICML 2026.
[2026.02] We release Intern-S1-Pro, a trillion-scale MoE multimodal scientific reasoning model.
[2025.12] Our system Intern-S1-MO has achieved gold medal in the China Mathematical Olympiad (CMO) 2025.
[2025.07] We release Intern-S1, an advanced open-source scientific multimodal reasoning model.
[2025.07] Our paper OREAL is accepted by COLM 2025.
[2025.01] Our paper Mask-DPO is accepted by ICLR 2025.
[2024.12] We release InternThinker, a powerful reasoning model.
[2024.09] Our paper ANAH-v2 is accepted by NeurIPS 2024.
[2024.05] Our paper ANAH is accepted by ACL 2024.

Projects

Intern-S2-Preview: an efficient 35B scientific multimodal foundation model.
Intern-S1-Pro: a trillion-scale MoE multimodal scientific reasoning model.
Intern-S1-MO: a multi-agent system for olympiad-level mathematical problem solving.
Intern-S1: an advanced open-source scientific multimodal reasoning model.
InternVL3.5: an advanced open-source multimodal model.
InternThinker: a powerful reasoning model.
InternLM: state-of-the-art open-source LLMs varying from 7B to 123B.
Xtuner: a next-generation training engine built for ultra-Large MoE models.
Lagent: a lightweight open-source framework that allows users to efficiently build LLM-based agents.
OpenCompass: a one-stop platform for large model evaluation.

Awards

Gold Medal in the China Mathematical Olympiad (CMO), 2025
Yanbao Scholarship of Shanghai Jiao Tong University (1/197), 2025
Outstanding Undergraduate of Wuhan University, 2024
Lei Jun Excellence Scholarship of Wuhan University, 2024 (Top 0.1% of WHU, 10w RMB)
National First Prize in the China Undergraduate Mathematical Contest in Modeling, 2023
First Prize Excellence Scholarship of Wuhan University, 2021, 2022, 2023

Academic Service

Reviewer for: ICML2025-2026, NeurIPS2025, ICLR2026, COLM2025-2026, CVPR2026, ECCV2026.

Selected Publications

* denotes equal contribution.

Technical Reports

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Intern-S1: A Scientific Multimodal Foundation Model
InternLM2 Technical Report

(Co-) First author Papers

	Intern-S1-MO: Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving Songyang Gao, Yuzhe Gu, Zijian Wu, Lingkai Kong, Wenwei Zhang, Zhongrui Cai, Fan Zheng, Tianyou Ma, Junhao Shen, Haiteng Zhao, Duanyang Zhang, Huilun Zhang, Kuikun Liu, Chengqi Lyu, Yanhui Duan, Chiyu Chen, Ningsheng Ma, Jianfei Gao, Han Lyu, Dahua Lin, Kai Chen The 3rd Conference on Language Modeling (COLM 2026)* [Paper] [Project]
	ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning Ziyan Liu, Xueda Shen, Yuzhe Gu, Songyang Gao, Kuikun Liu, Guangran Cheng, Chengqi Lyu, Dahua Lin, Wenwei Zhang, Kai Chen Forty-third International Conference on Machine Learning (ICML 2026)* [Paper] [Code]
	Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Chengqi Lyu, Songyang Gao, Yuzhe Gu, Wenwei Zhang, Jianfei Gao, Kuikun Liu, Ziyi Wang, Shuaibin Li, Qian Zhao, Haian Huang, Weihan Cao, Jiangning Liu, Hongwei Liu, Junnan Liu, Songyang Zhang, Dahua Lin, Kai Chen The 2nd Conference on Language Modeling (COLM 2025) [Paper] [Code] [Project]
	Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The Thirteenth International Conference on Learning Representations (ICLR 2025) [Paper] [Code]
	ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) [Paper] [Code] [Project]
	ANAH: Analytical Annotation of Hallucinations in Large Language Models Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) [Paper] [Code] [Project]
	One more set: Mitigating conflict-based cache side-channel attacks by extending cache set Yuzhe Gu, Ming Tang, Quancheng Wang, Han Wang, Haili Ding Journal of Systems Architecture (JSA) [Paper]

Co-author Papers

	Exploring Visual Pretraining for Learning Language Intelligence Zhonghan Zhao, Yiming Zhang, Wenwei Zhang, Haiteng Zhao, Xingguang Wei, Zhangwei Gao, Kuikun Liu, Yuzhe Gu, Size Wu, Haian Huang, Jianfei Gao, Haijun Lv, Demin Song, Yunhua Zhou, Qipeng Guo, Gaoang Wang, Kai Chen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2026) [Paper]
	MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization Xiangyu Zhao, Junming Lin, Tianhao Liang, Yifan Zhou, Wenhao Chai, Yuzhe Gu, Weiyun Wang, Kai Chen, Gen Luo, Wenwei Zhang, Junchi Yan, Hua Yang, Haodong Duan, Xue Yang The Fourteenth International Conference on Learning Representations (ICLR 2026) [Paper] [Project]
	The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner Zhouqi Hua, Wenwei Zhang, Chengqi Lyu, Yuzhe Gu, Songyang Gao, Kuikun Liu, Kai Chen The Fourteenth International Conference on Learning Representations (ICLR 2026) [Paper]
	CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward Shudong Liu, Hongwei Liu, Junnan Liu, Linchen Xiao, Songyang Gao, Chengqi Lyu, Yuzhe Gu, Wenwei Zhang, Derek F. Wong, Songyang Zhang, Kai Chen The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025) [Paper] [Project]
	Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning Junhao Shen, Haiteng Zhao, Yuzhe Gu, Songyang Gao, Kuikun Liu, Haian Huang, Jianfei Gao, Dahua Lin, Wenwei Zhang, Kai Chen The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) [Paper]
	Redeem myself: Purifying backdoors in deep learning models using self attention distillation Xueluan Gong, Yanjiao Chen, Wang Yang, Qian Wang, Yuzhe Gu, Huayang Huang, Chao Shen 44th IEEE Symposium on Security and Privacy (Oakland 2023) [Paper]

	Intern-S1-MO: Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving Songyang Gao, Yuzhe Gu, Zijian Wu, Lingkai Kong, Wenwei Zhang, Zhongrui Cai, Fan Zheng, Tianyou Ma, Junhao Shen, Haiteng Zhao, Duanyang Zhang, Huilun Zhang, Kuikun Liu, Chengqi Lyu, Yanhui Duan, Chiyu Chen, Ningsheng Ma, Jianfei Gao, Han Lyu, Dahua Lin, Kai Chen The 3rd Conference on Language Modeling (COLM 2026)* [Paper] [Project]
	ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning Ziyan Liu, Xueda Shen, Yuzhe Gu, Songyang Gao, Kuikun Liu, Guangran Cheng, Chengqi Lyu, Dahua Lin, Wenwei Zhang, Kai Chen Forty-third International Conference on Machine Learning (ICML 2026)* [Paper] [Code]
	Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Chengqi Lyu, Songyang Gao, Yuzhe Gu, Wenwei Zhang, Jianfei Gao, Kuikun Liu, Ziyi Wang, Shuaibin Li, Qian Zhao, Haian Huang, Weihan Cao, Jiangning Liu, Hongwei Liu, Junnan Liu, Songyang Zhang, Dahua Lin, Kai Chen The 2nd Conference on Language Modeling (COLM 2025) [Paper] [Code] [Project]
	Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The Thirteenth International Conference on Learning Representations (ICLR 2025) [Paper] [Code]
	ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) [Paper] [Code] [Project]
	ANAH: Analytical Annotation of Hallucinations in Large Language Models Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) [Paper] [Code] [Project]
	One more set: Mitigating conflict-based cache side-channel attacks by extending cache set Yuzhe Gu, Ming Tang, Quancheng Wang, Han Wang, Haili Ding Journal of Systems Architecture (JSA) [Paper]