Yuzhe Gu 顾宇喆PhD StudentLarge Model Center, Shanghai AI Laboratory
|
![]() |
I am a PhD student at Shanghai Jiao Tong University, in the joint program at Shanghai AI Laboratory, advised by Wenwei Zhang and Kai Chen. Before that, I received the bachelor degree at Wuhan University in 2024.
My research interests lie primarily in the area of Large Language Model (LLM). I'm focusing on improving the reasoning and knowledge capabilities of LLMs. I also have experience about the reducing hallucination in LLMs, including the annotation, detection and mitigation of hallucinations.
Discussions and cooperations are welcomed!
![]() |
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Chengqi Lyu*, Songyang Gao*, Yuzhe Gu*, Wenwei Zhang*, Jianfei Gao, Kuikun Liu, Ziyi Wang, Shuaibin Li, Qian Zhao, Haian Huang, Weihan Cao, Jiangning Liu, Hongwei Liu, Junnan Liu, Songyang Zhang, Dahua Lin, Kai Chen The 2nd Conference on Language Modeling (COLM 2025) [Paper] [Code] [Project] |
![]() |
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs
Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The Thirteenth International Conference on Learning Representations (ICLR 2025) [Paper] [Code] |
![]() |
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu*, Ziwei Ji*, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) [Paper] [Code] [Project] |
![]() |
ANAH: Analytical Annotation of Hallucinations in Large Language Models
Ziwei Ji*, Yuzhe Gu*, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) [Paper] [Code] [Project] |
![]() |
One more set: Mitigating conflict-based cache side-channel attacks by extending cache set
Yuzhe Gu, Ming Tang, Quancheng Wang, Han Wang, Haili Ding Journal of Systems Architecture (JSA) [Paper] |
![]() |
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
Shudong Liu, Hongwei Liu, Junnan Liu, Linchen Xiao, Songyang Gao, Chengqi Lyu, Yuzhe Gu, Wenwei Zhang, Derek F. Wong, Songyang Zhang, Kai Chen The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025) [Paper] [Project] |
![]() |
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning
Junhao Shen, Haiteng Zhao, Yuzhe Gu, Songyang Gao, Kuikun Liu, Haian Huang, Jianfei Gao, Dahua Lin, Wenwei Zhang, Kai Chen preprint [Paper] |
![]() |
The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner
Zhouqi Hua, Wenwei Zhang, Chengqi Lyu, Yuzhe Gu, Songyang Gao, Kuikun Liu, Kai Chen preprint [Paper] |
![]() |
BackCache: Mitigating contention-based cache timing attacks by hiding cache line evictions
Quancheng Wang, Xige Zhang, Han Wang, Yuzhe Gu, Ming Tang preprint [Paper] |
![]() |
Redeem myself: Purifying backdoors in deep learning models using self attention distillation
Xueluan Gong, Yanjiao Chen, Wang Yang, Qian Wang, Yuzhe Gu, Huayang Huang, Chao Shen 44th IEEE Symposium on Security and Privacy (Oakland 2023) [Paper] |