My name is Jinguo Zhu (祝金国), now is a technical staff at Moonshot AI. Prior to this, I worked as a Young Researcher at Shanghai AI Lab, and I also spent time at CVC Group, Tencent AI Lab. During my Ph.D. studying years, I had a long and enriching journey of internships, especially at SenseTime Research, where I embarked on my path in AI research (thanks to Prof. Jifeng Dai and Dr. Xizhou Zhu for their guidance and mentorship).
My research focuses on the training, evaluation, and data construction of large multimodal models. I have been a core contributor to several major projects, including InternVL Series at Shanghai AI Lab and SEED-X at Tencent AI Lab. Before the current wave of large language models, I authored the Uni-Perceiver Series of work on multimodal generalist models. I consider myself experienced in this field.
I received both my B.S. and Ph.D. degrees in Electrical Engineering from Xi’an Jiaotong University. Though my major was not directly related to AI—it mainly dealt with high-voltage equipment and its maintenance in power systems—I have always aspired to apply AI to these industrial domains, aiming to promote AI for Engineering. I completed my Ph.D. in 2024.09 under the supervision of Prof. Mingzhe Rong and Prof. Xiaohua Wang, after earning my bachelor’s degree in 2019.06.
I am always open to collaboration—whether as a research partner, internship, or in any other form. Please feel free to reach out to me at lechatelia@gmail.com.
🔥 News
- 2025.04 🎉🎉 We released VisuLogic, a benchmark for evaluating the reasoning ability of multimodal models.
- 2025.04: 🎉🎉 Our team released InternVL-3.0.
📝 Publications

Xizhou Zhu, Jinguo Zhu, Hao Li, Xiaoshi Wu, Hongsheng Li, Xiaohua Wang, Jifeng Dai
- Uni-Perceiver is a generalist model for vision and language tasks.
CVPR 2023
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks., Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai.NeurIPS 2022
Uni-Perceiver-MoE: Learning sparse generalist models with conditional moes, Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai.CVPR 2021
Complementary Relation Contrastive Distillation, Jinguo Zhu, Shixiang Tang, Dapeng Chen, et al.
🎖 Honors and Awards
- 2018.07 National First Prize in National Undergraduate Electronic Design Contest.
- 2017.07 National First Prize in National Undergraduate Robot Contest (ABU Robocon).
- 2016.12 National First Prize in Mathematics competition of Chinese College Students.
📖 Educations
- 2019.09 - 2024.09, Ph.D., Xi’an Jiaotong University, China.
- 2015.09 - 2019.06, Undergraduate, Xi’an Jiaotong University, China.
💻 Internships
- 2023.06 - 2024.05, AI Lab, Tencent.
- 2021.02 - 2023.05, SenseTime Research & Shanghai AI Laboratory.
- 2020.06 - 2021.02, SCG, SenseTime.