Yuelin Hu | SJTU Media Lab

Yuelin Hu ( 胡玥麟 )

Master Student

Shanghai Jiao Tong University

I am Yuelin Hu, a Research Master student at Shanghai Jiao Tong University (SJTU), specializing in large language models, post-training alignment, and agent-based systems. My academic training combines a strong theoretical foundation with intensive engineering practice, covering machine learning, deep learning, optimization, and AI systems.

I have conducted research and engineering work at multiple leading industrial research labs, including Xiaohongshu Hi Lab, Ant Group MYbank AI Department, and Microsoft Research Asia (MSRA). My experience spans large-scale supervised fine-tuning (SFT), reinforcement learning for LLMs (GRPO, DPO, PPO variants), data quality pipelines, evaluation infrastructure, and end-to-end model deployment on thousand-GPU clusters.

On the research side, I am the first author of multiple papers submitted to or accepted by top-tier conferences such as AAAI, ACL, ICASSP, and ISCAS. My work focuses on stabilizing and improving LLM alignment through principled training control, credit assignment, and agent-based reasoning, including MCTS-enhanced GRPO, adaptive SFT–RL mixing strategies, and domain-aware web agents.

Overall, I am deeply motivated by building reliable, scalable, and well-evaluated intelligent systems, and I aim to continue contributing to both academic research and real-world LLM applications.

Interests

Large Language Models (LLMs)
Reinforcement Learning for Alignment
Agent Systems and Tool-Use RL
Multimodal Intelligence
AI Systems and Evaluation

Education

M.S. in Electronic Information Engineering (SPEIT), Sep. 2023 ~ Feb. 2026 (expected)

Shanghai Jiao Tong University
Exchange M.S. in Computer Science, One Semester Exchange (QS Top 50 Program)

École Polytechnique
B.S. in Electronic Information Engineering (SPEIT), Sep. 2019 ~ Jun. 2023

Shanghai Jiao Tong University