Yuelin Hu

Yuelin Hu ( 胡玥麟 )

Master Student

Shanghai Jiao Tong University

I am Yuelin Hu, a Research Master student at Shanghai Jiao Tong University (SJTU), specializing in large language models, post-training alignment, and agent-based systems. My academic training combines a strong theoretical foundation with intensive engineering practice, covering machine learning, deep learning, optimization, and AI systems.

I have conducted research and engineering work at multiple leading industrial research labs, including Xiaohongshu Hi Lab, Ant Group MYbank AI Department, and Microsoft Research Asia (MSRA). My experience spans large-scale supervised fine-tuning (SFT), reinforcement learning for LLMs (GRPO, DPO, PPO variants), data quality pipelines, evaluation infrastructure, and end-to-end model deployment on thousand-GPU clusters.

On the research side, I am the first author of multiple papers submitted to or accepted by top-tier conferences such as AAAI, ACL, ICASSP, and ISCAS. My work focuses on stabilizing and improving LLM alignment through principled training control, credit assignment, and agent-based reasoning, including MCTS-enhanced GRPO, adaptive SFT–RL mixing strategies, and domain-aware web agents.

Overall, I am deeply motivated by building reliable, scalable, and well-evaluated intelligent systems, and I aim to continue contributing to both academic research and real-world LLM applications.

Interests
  • Large Language Models (LLMs)
  • Reinforcement Learning for Alignment
  • Agent Systems and Tool-Use RL
  • Multimodal Intelligence
  • AI Systems and Evaluation
Education
  • M.S. in Electronic Information Engineering (SPEIT), Sep. 2023 ~ Feb. 2026 (expected)

    Shanghai Jiao Tong University

  • Exchange M.S. in Computer Science, One Semester Exchange (QS Top 50 Program)

    École Polytechnique

  • B.S. in Electronic Information Engineering (SPEIT), Sep. 2019 ~ Jun. 2023

    Shanghai Jiao Tong University