I’m currently a master student at Fudan University at the School of Computer Science of Fudan University. I am a member of Fudan NLP Lab, advised by Prof. Xuanjing Huang (黄萱菁) and Associate Prof. Tao Gui(桂 韬). I got my bachelor’s degree from Tongji University, advised by Associate Prof. Dawei Cheng.
My research interest mainly lies in Multimodal reasoning, LLM Agent, Reinforcement Learning and Efficient AI.
🔥 News
- 2026.1: 🎉🎉 Our paper Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization is accepted by ICLR 2026! See you in Brazil🇧🇷!
- 2025.12: 🎉🎉 Our Agent memory survey Memory in the Age of AI Agents is released on arXiv!
- 2025.5: 🎉🎉 Our paper Masrouter: Learning to route llms for multi-agent systems is accepted by ACL 2025 Main!
💻 Internships
2025.4 - now
Bytedance, Data-Capcut AIGC research
Shanghai, China
📝 Selected Publications
Arxiv 2025

Memory in the Age of AI Agents: A Survey
Yuyang Hu†, Shichun Liu†, Yanwei Yue†, Guibin Zhang†, Boyang Liu,etc.
ICLR 2026

Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization
Boyang Liu†, Yifan Hu†, Senjie Jin†, Shihan Dou, Gonglei Shi, Jie Shao, Tao Gui, Xuanjing Huang
- We propose Aes-R1, a comprehensive aesthetic reasoning framework with reinforcement learning (RL). Concretely, Aes-R1 integrates a pipeline, AesCoT, to construct and filter high-quality chain-of-thought aesthetic reasoning data used for cold-start. After teaching the model to generate structured explanations prior to scoring, we then employ the Relative-Absolute Policy Optimization (RAPO), a novel RL algorithm that jointly optimizes absolute score regression and relative ranking order, improving both per-image accuracy and cross-image preference judgments. Aes-R1 enables MLLMs to generate grounded explanations alongside faithful scores, thereby enhancing aesthetic scoring and reasoning in a unified framework.
|
ACL 2025 Main

Masrouter: Learning to route llms for multi-agent systems
Yanwei Yue†, Guibin Zhang†, Boyang Liu†, Guancheng Wan, Kun Wang, Dawei Cheng, Yiyan Qi
- We first introduce the problem of Multi-Agent System Routing(MASR), which integrates all components ofMAS into a unified routing framework. Toward this goal, we propose MasRouter, the first high-performing, cost-effective, and inductive MASR solution. MasRouter employs collaboration mode determination, role allocation, and LLM routing through a cascaded controller network, progressively constructing a MAS that balances effectiveness and efficiency.
|
📖 Experience
- 2021.9 - 2025.6, B.E. at Tongji University with a major in Data Science and Big Data Technology.