news | Tiezheng Yu

Jan 08, 2026	We release our survey on Agent-as-a-Judge.
Dec 19, 2025	Our paper “The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs” is accepted in Transactions on Machine Learning Research (TMLR).