I am Yafu Li, a postdoctoral researcher at the Chinese University of Hong Kong under the supervision of Prof. Yu Cheng. My research focuses on reasoning, trustworthy AI, and multilinguality.
✨ ✨ ✨
We are looking for interns and joint PhD candidates (with THU, PKU, SJTU, FDU, etc.) to work on cutting-edge research in large language models. If you are interested, please feel free to contact me at yafuly@gmail.com.
-
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration (SpecBench) 📄 Paper | 💻 Code
-
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models (MathIF) 📄 Paper | 💻 Code
-
Learning to Reason under Off-Policy Guidance (LUFFY) 📄 Paper | 💻 Code
-
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond 📄 Paper | 💻 Code
-
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback (TPO) 📄 Paper | 💻 Code

