Xiangyu Liu
2104 Brendan Iribe Center
College Park, Maryland 20740
I am a fifth-year PhD student in Computer Science at the University of Maryland, College Park (UMD) since 2021, working under the guidance of Prof. Kaiqing Zhang, and an in-coming research scientist at Google Research, NYC. I have also had the privilege of working with Prof. Furong Huang. Prior to this, I completed my undergraduate studies in Computer Science at Shanghai Jiao Tong University (SJTU) from 2017 to 2021, where I conducted my bachelor thesis research with Prof. Ying Wen.
My research focuses on the foundational aspects of (multi-agent) reinforcement learning (RL), particularly on game-theoretical/strategic (NeurIPS 2021, ICML 2023) and partially observable settings (NeurIPS 2024), as well as their applications on adversarially robust RL (ICLR 2024).
More recently, I have expanded my research to explore (multi-)large language model (LLM) agents interactions (ICLR 2025), leveraging techniques from game theory and online learning.
In the past, I completed research internships at Google Research, Market Algorithms Team, where I worked with Zhe Feng, Aranyak Mehta, and Di Wang from Jun. 2025 to Dec. 2025, as well as Bloomberg AI Research in the summer of 2022.
News
| Feb 26, 2026 | My internship paper at Google Research is available on arxiv, where we have explored scaling inference-time computation via opponent simulation for LLM-based strategic reasoning. |
|---|---|
| May 27, 2025 | Joined Google Research, Mountain View as a research intern, working on LLMs and multi-agents. |
| Apr 18, 2025 | Gave an invited talk at UVA RL Meetup. |
| Jan 22, 2025 | One paper accepted to ICLR 2025 on LLM agents. |
| Sep 25, 2024 | One paper accepted to NeurIPS 2024 on partially observable RL with privileged information. |
| Mar 24, 2024 | Gave a talk at 2024 INFORMS Optimization Society Conference (IOS 2024) on partially obseravble multi-agent RL at Houston, Texas. |
| Jan 16, 2024 | Three papers accepted to ICLR 2024 including one spotlight. |
Selected publications
* denotes equal contribution, † denotes alphabetical order.
- ICML 2023Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information SharingIn International Conference on Machine Learning, 2023
Extended version accepted to SIAM Journal on Control and Optimization (SICON) - NeurIPS 2024Provable Partially Observable Reinforcement Learning with Privileged InformationIn The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
A shorter version also presented at ICML 2024 ARLET workshop. - ICLR 2024Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RLIn The Twelfth International Conference on Learning Representations, 2024
A preliminary version is selected at NeurIPS 2022 Trustworthy and Socially Responsible Machine Learning (TSRML) workshop as an outstanding paper - ICLR 2025Do LLM Agents Have Regret? A Case Study in Online Learning and GamesIn The Thirteenth International Conference on Learning Representations, 2025
- CDC 2025Principled Learning-to-Communicate with Quasi-Classical Information StructuresIn 2025 IEEE 64th Conference on Decision and Control (CDC), 2025
- Axiv 2026Scaling Inference-Time Computation via Opponent Simulation: Enabling Online Strategic Adaptation in Repeated NegotiationarXiv preprint arXiv:2602.19309, 2026
A short version accepted to ICLR 2026 Multi-Agent Learning and Generative AI workshop.
Talks
- Invited Talk at the UVA RL meetup on partially observable RL with privileged information, 2025
- Talk at the 2024 INFORMS Optimization Society Conference (IOS 2024) on partially observable multi-agent RL, Houston, Texas, 2024
- Contributed talk at the TSRML workshop of NeurIPS 2022 on adversarial policies in competitive games, 2022
- Talk at RLChina on unifying diversity in open-ended learning for zero-sum games, China, 2021
Awards
- Outstanding Paper Award, Trustworthy and Socially Responsible Machine Learning (TSRML) workshop, NeurIPS 2022
- Dean's Fellowship, UMD, 2021
- National Scholarship, China, 2018 & 2019
Service
- Reviewer for NeurIPS 2024-2025, ICLR 2025-2026, ICML 2025, AISTATS 2025-2026, UAI 2024-2025, AAMAS 2025-2026, CDC 2025, L4DC 2026
- Graduate lecturer at summer AI camps for K12 students (2023, 2024), teaching multi-agent RL.