Pengcheng Wang

Do research trivial in hindsight, elegant in insight.

prof_pic.jpg

2505 Hearst Ave

Berkeley, CA 94720

I am a Ph.D. student in the Mechanical Systems Control Lab at UC Berkeley, advised by Prof. Masayoshi Tomizuka (Member of the National Academy of Engineering). Prior to that, I received dual bachelor’s degrees in Mechanics and Aerospace from Tsinghua University. During my undergraduate studies, I worked with Prof. Shengbo Eben Li.

I worked on Transferable and Scalable Reinforcement Learning for Robotics. Specifically, I care how RL agent can digest the scaled model size/ data/ training, and can be efficiently generalized across different embodiments/ tasks/ dynamics.

Outside of research, I love gaming with large-scale strategy systems such as Stellaris, Civilization VI, and Victoria 3. I’m also passionate about singing and served as one of the tenor leads in the Choir of Tsinghua University. Check out my favorite performance!

Email: wangpc [AT] berkeley.edu

Feel free to contact me for research collaborations or just to chat!


news

Apr 30, 2026 DADP, REAR, and Mind Your Entropy have been accepted to ICML 2026!
Apr 27, 2026 DiscreteRTC just came out, check out this trivial but cute idea! :sparkles:
Jan 25, 2026 MVP has been accepted to ICLR 2026 as Oral! TD-MPC² has been accepted to L4DC 2026!

selected publications

  1. arXiv 2026
    DiscreteRTC.gif
    DiscreteRTC: Discrete Diffusion Policies are Natural Asynchronous Executors
    Pengcheng Wang, Kaiwen Hong, Chensheng Peng, and 4 more authors
    arXiv preprint
  2. ICML 2026
    DADP.gif
    DADP: Domain Adaptive Diffusion Policy
    Pengcheng Wang, Qinghang Liu, Haotian Lin, and 4 more authors
    ICML 2026
  3. ICLR 2025
    residual-mppi.gif
    Residual-mppi: Online policy customization for continuous control
    Pengcheng Wang, Chenran Li, Catherine Weaver, and 4 more authors
    ICLR 2025
  4. ICRA 2025
    RPG.png
    Residual Policy Gradient: A Reward View of KL-regularized Objective
    Pengcheng Wang, Xinghao Zhu, Yuxin Chen, and 3 more authors
    ICRA Safe-VLM Workshop (Spotlight)
  5. L4DC 2026
    TDMPC_square.jpg
    TD-M(PC)²: Improving Temporal Difference MPC Through Policy Constraint
    Haotian Lin, Pengcheng Wang, Jeff Schneider, and 1 more author
    L4DC 2026