Keming Wu

吴科明

Ph.D. Student, Tsinghua University

Beijing, China
Email: wukeming0608@gmail.com wukm25@mails.tsinghua.edu.cn

       


About

I am a Ph.D. student at the School of Software at Tsinghua University. I was a research intern at Visual Computing Group, Microsoft Research Asia from September, 2024 to April, 2025. Currently, my research interest include topics on deep generative models and their applications in Computer Vision and Language Models.

I’m currently actively seeking for Research Assistant, or internship positions related to any of the above topics. I’m also open to any possible discussions or collaborate opportunities. please feel free to contact me for further discussion and potential collaboration!

News

  • 2026.01 One paper is released: DeepResearchEval.
  • 2025.11 Two papers are released: OpenMMReasoner, and the other LongVT.
  • 2025.10 One paper is released: Focusing on generative image evaluation.
  • 2025.09 Two papers are released: one focusing on image editing reward model, and the other on generative video evaluation.
  • 2025.08 One paper got accepted by ACM MM 2025 Brave New Ideas Track (Oral).
  • 2025.06 One paper about layout to image generation got accepted by ICCV 2025 (First Author).
  • 2025.05 One paper about multi-layer image generation is released.
  • 2025.02 One paper got accepted by CVPR 2025.
  • 2024.10 One paper about information fusion got accepted by IEEE Transactions on Systems, Man and Cybernetics: Systems (CCF-B journal, First Author).
  • 2024.07 One paper got accepted by ACM MM 2024 (My first CCF-A conference paper, First Author. Congratulations!).
  • 2024.01 One paper got accepted by Information Sciences (CCF-B journal, First Author).

Selected Publications

(* equal contribution)

Multi-modality AIGC & Evaluation

Multi-modality Understanding & Reasoning

Open-Source Projects

LMMs-Engine GitHub repo stars
A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
Contributor
Project Link: https://github.com/EvolvingLMMs-Lab/lmms-engine
LMMs-Eval GitHub repo stars
Accelerating the development of large multimodal models (LMMs) with lmms-eval. We support most text, image, video and audio tasks..
Contributor
Project Link: https://github.com/EvolvingLMMs-Lab/lmms-eval

Other Publications

(* equal contribution)
  • DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation [PDF] [Code]

    Yibo Wang, Lei Wang, Yue Deng, Keming Wu, Yao Xiao, Huanjin Yao, Liwei Kang, Hai Ye, Yongcheng Jing, Lidong Bing

    Technical Report

  • A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports [PDF] [WebPage]

    Yang Yao, Yixu Wang, Yuxuan Zhang, Yi Lu, Tianle Gu, Lingyu Li, Dingyi Zhao, Keming Wu, Haozhe Wang, Ping Nie, Yan Teng, Yingchun Wang

    Technical Report

  • VideoScore2: Think before You Score in Generative Video Evaluation [PDF] [WebPage] [Code]

    Xuan He, Dongfu Jiang, Ping Nie, Minghao Liu, Zhengxuan Jiang, Mingyi Su, Wentao Ma, Junru Lin, Chun Ye, Yi Lu, Keming Wu, Benjamin Schneider, Quy Duc Do, Zhuofeng Li, Yiming Jia + 9 more authors

    Technical Report

  • Physics-Informed Representation Alignment for Sparse Radio-Map Reconstruction [PDF]

    Haozhe Jia, Wenshuo Chen, Zhihui Huang, Lei Wang, Hongru Xiao, Nanqian Jia, Keming Wu, Songning Lai, Bowen Tian, Yutao Yue

    ACM MM 2025 Brave New Ideas Track(Oral)

  • Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss [PDF] [Code]

    Wenshuo Chen, Haozhe Jia, Songning Lai, Keming Wu, Hongru Xiao, Lijie Hu, Yutao Yue

    Technical Report

  • RSC-SNN: Exploring the Trade-off Between Adversarial Robustness and Accuracy in Spiking Neural Networks via Randomized Smoothing Coding [PDF] [Code]

    Keming Wu*, Man Yao*, Yuhong Chou, Xuerui Qiu, Rui Yang, Bo Xu, Guoqi Li

    ACM MM 2024

  • A Fractal-based Complex Belief Entropy for Uncertainty Measure in Complex Evidence Theory [PDF]

    Keming Wu, Fuyuan Xiao, Yi Zhang

    IEEE Transactions on Systems, Man and Cybernetics: Systems 2024

  • A Novel Quantum Belief Entropy for Uncertainty Measure in Complex Evidence Theory [PDF]

    Keming Wu, Fuyuan Xiao

    Information Sciences 2024

Honors & Awards

  • National Scholarship (Three times)

Industrial Experience

Microsoft Research Asia
Visual Computing Group

Research Intern, supervised by Senior Researcher Yuhui Yuan and Principal Research Manager Dong Chen.
Sep. 2024 - Apr. 2025
Topic: Controllable Image Generation & Multi-Layer Image Generation

Education & Visiting

University of Waterloo
TIGER Lab

Research Intern, supervised by Prof. Wenhu Chen.
Apr. 2025 - Current
Topic: Image Editing & Multi-Modal AIGC Evaluation

Tsinghua University
School of Software

Ph.D. student of Software Engineering.
Aug. 2025 - Current
Topic: Multi-modality AIGC & Reasoning

Institute of Automation, Chinese Academy of Sciences
Brain-inspired computing lab

Research Intern, supervised by Prof. Guoqi Li.
Apr. 2023 - Jan. 2024
Topic: Brain-inspired Computing & Spiking Neural Network

Professional Activities

  • Journal Reviewer
  • Conference Reviewer
    • CVPR 2026, ICLR 2026


© Keming Wu | Last updated: 2026/01/16