Qiang (Jony) ZHANG jonyzhang2023

🧑‍🔬 About Me

Chair of the Research Committee & Chief Researcher at the China National Innovation Center of Embodied AI Robotics
Deeply collaborating with Prof. Shaoqing Ren at USTC

🤖 Currently leading research on Humanoid Robots — building the TianGong Humanoid Robot Platform
🏛️ Previously contributed to "Tianhe 天河" Supercomputing projects and worked at DJI
🎓 Welcome to visit and apply to our collaborative program at USTC
📚 Check out my Google Scholar for recent publications
🤝 Open to collaborations in Humanoid Robots, Embodied AI, RL, Vision Perception, LLM, Control & Planning

🤖 Featured Humanoid Projects

_{China Universal Humanoid Robot Platform}

_{PNDbotics Adam}

_{Fourier GR1}

🗞️ News & Highlights

🏅 2025-08: We won the 100m championship, 400m 2nd & 3rd, 1500m 2nd, 4×100m 2nd, material organization championship, and material handling 2nd at WHR 2025 — the first World Robotics Conference!

🏃‍♂️ 2025-04-19: The Tiangong humanoid robot made history — it successfully completed a half-marathon!

🔽 Click to expand full news timeline

Date	Event
2025-11-29	Our work SPO adopted as baseline RL algorithm by PI0.6
2025-07-29	Released Humanoid Occupancy — Generalized Multimodal Perception Module
2025-07-09	Released TienKung Marathon Control Framework
2025-05-24	Published an article in People's Daily
2025-05-08	Featured on the "Innovation China" TV program
2025-04-24	Co-hosted embodied intelligent robots seminar with Peking University
2025-04-10	Interviewed by "Innovation China" column of China Association for Science and Technology
2025-03-29	Invited talk at CEAI 2025
2025-03-06	Joined the CNR Finance Jin Ding Think Tank
2025-01-08	Participated in Beijing City's long-term robot planning and technical roadmap
2024-12-30	Invited by National Health Commission as expert reviewer for AI medical projects
2024-11-25	Interviewed by Global Times on AI for football & robot marathons
2024-11-16	Keynote on "Embodied Intelligence of Humanoid Robots" at AGIROS Conference, Chinese Academy of Sciences
2024-11-09	Intel China Academic Talent Program keynote on "Research on Embodied AI of Humanoid Robots"
2024-11-05	Attended BAAI "ZhiYuan Forum — Embodiment and World Model Summit"
2024-09-30	Talk at Peking University on Multimodal Perception & Large Model Decision-Making for Humanoid Robots
2024-07-29	Interviewed by Mango (Hunan) TV on "Embodied AI and Humanoid Robots Tian Gong"
2024-07-05	Invited by China Internet Research Institute to draft embodied intelligence white paper
2024-07-01	Invited talk at XMech, Zhejiang University
2024-05-29	Interviewed by Beijing Association for Science and Technology
2024-05-09	Featured on CCTV — TianGong continuous iteration

📝 Selected Publications

I'm currently exploring Embodied AI, RL, Vision Perception, LLM, Control & Planning in Robotics.

🔬 Pre-prints

🔽 Click to expand pre-prints

Title	Link
MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction	🔗
HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model	🔗
RoboStriker: Hierarchical Decision-Making for Autonomous Humanoid Boxing	🔗
MVISTA-4D: View-Consistent 4D World Model with Test-Time Action Inference	🔗
DPL: Depth-only Perceptive Humanoid Locomotion via Realistic Depth Synthesis	🔗
PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control	🔗
EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation	🔗
HumanoidVerse: A Versatile Humanoid for Vision-Language Guided Multi-Object Rearrangement	🔗
LOVON: Legged Open-Vocabulary Object Navigator	🔗
Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity	🔗
Occupancy World Model for Robots	🔗
RoboOcc: Enhancing the Geometric and Semantic Scene Understanding for Robots	🔗
The Meta-Representation Hypothesis	🔗
EmbodiedVSR: Dynamic Scene Graph-Guided Chain-of-Thought Reasoning for Visual Spatial Tasks	🔗
HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots	🔗
NeuGPT: Unified multi-modal Neural GPT	🔗
Recursive Cleaning for Large-scale Protein Data via Multimodal Learning	🔗
Query-based Semantic Gaussian Field for Scene Representation in RL	🔗
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline RL	🔗
MAD: Multi-Alignment MEG-to-Text Decoding	🔗
Manipulation Facing Threats: Evaluating Physical Vulnerabilities in End-to-End VLA Models	🔗
E2H: A Two-Stage Non-Invasive Neural Signal Driven Humanoid Robotic Whole-Body Control	🔗
Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across MLLMs	🔗
A Dual-Agent Adversarial Framework for Robust Generalization in Deep RL	🔗

📄 Peer-Reviewed Publications

🔽 Click to expand publications

Venue	Title	Link
IROS 2024	Whole-body Humanoid Robot Locomotion with Human Reference	🔗
CVPR 2026	SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models	🔗
ICLR 2026	Compose Your Policies! Improving Diffusion/Flow Robot Policies via Test-time Composition	🔗
ICLR 2026	ArtVIP: Articulated Digital Assets for Robot Learning	🔗
ICRA 2026	Physics-informed Diffusion Mamba Transformer for Real-world Driving	📬
ICRA 2026	TopoNav: Topological Graphs as a Key Enabler for Advanced Object Navigation	🔗
ICRA 2026	Learning Structural Latent Points for Efficient Visual Representations in Robotic Manipulation	📬
AAAI 2026	What You See is What You Reach: Spatial Navigation with High-Level Human Instructions	🔗
ICASSP 2026	NeuSpeech: Decode Neural Signal as Speech	🔗
ICML 2025	Simple Policy Optimization	🔗
RSS 2025	RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation	🔗
IEEE TVCG	DEGS: Deformable Event-based 3D Gaussian Splatting	🔗
CoRL 2025	Omni-Perception: Omnidirectional Collision Avoidance for Legged Locomotion	🔗
ICCV 2025	What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?	🔗
ICCV 2025	Learning Null Geodesics for Gravitational Lensing Rendering in General Relativity	🔗
ACMMM 2025	Transfer Attack for Bad and Good: Adversarial Transferability across MLLMs	🔗
IROS 2025	Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models	🔗
IROS 2025	Distillation-PPO: Two-Stage RL Framework for Humanoid Perceptive Locomotion	🔗
ACL 2025	MapNav: Novel Memory Representation via Annotated Semantic Maps for VLN	🔗
CVPR 2025	Uncovering Vision Modality Threats in Image-to-Image Tasks	🔗
ICRA 2025	Multi-Floor Zero-Shot Object Navigation Policy	🔗
ICASSP 2025	Fully Spiking Neural Network for Legged Robots	🔗
ICASSP 2025	Event Masked Autoencoder: Point-wise Action Recognition	🔗
ICME 2025	ES-Parkour: Advanced Robot Parkour with Bio-inspired Event Camera & SNN	🔗
IJCAI 2025 🏆	Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Models	🔗
PM2CE@IROS 2025	Humanoid Occupancy: Generalized Multimodal Occupancy Perception for Humanoid Robots	🔗
H2R@CoRL 2025	UniTracker: Universal Whole-Body Motion Tracker for Humanoid Robots	🔗
Sim2Real@Humanoids 2025	LiPS: Large-Scale Humanoid Robot RL with Parallel-Series Structures	🔗
Sim2Real@Humanoids 2025	Trinity: A Modular Humanoid Robot AI System	🔗
GenModels@ICLR 2025	Modality-Composable Diffusion Policy via Distribution-level Composition	🔗
NeurIPS 2024	DEL: Discrete Element Learner for Learning 3D Dynamics from 2D Observations	🔗
NeurIPS 2024	Spiking Neural Network as Adaptive Event Stream Slicer	🔗
IEEE TAI	Spiking Diffusion Models	🔗
IROS 2024	Reinforcement Learning with Generalizable Gaussian Splatting	🔗
IROS 2024	TriHelper: Zero-Shot Object Navigation with Dynamic Assistance	🔗
ICRA 2024	Prompting Multi-Modal Tokens for End-to-End Autonomous Driving with LLMs	🔗
ICRA 2024	Prompt, Plan, Perform: LLM-based Humanoid Control via Quantized Imitation Learning	🔗
WACV 2024	Spiking Denoising Diffusion Probabilistic Models	🔗
ICCV 2023	Masked Spiking Transformer	🔗
CoRL 2022	RoboTube: Learning Household Manipulation from Human Videos	🔗

💡 I have hidden some previous work — feel free to chat! Currently preparing my personal website.

💻 Tech Stack

Python	C++	C	JavaScript	R	Bash	MATLAB
PyTorch	TensorFlow	OpenCV	Docker	Linux	Ubuntu	RedHat
GitHub	Git	GitLab	AWS	CMake	Jenkins	RPi
VSCode	PyCharm	Vim	Sublime	HTML5	PS	AI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qiang (Jony) ZHANG jonyzhang2023

Achievements

Achievements

Block or report jonyzhang2023