Skip to content
View jonyzhang2023's full-sized avatar
  • Hong Kong University of Science and Technology

Block or report jonyzhang2023

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jonyzhang2023/README.md

Typing SVG

Google Scholar Email Foxmail HKUST-GZ Profile views


πŸ§‘β€πŸ”¬ About Me

Chair of the Research Committee & Chief Researcher at the China National Innovation Center of Embodied AI Robotics
Deeply collaborating with Prof. Shaoqing Ren at USTC

  • πŸ€– Currently leading research on Humanoid Robots β€” building the TianGong Humanoid Robot Platform
  • πŸ›οΈ Previously contributed to "Tianhe 倩河" Supercomputing projects and worked at DJI
  • πŸŽ“ Welcome to visit and apply to our collaborative program at USTC
  • πŸ“š Check out my Google Scholar for recent publications
  • 🀝 Open to collaborations in Humanoid Robots, Embodied AI, RL, Vision Perception, LLM, Control & Planning

πŸ€– Featured Humanoid Projects

TianGong
China Universal Humanoid Robot Platform
Adam
PNDbotics Adam
GR1
Fourier GR1

πŸ—žοΈ News & Highlights

πŸ… 2025-08: We won the 100m championship, 400m 2nd & 3rd, 1500m 2nd, 4Γ—100m 2nd, material organization championship, and material handling 2nd at WHR 2025 β€” the first World Robotics Conference!

πŸƒβ€β™‚οΈ 2025-04-19: The Tiangong humanoid robot made history β€” it successfully completed a half-marathon!

πŸ”½ Click to expand full news timeline
Date Event
2025-11-29 Our work SPO adopted as baseline RL algorithm by PI0.6
2025-07-29 Released Humanoid Occupancy β€” Generalized Multimodal Perception Module
2025-07-09 Released TienKung Marathon Control Framework
2025-05-24 Published an article in People's Daily
2025-05-08 Featured on the "Innovation China" TV program
2025-04-24 Co-hosted embodied intelligent robots seminar with Peking University
2025-04-10 Interviewed by "Innovation China" column of China Association for Science and Technology
2025-03-29 Invited talk at CEAI 2025
2025-03-06 Joined the CNR Finance Jin Ding Think Tank
2025-01-08 Participated in Beijing City's long-term robot planning and technical roadmap
2024-12-30 Invited by National Health Commission as expert reviewer for AI medical projects
2024-11-25 Interviewed by Global Times on AI for football & robot marathons
2024-11-16 Keynote on "Embodied Intelligence of Humanoid Robots" at AGIROS Conference, Chinese Academy of Sciences
2024-11-09 Intel China Academic Talent Program keynote on "Research on Embodied AI of Humanoid Robots"
2024-11-05 Attended BAAI "ZhiYuan Forum β€” Embodiment and World Model Summit"
2024-09-30 Talk at Peking University on Multimodal Perception & Large Model Decision-Making for Humanoid Robots
2024-07-29 Interviewed by Mango (Hunan) TV on "Embodied AI and Humanoid Robots Tian Gong"
2024-07-05 Invited by China Internet Research Institute to draft embodied intelligence white paper
2024-07-01 Invited talk at XMech, Zhejiang University
2024-05-29 Interviewed by Beijing Association for Science and Technology
2024-05-09 Featured on CCTV β€” TianGong continuous iteration

πŸ“ Selected Publications

I'm currently exploring Embodied AI, RL, Vision Perception, LLM, Control & Planning in Robotics.

πŸ”¬ Pre-prints

πŸ”½ Click to expand pre-prints
Title Link
MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction πŸ”—
HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model πŸ”—
RoboStriker: Hierarchical Decision-Making for Autonomous Humanoid Boxing πŸ”—
MVISTA-4D: View-Consistent 4D World Model with Test-Time Action Inference πŸ”—
DPL: Depth-only Perceptive Humanoid Locomotion via Realistic Depth Synthesis πŸ”—
PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control πŸ”—
EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation πŸ”—
HumanoidVerse: A Versatile Humanoid for Vision-Language Guided Multi-Object Rearrangement πŸ”—
LOVON: Legged Open-Vocabulary Object Navigator πŸ”—
Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity πŸ”—
Occupancy World Model for Robots πŸ”—
RoboOcc: Enhancing the Geometric and Semantic Scene Understanding for Robots πŸ”—
The Meta-Representation Hypothesis πŸ”—
EmbodiedVSR: Dynamic Scene Graph-Guided Chain-of-Thought Reasoning for Visual Spatial Tasks πŸ”—
HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots πŸ”—
NeuGPT: Unified multi-modal Neural GPT πŸ”—
Recursive Cleaning for Large-scale Protein Data via Multimodal Learning πŸ”—
Query-based Semantic Gaussian Field for Scene Representation in RL πŸ”—
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline RL πŸ”—
MAD: Multi-Alignment MEG-to-Text Decoding πŸ”—
Manipulation Facing Threats: Evaluating Physical Vulnerabilities in End-to-End VLA Models πŸ”—
E2H: A Two-Stage Non-Invasive Neural Signal Driven Humanoid Robotic Whole-Body Control πŸ”—
Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across MLLMs πŸ”—
A Dual-Agent Adversarial Framework for Robust Generalization in Deep RL πŸ”—

πŸ“„ Peer-Reviewed Publications

πŸ”½ Click to expand publications
Venue Title Link
IROS 2024 Whole-body Humanoid Robot Locomotion with Human Reference πŸ”—
CVPR 2026 SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models πŸ”—
ICLR 2026 Compose Your Policies! Improving Diffusion/Flow Robot Policies via Test-time Composition πŸ”—
ICLR 2026 ArtVIP: Articulated Digital Assets for Robot Learning πŸ”—
ICRA 2026 Physics-informed Diffusion Mamba Transformer for Real-world Driving πŸ“¬
ICRA 2026 TopoNav: Topological Graphs as a Key Enabler for Advanced Object Navigation πŸ”—
ICRA 2026 Learning Structural Latent Points for Efficient Visual Representations in Robotic Manipulation πŸ“¬
AAAI 2026 What You See is What You Reach: Spatial Navigation with High-Level Human Instructions πŸ”—
ICASSP 2026 NeuSpeech: Decode Neural Signal as Speech πŸ”—
ICML 2025 Simple Policy Optimization πŸ”—
RSS 2025 RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation πŸ”—
IEEE TVCG DEGS: Deformable Event-based 3D Gaussian Splatting πŸ”—
CoRL 2025 Omni-Perception: Omnidirectional Collision Avoidance for Legged Locomotion πŸ”—
ICCV 2025 What Makes for Text to 360-degree Panorama Generation with Stable Diffusion? πŸ”—
ICCV 2025 Learning Null Geodesics for Gravitational Lensing Rendering in General Relativity πŸ”—
ACMMM 2025 Transfer Attack for Bad and Good: Adversarial Transferability across MLLMs πŸ”—
IROS 2025 Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models πŸ”—
IROS 2025 Distillation-PPO: Two-Stage RL Framework for Humanoid Perceptive Locomotion πŸ”—
ACL 2025 MapNav: Novel Memory Representation via Annotated Semantic Maps for VLN πŸ”—
CVPR 2025 Uncovering Vision Modality Threats in Image-to-Image Tasks πŸ”—
ICRA 2025 Multi-Floor Zero-Shot Object Navigation Policy πŸ”—
ICASSP 2025 Fully Spiking Neural Network for Legged Robots πŸ”—
ICASSP 2025 Event Masked Autoencoder: Point-wise Action Recognition πŸ”—
ICME 2025 ES-Parkour: Advanced Robot Parkour with Bio-inspired Event Camera & SNN πŸ”—
IJCAI 2025 πŸ† Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Models πŸ”—
PM2CE@IROS 2025 Humanoid Occupancy: Generalized Multimodal Occupancy Perception for Humanoid Robots πŸ”—
H2R@CoRL 2025 UniTracker: Universal Whole-Body Motion Tracker for Humanoid Robots πŸ”—
Sim2Real@Humanoids 2025 LiPS: Large-Scale Humanoid Robot RL with Parallel-Series Structures πŸ”—
Sim2Real@Humanoids 2025 Trinity: A Modular Humanoid Robot AI System πŸ”—
GenModels@ICLR 2025 Modality-Composable Diffusion Policy via Distribution-level Composition πŸ”—
NeurIPS 2024 DEL: Discrete Element Learner for Learning 3D Dynamics from 2D Observations πŸ”—
NeurIPS 2024 Spiking Neural Network as Adaptive Event Stream Slicer πŸ”—
IEEE TAI Spiking Diffusion Models πŸ”—
IROS 2024 Reinforcement Learning with Generalizable Gaussian Splatting πŸ”—
IROS 2024 TriHelper: Zero-Shot Object Navigation with Dynamic Assistance πŸ”—
ICRA 2024 Prompting Multi-Modal Tokens for End-to-End Autonomous Driving with LLMs πŸ”—
ICRA 2024 Prompt, Plan, Perform: LLM-based Humanoid Control via Quantized Imitation Learning πŸ”—
WACV 2024 Spiking Denoising Diffusion Probabilistic Models πŸ”—
ICCV 2023 Masked Spiking Transformer πŸ”—
CoRL 2022 RoboTube: Learning Household Manipulation from Human Videos πŸ”—

πŸ’‘ I have hidden some previous work β€” feel free to chat! Currently preparing my personal website.


πŸ’» Tech Stack

icon
Python
icon
C++
C
C
icon
JavaScript
R
R
Bash
Bash
MATLAB
MATLAB
PyTorch
PyTorch
TensorFlow
TensorFlow
OpenCV
OpenCV
icon
Docker
Linux
Linux
Ubuntu
Ubuntu
RedHat
RedHat
icon
GitHub
Git
Git
GitLab
GitLab
icon
AWS
CMake
CMake
Jenkins
Jenkins
RPi
RPi
VSCode
VSCode
PyCharm
PyCharm
Vim
Vim
Sublime
Sublime
HTML5
HTML5
PS
PS
AI
AI

πŸ“Š GitHub Stats


Pinned Loading

  1. awesome-humanoid-learning awesome-humanoid-learning Public

    Humanoid Robots Resources

    872 38

  2. awesome-embodied-vla-va-vln awesome-embodied-vla-va-vln Public

    A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

    2.7k 118

  3. TienKung-Lab TienKung-Lab Public

    Forked from Open-X-Humanoid/TienKung-Lab

    Tien Kung-Lab: Direct IsaacLab Workflow for Legged Robots

    Python

  4. Humanoid-Occupancy Humanoid-Occupancy Public

    Forked from Open-X-Humanoid/Humanoid-Occupancy

    1 1