Pro3D: Roadside Monocular 3D Detection Prompted by 2D Detection

Yechi Ma^1,2 · Yanan Li² · Wei Hua^2,1 · Shu Kong^3,4,*

¹Zhejiang University ²Zhejiang Lab ³University of Macau ⁴Institute of Collaborative Innovation

Pro3D is a novel vision-based roadside monocular 3D object detector that establishes new state-of-the-art performance. On the DAIR-V2X-I benchmark, Pro3D demonstrates significant improvements over BEVSpread with margins of 6.4% (vehicle), 9.8% (cyclist), and 9.3% (pedestrian) across respective classes.

🚀 News

[2025/11/25] : arXiv paper released.
[2025/11/22] : Pro3D is accepted to WACV 2026.

📝 Catalog

🔍 Quick Start: Interactive Demo (Recommended First Step)

Before proceeding with full pipeline implementation, we strongly recommend exploring our pre-configured demonstration notebook:

📚 ▶️ demo-pro3d-infer-vis.ipynb
This interactive notebook provides:

End-to-end inference pipeline visualization
Sample detection results with 3D bounding boxes
Core feature demonstrations
Environment validation checks

⚠️ Note: The complete production codebase is currently undergoing active development. While the demo reflects current capabilities, the full implementation will receive significant architectural improvements and expanded functionality in upcoming releases.

📑 Table of Contents

Contents

Getting Started
Acknowledgments

🛠️ Getting Started

1. Prerequisites

Installation Guide (GPU environment setup)
Dataset Preparation (DAIR-V2X-I/Rope3D conversion)

2. Core Workflow

Generate the scene priors

python scripts/gen_scene_prior.py

Train Pro3D

python [EXP_PATH] --gpus 8 -b 32

Eval Pro3D

python [EXP_PATH] --ckpt_path [CKPT_PATH] --gpus 1 -e

🙏 Acknowledgments

This project leverages foundational work from these critical repositories:

Project	Purpose	Link
BEVSpread	Voxel pooling innovation	GitHub
BEVHeight	Height-aware feature learning	GitHub
BEVDepth	Reliable depth estimation	GitHub
DAIR-V2X	Real-world roadside dataset	GitHub
Rope3D	Challenging 3D detection dataset	GitHub

Development Status: The codebase is actively evolving. Major architecture improvements and additional features will be released in subsequent versions. Current implementations reflect our validated research baseline.

📚 Citation

If you use Pro3D in your research, please cite our work:

@inproceedings{ma2025pro3d,
  title={Roadside Monocular 3D Detection Prompted by 2D Detection}, 
  author={Yechi Ma and Yanan Li and Wei Hua and Shu Kong},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
  year={2026}
}

📚 References

Performance comparisons based on DAIR-V2X-I benchmark (CVPR 2024)
All cited projects contain their respective citation requirements

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
docs		docs
.gitignore		.gitignore
README.md		README.md
demo-pro3d-infer-vis.ipynb		demo-pro3d-infer-vis.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pro3D: Roadside Monocular 3D Detection Prompted by 2D Detection

🚀 News

📝 Catalog

🔍 Quick Start: Interactive Demo (Recommended First Step)

📑 Table of Contents

🛠️ Getting Started

1. Prerequisites

2. Core Workflow

Generate the scene priors

Train Pro3D

Eval Pro3D

🙏 Acknowledgments

📚 Citation

📚 References

About

Uh oh!

Releases

Packages

Languages

cc50121/Pro3D

Folders and files

Latest commit

History

Repository files navigation

Pro3D: Roadside Monocular 3D Detection Prompted by 2D Detection

🚀 News

📝 Catalog

🔍 Quick Start: Interactive Demo (Recommended First Step)

📑 Table of Contents

🛠️ Getting Started

1. Prerequisites

2. Core Workflow

Generate the scene priors

Train Pro3D

Eval Pro3D

🙏 Acknowledgments

📚 Citation

📚 References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages