Preprint [Under Review]
RobotArena ∞: Scalable Robot Benchmarking via Real-to-Sim Translation
A scalable benchmark for real-world-trained robot policies, converting video demonstrations into simulated environments with automated vision-language scoring.