LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

Tianye Ding^1*, Yiming Xie^1*, Yiqing Liang^2*, Moitreya Chatterjee³, Pedro Miraldo³, Huaizu Jiang¹
¹ Northeastern University, ² Independent Researcher, ³ Mitsubishi Electric Research Laboratories
^* Equal Contribution

📢 Updates

[2025-12-15] ArXiv preprint released.

📝 To-Do List

💡 Abstract

We propose LASER, a training-free framework that converts an offline reconstruction model into a streaming system by aligning predictions across consecutive temporal windows. We observe that simple similarity transformation (Sim(3)) alignment fails due to layer depth misalignment: monocular scale ambiguity causes relative depth scales of different scene layers to vary inconsistently between windows. To address this, we introduce layer-wise scale alignment, which segments depth predictions into discrete layers, computes per-layer scale factors, and propagates them across both adjacent windows and timestamps.

🛠️ Installation

# 1. Clone the repository
git clone --recursive git@github.com:neu-vi/LASER.git
cd LASER

# 2. Create environment
conda create -n laser -y python=3.11
conda activate laser

# 3. Install dependencies
pip install -r requirements.txt

# 4. Compile cython modules
python setup.py build_ext --inplace

# 5. Install Viser
pip install -e viser

(Optional) Download checkpoints needed for loop-closure inference

bash ./scripts/download_weights.sh

🚀 Usage

Inference

To run the inference code, you can use the following command:

export PYTHONPATH="./":$PYTHONPATH

python demo.py \
--data_path DATA_PATH \
--output_path "./viser_results" \
--cache_path "./cache" \
--sample_interval SAMPLE_INTERVAL \
--window_size WINDOW_SIZE \
--overlap OVERLAP \
--depth_refine

# example inference script
python demo.py \
--data_path "examples/titanic" \
--output_path "./viser_results" \
--cache_path "./cache" \
--sample_interval 1 \
--window_size 30 \
--overlap 10 \
--depth_refine

The results will be saved in the viser_results/SEQ_NAMEdirectory for future visualization.

Visualization

To visualize the interactive 4D results, you can use the following command:

python viser/visualizer_monst3r.py --data viser_results/SEQ_NAME

# example visualization script
python viser/visualizer_monst3r.py --data viser_results/titanic

Evaluation

Please refer to MonST3R for dataset setup details.

Put all datasets in data/.

Video Depth

Sintel

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
--mode=eval_pose \
--model=streaming_pi3 \
--eval_dataset=sintel \
--output_dir="outputs/video_depth/sintel_depth" \
--full_seq \
--no_crop

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 depth_metric.py \
--eval_dataset=sintel \
--result_dir="outputs/video_depth/sintel_depth" \
--output_dir="outputs/video_depth"

Bonn

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
--mode=eval_pose \
--model=streaming_pi3 \
--eval_dataset=bonn \
--output_dir="outputs/video_depth/bonn_depth" \
--no_crop

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 depth_metric.py \
--eval_dataset=bonn \
--result_dir="outputs/video_depth/bonn_depth" \
--output_dir="outputs/video_depth"

KITTI

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
--mode=eval_pose \
--model=streaming_pi3 \
--eval_dataset=kitti \
--output_dir="outputs/video_depth/kitti_depth" \
--no_crop \
--flow_loss_weight 0 \
--translation_weight 1e-3

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 depth_metric.py \
--eval_dataset=kitti \
--result_dir="outputs/video_depth/kitti_depth" \
--output_dir="outputs/video_depth"

Camera Pose

Sintel

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
--mode=eval_pose \
--model=streaming_pi3 \
--eval_dataset=sintel \
--output_dir="outputs/cam_pose/sintel_pose"

ScanNet

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
--mode=eval_pose \
--model=streaming_pi3 \
--eval_dataset=scannet \
--output_dir="outputs/cam_pose/scannet_pose"

TUM

export PYTHONPATH="./":$PYTHONPATH

CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node=1 --master_port=12345 eval_launch.py \
--mode=eval_pose \
--model=streaming_pi3 \
--eval_dataset=tum \
--output_dir="outputs/cam_pose/tum_pose"

Citation

If you find this repository useful in your research, please consider giving a star ⭐ and a citation

@article{ding2025laser,
  title={LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction},
  author={Ding, Tianye and Xie, Yiming and Liang, Yiqing and Chatterjee, Moitreya and Miraldo, Pedro and Jiang, Huaizu},
  year={2025}
}

Acknowledgements

We would like to thank the authors for the following excellent open source projects: VGGT, π³, MonST3R, CUT3R, VGGT-Long and many other inspiring works in the community.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
datasets		datasets
eval		eval
examples/titanic		examples/titanic
inference_engine		inference_engine
loop_closure		loop_closure
mv_recon		mv_recon
pi3		pi3
scripts		scripts
utils		utils
vggt		vggt
viser @ 9cc1733		viser @ 9cc1733
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
depth_metric.py		depth_metric.py
eval_launch.py		eval_launch.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

📢 Updates

📝 To-Do List

💡 Abstract

🛠️ Installation

🚀 Usage

Inference

Visualization

Evaluation

Video Depth

Camera Pose

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

neu-vi/LASER

Folders and files

Latest commit

History

Repository files navigation

LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

📢 Updates

📝 To-Do List

💡 Abstract

🛠️ Installation

🚀 Usage

Inference

Visualization

Evaluation

Video Depth

Camera Pose

Citation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages