MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views

This is the code implementation for MonoSelfRecon. https://openaccess.thecvf.com/content/CVPR2024W/3DMV/papers/Li_MonoSelfRecon_Purely_Self-Supervised_Explicit_Generalizable_3D_Reconstruction_of_Indoor_Scenes_CVPRW_2024_paper.pdf or arxiv https://arxiv.org/pdf/2404.06753

Prepare Environments

Please follow NeuralRecon https://github.com/zju3dv/NeuralRecon?tab=readme-ov-file#installation or the official github https://github.com/mit-han-lab/torchsparse.git to install torchsparse.

Prepare Dataset

We use ScanNet standard training and testing split, as Atlas https://github.com/magicleap/Atlas?tab=readme-ov-file#scannet and NeuralRecon https://github.com/zju3dv/NeuralRecon?tab=readme-ov-file#data-preperation-for-scannet, and we use the same dataset structure. Please refer to them to prepare for ScanNet dataset.

Train

Train with two GPUs

CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch --nproc_per_node=2 main.py --cfg ./config/train.yaml

To switch between the training of pure self-supervision and weak supervision, set line 6 and 7 in `.models/selfrecon_sdf_mpinerf.py'.

To train from scratch, we use the same strategy as NeuralRecon by first warming up without GRU for 10 epochs, set MODEL.FUSION.FUSION_ON=False, MODEL.FUSION.FULL=False. After 10 epochs, set MODEL.FUSION.FUSION_ON=False, MODEL.FUSION.FULL=False to jointly optimize GRU.

You can also download our pretrained checkpoints from https://drive.google.com/drive/folders/1MG91GdkYn1Almryek6ehd5kgeHJDviOp?usp=drive_link. ckpt_fragloss_att is the purely self-supervision, ckpt_fragloss_att_gru_semi is the weak supervision. Put both the two folders under the project root directory.

Evaluation

This step will generate mesh files for every scene and save at ./results. You can use our pretrained checkpoints or the one trained by yourself.

python main.py --cfg ./config/test.yaml

Or you can skip this step and download the mesh files we generated for all scenes from https://drive.google.com/drive/folders/1MG91GdkYn1Almryek6ehd5kgeHJDviOp?usp=drive_link and directly evaluate it. Unzip scene_scannet_allfrag_ckpt_fragloss_att_fusion_eval_28.zip and scene_scannet_allfrag_ckpt_fragloss_att_gru_semi_fusion_eval_20.zip, put them under ./result.

To evaluate the purely self--supervised training, use the "key_names" for "ckpt_fragloss_att" at line 8-12 of ./tools/visualize_metrics.py, and use

python tools/evaluation.py --model ./results/scene_scannet_allfrag_ckpt_fragloss_att_fusion_eval_28 --n_proc 16

To evaluate the weakly-supervised training, use the "key_names" for "ckpt_fragloss_att_gru_semi" at line 8-12 of ./tools/visualize_metrics.py, and use

python tools/evaluation.py --model ./results/scene_scannet_allfrag_ckpt_fragloss_att_gru_semi_fusion_eval_20 --n_proc 16

Citation

@InProceedings{MonoSelfRecon,
    author    = {Li, Runfa and Mahbub, Upal and Bhaskaran, Vasudev and Nguyen, Truong},
    title     = {MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2024},
    pages     = {656-666}
}

Acknowledgments

Thanks for the contribution from "NeuralRecon" - https://github.com/zju3dv/NeuralRecon and "Atlas" - https://github.com/magicleap/Atlas.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
__pycache__		__pycache__
config		config
datasets		datasets
models		models
ops		ops
tools		tools
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
google5dbed6cc71f8325c.html		google5dbed6cc71f8325c.html
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views

Prepare Environments

Prepare Dataset

Train

Evaluation

Citation

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views

Prepare Environments

Prepare Dataset

Train

Evaluation

Citation

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages