Skip to content

gokucs/causalstruct

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 

Repository files navigation

Causal Reasoning Elicits Controllable 3D Scene Generation

Project Page

Authors: Shen Chen1, Ruiyu Zhao2, Zongkai Wu3, Jenq-Neng Hwang4, Serge Belongie5, Lei Li4,5*

1Zhejiang University  2East China University of Science and Technology  3skai worldwide   4University of Washington   5University of Copenhagen

*Corresponding author.

This project introduces CausalStruct, a novel framework that integrates causal reasoning into 3D scene generation to enhance logical coherence, physical plausibility, and adaptability.

CausalStruct

Abstract

Existing 3D scene generation methods often struggle to model the complex logical dependencies and physical constraints between objects, limiting their ability to adapt to dynamic and realistic environments. We propose CausalStruct, a novel framework that embeds causal reasoning into 3D scene generation. Utilizing large language models (LLMs), we construct causal graphs where nodes represent objects and attributes, while edges encode causal dependencies and physical constraints. CausalStruct iteratively refines the scene layout by enforcing causal order to determine the placement order of objects and applies causal intervention to adjust the spatial configuration according to physics-driven constraints, ensuring consistency with textual descriptions and real-world dynamics. The refined scene causal graph informs subsequent optimization steps, employing a Proportional-Integral-Derivative (PID) controller to iteratively tune object scales and positions. Our method uses text or images to guide object placement and layout in 3D scenes, with 3D Gaussian Splatting and Score Distillation Sampling improving shape accuracy and rendering stability. Extensive experiments show that CausalStruct generates 3D scenes with enhanced logical coherence, realistic spatial interactions, and robust adaptability.


Project Structure

.
├── README.md          # Project description file
├── code/              # Code files
│   ├── CLIP_evaluate.py    # Evaluation script
│   ├── MLLM_evaluate.py    # Evaluation script


Demo

Comparison

Our method demonstrates superior spatial consistency and causal alignment compared to existing methods. Below are qualitative results:

Qualitative Reconstruction Results

Scene Editing

Our method supports flexible and controllable scene editing via text descriptions. Objects can be added, removed, or repositioned based on causal relationships, ensuring physical plausibility and semantic coherence.

Scene Editing

Knowledge Distillation

We conduct ablation studies to evaluate the impact of causal reasoning. The results show that causal reasoning enhances scene coherence.

Ablation Study of causal reasoning and adaptability.


Thank you for your interest in this project! I hope my research provides valuable insights and inspiration. For any inquiries, please feel free to contact me.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages