Causal Reasoning Elicits Controllable 3D Scene Generation

Authors: Shen Chen¹, Ruiyu Zhao², Zongkai Wu³, Jenq-Neng Hwang⁴, Serge Belongie⁵, Lei Li^4,⁵^*

¹Zhejiang University ²East China University of Science and Technology ³skai worldwide ⁴University of Washington ⁵University of Copenhagen

^*Corresponding author.

This project introduces CausalStruct, a novel framework that integrates causal reasoning into 3D scene generation to enhance logical coherence, physical plausibility, and adaptability.

Abstract

Existing 3D scene generation methods often struggle to model the complex logical dependencies and physical constraints between objects, limiting their ability to adapt to dynamic and realistic environments. We propose CausalStruct, a novel framework that embeds causal reasoning into 3D scene generation. Utilizing large language models (LLMs), we construct causal graphs where nodes represent objects and attributes, while edges encode causal dependencies and physical constraints. CausalStruct iteratively refines the scene layout by enforcing causal order to determine the placement order of objects and applies causal intervention to adjust the spatial configuration according to physics-driven constraints, ensuring consistency with textual descriptions and real-world dynamics. The refined scene causal graph informs subsequent optimization steps, employing a Proportional-Integral-Derivative (PID) controller to iteratively tune object scales and positions. Our method uses text or images to guide object placement and layout in 3D scenes, with 3D Gaussian Splatting and Score Distillation Sampling improving shape accuracy and rendering stability. Extensive experiments show that CausalStruct generates 3D scenes with enhanced logical coherence, realistic spatial interactions, and robust adaptability.

Project Structure

.
├── README.md          # Project description file
├── code/              # Code files
│   ├── CLIP_evaluate.py    # Evaluation script
│   ├── MLLM_evaluate.py    # Evaluation script

Demo

Comparison

Our method demonstrates superior spatial consistency and causal alignment compared to existing methods. Below are qualitative results:

Scene Editing

Our method supports flexible and controllable scene editing via text descriptions. Objects can be added, removed, or repositioned based on causal relationships, ensuring physical plausibility and semantic coherence.

Knowledge Distillation

We conduct ablation studies to evaluate the impact of causal reasoning. The results show that causal reasoning enhances scene coherence.

Thank you for your interest in this project! I hope my research provides valuable insights and inspiration. For any inquiries, please feel free to contact me.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
code		code
figure		figure
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Causal Reasoning Elicits Controllable 3D Scene Generation

Abstract

Project Structure

Demo

Comparison

Scene Editing

Knowledge Distillation

About

Uh oh!

Releases

Packages

Languages

gokucs/causalstruct

Folders and files

Latest commit

History

Repository files navigation

Causal Reasoning Elicits Controllable 3D Scene Generation

Abstract

Project Structure

Demo

Comparison

Scene Editing

Knowledge Distillation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages