Cognitive MRI of AI Conversations: Research Compendium

Applying complex network analysis to ChatGPT conversation archives to reveal knowledge organization, community structure, and temporal evolution patterns.

Authors: Alexander Towell and John Matta — Southern Illinois University Edwardsville

Overview

This repository transforms sequential AI conversation logs into semantic similarity networks, revealing latent cognitive structure in AI-assisted knowledge exploration. We analyze 1,908 ChatGPT conversations spanning December 2022 to April 2025, constructing networks that expose knowledge communities, bridge conversations, and temporal evolution patterns invisible in linear logs.

Papers

Paper	Venue	Status
Temporal Evolution of Cognitive Knowledge Networks in AI-Assisted Conversations	PLOS Complex Systems	Submitted
Cognitive MRI of AI Conversations: Analyzing AI Interactions through Semantic Embedding Networks	Complex Networks 2025 (Springer)	Published

Repository Structure

├── comp-net-2025-journal/           # Journal extension (PLOS Complex Systems)
│   ├── paper/PLOS/                  # Submission-ready paper, figures, refs
│   └── code/                        # Temporal analysis scripts
├── comp-net-2025-camera-ready/      # Published conference paper (Springer)
│   ├── paper/                       # Camera-ready paper
│   └── slides/                      # Conference presentation (Beamer)
├── data/                            # Reproducibility data for both papers
│   ├── embeddings/                  # 1,908 conversation embeddings (2:1 ratio)
│   ├── network/                     # Primary edge list (601 nodes, 1,718 edges)
│   ├── temporal/                    # Journal paper: monthly network snapshots
│   ├── ablation/                    # Conference paper: 63-config parameter study
│   └── conversations/               # Placeholder (sanitization in progress)
└── code/                            # Embedding + network construction pipeline
    ├── cli.py                       # Main CLI (embeddings, edges, export)
    ├── networks.py                  # Network statistics & metrics
    ├── embedding/                   # LLM & TF-IDF embedding models
    ├── graph/                       # Edge generation, GPU acceleration
    └── run_ablation_study.py        # 63-config ablation study

Reproducing Results

All derived data needed to reproduce every figure and table in both papers is included in data/. See data/README.md for full documentation.

To regenerate all journal paper figures from the curated data:

pip install -r code/requirements.txt
bash data/reproduce.sh

This runs the temporal analysis pipeline (~42 seconds) using the embeddings and edge list in data/, producing all figures in comp-net-2025-journal/paper/figures/temporal/.

Building the Pipeline from Scratch

To regenerate embeddings and networks from raw conversations (requires Ollama with nomic-embed-text):

pip install -r code/requirements.txt

cd code
python cli.py node-embeddings --input-dir <conversations-dir> \
    --method role-aggregate --embedding-method llm --output-dir <output-dir>
python cli.py edges-gpu --input-dir <output-dir> --output-file edges.json
python cli.py cut-off --input-file edges.json --output-file filtered.json --cutoff 0.9
python cli.py export --nodes-dir <output-dir> --edges-file filtered.json --format gexf -o graph.gexf

The pipeline code is also available as a standalone package at chatgpt-complex-net (DOI: 10.5281/zenodo.15314235).

Citation

If you use this work, please cite the journal paper:

@article{towell2026temporal,
  author  = {Towell, Alexander and Matta, John},
  title   = {Temporal Evolution of Cognitive Knowledge Networks in AI-Assisted Conversations},
  journal = {PLOS Complex Systems},
  year    = {2026},
  note    = {Submitted}
}

See CITATION.cff for machine-readable citation metadata.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
agentic		agentic
code		code
comp-net-2025-camera-ready		comp-net-2025-camera-ready
comp-net-2025-journal		comp-net-2025-journal
compsac-2026		compsac-2026
data		data
docs/plans		docs/plans
.gitignore		.gitignore
.zenodo.json		.zenodo.json
CITATION.cff		CITATION.cff
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
springer-cognitive-mri.pdf		springer-cognitive-mri.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cognitive MRI of AI Conversations: Research Compendium

Overview

Papers

Repository Structure

Reproducing Results

Building the Pipeline from Scratch

Citation

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Cognitive MRI of AI Conversations: Research Compendium

Overview

Papers

Repository Structure

Reproducing Results

Building the Pipeline from Scratch

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages