GitHub - Bxzfrm/PRISM

PRISM: Prosody-Integrated Multi-Agent Reasoning Framework for Empathetic Spoken Dialogue

Overview

Empathetic spoken dialogue systems require not only semantically appropriate responses but also emotionally aligned prosodic expression. Existing cascade pipelines often discard rich acoustic cues during speech-to-text conversion, while end-to-end speech models lack interpretable control over emotion and knowledge integration.

PRISM addresses these limitations through a multi-agent framework that decouples speech perception, response generation, and speech synthesis into coordinated components. The framework introduces a prosody-to-language translation mechanism to stabilize large language model reasoning and supports on-demand invocation of external knowledge tools for empathetic dialogue generation.

Framework

Installation

Clone the repository

git clone https://github.com/yourname/PRISM.git
cd PRISM

Create environment

conda create -n prism python=3.10
conda activate prism

Install dependencies

pip install -r requirements.txt

Dataset

Experiments are conducted on public empathetic dialogue datasets.

Please download the datasets from their official sources before training and evaluation:

TOOL-ED: https://github.com/caohy123/EKTC
AvaMERG: https://huggingface.co/datasets/ZhangHanXD/AvaMERG

Speech Synthesis Model

For speech synthesis, we employ StyleTTS2 as the backbone TTS model.

StyleTTS2 can be obtained from:

https://github.com/yl4579/styletts2

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
examples		examples
figures		figures
README.md		README.md
main.py		main.py
manager.py		manager.py
perceiver.py		perceiver.py
requirements.txt		requirements.txt
responder.py		responder.py
vocalizer.py		vocalizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Framework

Installation

Clone the repository

Create environment

Install dependencies

Dataset

Speech Synthesis Model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Framework

Installation

Clone the repository

Create environment

Install dependencies

Dataset

Speech Synthesis Model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages