r/AITA Conflict Evaluation Using a RAG Agent

This repository is supplemental to the paper 'Context is Key: Aligning Large Language Models with Human Moral Judgments through Retrieval-Augmented Generation', presented at FLAIRS-38 and published in Florida Online Journals.

Overview

The project introduces an AI agent that evaluates interpersonal conflicts by using Retrieval-Augmented Generation (RAG) to:

Collect similar conflicts from a dataset
Use these conflicts as context to refine the LLM's judgment
Provide adaptable moral evaluations without costly fine-tuning

Dataset

A dataset containing the top 50,000 submissions to the r/AmITheAsshole (r/AITA) subreddit from 2018-2022 was created, including the top ten comments for each post.

Results

Using OpenAI's GPT-4o as the base LLM, two agents were developed:

Base: doesn't use RAG to refine its responses.
RAG: uses RAG to retrieve AITA conflicts to use as evidence to iteratively refine its response.

The RAG agent, demonstrated clear improvements over the Base Agent, as its accuracy increased from 77% to 84% and its Matthews correlation coefficient (MCC) improved from 0.357 to 0.469. Additionally, the generation of any toxic responses was practically eliminated.

These findings demonstrate that integrating LLMs into RAG frameworks effectively improves alignment with human moral judgments while mitigating harmful language.

⚠️ Note: As FLAIRS-38 is upcoming, the paper is temporarily included in this repository and will be replaced with a link to the official publication after the conference.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.vscode		.vscode
agent-results/v1		agent-results/v1
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
FLAIRS_38_Paper.pdf		FLAIRS_38_Paper.pdf
README.md		README.md
prompt-templates-v1.md		prompt-templates-v1.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

r/AITA Conflict Evaluation Using a RAG Agent

Overview

Dataset

Results

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

r/AITA Conflict Evaluation Using a RAG Agent

Overview

Dataset

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages