ReFuGe: Feature Generation for Prediction Tasks on Relational Databases with LLM Agents

This is the official implementation of ReFuGe (Feature Generation for Prediction Tasks on Relational Databases with LLM Agents).

Accepted in ACM WWW 2026

🛠️ Requirements

ReFuGe requires core dependencies:

RelBench library (required) — used for loading RDB benchmark datasets, schemas, and task definitions.

Install RelBench via:

pip install relbench

For details, please refer to https://github.com/snap-stanford/relbench

Claude API — ReFuGe uses Claude models as the backbone LLM agents (schema selection, feature generation, and feature filtering). Make sure to set your API key:

export ANTHROPIC_API_KEY="YOUR_API_KEY"

🚀 How to run

Each dataset–task pair comes with its own Python script. To run experiments, simply execute the corresponding file (e.g., amazonchurn.py etc.).

python {dataset}{task}.py

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
data		data
README.md		README.md
REFUGE_Online Appendix.pdf		REFUGE_Online Appendix.pdf
amazonchurn.py		amazonchurn.py
avitoclicks.py		avitoclicks.py
avitovisits.py		avitovisits.py
eventignore.py		eventignore.py
eventrepeat.py		eventrepeat.py
f1dnf.py		f1dnf.py
f1top3.py		f1top3.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReFuGe: Feature Generation for Prediction Tasks on Relational Databases with LLM Agents

🛠️ Requirements

🚀 How to run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ReFuGe: Feature Generation for Prediction Tasks on Relational Databases with LLM Agents

🛠️ Requirements

🚀 How to run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages