Skip to content

Harvard-AI-and-Robotics-Lab/MedConclusion

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts

Datasets

The dataset is available on Hugging Face in two versions:

Evaluation Scripts

This repository contains the scripts used for generating and evaluating the conclusions:

  • generate.py: Script to generate conclusions from abstract inputs using various LLMs via API.
  • evaluate.py: Script to execute evaluation metrics (Rule-based scores like ROUGE/BLEU, Perplexity, and LLM-as-a-judge).

Citation

If you find this work useful, please cite:

@article{li2026medconclusion,
  title={MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts},
  author={Li, Weiyue and Qian, Ruizhi and Li, Yi and Li, Yongce and Long, Yunfan and Cai, Jiahui and Luo, Yan and Wang, Mengyu},
  journal={arXiv preprint arXiv:2604.06505},
  year={2026}
}

About

[arXiv 2026] MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages