Skip to content

shabnam-b/ELQA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ELQA: A Corpus of Metalinguistic Questions and Answers about English

This repository provides data and codes for our ACL 2023 paper:

ELQA: A Corpus of Metalinguistic Questions and Answers about English
Shabnam Behzad, Keisuke Sakaguchi, Nathan Schneider, Amir Zeldes

Make sure you have git LFS installed before cloning this repository.

git lfs install
git lfs clone https://github.com/shabnam-b/ELQA.git

Original data was collected from here, Publication date: 2021-12-06, Contributor: Stack Exchange Community. Please adhere to Stack Exchange guidelines if using the data. License: https://stackoverflow.com/help/licensing

To get train/dev/test splits for the QA task, run:

python QA_splits.py

You will find output tsv files in "data" directory. The project was tested using Python 3.8.

To train T5 models, please refer to T5x GitHub page.

** Human evaluation scores will be available soon! **

Citation

If you find this work useful for your research, please cite our paper:

@inproceedings{behzad-etal-2023-elqa,
    title = "{ELQA}: A Corpus of Metalinguistic Questions and Answers about {E}nglish",
    author = "Behzad, Shabnam  and
      Sakaguchi, Keisuke  and
      Schneider, Nathan  and
      Zeldes, Amir",
    booktitle = "Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = jul,
    year = "2023",
    address = "Toronto, Canada",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.acl-long.113",
    pages = "2031--2047",
}

About

Data Repository for ELQA: A Corpus of Metalinguistic Questions and Answers about English. Paper Link: https://arxiv.org/abs/2205.00395

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages