PyLangAcq: Language Acquisition Research in Python

PyPI version Conda version Supported Python versions

PyLangAcq is a Python library for language acquisition research.

  • Reading and writing the CHAT data format used by TalkBank and CHILDES datasets

  • Intuitive Python data structures for flexible data access and manipulation

  • Standard developmental measures readily available: Mean length of utterance (MLU), type-token ratio (TTR), and Index of Productive Syntax (IPSyn)

  • Direct support and powerful extensions possible for CHAT-formatted conversational datasets more generally

Since v0.20.0 (February 2026), PyLangAcq depends on Rustling, a library for efficiently handling CHAT data and other computational linguistics tasks.

Download and Install

Using pip:

pip install --upgrade pylangacq

Using conda:

conda install -c conda-forge pylangacq

Ready for more? Check out the Quickstart page.

How to Cite

Lee, Jackson L., Ross Burkholder, Gallagher B. Flinn, and Emily R. Coppess. 2016. Working with CHAT transcripts in Python. Technical report TR-2016-02, Department of Computer Science, University of Chicago.

@TechReport{lee-et-al-pylangacq:2016,
   Title       = {Working with CHAT transcripts in Python},
   Author      = {Lee, Jackson L. and Burkholder, Ross and Flinn, Gallagher B. and Coppess, Emily R.},
   Institution = {Department of Computer Science, University of Chicago},
   Year        = {2016},
   Number      = {TR-2016-02},
}

License

MIT License

Table of Contents