Two decades of statistical language modeling: where do we go from here?

doi:10.1109/5.880083

Two decades of statistical language modeling: where do we go from here?

Rosenfeld, R.

Statistical language models estimate the distribution of various natural language phenomena for the purpose of speech recognition and other language technologies. Since the first significant model was proposed in 1980, many attempts have been made to improve the state of the art. We review them, point to a few promising directions, and argue for a Bayesian approach to integration of linguistic theories with data.

Publication:

IEEE Proceedings

Pub Date:

August 2000

DOI:

10.1109/5.880083

Bibcode:

2000IEEEP..88.1270R

Keywords:

Natural languages;
Speech recognition;
Bayesian methods;
Probability distribution;
Information retrieval;
Training data;
Associate members;
Paper technology;
Routing;
Optical character recognition software

ADS

Two decades of statistical language modeling: where do we go from here?

Abstract