LING 539: Statistical natural language processing

This course introduces the key concepts underlying statistical natural language processing. Students will learn a variety of techniques for the computational modeling of natural language, including: n-gram models, smoothing, Hidden Markov models, Bayesian Inference, Expectation Maximization, Viterbi, Inside-Outside Algorithm for Probabilistic Context-Free Grammars, and higher-order language models. Graduate-level requirements include assignments of greater scope than undergraduate assignments. In addition to being more in-depth, graduate assignments are typically longer and additional readings are required.

Data Types: Numerical, Categorical, Text

Methods: Descriptive statistics, Frequentist inference, Bayesian inference, Machine learning, Data management

Programming Languages: Python, Unix, Git

Course Credits
3