LING 583: Text retrieval and web search
Contributions of the ethnography of communication, language variation studies, and conversation/discourse analysis to the interdisciplinary development of sociolinguistics.
Contributions of the ethnography of communication, language variation studies, and conversation/discourse analysis to the interdisciplinary development of sociolinguistics.
This course introduces the key concepts underlying statistical natural language processing. Students will learn a variety of techniques for the computational modeling of natural language, including: n-gram models, smoothing, Hidden Markov models, Bayesian Inference, Expectation Maximization, Viterbi, Inside-Outside Algorithm for Probabilistic Context-Free Grammars, and higher-order language models. Graduate-level requirements include assignments of greater scope than undergraduate assignments. In addition to being more in-depth, graduate assignments are typically longer and additional readings are required.
This intermediate-level course is a continuation of LING 529 and covers a combination of theoretical and applied topics such as (but not limited to) unsupervised learning (clustering), decision trees, and the basics of information retrieval.
History and structure of the Arabic language in its various forms. Graduate-level requirements include a research paper on any phonological, morphological, or syntactic structure of any variety of Arabic.
The methods and tools of Artificial Intelligence used to provide systems with the ability to autonomously problem solve and reason with uncertain information. Topics include: problem solving (search spaces, uninformed and informed search, games, constraint satisfaction), principles of knowledge representation and reasoning (propositional and first-order logic, logical inference, planning), and representing and reasoning with uncertainty (Bayesian networks, probabilistic inference, decision theory). Graduate-level requirements include additional reading of supplementary material, more rigorous tests and homework assignments, and a more sophisticated course project. Sophisticated application and technique.
Machine learning describes the development of algorithms which can modify their internal parameters (i.e., "learn") to recognize patterns and make decisions based on example data. These examples can be provided by a human, or they can be gathered automatically as part of the learning algorithm itself. This course will introduce the fundamentals of machine learning, will describe how to implement several practical methods for pattern recognition, feature selection, clustering, and decision making for reward maximization, and will provide a foundation for the development of new machine learning algorithms.
Bayesian modeling and inference is a powerful modern approach to representing the statistics of the world, reasoning about the world in the face of uncertainty, and learning about it from data. It cleanly separates the notions of representation, reasoning, and learning. It provides a principled framework for combining multiple source of information such as prior knowledge about the world with evidence about a particular case in observed data. This course will provide a solid introduction to the methodology and associated techniques, and show how they are applied in diverse domains ranging from computer vision to molecular biology to astronomy. Graduate-level requirements include different exams requiring greater depth of understanding of topics, and will be assigned questions based on graduate-student specific assignments topics.