Prereq: LIN 221; LIN 301 or CS 115 (or permission of instructor).
A linguistic corpus is a collection of language samples chosen to model language use of a specific speech community and to provide primary materials for linguistic investigation. Modern digital corpora harness the quantitative power of computers for data-rich analysis in all areas of linguistic study. This course surveys the key principles of corpus linguistics and the criteria used in assembling linguistic corpora. It discusses the application of corpus-based investigations across linguistic research domains, and engages students in hands-on linguistic research using various types of corpora.
A linguistic corpus is a collection of language samples chosen to model language use of a specific speech community and to provide primary materials for linguistic investigation. Modern digital corpora harness the quantitative power of computers for data-rich analysis in all areas of linguistic study. This course surveys the key principles of corpus linguistics and the criteria used in assembling linguistic corpora. It discusses the application of corpus-based investigations across linguistic research domains, and engages students in hands-on linguistic research using various types of corpora.