Skip to main content
UMass Amherst Libraries

Linguistics

WHAT IS A LINGUISTIC CORPUS?

A linguistic corpus is a collection of language and linguistic data which has been collected for the purpose of analysis. A corpus is useful in many ways; it enables researchers to formulate hypotheses about the workings of language as well as provides statistics and metrics to reinforce theories and research.

Corpus linguistics refers to a field of study that analyzes naturally-occurring language structure and use through the collection of samples of spoken or written language.

RESOURCES