Microsoft Store
 

Corpus


 

In law a corpus (Latin: "body") is a set, a collection of documents and sources. See Corpus Juris Civilis.

Related Topics:
Law - Latin - Corpus Juris Civilis

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

In linguistics, corpus (plural corpora) is a large and structured set of texts (now usually electronically stored and processed). A corpus may contain texts in single language (monolingual corpus) or text data in multiple languages (multilingual corpus). Multilingual corpora that have been specially formatted for side-by-side comparison are called aligned parallel corpora.

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

In order to make the corpora more useful for doing linguistic research, they are often subjected to a process known as annotation.

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

An example of annotating a corpus is part-of-speech tagging, or POS-tagging, in which information about each word's part of speech (verb, noun, adjective, etc.) are added to the corpus in the form of tags. Another example would be marking what the lemma form of each word which has been inflected or modified from its root form.

Related Topics:
Part-of-speech tagging - Lemma

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

Corpora are the main knowledge base in corpus linguistics.

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

In biology, corpus refers to the main body/mass/part of an organ or other anatomical structure, distinguished from the head or tail.

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

In Christianity, the corpus refers to the body of Christ and often is used as a technical term for the figure which hangs upon a crucifix.

Related Topics:
Christianity - Christ - Crucifix

~ ~ ~ ~ ~ ~ ~ ~ ~ ~