British National Corpus

British National Corpus

The British National Corpus (or just BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. It was compiled as a general corpus (text collection) in the field of corpus linguistics. The corpus covers British English of the late twentieth century from a wide variety of genres with the intention that it be a representative sample of spoken and written British English of that time.

Of the two parts to the 10-million word spoken corpus, one is a demographic part, containing transcriptions of spontaneous natural conversations made by members of the public and the other a context-governed part, containing transcriptions of recordings made at specific types of meeting and event. All the original recordings transcribed for inclusion in the BNC have been deposited at the National Sound Archives of the British Library.

The corpus is marked up following the recommendations of the Text Encoding Initiative and includes full linguistic annotation and contextual information The most recent edition, from March 2007, is distributed in XML format along with the XAIRA software. It is freely available under a licence and is very widely distributed.

ee also

* Corpus of Contemporary American English (COCA) 360 million words, 1990-2007. Freely available online.
* American National Corpus
* Oxford English Corpus

External links

* [http://www.natcorp.ox.ac.uk British National Corpus website]
* [http://corpus.byu.edu/bnc Free BNC interface]


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • British National Corpus — Das British National Corpus (BNC) ist eine 100 Millionen Wörter umfassende Sammlung geschriebener und gesprochener Sprache. Es umfasst dabei eine Vielzahl verschiedener Quellen, um einen repräsentativen Querschnitt durch das Britische Englisch… …   Deutsch Wikipedia

  • American National Corpus — (ANC) is a paid membership based collaboratory with the aim of creating an electronic text corpus of American English. The collection will include text and transcripts of spoken data produced from 1990, with the goal of a 100 million word… …   Wikipedia

  • Corpus linguistics — is the study of language as expressed in samples (corpora) or real world text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally …   Wikipedia

  • Corpus-assisted discourse studies — Corpus assisted discourse studies, or CADS, is related historically and methodologically to the discipline of corpus linguistics. The principal endeavor of corpus assisted discourse studies is the investigation, and comparison of features of… …   Wikipedia

  • Corpus oraux — Corpus oral En linguistique, un corpus oral est un corpus constitué de transcriptions de données orales. Bibliographie Olivier Baude, Corpus oraux. Guide des bonnes pratiques, Paris, CNRS, 2006 Douglas Biber, Variation across speech and writing,… …   Wikipédia en Français

  • corpus — meaning ‘a collection of writings’, has a plural corpora, although corpuses is increasingly found. In the domain of language and linguistics it is used to refer to a collection of texts of all kinds, written and spoken, which are read and… …   Modern English usage

  • Corpus of Contemporary American English — The freely searchable 425 million word Corpus of Contemporary American English (COCA) is the largest corpus of American English currently available, and the only publicly available corpus of American English to contain a wide array of texts from… …   Wikipedia

  • Corpus oral — En linguistique, un corpus oral est un corpus constitué de transcriptions de données orales. Bibliographie Olivier Baude, Corpus oraux. Guide des bonnes pratiques, Paris, CNRS, 2006 Douglas Biber, Variation across speech and writing, Cambridge,… …   Wikipédia en Français

  • British English — Britisches Englisch (British English, kurz BE oder BrE) ist die Bezeichnung für die Varietäten der englischen Sprache, wie sie auf den britischen Inseln (daher auch oft umgangssprachlich „Insel Englisch“ genannt) gesprochen und unter der… …   Deutsch Wikipedia

  • American and British English spelling differences — Spelling differences redirects here. For other uses, see Category:Language comparison. For guidelines on dialects and spelling in the English language version of Wikipedia, see Wikipedia:Manual of Style#National varieties of English. Differences… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”