A General Service List of English Words

A General Service List of English Words

The General Service List is a list of roughly 2000 words published by Michael West in 1953. The words were selected to represent the most frequent words of English and were taken from a corpus of written English. The target audience was English language learners and ESL teachers. To maximize the utility of the list, some frequent words that overlapped broadly in meaning with words already on the list were omitted. In the original publication the relative frequencies of various senses of the words were also included.

The list is important because a person who knows all the words on the list and their related families would understand approximately 90-95 percent of colloquial speech and 80-85 percent of common written texts. The list consists only of headwords, which means that the word "be" is high on the list, but assumes that the person is fluent in all forms of the word, e.g. am, is, are, was, were, being and been.

Researchers have expressed doubts about the adequacy of the GSL because of its age and the relatively low coverage provided by the words not in the first 1000 words of the list (Engels, 1968). Engels was, in particular, critical of the limited vocabulary chosen by West 1953, and while he concurred that the first 1000 words of the GSL were good selections based on their high frequency and wide range, he was of the opinion that that the words beyond the first 1000 of the GSL could not be considered general service words because the range and frequency of these words were too low to be included in the list. Recent research by Billuroğlu and Neufeld (2005) confirmed that the General Service List was in need of minor revision, but the headwords in the list still provide approximately 80% text coverage in written English. The research showed that the GSL contains a small number of archaic terms, such as shilling, while excluding words that have gained currency since the first half of the twentieth century, e.g. plastic, television, battery, okay, victim, drug, etc.

The GSL evolved over several decades before West’s publication in 1953. Contrary to popular belief, the GSL is not a list based solely on frequency, but includes groups of words on a semantic basis (Nation & Waring, 2004; Dickins). Today there is no version of the GSL in print; it only exists in virtual form via the Internet. Various versions float around the Internet, and attempts have been made to improve it (Bauman, 1995). However, for practical purposes, one of its most accessible formats exists in the [http://lextutor.ca Compleat Lexical Tutor] web site created by Tom Cobb, where it can be viewed, downloaded or used for vocabulary profiling of texts. The headwords and family members in this version of the GSL conform to Bauer and Nation, level 6.

The [http://www.lextutor.ca/vp/eng/ Classic Vocabulary Profiler] , produces output in coloured form—blue for K1 (the first 1,000 words of the GSL), green for K2 (the second 1,000 words of the GSL), yellow for Academic word list, and red for words that are not in any of the lists.

For other types of reading different lists may be required. For example when reading texts in the academic genre, a person may wish to consult the Academic word list. For students studying English as a foreign language, a person may wish to consult the [http://www.editthis.info/thebnl/Main_Page BNL2709] and use the [http://lextutor.ca/vp/bnl BNL profiler] at the Compleat Lexical Tutor web site.

External links

The actual list, along with a more detailed discussion is available [http://jbauman.com/aboutgsl.html here]

Heatley, Nation, and Coxhead have made their RANGE and FREQUENCY programs available for [http://www.vuw.ac.nz/lals/staff/Paul_Nation download] and include the GSL and the AWL as TXT files for vocabulary profiling.

An extended version of the General Service List in Excel format produced by Dickins (http://www.languages.salford.ac.uk/staff/dickins.php) can be found on [http://www.languages.salford.ac.uk/staff/dickins/GSLlist2.xls] .

This provides in a manipulable format all the information in the General Service List, plus a semantic-field categorisation of each entry using the categories in the Longman Lexicon of Contemporary English(LLCE) by Tom McArthur (Longman, 1981). This latter allows entries to be ordered or extracted according to semantic categories. Details of the LLCE categories can be found on the same webpage. The full categories for the extended version of the General Service List are as follows:

  1. Line numbering (order in which entries appear in the printed version of the General Service List);
  2. Headword as given in the General Service List;
  3. Lemmatized headword, i.e. standard dictionary-type headword;
  4. McArthur category;
  5. Word-class;
  6. Word count 1, as given in the General Service List;
  7. Word count 2, 'raw' word count without additional information given in the General Service List;
  8. Percentage scores for occurrences of words in a particular sense in the General Service List;
  9. Word-in-sense frequency;
  10. Source of information (given throughout as GenSerList);
  11. Meaning (as given in General Service List).

References

Bauer, L. and Nation, I.S.P. (1993). Word families, International Journal of Lexicography 6, 3 (1993) 1-27.

Billuroğlu, A. & Neufeld, S. (2005). The Bare Necessities in Lexis: a new perspective on vocabulary profiling. Retrieved September 2007 from http://lextutor.ca/vp/BNL_Rationale.doc

Dickins, J. (n.d.). Extended Version of Rank Frequency List: Spoken English, retrieved 3 December 2007 from http://www.languages.salford.ac.uk/staff/dickins.php.

Engels, L.K. (1968). The fallacy of word counts. IRAL 6: 213-231.

Heatley, A., Nation, I.S.P. and Coxhead, A. 2002. RANGE and FREQUENCY programs. http://www.vuw.ac.nz/lals/staff/Paul_Nation

Nation, P., & Waring, R. (2004). Vocabulary size, text coverage and word lists. Retrieved September 2007 from http://www.wordhacker.com


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • List of English words of Russian origin — Including English, contain words most likely borrowed from the Russian language. Not all of the words are truly fluent Russian or Slavic origin. Some of them co exist in other Slavic languages and it is difficult to decide whether they made… …   Wikipedia

  • List of English words of Persian origin — As Indo European languages, English and Persian have many words of common Proto Indo European origin, and many of these cognate words often have similar forms. Examples of these include: English (Mother) and Persian (Madar), English (Father) and… …   Wikipedia

  • List of English words with disputed usage — Some English words are often used in ways that are contentious between writers on usage and prescriptive commentators. The contentious usages are especially common in spoken English. While in some circles the usages below may make the speaker… …   Wikipedia

  • List of commonly misused English words — This is a list of English words which are commonly misused. It is meant to include only words whose misuse is deprecated by most usage writers, editors, and other professional linguists of Standard English. It is possible that some of the… …   Wikipedia

  • List of British words not widely used in the United States — Differences between American and British English American English …   Wikipedia

  • English words first attested in Chaucer — Contents 1 Etymology 2 List 2.1 Canterbury Tales General Prologue …   Wikipedia

  • List of American words not widely used in the United Kingdom — This is a list of American words not widely used in the United Kingdom.* Words with specific American meanings that have different meanings in British English and/or additional meanings common to both dialects (e.g. pants , crib ) are to be found …   Wikipedia

  • Most common words in English — The list below of most common words in English cannot be definitive. It is based on an analysis of the Oxford English Corpus of over a billion words, and represents one study done by Oxford Online, associated with the Oxford English… …   Wikipedia

  • Swadesh list — A Swadesh list is one of several lists of vocabulary with basic meanings, developed by Morris Swadesh in the 1940 50s, which is used in lexicostatistics (quantitative language relatedness assessment) and glottochronology (language divergence… …   Wikipedia

  • List of gairaigo and wasei-eigo terms — This is a selected list of gairaigo, Japanese words originating or based on foreign language (generally Western) terms, including wasei eigo (Japanese pseudo Anglicisms). Many derive from Portuguese, due to Portugal s early role in Japanese… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”