Xaira

Xaira

Xaira is an XML Aware Indexing and Retrieval Architecture developed at Oxford University. It is based on SARA, an SGML-aware text-searching system originally developed for searching the British National Corpus. Xaira has been redeveloped as a generic XML system for constructing query-systems for any kind of XML data, in particular for use with TEI. The current Windows implementation is intended for non-specialist users. A more sophisticated and open-source version is currently under development. This version supports cross-platform working using standards such as XML-RPC and SOAP.

External links

* [http://www.oucs.ox.ac.uk/rts/xaira/ Home page]
** [http://www.oucs.ox.ac.uk/rts/xaira/Doc/ Preliminary documentation]
** [http://xaira.sourceforge.net/ Sourceforge site]
* [http://drh2004.ncl.ac.uk/abstract.php?abstract=218 A talk on Xaira]
* [http://www.natcorp.ox.ac.uk/ The British National Corpus]
** [http://www.natcorp.ox.ac.uk/corpus/babyinfo.html The BNC baby demo CD-ROM] also [http://www.uib.no/mailman/public/corpora/2004-October/000069.html] , [http://listserv.brown.edu/archives/cgi-bin/wa?A2=ind0410&L=tei-l&F=&S=&P=4323] and [http://lists.village.virginia.edu/lists_archive/Humanist/v18/0296.html]

ee also

* Corpus linguistics
*Lemmatization


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Corpus linguistics — is the study of language as expressed in samples (corpora) or real world text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally …   Wikipedia

  • Sara — Contents 1 Ethnic groups and languages 2 Legislation 3 Media and entertainment …   Wikipedia

  • British National Corpus — The British National Corpus (or just BNC) is a 100 million word text corpus of samples of written and spoken English from a wide range of sources. It was compiled as a general corpus (text collection) in the field of corpus linguistics. The… …   Wikipedia

  • Manuel del Pópulo Vicente García — (January 21, 1775 June 10, 1832) was a noted Spanish opera singer, composer and singing teacher.García was born in Seville, Spain. In 1808 he went to Paris with a reputation already gained as a tenor at Madrid and Cadiz. By 1808, when he appeared …   Wikipedia

  • Beta Code — Le Beta Code est un système de représentation des caractères et de la ponctuation de plusieurs langues anciennes (dont notamment le grec ancien), à l’aide des caractères ASCII. Son but n’est pas tant de décrire une norme de romanisation que de… …   Wikipédia en Français

  • Alpujarra Granadina — Artículo principal: La Alpujarra Alpujarra Granadina Comarca de España …   Wikipedia Español

  • Beta Code — Saltar a navegación, búsqueda El Beta Code (en inglés literalmente ‘código beta’) es un método de representar, usando solo caracteres ASCII, las letras y formatos presentes en textos escritos en griego antiguo (y otros lenguajes arcaicos). Su… …   Wikipedia Español

  • Granada musulmana — La ciudad de Granada, como entidad urbana, se remonta al siglo XI, cuando se produce el abandono de Medina Elvira, capital de la Cora de Elvira, como consecuencia de la desaparición del Califato de Córdoba y de la fundación del reino zirí de… …   Wikipedia Español

  • Manuel del Pópulo Vicente García — Manuel García en el papel de Otello …   Wikipedia Español

  • Vicente Antonio García de la Huerta — Para otros usos de este término, véase Vicente García. Vicente Antonio García de la Huerta (* Zafra, provincia de Badajoz, España, 9 de marzo de 1734 † Madrid, España, 12 de marzo de 1787) fue un poeta y dramaturgo español, hermano del sacerdote… …   Wikipedia Español

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”