Asia Online

Asia Online

Asia Online is a Thailandbased company undertaking what it calls the world's largest literacy project by translating vast quantities of the worlds English language knowledge into Asian languages. This is achieved using statistical machine translation (SMT) technologies developed and enhanced in Thailand with a specific focus on Asian languages.

It was founded in 2006 by the University of Edinburgh's Philipp Koehn, Gregory Binger a leading technoligist and IT/IP lawyer, and former Gartner senior analysts Bob Hayward and Dion Wiggins.

Asia Onlines statistically-based translation software is an instance of a recent advance in automated translation. While earlier machine translation technology relied on collections of linguistic rules to analyze the source sentence, and then map the syntactic and semantic structure into the target language, Asia Online uses statistical techniques from cryptography, applying machine learning algorithms that automatically acquire statistical models from existing parallel collections of human translations.

Until early 2008, Google, Microsoft and Language Weaver had publicly available SMT systems. Asia Online claims there are flaws in the existing processes and techniques of SMT and worked to resolve these issues. It claims three key differences from traditional SMT approaches:
* Clean data - The traditional approach leveraged content found on the web in corporate sites, news articles and other similar sources where the same content was available in multiple languages. The quality of the data was very low. Asia Online has focussed machine and human resources in this area to ensure that the data is as clean and as accurate as possible. Data is sourced from high quality translations provided by book publishers and translation companies and is aligned at the segment level (usually sentences) and converted into a consistent format in order to be processed by the learning software. This step includes:
**Extracting segments from files and documents if they are not in a TMX format.
**Aligning segments (if necessary) once they have been extracted. While this is automated by machines, humans are also used to validate the accuracy.
**Converting data to a base UTF-8 encoding for training the SMT system.
**Extracting small subsets from the data to guide training.
**Reviewing, cleaning and analyzing the data to ensure optimal training impact.

* Multiple Domains - Extensive efforts have been put into a system that allows for training in many domains. This is done by extending a base set of information with multiple additional learning sources.

* Real Time Corrections

* Languages Available - Asia Online currently has 203 language pairs available in a baseline form and several with domain data. These systems are currently used to build customized translation systems for corporate and language service provider (LSP) customers who add their bilingual parallel corpus to the existing data to create higher quality translation systems. These available languages include English, French, Italian, German, Spanish, Portuguese, Dutch, Swedish, Danish, Greek, Finnish, Thai, Simplified Chinese and Hindi.

Asia Online is also building SMT systems for English to Indonesian, Malay, Vietnamese, Tagalog, Traditional Chinese, Japanese and Korean.

ee also

Google Translate

External links

* [ Company Homepage]

Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • — is an English version of the first Russian web service, intended for translation of texts and web pages ( It was launched by PROMT on the 6th of March, 1998 [1]. Every day Online provides translation to 70.000 people… …   Wikipedia

  • Online gaming in the People's Republic of China — represents one of the largest and fastest growing Internet business sectors in the country. With 457 million Internet users currently active in the PRC, the country now has the largest online user base in world, of which two thirds engage in… …   Wikipedia

  • Asia Times Online — Saltar a navegación, búsqueda Asia Times Online es un periódico accesible sólo por Internet con secciones de información y opinión sobre temas geopolíticos, políticos y económicos, generalmente (aunque no siempre) desde un punto de vista asíático …   Wikipedia Español

  • Asia Times Online — (abbr. ATol ) is an Internet only news and commentary publication that reports and examines geopolitical, political, economic and business issues from an Asian perspective. ATol is published in two language editions, English and Chinese.Asia… …   Wikipedia

  • Asia Times Online — ist eine Online Publikation, die in englisch und chinesisch Nachrichten und Kommentare verbreitet. Die Berichte untersuchen geopolitische, politische, wirtschaftliche und geschäftliche Themen aus asiatischer Perspektive. Asia Times Online wurde… …   Deutsch Wikipedia

  • Asia Bibi — Asia Noreen (Bibi) (* zwischen 1964 und 1971)[1] ist eine Pakistanerin, die am 8. November 2010 von einem Gericht in Nankana[2] als erste Frau in der Geschichte des Landes wegen Gotteslästerung zum Tode verurteilt wurde.[3] Das Gericht sah es als …   Deutsch Wikipedia

  • Asia Times — Online ist eine Online Publikation, die in englisch und chinesisch Nachrichten und Kommentare verbreitet. Die Berichte untersuchen gepolitische, politische, wirtschaftliche und geschäftliche Themen aus asiatischer Perspektive. Asia Times Online… …   Deutsch Wikipedia

  • Asia Times Online — (ou ATol) est une revue d actualité et d information publiée sur Internet. Elle étudie les problèmes géopolitiques, politiques, économiques et commerciaux d un point de vue asiatique. ATol paraît en anglais et en chinois. Histoire Asia Times… …   Wikipédia en Français

  • Asia Times Online — Asia Times Online  общественно политическое интернет издание, основанное в 1999 году и зарегистрированное в Гонконге. Публикует материалы на английском и китайском языках. Издание имеет более 50 корреспондентов и авторов в 17 странах Азии, в …   Википедия

  • Asia Pulp & Paper — Asia Pulp Paper, based in Singapore, is one of the largest pulp and paper companies in the world. It can produce about 2 million tons of pulp and more than 5 million tons of paper and packaging materials per year. Part of the Sinar Mas Group, it… …   Wikipedia

Share the article and excerpts

Direct link Do a right-click on the link above
and select “Copy Link”