- Languageware
LanguageWare is a
natural language processing (NLP) technology, developed byIBM , that allows applications to "understand" natural language text. It comprises a light-weight set of Java libraries which provide a range ofNLP functions; language identification, text segmentation and normalization, entity and relationship extraction, and semantic analysis and disambiguation.The LanguageWare analysis engine is based upon a
Finite State Machine . It manages to be very fast, while still maintaining a reasonably small dictionary size, because it uses a variety of node layout types [Troussov 2003] . This approach works very well because of the statistical properties of natural language dictionaries [Troussov 2003a] .The behaviour of the system is driven through a set of lexico-semantic resources which describe the characteristics of the language and domain. A default set of resources come as part of LanguageWare and these describe the native language characteristics, such as morphology, and the basic vocabulary for the language. Supplemental resources are available which capture additional vocabularies, terminologies, rules and grammars, which may be generic to the language or specific to one or more domains.
A set of Eclipse-based customization tooling, LanguageWare Resource Workbench, is provided to allow any type of domain knowledge to be compiled into these resources and thereby incorporated into the analysis process.
LanguageWare can be deployed as a set of
UIMA -compliant annotators, Eclipse plug-ins orWeb Services .See also
*
UIMA
*Linguistics
*Semantics
*Semantic Web
*Web services
*Service-oriented architecture
*Formal language
*Finite state machine Interesting links
* [http://www.alphaworks.ibm.com/tech/lrw IBM LanguageWare Resource Workbench on alphaWorks]
* [http://www.alphaworks.ibm.com/tech/galaxy IBM LanguageWare Miner for Multidimensional Socio-Semantic Networks on alphaWorks]
* [http://incubator.apache.org/uima/ UIMA Homepage at the Apache Software Foundation]
* [http://sourceforge.net/projects/uima-framework UIMA Framework on SourceForge]
* [http://omnifind.ibm.yahoo.net/ IBM OmniFind Yahoo! Edition (FREE enterprise search engine)]
* [http://smile.deri.ie/ Semantic Information Systems and Language Engineering Group]
* [http://www.semanticdesktop.org/xwiki/bin/view/Main/ SemanticDesktop.org]References
Citations
*wikicite|id=idTroussov2003|reference=http://www.springerlink.com/content/dl6l5qq3wq0be630/ A. Troussov, B. O'Donovan, S. Koskenniemi, and N. Glushnev, "Per-Node Optimization of Finite-State Mechanisms for Natural Language Processing," Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science N 2588, Springer-Verlag, pp.223-227, (2003)
*wikicite|id=idTroussov2003a|reference=http://www.iol.ie/~bodonovan/pubs/RA-NLP-03.pdf A. Troussov and B. O’Donovan, "Statistics of Morphological Finite-State Transition Networks Obey the Power Law," Proceedings of the International Conference RANLP -- 2003 (Recent Advances in Natural Language Processing) 10-12 September 2003, Borovets, Bulgaria|
Additional Related Papers
* [http://www.dfki.uni-kl.de/~grimnes/2006/12/telco/IBM_OntologicalNetworkMiner_v1_2.pdf Alexander Troussov, Mikhail Sogrin, "IBM LanguageWare Ontological Network Miner"]
* [http://www.friemel.com/asna/papers/ASNA2007_Paper_Kinsella.pdf Sheila Kinsella, Andreas Harth, Alexander Troussov, Mikhail Sogrin, John Judge, Conor Hayes, John G. Breslin, "Navigating and Annotating Semantically-Enabled Networks of People and Associated Objects"]
* [http://www.scai.fraunhofer.de/fileadmin/download/vortraege/tms_06/IBM_Nevidomsky.pdf Alex Nevidomsky, "UIMA Framework and Knowledge Discovery at IBM", Presentation at 4th Text Mining Symposium, Fraunhofer SCAI, 2006]
* [http://www.fzi.de/KCMS/kcms_file.php?action=link&id=676 Mikhail Kotelnikov, Alexander Polonsky, Malte Kiesel, Max Völkel, Heiko Haller, Mikhail Sogrin, Pär Lannerö, Brian Davis, "Interactive Semantic Wikis"]
* [http://www.kde.org/kdeslides/fosdem2007/2007-02-vandenoever-kde_semantic.pdf Sebastian Trüg, Jos van den Oever, Stéphane Laurière, "The Social Semantic Desktop: Nepomuk"]
* [http://www.cs.tcd.ie/~slawless/papers/ah2006.pdf Séamus Lawless, Vincent Wade, "Dynamic Content Discovery, Harvesting and Delivery"]
* [http://www.cs.tcd.ie/~slawless/papers/ewb2007.pdf Séamus Lawless, "Open Corpus Learning Content; Harvesting Knowledge to provide Equitable Access to Education for All"]
Wikimedia Foundation. 2010.