- Shogun (toolbox)
Infobox Software
name = Shogun
caption =
collapsible =
author = Gunnar Raetsch
Soeren Sonnenburg
developer =
released =
latest release version = 0.6.4
latest release date =August 15 ,2008
latest preview version =
latest preview date =
frequently updated =
programming language =C++
operating system =Linux ,Mac OS X
platform =
size =
language =
status =
genre =Machine learning
license =GNU General Public License v3
website = http://www.shogun-toolbox.org/Shogun is an
Free software ,open source toolbox written inC++ . It offers numerous algorithms and data structures formachine learning problems.Shogun is licensed under the terms of the
GNU General Public License version 3 or later.Description
The focus of "Shogun" is on kernel machines such as
support vector machine s for regression and classification problems. "Shogun" also offers a full implementation ofHidden Markov model s.The core software itself is written in C++ and offers interfaces forMATLAB , Octave, Python and R."Shogun" has been under active development since 1999. Today there is a vibrant user community all over the world using "Shogun" as a base for research and education, and contributing to the core package.Supported algorithms
Currently "Shogun" supports the following algorithms:
* Kernel Ridge Regression
* Hidden Markov Models
*Support vector machine
* K-Nearest Neighbors
*Linear discriminant analysis
* Kernel Perceptrons.Many different kernels are implemented, ranging from kernels for numerical data (such as gaussian or linear kernels) to kernels on special data (such as strings over certain alphabets). The currently implemented kernels for numeric data include:
* linear
* gaussian
* polynomial
* sigmoid kernelsThe supported kernels for special data include:
* Spectrum
* Weighted Degree
* Weighted Degree with ShiftsThe latter group of kernels allows processing of arbitrary sequences over fixed alphabets such as
DNA sequence s as well as whole e-mail texts.Special features
As "Shogun" was developed with
bioinformatics applications in mind it is capable of processing huge datasets consisting of up to 10 million samples. "Shogun" supports the use of pre-calculated kernels. It is also possible to use a combined kernel i.e. a kernel consisting of a linear combination of arbitrary kernels over different domains. The coefficients or weights of the linear combination can be learned as well. For this purpose "Shogun" offers a "multiple kernel learning" functionality.References
* C.Cortes und V.N. Vapnik. "Support-vector networks"
Machine Learning , 20(3):273--297, 1995.
* S.Sonnenburg, G.Rätsch, C.Schäfer und B.Schölkopf:, "Large Scale Multiple Kernel Learning"Journal of Machine Learning Research ,7:1531-1565, July 2006, K.Bennett and E.P.-Hernandez Editors.
* T.Joachims. "Making large-scale SVM learning practical" In B.Schölkopf, C.J.C. Burges, and A.J. Smola, editors, "Advances in Kernel Methods - Support Vector Learning", pages 169--184, Cambridge, MA, 1999. MIT Press.
* C.-C. Chang and C.-J. Lin, "LIBSVM : a library for support vector machines", 2001.External links
* [http://www.shogun-toolbox.org/ Shogun toolbox homepage]
* [http://freshmeat.net/projects/shogun/ SHOGUN] on Freshmeat
Wikimedia Foundation. 2010.