- Backpropagation
Backpropagation, or propagation of error, is a common method of teaching
artificial neural network s how to perform a given task. It was first described byPaul Werbos in1974 , but it wasn't until1986 , through the work ofDavid E. Rumelhart ,Geoffrey E. Hinton andRonald J. Williams , that it gained recognition, and it led to a “renaissance” in the field of artificial neural network research.It is a
supervised learning method, and is an implementation of theDelta rule . It requires a teacher that knows, or can calculate, the desired output for any given input. It is most useful forfeed-forward networks (networks that have no feedback, or simply, that have no connections that loop). The term is an abbreviation for "backwards propagation of errors". Backpropagation requires that theactivation function used by theartificial neuron s (or "nodes") isdifferentiable .ummary
Summary of the technique:
# Present a training sample to the neural network.
# Compare the network's output to the desired output from that sample. Calculate the error in each output neuron.
# For each neuron, calculate what the output should have been, and a "scaling factor", how much lower or higher the output must be adjusted to match the desired output. This is the local error.
# Adjust the weights of each neuron to lower the local error.
# Assign "blame" for the local error to neurons at the previous level, giving greater responsibility to neurons connected by stronger weights.
# Repeat from step 3 on the neurons at the previous level, using each one's "blame" as its error..Algorithm
Actual algorithm for a 3-layer network (only one hidden layer):
Initialize the weights in the network (often randomly) Do For each example e in the training set O = neural-net-output(network, e) ; forward pass T = teacher output for e Calculate error (T - O) at the output units Compute delta_wi for all weights from hidden layer to output layer ; backward pass Compute delta_wi for all weights from input layer to hidden layer ; backward pass continued Update the weights in the network Until all examples classified correctly or stopping criterion satisfied Return the network
As the
algorithm 's name implies, the errors (and therefore the learning) propagate backwards from the output nodes to the inner nodes. So technically speaking, backpropagation is used to calculate the gradient of the error of the network with respect to the network's modifiable weights. This gradient is almost always then used in a simplestochastic gradient descent algorithm to find weights that minimize the error. Often the term "backpropagation" is used in a more general sense, to refer to the entire procedure encompassing both the calculation of the gradient and its use in stochastic gradient descent. Backpropagation usually allows quick convergence on satisfactory local minima for error in the kind of networks to which it is suited.It is important to note that backpropagation networks are necessarily multilayer perceptrons (usually with one input, one hidden, and one output layer). In order for the hidden layer to serve any useful function, multilayer networks must have non-linear activation functions for the multiple layers: a multilayer network using only linear activiation functions is equivalent to some single layer, linear network. Non-linear activation functions that are commonly used include the
logistic function , the softmax function, and thegaussian function .The backpropagation algorithm for calculating a gradient has been rediscovered a number of times, and is a special case of a more general technique called
automatic differentiation in the reverse accumulation mode.It is also closely related to the
Gauss–Newton algorithm , and is also part of continuing research inneural backpropagation .External links
* Chapter 7 [http://page.mi.fu-berlin.de/rojas/neural/chapter/K7.pdf The backpropagation algorithm] of [http://page.mi.fu-berlin.de/rojas/neural/index.html.html "Neural Networks - A Systematic Introduction"] by
Raúl Rojas (ISBN 978-3540605058)
* [http://neurondotnet.freehostia.com NeuronDotNet - A modular implementation of artificial neural networks in C# along with sample applications]
* [http://www.codeproject.com/KB/recipes/BP.aspx Implementation of BackPropagation in C++]
* [http://www.codeproject.com/KB/cs/BackPropagationNeuralNet.aspx Implementation of BackPropagation in C#]
* [http://ai4r.rubyforge.org/neuralNetworks.html Implementation of BackPropagation in Ruby]
* [http://www.tek271.com/articles/neuralNet/IntoToNeuralNets.html Quick explanation of the backpropagation algorithm]
* [http://galaxy.agh.edu.pl/~vlsi/AI/backp_t_en/backprop.html Graphical explanation of the backpropagation algorithm]
* [http://www.speech.sri.com/people/anand/771/html/node37.html Concise explanation of the backpropagation algorithm using math notation]
* [http://fbim.fh-regensburg.de/~saj39122/jfroehl/diplom/e-13-text.html Detailed numeric demonstration of how backpropagation algorithms work]
* [http://en.wikiversity.org/wiki/Learning_and_Neural_Networks Backpropagation neural network tutorial at the Wikiversity]
Wikimedia Foundation. 2010.