Artificial neuron

An artificial neuron is a mathematical function conceived as a crude model, or abstraction of biological neurons. Artificial neurons are the constitutive units in an artificial neural network. Depending on the specific model used, it can receive different names, such as semi-linear unit, Nv neuron, binary neuron, linear threshold function or McCulloch–Pitts (MCP) neuron . The artificial neuron receives one or more inputs (representing the one or more dendrites) and sums them to produce an output (representing a biological neuron's axon). Usually the sums of each node are weighted, and the sum is passed through a non-linear function known as an activation function or transfer function. The transfer functions usually have a sigmoid shape, but they may also take the form of other non-linear functions, piecewise linear functions, or step functions. They are also often monotonically increasing, continuous, differentiable and bounded.

The artificial neuron transfer function should not be confused with a linear system's transfer function.

1 Basic structure
2 History
3 Types of transfer functions
4 Pseudocode algorithm
5 Spreadsheet example
6 Limitations
7 See also
8 References
9 Further reading
10 External links

Basic structure

For a given artificial neuron, let there be m + 1 inputs with signals x₀ through x_m and weights w₀ through w_m. Usually, the x₀ input is assigned the value +1, which makes it a bias input with w_k0 = b_k. This leaves only m actual inputs to the neuron: from x₁ to x_m.

The output of kth neuron is:

$y_k = \varphi \left( \sum_{j=0}^m w_{kj} x_j \right)$

Where $φ$ (phi) is the transfer function.

Artificial neuron.png

The output is analogous to the axon of a biological neuron, and its value propagates to input of the next layer, through a synapse. It may also exit the system, possibly as part of an output vector.

It has no learning process as such. It cannot decide its own, here weights are calculated and accordingly the threshold value is calculated.

History

The first artificial neuron was the Threshold Logic Unit (TLU) first proposed by Warren McCulloch and Walter Pitts in 1943. As a transfer function, it employed a threshold, equivalent to using the Heaviside step function. Initially, only a simple model was considered, with binary inputs and outputs, some restrictions on the possible weights, and a more flexible threshold value. Since the beginning it was already noticed that any boolean function could be implemented by networks of such devices, what is easily seen from the fact that one can implement the AND and OR functions, and use them in the disjunctive or the conjunctive normal form.

Researchers also soon realized that cyclic networks, with feedbacks through neurons, could define dynamical systems with memory, but most of the research concentrated (and still does) on strictly feed-forward networks because of the smaller difficulty they present.

One important and pioneering artificial neural network that used the linear threshold function was the perceptron, developed by Frank Rosenblatt. This model already considered more flexible weight values in the neurons, and was used in machines with adaptive capabilities. The representation of the threshold values as a bias term was introduced by Widrow in 1960^{[citation needed]}.

In the late 1980s, when research on neural networks regained strength, neurons with more continuous shapes started to be considered. The possibility of differentiating the activation function allows the direct use of the gradient descent and other optimization algorithms for the adjustment of the weights. Neural networks also started to be used as a general function approximation model.

Types of transfer functions

The transfer function of a neuron is chosen to have a number of properties which either enhance or simplify the network containing the neuron. Crucially, for instance, any multilayer perceptron using a linear transfer function has an equivalent single-layer network; a non-linear function is therefore necessary to gain the advantages of a multi-layer network.

Below, u refers in all cases to the weighted sum of all the inputs to the neuron, i.e. for n inputs,

$u = \sum_{i = 1}^n w_{i} x_{i}$

where w is a vector of synaptic weights and x is a vector of inputs.

Step function

The output y of this transfer function is binary, depending on whether the input meets a specified threshold, θ. The "signal" is sent, i.e. the output is set to one, if the activation meets the threshold.

$y = \left\{ \begin{matrix} 1 & \mbox{if }u \ge \theta \\ 0 & \mbox{if }u < \theta \end{matrix} \right.$

This function is used in perceptrons and often shows up in many other models. It performs a division of the space of inputs by a hyperplane. It is specially useful in the last layer of a network intended to perform binary classification of the inputs. It can be approximated from other sigmoidal functions by assigning large values to the weights.

Linear combination

In this case, the output unit is simply the weighted sum of its inputs plus a bias term. A number of such linear neurons perform a linear transformation of the input vector. This is usually more useful in the first layers of a network. A number of analysis tools exist based on linear models, such as harmonic analysis, and they can all be used in neural networks with this linear neuron. The bias term allows us to make affine transformations to the data.

See: Linear transformation, Harmonic analysis, Linear filter, Wavelet, Principal component analysis, Independent component analysis, Deconvolution.

Sigmoid

A fairly simple non-linear function, a Sigmoid function such as the logistic function also has an easily calculated derivative, which can be important when calculating the weight updates in the network. It thus makes the network more easily manipulable mathematically, and was attractive to early computer scientists who needed to minimize the computational load of their simulations. It is commonly seen in multilayer perceptrons using a backpropagation algorithm.

See: Sigmoid function

Pseudocode algorithm

The following is a simple pseudocode implementation of a single TLU which takes boolean inputs (true or false), and returns a single boolean output when activated. An object-oriented model is used. No method of training is defined, since several exist. If a purely functional model were used, the class TLU below would be replaced with a function TLU with input parameters threshold, weights, and inputs that returned a boolean value.

 class TLU defined as:
  data member threshold : number
  data member weights : list of numbers of size X
  function member fire( inputs : list of booleans of size X ) : boolean defined as:
   variable T : number
   T ← 0
   for each i in 1 to X :
    if inputs(i) is true :
     T ← T + weights(i)
    end if
   end for each
   if T > threshold :
    return true
   else:
    return false
   end if
  end function
 end class

Spreadsheet example

		Input			Initial		Output						Final
Threshold	Learning Rate	Sensor values		Desired output	Weights		Calculated		Sum	Network	Error	Correction	Weights
TH	LR	X1	X2	Z	w1	w2	C1	C2	S	N	E	R	W1	W2
							X1 x w1	X2 x w2	C1+C2	IF(S>TH,1,0)	Z-N	LR x E	R+w1	R+w2
0.5	0.2	0	0	0	0.1	0.3	0	0	0	0	0	0	0.1	0.3
0.5	0.2	0	1	1	0.1	0.3	1	0.3	0.3	0	1	0.2	0.3	0.5
0.5	0.2	1	0	1	0.3	0.5	0.3	0	0.3	0	1	0.2	0.5	0.7
0.5	0.2	1	1	1	0.5	0.7	0.5	0.7	1.2	1	0	0	0.5	0.7
0.5	0.2	0	0	0	0.5	0.7	0	0	0	0	0	0	0.5	0.7
0.5	0.2	0	1	1	0.5	0.7	0	0.7	0.7	1	0	0	0.5	0.7
0.5	0.2	1	0	1	0.5	0.7	0.5	0	0.5	0	1	0.2	0.7	0.9
0.5	0.2	1	1	1	0.7	0.9	0.7	0.9	1.6	1	0	0	0.7	0.9
0.5	0.2	0	0	0	0.7	0.9	0	0	0	0	0	0	0.7	0.9
0.5	0.2	0	1	1	0.7	0.9	0	0.9	0.9	1	0	0	0.7	0.9
0.5	0.2	1	0	1	0.7	0.9	0.7	0	0.7	1	0	0	0.7	0.9
0.5	0.2	1	1	1	0.7	0.9	0.7	0.9	1.6	1	0	0	0.7	0.9

Supervised neural network training for an OR gate.

Note: Initial weight equals final weight of previous iteration.

Limitations

Artificial neurons of simple types, such as the McCulloch–Pitts model, are sometimes characterized as "caricature models", in that they are intended to reflect one or more neurophysiological observations, but without regard to realism.^[1]

References

^ F. C. Hoppensteadt and E. M. Izhikevich (1997). Weakly connected neural networks. Springer. p. 4. ISBN 9780387949482.

External links

[2] A good general overview

Categories:

Neural networks
American inventions

Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

Neuron (disambiguation) — Neuron is one of the primary cell types in the nervous system. Neuron may also refer to: Artificial neuron is the basic unit in an artificial neural network Neuron (synthesizer) is an electronic musical instrument The Dassault nEUROn is a planned … Wikipedia
Artificial consciousness — (AC), also known as machine consciousness (MC) or synthetic consciousness, is a field related to artificial intelligence and cognitive robotics whose aim is to define that which would have to be synthesized were consciousness to be found in an… … Wikipedia
artificial intelligence — the capacity of a computer to perform operations analogous to learning and decision making in humans, as by an expert system, a program for CAD or CAM, or a program for the perception and recognition of shapes in computer vision systems. Abbr.:… … Universalium
Artificial Imagination — (AIm), also called Synthetic Imagination or machine imagination is defined as artificial simulation of human imagination by general or special purpose computers or artificial neural networks. The term artificial imagination is also used to… … Wikipedia
Artificial neural network — An artificial neural network (ANN), usually called neural network (NN), is a mathematical model or computational model that is inspired by the structure and/or functional aspects of biological neural networks. A neural network consists of an… … Wikipedia
Artificial Intelligence System — For the computer science and machine intelligence articles see Artificial Intelligence. Artificial Intelligence System is a distributed computing project being undertaken by Intelligence Realm, Inc. with the long term goal of simulating the human … Wikipedia
Neuron (sintetizador) — El Hartmann Neuron fue un instrumento musical electrónico diseñado y construido por el diseñador industrial Axel Hartmann, de la compañía alemana Hartmann Music, en el período comprendido entre 2001 y 2005. El sintetizador utilizaba una Red… … Wikipedia Español
Artificial Neural Network — Réseau de neurones Pour les articles homonymes, voir Réseau. Vue simplifiée d un réseau artificiel de neurones Un réseau de neurones artificiel est un modèle de c … Wikipédia en Français
Biological neuron model — A biological neuron model (also known as spiking neuron model) is a mathematical description of the properties of nerve cells, or neurons, that is designed to accurately describe and predict biological processes. This is in contrast to the… … Wikipedia
Hartmann Neuron — Neuron Manufactured by Hartmann Dates 2003 Price USD 5000, at introduction Technical specifications Polyphony 16 48 voices Timbrality 4 16 … Wikipedia

Academic Dictionaries and Encyclopedias

Artificial neuron

Contents

Basic structure

History

Types of transfer functions

Step function

Linear combination

Sigmoid

Pseudocode algorithm

Spreadsheet example

Limitations

See also

References

Further reading

External links

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Artificial neuron

Contents

Basic structure

History

Types of transfer functions

Step function

Linear combination

Sigmoid

Pseudocode algorithm

Spreadsheet example

Limitations

See also

References

Further reading

External links

Look at other dictionaries:

Share the article and excerpts

Direct link