- Holland's schema theorem
Holland's schema theorem is widely taken to be the foundation for explanations of the power of
genetic algorithms .A schema is a template that identifies a
subset of strings with similarities at certain string positions. Schemata are a special case ofcylinder set s; and so form atopological space .Description
For example, consider binary strings of length 6. The schema 1**0*1 describes the set of all strings of length 6 with 1's at positions 1 and 6 and a 0 at position 4. The * is a
wildcard symbol, which means that positions 2, 3 and 5 can have a value of either 1 or 0. The "order of a schema" is defined as the number of fixed positions in the template, while the "defining length" is the distance between the first and last specific positions. The order of 1**0*1 is 3 and its defining length is 5. The "fitness of a schema" is the average fitness of all strings matching the schema. The fitness of a string is a measure of the value of the encoded problem solution, as computed by a problem-specific evaluation function. With the genetic operators as defined above, the schema theorem states that short, low-order, schemata with above-average fitness increase exponentially in successive generations. Expressed as an equation::
Here is the number of strings belonging to schema at generation , is the "observed" fitness of schema and is the "observed" average fitness at generation . The probability of disruption is the probability that crossover or mutation will destroy the schema . It can be expressed as
:
where is the number of fixed positions, is the length of the code, is the probability of mutation and is the probability of crossover. So a schema with a shorter defining length is less likely to be disrupted.
An often misunderstood point is why the Schema Theorem is an "inequality" rather than an equality. The answer is in fact simple: the Theorem neglects the small, yet non-zero probability, that a string belonging to the schema h will be created "from scratch" by mutation of a single string (or recombination of two strings) that did "not" belong to h in the previous generation.References
* Holland, "Adaptation in Natural and Artificial Systems", The MIT Press; Reprint edition 1992 (originally published in 1975).
Wikimedia Foundation. 2010.