Evolvability (computer science)

Evolvability (computer science)

The concept of evolvability is increasingly studied in evolutionary computation as well as in biology. The term Evolvability is also used for a recent framework of computational learning introduced by Leslie Valiant in his paper of the same name and described below. The aim of this theory is to model biological evolution and categorize which types of mechanisms are evolvable. Evolution is an extension of PAC learning and learning from statistical queries.

General Framework

Let F_n, and R_n, be collections of functions on n, variables. Given an "ideal function" f in F_n, the goal is to find by local search a "representation" r in R_n that closely approximates f,. This closeness is measured by the "performance" operatorname{Perf}(f,r) of r, with respect to f,.

As is the case in the biological world, there is a difference between genotype and phenotype. In general, there can be multiple representations (genotypes) that correspond to the same function (phenotype). That is, for some r,r' in R_n, with r eq r',, still r(x) = r'(x), for all x in X_n. However, this need not be the case. The goal then, is to find a representation that closely matches the phenotype of the ideal function, and the spirit of the local search is to allow only small changes in the genotype. Let the "neighborhood" N(r), of a representation r, be the set of possible mutations of r,.

For simplicity, consider Boolean functions on X_n = {-1,1}^n,, and let D_n, be a probability distribution on X_n,. Define the performance in terms of this. Specifically,: operatorname{Perf}(f,r) = sum_{x in X_n} f(x) r(x) D_n(x). Note that operatorname{Perf}(f,r) = operatorname{Prob}(f(x)=r(x)) - operatorname{Prob}(f(x) eq r(x)). In general, for non-Boolean functions, the performance will not correspond directly to the probability that the functions agree, although it will have some relationship.

Throughout an organism's life, it will only experience a limited number of environments, so its performance cannot be determined exactly. The "empirical performance" is defined by operatorname{Perf}_s(f,r) = frac{1}{s} sum_{x in S} f(x)r(x), where S, is a multiset of s, independent selections from X_n, according to D_n,. If s, is large enough, evidently operatorname{Perf}_s(f,r) will be close to the actual performance operatorname{Perf}(f,r).

Given an ideal function f in F_n, initial representation r in R_n, "sample size" s,, and "tolerance" t,, the "mutator" operatorname{Mut}(f,r,s,t) is a random variable defined as follows. Each r' in N(r) is classified as beneficial, neutral, or deleterious, depending on its empirical performance. Specifically,
* r', is a beneficial mutation if operatorname{Perf}_s(f,r') - operatorname{Perf}_s(f,r) geq t;
* r', is a neutral mutation if -t < operatorname{Perf}_s(f,r') - operatorname{Perf}_s(f,r) < t;
* r', is a deleterious mutation if operatorname{Perf}_s(f,r') - operatorname{Perf}_s(f,r) leq -t.

If there are any beneficial mutations, then operatorname{Mut}(f,r,s,t) is equal to one of these at random. If there are no beneficial mutations, then operatorname{Mut}(f,r,s,t) is equal to a random neutral mutation. In light of the similarity to biology, r, itself is required to be available as a mutation, so there will always be at least one neutral mutation.

The intention of this definition is that at each stage of evolution, all possible mutations of the current genome are tested in the environment. Out of the ones who thrive, or at least survive, one is chosen to be the candidate for the next stage. Given r_0 in R_n, we define the sequence r_0,r_1,r_2,ldots by r_{i+1} = operatorname{Mut}(f,r_i,s,t). Thus r_g, is a random variable representing what r_0, has evolved to after g, "generations".

Let F, be a class of functions, R, be a class of representations, and D, a class of distributions on X,. We say that F, is "evolvable by R, over D," if there exists polynomials p(cdot,cdot), s(cdot,cdot), t(cdot,cdot), and g(cdot,cdot) such that for all n, and all epsilon > 0,, for all ideal functions f in F_n and representations r_0 in R_n, with probability at least 1 - epsilon,,: operatorname{Perf}(f,r_{g(n,1/epsilon)}) geq 1-epsilon, where the sizes of neighborhoods N(r), for r in R_n, are at most p(n,1/epsilon),, the sample size is s(n,1/epsilon),, the tolerance is t(1/n,epsilon),, and the generation size is g(n,1/epsilon),.

F, is "evolvable over D," if it is evolvable by some R, over D,.

F, is "evolvable" if it is evolvable over all distributions D,.

Results

The class of conjunctions and the class of disjunctions are evolvable over the uniform distribution for short conjunctions and disjunctions, respectively.

The class of parity functions (which evaluate to the parity of the number of true literals in a given subset of literals) are not evolvable, even for the uniform distribution.

Evolvability implies PAC learnability.

References

# L.G. Valiant. "Evolvability". ECCC, TR06-120, 2006. [http://eccc.hpi-web.de/eccc-reports/2006/TR06-120/index.html]


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Evolvability — is a concept within the Darwinian understanding of biological evolution. Darwin s theory of evolution by natural selection requires that plants, animals, and other organisms be able to produce offspring that are sometimes better adapted to the… …   Wikipedia

  • Normalized Systems — is a theory to design and engineer information systems exhibiting proven evolvability. Originally established at the University of Antwerp, at the department Management Information Systems of the faculty Applied Economics, it aims at re creating… …   Wikipedia

  • Evolution — This article is about evolution in biology. For other uses, see Evolution (disambiguation). For a generally accessible and less technical introduction to the topic, see Introduction to evolution. Part of a series on …   Wikipedia

  • Evidence of common descent — The wide range of evidence of common descent of living things strongly indicates the occurrence of evolution and provides a wealth of information on the natural processes by which the variety of life on Earth developed, supporting the modern… …   Wikipedia

  • Degeneracy (biology) — Within biological systems, degeneracy refers to circumstances where structurally dissimilar components/modules/pathways can perform similar functions (i.e. are effectively interchangeable) under certain conditions, but perform distinct functions… …   Wikipedia

  • Peter J. Bentley — Infobox Scientist name = Peter J. Bentley image width = caption = birth date = birth date and age|1972|5|16 birth place = Colchester,UK death date = death place = residence = citizenship = nationality = British ethnicity = field = Computer… …   Wikipedia

  • Super-recursive algorithm — In computer science and computability theory, super recursive algorithms are algorithms that are more powerful, that is, compute more, than Turing machines. The term was introduced by Mark Burgin, whose book Super recursive algorithms develops… …   Wikipedia

  • Charles Ofria — Dr. Charles A. Ofria is currently an Associate Professor in the Department of Computer Science and Engineering, the director of the Digital Evolution (DEvo) Lab at Michigan State University and a co founder of the BEACON Center for the Study of… …   Wikipedia

  • Self-replication — is any process by which a thing might make a copy of itself. Biological cells, given suitable environments, reproduce by cell division. During cell division, DNA is replicated and can be transmitted to offspring during reproduction. Biological… …   Wikipedia

  • Current research in evolutionary biology — covers diverse topics, as should be expected given the centrality of evolution to understanding biology. Modern evolutionary biology incorporates ideas from diverse areas of science, such as molecular genetics and even computer science. First,… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”