Gene regulatory network

Gene regulatory network

A gene regulatory network (also called a "GRN" or "genetic regulatory network") is a collection of DNA segments in a cell which interact with each other (indirectly through their RNA and protein expression products) and with other substances in the cell, thereby governing the rates at which genes in the network are transcribed into mRNA. In general, each mRNA molecule goes on to make a specific protein (or set of proteins). In some cases this protein will be structural, and will accumulate at the cell-wall or within the cell to give it particular structural properties. In other cases the protein will be an enzyme; a micro-machine that catalyses a certain reaction, such as the breakdown of a food source or toxin. Some proteins though serve only to activate other genes, and these are the transcription factors that are the main players in regulatory networks or cascades. By binding to the promoter region at the start of other genes they turn them on, initiating the production of another protein, and so on. Some transcription factors are inhibitory.

In single-celled organisms regulatory networks respond to the external environment, optimising the cell at a given time for survival in this environment. Thus a yeast cell, finding itself in a sugar solution, will turn on genes to make enzymes that process the sugar to alcohol [] . This process, which we associate with wine-making, is how the yeast cell makes its living, gaining energy to multiply, which under normal circumstances would enhance its survival prospects.

In multicellular animals the same principle has been put in the service of gene cascades that control body-shape [] . Each time a cell divides, two cells result which, although they contain the same genome in full, can differ in which genes are turned on and making proteins. Sometimes a 'self-sustaining feedback loop' ensures that a cell maintains its identity and passes it on. Less understood is the mechanism of epigenetics by which chromatin modification may provide cellular memory by blocking or allowing transcription. A major feature of multicellular animals is the use of morphogen gradients, which in effect provide a positioning system that tells a cell where in the body it is, and hence what sort of cell to become. A gene that is turned on in one cell may make a product that leaves the cell and diffuses through adjacent cells, entering them and turning on genes only when it is present above a certain threshold level. These cells are thus induced into a new fate, and may even generate other morphogens that signal back to the original cell. Over longer distances morphogens may use the active process of signal transduction Such signalling controls embryogenesis, the building of a body plan from scratch through a series of sequential steps. They also control maintain adult bodies through feedback processes, and the loss of such feedback because of a mutation can be responsible for the cell proliferation that is seen in cancer. In parallel with this process of building structure, the gene cascade turns on genes that make structural proteins that give each cell the physical properties it needs.


At one level, biological cells can be thought of as "partially-mixed bags" of biological chemicals -- in the discussion of gene regulatory networks, these chemicals are mostly the mRNAs and proteins that arise from gene expression. These mRNA and proteins interact with each other with various degrees of specificity. Some diffuse around the cell. Others are bound to cell membranes, interacting with molecules in the environment. Still others pass through cell membranes and mediate long range signals to other cells in a multi-cellular organism. These molecules and their interactions comprise a "gene regulatory network". A typical gene regulatory network looks something like this:

The nodes of this network are proteins, their corresponding mRNAs, and protein/protein complexes. Nodes that are depicted as lying along vertical lines are associated with the cell/environment interfaces, while the others are free-floating and diffusible. Implied are genes, the DNA sequences which are transcribed into the mRNAs that translate into proteins. Edges between nodes represent individual molecular reactions, the protein/protein and protein/mRNA interactions through which the products of one gene affect those of another, though the lack of experimentally obtained information often implies that some reactions are not modeled at such a fine level of detail. These interactions can be inductive (the arrowheads), with an increase in the concentration of one leading to an increase in the other, or inhibitory (the filled circles), with an increase in one leading to a decrease in the other. A series of edges indicates a chain of such dependences, with cycles corresponding to feedback loops. The network structure is an abstraction of the system's chemical dynamics, describing the manifold ways in which one substance affects all the others to which it is connected. In practice, such GRNs are inferred from the biological literature on a given system and represent a distillation of the collective knowledge about a set of related biochemical reactions.

Genes can be viewed as nodes in the network, with input being proteins such as transcription factors, and outputs being the level of gene expression. The node itself can also be viewed as a function which can be obtained by combining basic functions upon the inputs (in the Boolean network described below these are Boolean functions, typically AND, OR, and NOT). These functions have been interpreted as performing a kind of information processing within the cell, which determines cellular behavior. The basic drivers within cells are concentrations of some proteins, which determine both spatial (location within the cell or tissue) and temporal (cell cycle or developmental stage) coordinates of the cell, as a kind of "cellular memory". The gene networks are only beginning to be understood, and it is a next step for biology to attempt to deduce the functions for each gene "node", to help understand the behavior of the system in increasing levels of complexity, from gene to signaling pathway, cell or tissue level (see systems biology).

Mathematical models of GRNs have been developed to capture the behavior of the system being modeled, and in some cases generate predictions corresponding with experimental observations. In some other cases, models have proven to make accurate novel predictions, which can be tested experimentally, thus suggesting new approaches to explore in an experiment that sometimes wouldn't be considered in the design of the protocol of an experimental laboratory. The most common modeling technique involves the use of coupled ordinary differential equations (ODEs). Several other promising modeling techniques have been used, including Boolean networks, Petri nets, Bayesian networks, graphical Gaussian models, Stochastic, and Process Calculi. Conversely, techniques have been proposed for generating models of GRNs that best explain a set of time series observations.


Coupled ODEs

It is common to model such a network with a set of coupled ordinary differential equations (ODEs) or stochastic ODEs, describing the reaction kinetics of the constituent parts. Suppose that our regulatory network has N nodes, and let S_1(t),S_2(t), ldots, S_N(t) represent the concentrations of the N corresponding substances at time t . Then the temporal evolution of the system can be described approximately by

: frac{dS_j}{dt} = f_j left ( S_1,S_2, ldots, S_N ight)

where the functions f_j express the dependence of S_j on the concentrations of other substances present in the cell. The functions f_j are ultimately derived from basic principles of chemical kinetics or simple expressions derived from these e.g. Michaelis-Menten enzymatic kinetics. Hence, the functional forms of the f_j are usually chosen as low-order polynomials or Hill functions that serve as an ansatz for the real molecular dynamics. Such models are then studied using the mathematics of nonlinear dynamics. System-specific information, like reaction rate constants and sensitivities, are encoded as constant parameters.

By solving for the fixed point of the system:: frac{dS_j}{dt} = 0 for all j, one obtains (possibly several) concentration profiles of proteins and mRNAs that are theoretically sustainable (though not necessarily stable). Steady states of kinetic equations thus correspond to potential cell types, and oscillatory solutions to the above equation to naturally cyclic cell types. Mathematical stability of these attractors can usually be characterized by the sign of higher derivatives at critical points, and then correspond to biochemical stability of the concentration profile. Critical points and bifurcations in the equations correspond to critical cell states in which small state or parameter perturbations could switch the system between one of several stable differentiation fates. Trajectories correspond to the unfolding of biological pathways and transients of the equations to short-term biological events. For a more mathematical discussion, see the articles on nonlinearity, dynamical systems, bifurcation theory, and chaos theory.

Boolean network

The following example illustrates how a Boolean network can model a GRN together with its gene products (the outputs) and the substances from the environment that affect it (the inputs). Stuart Kauffman was amongst the first biologists to use the metaphor of Boolean networks to model genetic regulatory networkscite book | title=The origins of Order| last=Kauffman| first=Stuart| | date=1993] .

#Each gene, each input, and each output is represented by a node in a directed graph in which there is an arrow from one node to another if and only if there is a causal link between the two nodes.
#Each node in the graph can be in one of two states: on or off.
#For a gene, "on" corresponds to the gene being expressed; for inputs and outputs, "on" corresponds to the substance being present.
#Time is viewed as proceeding in discrete steps. At each step, the new state of a node is a Boolean function of the prior states of the nodes with arrows pointing towards it.

The validity of the model can be tested by comparing simulation results with time series observations.

Continuous networks

Continuous network models of GRNs are an extension of the boolean networks described above. Nodes still represent genes and connections between them regulatory influences on gene expression. Genes in biological systems display a continuous range of activity levels and it has been argued that using a continuous representation captures several properties of gene regulatory networks not present in the Boolean model [Vohradsky, J. (2001). Neural model of the genetic network. The Journalof Biological Chemistry, 276, 36168–36173.] . Formally most of these approaches are similar to an artificial neural network, as inputs to a node are summed up and the result serves as input to a sigmoid function, e.g. [Geard, N. and Wiles, J. A Gene Network Model for Developing Cell Lineages. In Artificial Life, 11 (3): 249-268, 2005.] , but proteins do often control gene expression in a synergistic, i.e. non-linear, way [Schilstra, M. J. and Bolouri, H. The Logic of Gene Regulation.,] . However there is now a continuous network model [Knabe, J. F., Nehaniv, C. L., Schilstra, M. J. and Quick, T. Evolving Biological Clocks using Genetic Regulatory Networks. In In Artificial Life X: Proceedings of the Tenth International Conference on the Simulation and Synthesis of Living Systems, pages 15-21, MIT Press, 2006.] that allows grouping of inputs to a node thus realizing another level of regulation. This model is formally closer to a higher order recurrent neural network. The same model has also been used to mimic the evolution of cellular differentiation [Knabe, J. F., Nehaniv, C. L. and Schilstra, M. J. Evolutionary Robustness of Differentiation in Genetic Regulatory Networks. In Proceedings of the 7th German Workshop on Artificial Life 2006 (GWAL-7), pages 75-84, Akademische Verlagsgesellschaft Aka, Berlin, 2006.] and even multicellular morphogenesis [Knabe, J. F., Schilstra, M. J. and Nehaniv, C. L. Evolution and Morphogenesis of Differentiated Multicellular Organisms: Autonomously Generated Diffusion Gradients for Positional Information. In Artificial Life XI: Proceedings of the Eleventh International Conference on the Simulation and Synthesis of Living Systems, MIT Press, 2008.] .

tochastic gene networks

Recent experimental results [Elowitz, M.B., Levine, A.J., Siggia, E.D., and Swain, P.S. 2002. Stochastic gene expression in a single cell. Science 297: 1183-1186] [Blake, W.J., Kaern, M., Cantor, C.R., and Collins, J.J. 2003. [ Noise in eukaryotic gene expression.] Nature 422: 633-637] have demonstrated that gene expression is a stochastic process. Thus, many authors are now using the stochastic formalism, after the first work by [Arkin, A. and McAdams, H.H. 1998. Stochastic kinetic analysis of developmental pathway bifurcation in phage lambda-infected Escherichia coli cells. Genetics 149: 1633-1648.] . Works on single gene expression [Raser, J.M., and O'Shea, E.K., (2005) Noise in gene expression: origins, consequences, and control, Science, 309, 2010-2013] and small synthetic genetic networks [Elowitz, M. B., and Leibler, S., (2000) A synthetic oscillatory network of transcriptional regulators., Nature, 403, 335-338] [Gardner, T. S., Cantor, C. R., and Collins., J. J., (2000) Construction of a genetic toggle switch in Escherichia coli., Nature, 403, 339-342] , such as the genetic toggle switch of Tim Gardner and Jim Collins, provided additional experimental data on the phenotypic variability and the stochastic nature of gene expression. The first versions of stochastic models of gene expression involved only instantaneous reactions and were driven by the Gillespie algorithm [Gillespie, D.T., A general method for numerically simulating the stochastic time evolution of coupled chemical reactions, 1976, J. Comput. Phys., 22, 403-434.] .

Since some processes, such as gene transcription, involve many reactions and could not be correctly modeled as an instantaneous reaction in a single step, it was proposed to model these reactions as single step multiple delayed reactions in order to account for the time it takes for the entire process to be complete [Roussel, M.R., and Zhu, R., Validation of an algorithm for delay stochastic simulation of transcription and translation in prokaryotic gene expression, 2006, Phys. Biol. 3, 274-284] .

From here, a set of reactions were proposed [Ribeiro, Andre S., Zhu, R., Kauffman, S.A. (2006). "A General Modeling Strategy for Gene Regulatory Networks with Stochastic Dynamics", Journal of ComputationalBiology, 13(9), 1630-1639.] that allow generating GRNs. These are then simulated using a modified version of the Gillespie algorithm, that can simulate multiple time delayed reactions (chemical reactions where each of the products is provided a time delay that determines when will it be released in the system as a "finished product").

For example, basic transcription of a gene can be represented by the following single-step reaction (RNAP is the RNA polymerase, RBS is the RNA ribosome binding site, and Pro_i is the promoter region of gene i):

: RNAP+ Pro_i overset{k_{i,baslongrightarrow Pro_i( au _{i}^{1}) + RBS_i( au_{i}^{1})+ RNAP( au _{i}^{2})

A recent work proposed a simulator (SGNSim, "Stochastic Gene Networks Simulator") [ Andre S. Ribeiro and Jason Lloyd-Price, (2007) "SGN Sim, a Stochastic Genetic Networks Simulator", Bioinformatics, 23(6):777-779. doi:10.1093/bioinformatics/btm004., doi:10.1093/bioinformatics/btm004.] , that can model GRNs where transcription and translation are modeled as multiple time delayed events and its dynamics is driven by a stochastic simulation algorithm (SSA) able to deal with multiple time delayed events. The time delays can be drawn from several distributions and the reaction rates from complexfunctions or from physical parameters. SGNSim can generate ensembles of GRNs within a set of user-defined parameters, such as topology. It can also be used to model specific GRNs and systems of chemical reactions. Genetic perturbations such as gene deletions, gene over-expression, insertions, frame shift mutations can also be modeled as well.

The GRN is created from a graph with the desired topology, imposing in-degree and out-degree distributions. Gene promoter activities are affected by other genes expression products that act as inputs, in the form of monomers or combined into multimers and set as direct or indirect. Next, each direct input is assigned to an operator site and different transcription factors can be allowed, or not, to compete for the same operator site, while indirect inputs are given a target. Finally, a function is assigned to each gene, defining the gene's response to a combination of transcription factors (promoter state). The transfer functions (that is, how genes respond to a combination of inputs) can be assigned to each combination of promoter states as desired.

In other recent work, multiscale models of gene regulatory networks have been developed that focus on synthetic biology applications. Simulations have been used that model all biomolecular interactions in transcription, translation, regulation, and induction of gene regulatory networks, guiding the design of synthetic systems. [ Y. N. Kaznessis, (2007) "Models for Synthetic Biology", BMC Systems Biology, 2007, 1:47 doi:10.1186/1752-0509-1-47 [] . ]

See also

*Systems biology
*Body plan


*James M. Bower, Hamid Bolouri (editors), (2001) "Computational Modeling of Genetic and Biochemical Networks" Computational Molecular Biology Series, MIT Press, ISBN 0-262-02481-0

* L. Franke, H. van Bakel, L. Fokkens, E. de Jong, M. Egmont-Petersen, C. Wijmenga, (2006) Reconstruction of a probabilistic human gene network, with an application for prioritizing positional candidate genes, Amer. J. of Human Genetics, 78(6), 1011-25. [ Human gene network] , [ Prioritizer software application] .

*S. A. Kauffman, "Metabolic stability and epigenesis in randomly constructed genetic nets", J. Theoret. Biol (1969) 22, 434–467

External links

* [ Gene Regulatory Networks] - Short introduction
* [ BIB: Yeast Biological Interaction Browser]
* [ Graphical Gaussian models for genome data] - Inference of gene association networks with GGMs
* [ A bibliography on learning causal networks of gene interactions] - regularly updated, contains hundreds of links to papers from bioinformatics, statistics, machine learning.
* BIOREL is a web-based resource for quantitative estimation of the gene network bias in relation to available database information about gene activity/function/properties/associations/interactio.
* [ Evolving Biological Clocks using Genetic Regulatory Networks] - Information page with model source code and Java applet.
* [ Engineered Gene Networks]
* [ Tutorial: Genetic Algorithms and their Application to the Artificial Evolution of Genetic Regulatory Networks]

Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Gene expression — For vocabulary, see Glossary of gene expression terms. For a non technical introduction to the topic, see Introduction to genetics. Genes are expressed by being transcribed into RNA, and this transcript may then be translated into protein. Gene… …   Wikipedia

  • Regulatory sequence — A regulatory sequence (also called a regulatory region or a regulatory area ) is a segment of DNA where regulatory proteins such as transcription factors bind preferentially. These regulatory proteins bind to short stretches of DNA called… …   Wikipedia

  • Network motif — Network motifs are connectivity patterns (sub graphs) that occur much more often than they do in random networks. Most networks studied in biology, ecology and other fields have been found to show a small set of network motifs; surprisingly, in… …   Wikipedia

  • Regulation of gene expression — Gene modulation redirects here. For information on therapeutic regulation of gene expression, see therapeutic gene modulation. For vocabulary, see Glossary of gene expression terms Diagram showing at which stages in the DNA mRNA protein pathway… …   Wikipedia

  • List of network theory topics — Network theory is an area of applied mathematics. This page is a list of network theory topics. See also List of graph theory topics. Contents 1 Network theorems 2 Network properties 3 Network theory applications …   Wikipedia

  • Network theory — For network theory of the regulation of the adaptive immune system see Immune network theory For the sociological theory, see Social network Network theory is an area of computer science and network science and part of graph theory. It has… …   Wikipedia

  • Network diagram — A network diagram is a general type of diagram, which represents some kind of network. A network in general is an interconnected group or system, or a fabric or structure of fibrous elements attached to each other at regular intervals.A network… …   Wikipedia

  • Cis-regulatory module — (CRM) is a stretch of DNA, usually 100 1000 DNA base pairs in length,[1] where a number of transcription factors can bind and regulate expression of nearby genes. One cis regulatory element can regulate several genes,[2] and conversely, one gene… …   Wikipedia

  • Cis-regulatory element — A cis regulatory element or cis element is a region of DNA or RNA that regulates the expression of genes located on that same molecule of DNA (often a chromosome). This term is constructed from the Latin word cis, which means on the same side as …   Wikipedia

  • Gene therapy — using an Adenovirus vector. A new gene is inserted into an adenovirull. If the treatment is successful, the new gene will make a functional protein. Gene therapy is the insertion, alteration, or removal of genes within an individual s cells and… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”