(The Origins of Order, Chapter 8). Many alternative pathways are possible to produce this
autocatalytic network, but only one of them is required.
This model of the origin of life suggests an answer to the question of why free living cells have an
apparent minimal complexity. Mycoplasma genitalium, the smallest known genome that
constitutes a cell, contains approximately 470 genes, a large number for the simplest organism
(note 6). Based on the above model, we can hypothesize that a lesser number of genes would
lack the complexity to create a self-staining network. Other theories of the origin of life, such as
the replicating RNA theory, provide no explanation for this minimal complexity.
4. Much of the order in organisms is due to generic properties that emerge from a
network of gene products.
Each cell coordinates the activities of approximately 20,000 genes and their products (Nature
2004;431:931). Not only does the cell coordinate complex activities such as mitosis, but these
activities occur without any oversight. The molecules interact spontaneously with each other, and
lead to the creation of new cells and new living organisms with the same properties. If we are to
get a deeper understanding of diseases whose secrets have defied decades of determined
research, we may need to understand the general principles underlying these processes.
The traditional view is that the sole source of order in organisms is natural selection as described
by Darwin (On the Origin of Species, 1859); the patterns of a pine cone or the artistry of mitosis are
due to “chance caught on the wing,” a description of natural selection by Jacques Monod (Le
Hasard et la nécessité (Chance and Necessity, 1970). An alternative view, advanced by Kauffman
and others, is that order is an expected emergent property of molecular networks, based on
structural properties of networks that are not dependent on details on the particular molecules
(The Origins of Order: Self-Organization and Selection in Evolution, 1993). Genes, RNA and
proteins form a complex parallel processing network in which each molecule is connected to
some other molecule, and switches each other off and on. During ontogeny, gene products turn
each other on and off to generate precursor cells. Once cells become differentiated, some of this
switching continues, but the cells are relatively stable.
Consider a hypothetical gene product A, which could be considered active or not, based on its
state of phosphorylation. Based on inputs from kinases or other gene products, protein A may
change its properties in the next instant of time. Any change to gene product A may affect other
gene products it is connected to in the network, and for which it acts as an input. If we consider
all the gene products in a cell, we can consider all of their properties at a particular moment in
time as the state of the cell. In the next instant, each gene product may change its properties
based on relevant inputs, leading to a new state. The path that a network traverses over time,
based on changes in the state of each gene product at each moment in time, is called its state
space or state cycle.
Theoretically, a cell with 20,000 types of gene products, and numerous copies of each, could
have an almost infinite length to its state space, and never return to its original state. For
example, a network of N gene products, with only one copy and two possible properties for each
gene product, would theoretically have a state cycle of length 2
N
. For N = 20,000 gene products,
this is approximately 10
6000
. However, a state cycle this large does not happen, due to three
network properties that induce order.
The first network property that induces order is the surprising finding that if each gene product is
regulated by at most 2 inputs, which is relatively common, the median length of the state cycle is
only the square root of the number of gene products, or 141 if N is 20,000 (J Theor Biol
1969;22:437, The Origins of Order, p. 479). This network property creates inherent stability, even in
networks with large numbers of gene products, as the cell network is localized to a very small