Gene and genon concept: coding versus regulation. A conceptual and information-theoretic analysis of genetic storage and expression in the light of modern molecular biology.

Institut Jacques Monod, CNRS and Univ. Paris 7, 2, place Jussieu, 75251, Paris-Cedex 5, France.
Theory in Biosciences (Impact Factor: 1.08). 11/2007; 126(2-3):65-113. DOI: 10.1007/s12064-007-0012-x
Source: PubMed

ABSTRACT We analyse here the definition of the gene in order to distinguish, on the basis of modern insight in molecular biology, what the gene is coding for, namely a specific polypeptide, and how its expression is realized and controlled. Before the coding role of the DNA was discovered, a gene was identified with a specific phenotypic trait, from Mendel through Morgan up to Benzer. Subsequently, however, molecular biologists ventured to define a gene at the level of the DNA sequence in terms of coding. As is becoming ever more evident, the relations between information stored at DNA level and functional products are very intricate, and the regulatory aspects are as important and essential as the information coding for products. This approach led, thus, to a conceptual hybrid that confused coding, regulation and functional aspects. In this essay, we develop a definition of the gene that once again starts from the functional aspect. A cellular function can be represented by a polypeptide or an RNA. In the case of the polypeptide, its biochemical identity is determined by the mRNA prior to translation, and that is where we locate the gene. The steps from specific, but possibly separated sequence fragments at DNA level to that final mRNA then can be analysed in terms of regulation. For that purpose, we coin the new term "genon". In that manner, we can clearly separate product and regulative information while keeping the fundamental relation between coding and function without the need to introduce a conceptual hybrid. In mRNA, the program regulating the expression of a gene is superimposed onto and added to the coding sequence in cis - we call it the genon. The complementary external control of a given mRNA by trans-acting factors is incorporated in its transgenon. A consequence of this definition is that, in eukaryotes, the gene is, in most cases, not yet present at DNA level. Rather, it is assembled by RNA processing, including differential splicing, from various pieces, as steered by the genon. It emerges finally as an uninterrupted nucleic acid sequence at mRNA level just prior to translation, in faithful correspondence with the amino acid sequence to be produced as a polypeptide. After translation, the genon has fulfilled its role and expires. The distinction between the protein coding information as materialised in the final polypeptide and the processing information represented by the genon allows us to set up a new information theoretic scheme. The standard sequence information determined by the genetic code expresses the relation between coding sequence and product. Backward analysis asks from which coding region in the DNA a given polypeptide originates. The (more interesting) forward analysis asks in how many polypeptides of how many different types a given DNA segment is expressed. This concerns the control of the expression process for which we have introduced the genon concept. Thus, the information theoretic analysis can capture the complementary aspects of coding and regulation, of gene and genon.


Available from: Jürgen Jost, May 29, 2015
  • [Show abstract] [Hide abstract]
    ABSTRACT: The aim of this paper is to investigate in a systematic and comparative way previous results of independent studies on the treatment of genes and gene function in high school textbooks from six different countries. We analyze how the conceptual variation within the scientific domain of Genetics regarding gene function models and gene concepts is transformed via the didactic transposition into school science textbooks. The results indicate that a common textbook discourse on genes and their function exist in textbooks from the different countries. The structure of science as represented by conceptual variation and the use of multiple models was present in all the textbooks. However, the existence of conceptual variation and multiple models is implicit in these textbooks, i.e., the phenomenon of conceptual variation and multiple models are not addressed explicitly, nor its consequences and, thus, it ends up introducing conceptual incoherence about the gene concept and its function within the textbooks. We conclude that within the found textbook-discourse ontological aspects of the academic disciplines of genetics and molecular biology were retained, but without their epistemological underpinnings; these are lost in the didactic transposition. These results are of interest since students might have problems reconstructing the correct scientific understanding from the transformed school science knowledge as depicted within the high school textbooks. Implications for textbook writing as well as teaching are discussed in the paper.
    Science & Education 02/2014; 32(2):381-416. DOI:10.1007/s11191-012-9499-8 · 0.72 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: This article explores the relativistic principle that there is no privileged scale of causality in biology to clarify the relationships between genomes and phenotypes. The idea that genetic causes are primary views the genome as a program. Initially, that view was vindicated by the discovery of mutations and knockouts that have large and specific effects on the phenotype. But we now know that these form the minority of cases. Many changes at the genome level are buffered by robust networks of interactions in cells, tissues and organs. The 'differential' view of genetics therefore fails because it is too restrictive. An 'integral' view, using reverse engineering from systems biological models to quantify contributions to function, can solve this problem. The article concludes by showing that far from breaking the supervenience principle, downward causation requires that it should be obeyed.
    Progress in Biophysics and Molecular Biology 10/2012; 111(2-3). DOI:10.1016/j.pbiomolbio.2012.09.004 · 3.38 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: The intimate relation between biology and cognition can be formally examined through statistical models constrained by the asymptotic limit theorems of communication theory, augmented by methods from statistical mechanics and nonequilibrium thermodynamics. Cognition, often involving submodules that act as information sources, is ubiquitous across the living state. Less metabolic free energy is consumed by permitting crosstalk between biological information sources than by isolating them, leading to evolutionary exaptations that assemble shifting, tunable cognitive arrays at multiple scales, and levels of organization to meet dynamic patterns of threat and opportunity. Cognition is thus necessary for life, but it is not sufficient: An organism represents a highly patterned outcome of path-dependent, blind, variation, selection, interaction, and chance extinction in the context of an adequate flow of free energy and an environment fit for development. Complex, interacting cognitive processes within an organism both record and instantiate those evolutionary and developmental trajectories.
    Cognitive Processing 06/2013; 15(1). DOI:10.1007/s10339-013-0573-1 · 1.57 Impact Factor