A proposal for a DNA-based computer code

International Invention Journal of Biochemistry and Bioinformatics 10/2013; 1(1):1-4.


The use of DNA has become an attractive method for storing information in the future biocomputers due to its capacity to store a large amount of information while requiring little physical volume. In the last decade, the order of nucleotides (nt) has been considered as the best method to store a large amount of data. However, proposals for this method have weaknesses. I present a new coding system for DNA-based computing that uses 4 nt per symbol. This code is based on the conversion of all 256 computer symbols' ASCII numbers into base-4 numbers and on assigning nucleotides ATCG to 0123 respectively. This encoding has: uniformity, due to all symbols coded with 4 nt; consistency, due to a biunivocal relationship between the symbols and tetraplets; homogeneity, because similar symbols share the same first nt; intuitiveness in locating reading frames; and error resistance, due to shorter sequences, homogeneity on the first nt, and almost none nt repetition longer than two. This coding system will provide a more efficient method to implement DNA-based information storage, which will thus help to design upcoming biocomputers.

Download full-text


Available from: Alfonso Jimenez-Sanchez