The e-Dimensionality of Genetic Information
preprintposted on 2021-07-17, 21:11 authored by Subhash KakSubhash Kak
This paper provides an explanation for why the assignment of codons to amino acids, which range from 1 to 6, is non-uniform. Since mathematical coding theory demands a near uniform assignment, the answer to this question is important to understand deeper aspects of the structure of the genetic code. Our analysis points to 20 different covering regions in an e-dimensional information space, which is equal to the number of amino acids. It is also shown that the assignment of the codons to the amino acids is fractal-like that is well modeled by the Zipf distribution. It is remarkable that the Zipf distribution that holds for the letter frequencies of words in a language also applies to the rank order of triplets the code for amino acids.