Experimental Data and Results

 
 
 
 
 
 

(Khorana's paper on " Polynucleotide synthesis and the genetic code", 1965)


                                   "To make a nucleic acid of completely defined structure and analyze
                                    the protein specified by it, then you would have a direct correlation of the sequences
                                                                        of the two types of macromolecules"
                                                                                                                        Khorana
 

Deciphering the Code

   How was the triplet nucleotide code discovered?

          It was known that there were 20 naturally occuring amino acids.
          If it was a 1- or 2- letter code then
                   a) a one letter code will result in 4 amino acids (A, C, G, T)
                   b) a two letter code will result in 16 amino acids (combinations of A, C, G, and T)
          Therefore to produce 20 amino acids a minimum of 3 letters is required. Hence a 3-letter code.

Experimental Asumptions

   1) Three letter-code for the known 20 amino acids
   2) 64 possible triplets coding for varoius amino acids
   3) mRNA coded using triplets
   4) Nonoverlapping property of the code
         - the nucleotide sequence for example ACGACGACG...
           is read as ACG ACG ... and not as ACG CGA GAC... (overlapping)
   5) Some sort of stop and start codon

Experimental Data and Results

Experiments with RNA homopolymers* (Nirenberg 1961):

Previous to Khorana's work on the genetic code, Nirenberg laid the foundation for deciphering the genetic code. He performed experiments using repeating sequences of the same nucleotide of mRNA and comparing it to the amino acid produced.

He found the following:

     Poly U -----  UUUUUUUUU...   produced poly-phenylalanine, so   UUUcodes for phe (phenyalanine)
    Poly A -----  AAAAAAAAA....  produced poly- lysine, so              AAA  codes for lys (lysine)
    Poly C -----  CCCCCCCC...    produced poly proline, so             CCC codes for pro (proline)
    Poly G -----  GGGGGGGG...    produced poly- glycine, so           GGG codes for gly (glycine)

Experiments with RNA di-, tri-, and tetra-nucleotide polymers (Khorana 1965):

Data:

   An example of di-, tri-, and tetra-nucleotide polymers (Fig.1).

   Khorana's experiment was based on different combinations of repeating nucleotide sequences of
   mRNA which he placed in a cell-free environment.

   Repeating sequences of two, three and four nucleotides combinations of DNA used in the experiment
   (Fig.2).

   These were then used to produce double stranded DNA molecules by combining them together, and
   transcribing the DNA molecules using RNA polymerase to produce single stranded poly mRNA
   molecules like poly AC mRNA (messenger RNA molecule with repeating AC sequence) (Refer to
   procedure).

   The results obtained in translating these polynucleotide mRNA as follows:
         a) Dinucleotide (Fig.3)
         b) Trinucleotide(Fig. 4)
         c) Tetranucleotide (Fig 5)

Codon Assignments:

   How were the nucleotide triplets assigned to each amino acid?

   There were two ways in which this was done:

   1) Using Logical Deductions:

        The different nucleotide sequences with the corresponding amino acids were compared and the
         triplet code for the amino acids was determined.

   Example:
      (Note: * denotes initiation sites)

        i) Comparison of :

    Poly AC (poly-dinucleotide)

         AC + AC+ AC + AC... ---> ACACACAC... ---> ACA-CAC-ACA-CAC...    mixture of poly-thr and
                                                                                   or CAC-ACA-CAC-ACA...    his

     Poly CAA(poly-trinucleotide)

          CAA + CAA + CAA... ---> C*A*A*CAACAACAA... ---> CAA-CAA-CAA...   mixture of
                                                                                              or AAC-AAC-AAC... poly-thr, gln and asn
                                                                                                     or ACA-ACA-ACA...

              Since ACA and thr occur in both experiments, then ACA makes thr (theronine).
              So CAC makes his (histidine). Therefore either AAC or CAA makes either
              gln (glutamine) or asn (asparagine). There was not enough data present in the
              original paper to further deduce which code belonged to which amino acid.

        ii) Comparison of:

    Poly UUC (poly-trinucleotide)

    UUC + UUC + UUC... ---> U*U*C*UUC... ---> UUC-UUC...      mixture of poly-phe, ser and leu
                                                                                  or UCU-UCU...
                                                                                  or CUU-CUU...

   Poly UUAC (poly-tetranucleotide)

   UUAC + UUAC... ---> U**U*A*C*UUAC.. ---> UUA-CUU-ACU-UAC...   mixture of poly-leu, leu,thr
                                                                                 or UAC-UUA-CUU-ACU...   and tyr
                                                                                 or ACU-UAC-UUA-CUU...
                                                                                 or CUU-AUC-UAC-UUA...

              CUU and leu are common to both experiments, hence CUU makes leu.
              Compare poly UUC to poly UC:

   Poly UC (poly-dinucleotide)

   UC + UC + UC... ---> UCUCUC...  ---> UCU-CUC...                mixture of poly-leu and ser
                                                                       or CUC-UCU...

              UCU and ser are common to each and hence UCU makes ser (serine).
              Therefore CUC makes leu (leucine) and  UUC makes phe (phenylalanine).

        iii) Comparison of the last two poly-tetranucleotides (Fig.5) indicates the formation of a
            mixture of di- and tri-peptides, indicating that there were nonsense triplets in the sequences.
            By using a similar sort of deduction as above UAA and UAG are seen as stop codes.

         Since leucine from the above two examples is coded by two triplets, CUC and CUU, Khorana
         concluded about the idea of degeneracy, where more than one triplet may code for a single
         amino acid. The idea of the wobble position and degeneracy was observed by Crick in 1966,
         when he developed the wobble hypothesis. Whether Khorana knew about the wobble position
         is questionable.

   2) Binding Assay

        Beacause more than one triplert may code for any single amino acid and a large number of
        polypepetide syntheses is required to use the method mentioned above, it is easier to match
        the possible triplets with the amino acids using an experimental method.

        Khorana used a technique developed by Nirenberg and Leder (see procedure).

        For example a poly-AAG sequence which produced a mixture of poly-lysine, glutamate, and
        arginine can use the binding technique (binding of aminoacyl-tRNA to ribosome in the presence
        of specific trinucleotides). .(Fig.6)

Conclusions:

   Codon assignments led to:
        (i) the assignment of 51 out of the 64  possible triplets to amino acids (Fig.7)

        (ii) the formation of a genetic code table (Fig 8)