aa2nt

Convert an amino acid sequence to a nucleotide sequence

Syntax

SeqNT = aa2nt(SeqAA, 'PropertyName', PropertyValue...)
aa2nt(..., 'GeneticCode', GeneticCodeValue)
aa2nt(..., 'Alphabet' AlphabetValue)

Arguments

SeqAAAmino acid sequence. Enter a character string or a vector of integers from the table . Examples: 'ARN' or [1 2 3]
GeneticCodeProperty to select a genetic code. Enter a code number or code name from the table Genetic Code below. If you use a code name, you can truncate the name to the first two characters of the name.
AlphabetProperty to select a nucleotide alphabet. Enter either 'DNA' or 'RNA'. The default value is 'DNA', which uses the symbols A, C, T, G. The value 'RNA' uses the symbols A, C, U, G.

Genetic Code

Code NumberCode NameCode NumberCode Name
1

Standard

12

Alternative Yeast Nuclear
2Vertebrate Mitochondrial13Ascidian Mitochondrial
3Yeast Mitochondrial14Flatworm Mitochondrial
4Mold, Protozoan, Coelenterate Mitochondrial, and Mycoplasma /Spiroplasma15Blepharisma Nuclear
5Invertebrate Mitochondrial16Chlorophycean Mitochondrial
6Ciliate, Dasycladacean, and Hexamita Nuclear21Trematode Mitochondrial
9Echinoderm Mitochondrial22Scenedesmus Obliquus Mitochondrial
10Euplotid Nuclear23Thraustochytrium Mitochondrial
11Bacterial and Plant Plastid  

Description

SeqNT = aa2nt(SeqAA, 'PropertyName', PropertyValue...) converts an amino acid sequence (SeqAA) to a nucleotide sequence (SeqNT) using the standard genetic code. In general, the mapping from an amino acid to a nucleotide codon is not a one-to-one mapping. For amino acids with more then one possible nucleotide codon, this function selects randomly a codon corresponding to that particular amino acid.

For the ambiguous characters B and Z, one of the amino acids corresponding to the letter is selected randomly, and then a codon sequence is selected randomly. For the ambiguous character X, a codon sequence is selected randomly from all possibilities.

aa2nt(..., 'GeneticCode', GeneticCodeValue) selects a genetic code (GeneticCodeValue) to use when converting an amino acid sequence (SeqAA) to a nucleotide sequence (SeqNT).

aa2nt(..., 'Alphabet' AlphabetValue) selects a nucleotide alphabet (AlphabetValue).

Standard Genetic Code

Amino Acid Amino Acid 

Alanine (A)

GCT, GCC, GCA, GCG

Phenylalanine (F)

TTT, TTC

Arginine (R)

CGT, CGC, CGA, CGG, AGA, AGG

Proline (P)

CCT, CCC, CCA, CCG

Asparagine (N)

ATT, AAC

Serine (S)

TCT, TCC, TCA,TCG, AGT, AGC

Aspartic acid (Aspartate, D)

GAT, GAC

Threonine (T)

ACT, ACC, ACA, ACG

Cysteine (C)

TGT, TGC

Tryptophan (W)

TGG

Glutamine (Q)

CAA, CAG

Tyrosine (Y)

TAT, TAC

Glutamic acid (Glutamate, E)

GAA, GAG

Valine (V)

GTT, GTC, GTA, GTG

Glycine (G)

GGT, GGC, GGA, GGG

Aspartic acid or Asparagine

B—random codon from D and N

Histidine (H)

CAT, CAC

Glutamic acid or Glutamine

Z—random codon from E and Q

Isoleucine (I)

ATT, ATC, ATA

Unknown or any amino acid

X—random codon

Leucine (L)

TTA, TTG, CTT, CTC, CTA, CTG

Translation stop (*)

TAA, TAG, TGA

Lysine (K)

AAA, AAG

Gap of indeterminate length (-)

---

Methionine (M)

ATG

Any character or any symbol not in table (?)

???

Examples

Convert a amino acid sequence to a nucleotide sequence using the standard genetic code.

aa2nt('MATLAB')

Warning: The sequence contains ambiguous characters.
ans =
ATGGCAACCCTGGCGAAT

Use the Vertebrate Mitochondrial genetic code.

aa2nt('MATLAP', 'GeneticCode', 2)

ans =
ATGGCAACTCTAGCGCCT

Use the genetic code for the Echinoderm Mitochondrial RNA alphabet.

aa2nt('MATLAB','GeneticCode','ec','Alphabet','RNA')

Warning: The sequence contains ambiguous characters.
ans =
AUGGCUACAUUGGCUGAU

Convert a sequence with the ambiguous amino acid characters B.

aa2nt('abcd')

Warning: The sequence contains ambiguous characters.
ans =
GCCACATGCGAC

See Also

Bioinformatics Toolbox functions aminolookup, baselookup, geneticcode, nt2aa , revgeneticcode


© 1994-2005 The MathWorks, Inc.