Convert protein sequences to DNA using codon usage tables.
Paste one or more FASTA protein sequences (max 20,000,000 characters).
Enter the codon table in GCG format. The default is E. coli codon usage.
Paste one or more protein sequences in FASTA format. Use standard one-letter amino acid codes (ACDEFGHIKLMNPQRSTVWY). The tool accepts sequences up to 20 million characters and automatically removes invalid characters.
The default table uses E. coli codon preferences in GCG format. For other organisms, paste a custom codon usage table. The table specifies which DNA codons are preferred for each amino acid, affecting the output DNA sequence.
Results include three versions: Non-degenerate (most likely codons), Degenerate (showing alternative codons with IUPAC codes like R=A/G, Y=C/T), and a graphical representation. The non-degenerate version is suitable for gene synthesis.
Use reverse translation for designing PCR primers, planning gene synthesis, or predicting possible DNA sequences. For heterologous expression, use codon tables from the host organism to optimize expression levels and avoid rare codons.
Reverse translation converts a protein sequence back into a DNA sequence. Since the genetic code is degenerate (multiple codons code for the same amino acid), reverse translation must make choices about which codon to use. This tool uses codon usage statistics to select the most likely codons.
This tool uses E. coli codon usage preferences. For best results with other organisms, you can provide custom codon usage tables. The tool selects the most frequently used codon for each amino acid.