The genome is the complete set of genes in a cell, while the proteome refers to the full range of proteins a cell can produce. The genetic code for these proteins is stored in DNA, a long molecule located in the nucleus of eukaryotic cells. The main features of the genetic code include:
Each amino acid is coded by a sequence of three bases, called a codon.
As discussed in the DNA, genes and chromosomes subtopic, the code is degenerate, non-overlapping and universal across almost all organisms.
There are three stop codons that signal the end of the polypeptide chain.
To use the DNA code for protein synthesis, the cell utilises ribonucleic acid (RNA), which comes in three main forms: mRNA (messenger RNA), tRNA (transfer RNA), and rRNA (ribosomal RNA).
mRNA:
A long strand that forms a single helix.
Complementary to a DNA segment (usually a gene) and acts as a template for polypeptide synthesis.
Leaves the nucleus through nuclear pores to associate with ribosomes.
mRNA can be easily broken down after use.
tRNA:
A single-stranded RNA folded into a cloverleaf shape.
One end of the tRNA molecule has a site for an amino acid to attach.
The opposite end has an anticodon, which pairs with the complementary codon on the mRNA strand.
Each tRNA type has a unique anticodon and binds to a specific amino acid.
rRNA:
Combines with ribosomal proteins to form ribosomes, the sites of protein synthesis.
Transcription is the process of producing pre-mRNA using DNA as a template. It occurs in the nucleus and involves the following steps:
RNA polymerase binds to a DNA region known as the promoter, near the start codon.
RNA polymerase unwinds and breaks hydrogen bonds between complementary DNA strands, exposing the unpaired bases.
Free RNA nucleotides in the nucleus bind with the exposed template strand of DNA.
RNA polymerase moves along the DNA, forming strong phosphodiester bonds between adjacent RNA nucleotides.
The DNA rewinds behind the RNA strand as RNA polymerase continues to elongate the RNA molecule.
When RNA polymerase reaches a stop codon, it detaches, and the synthesis of pre-mRNA is complete.
Within pre-mRNA, only certain sequences, called exons, code for polypeptides, while non-coding sequences, known as introns, do not. Introns are removed in a process called splicing, leaving only exons in the mature mRNA. This splicing allows exons to be recombined in various ways, enabling one gene to code for multiple proteins.
Translation is the process of producing a polypeptide chain based on the sequence of codons in mRNA. This process occurs in the cytoplasm on ribosomes, either free-floating or attached to the rough endoplasmic reticulum (ER). The steps are as follows:
The mRNA exits the nucleus through nuclear pores and attaches to a ribosome at the start codon.
The ribosome binds to the start codon on the mRNA, and the complementary anticodon of a tRNA molecule carrying the specific amino acid binds to it.
Another tRNA with a matching anticodon binds to the next codon on the mRNA, bringing the next amino acid.
Enzymes, with the help of ATP, link the amino acids on adjacent tRNA molecules, forming a peptide bond.
The ribosome moves along the mRNA strand, releasing the first tRNA and allowing a new tRNA to bind.
Translation continues until the ribosome reaches a stop codon. At this point, the ribosome, mRNA, and tRNA molecules detach, leaving a complete polypeptide chain.
A protein may consist of one or more polypeptide chains. After translation, these chains undergo post-translational modifications:
Folding: The polypeptide folds into a secondary structure such as an alpha-helix or beta-sheet.
Assembly: Different polypeptide chains, along with any non-protein groups (e.g., lipids or carbohydrates), combine to form a quaternary structure.
This post-translational modification determines the final shape and function of the protein.
Anticodon: a sequence of three adjacent nucleotides on a molecule of transfer RNA that is complementary to a particular codon on a messenger RNA molecule.
Codon: a sequence of three adjacent nucleotides in mRNA that codes for one amino acid.
Degenerate code: more than one codon can code for the same amino acid.
mRNA: RNA molecule produced during transcription that is complementary to a gene and carries the sequence out of the nucleus to a ribosome.
Non-overlapping: stating that codons on a gene are read in order, one immediately after the other.
Promoter: Sequence of DNA prior to a gene to which transcription factors (and so RNA polymerase) can bind.
Ribose: pentose sugar found in RNA.
RNA polymerase: enzyme required for transcription that is responsible for breaking hydrogen bonds in a DNA molecule and simultaneously forming phosphoester bonds in an mRNA molecule.
Splicing: the modification of a polynucleotide through breaking and forming phosphoester bonds e.g. during the formation of mature mRNA from pre-mRNA.
Transcription: formation of messenger RNA molecules from the DNA that makes up a particular gene. It is the first stage of protein synthesis.
Translation: formation of a polypeptide from mRNA using tRNA.
tRNA: RNA molecule shaped like a clover-leaf with an attached amino acid that is specific to the anticodon on the opposite side of the molecule.
Universal code: referencing that the codons in DNA represent the same amino acids in (almost) all organisms.
Uracil: nucleotide base in RNA. Pairs with adenine.