Chemical Structure
The Chemical Structure of nucleic acids – built from nucleotides linked by phosphodiester bonds into a directional sugar-phosphate backbone with projecting bases – dictates their function. DNA’s stable, antiparallel double helix with specific A-T/G-C pairing is ideal for information storage. RNA’s single-stranded nature, ribose sugar, and U base allow it to adopt diverse structures for varied roles in gene expression and regulation. Grasping this structure is key to understanding how genetic information is stored, accessed, and analyzed in the clinical laboratory
The chemical structure of nucleic acids ties together sugars and bases. Think of it like building with LEGOs: * The sugars (deoxyribose/ribose) and phosphates are the standard connecting bricks that form the long chains * The bases (A, T, G, C, U) are the special feature bricks that stick out and carry the unique information
The Monomer: Nucleotides
The fundamental repeating unit of both DNA and RNA is the nucleotide. Each nucleotide consists of three covalently linked components:
- A Pentose Sugar: Deoxyribose (in DNA) or Ribose (in RNA). As we discussed, this forms the central scaffold
- A Nitrogenous Base: A purine (A or G) or a pyrimidine (C, T, or U). This is attached to the 1’ carbon of the sugar via an N-glycosidic bond. This base holds the genetic code
- One or More Phosphate Groups: Usually attached to the 5’ carbon of the sugar. In the nucleic acid chain, there’s typically one phosphate linking sugars together. Free nucleotides (like ATP) can have one, two, or three phosphate groups (mono-, di-, tri-phosphates)
(Remember: Base + Sugar = Nucleoside. Base + Sugar + Phosphate(s) = Nucleotide)
The Backbone: Linking Nucleotides with Phosphodiester Bonds
This is the key to forming the long polymer chains of DNA and RNA. Nucleotides are linked together by phosphodiester bonds. Here’s how it works:
- The phosphate group attached to the 5’ carbon of one nucleotide forms a covalent bond with the hydroxyl (-OH) group on the 3’ carbon of the previous nucleotide’s sugar
- This reaction releases a water molecule (pyrophosphate, actually, which then hydrolyzes)
- The result is a sugar-phosphate-sugar-phosphate chain, forming the backbone of the nucleic acid strand
- The nitrogenous bases project outwards from this backbone
- Crucially, the phosphate groups in the backbone carry a negative charge at physiological pH. This makes DNA and RNA acidic molecules (hence “nucleic acid”) and is vital for many lab techniques (like electrophoresis!)
Directionality: The 5’ and 3’ Ends
The formation of the phosphodiester bond creates an inherent directionality in the nucleic acid strand:
- One end of the strand will have a free phosphate group attached to the 5’ carbon of the terminal sugar. This is called the 5’ end (five-prime end)
- The other end of the strand will have a free hydroxyl (-OH) group on the 3’ carbon of the terminal sugar. This is called the 3’ end (three-prime end)
- By convention, nucleic acid sequences are always written and read from 5’ to 3’. This directionality is critical for processes like DNA replication and transcription, as enzymes involved (polymerases) typically synthesize new strands in the 5’ to 3’ direction
DNA Structure: The Double Helix
DNA usually exists as a double helix, famously described by Watson and Crick. Key features:
- Two Strands: Two separate polynucleotide chains (sugar-phosphate backbones) are wound around each other
- Antiparallel: The two strands run in opposite directions. If one strand runs 5’ to 3’ top-to-bottom, the complementary strand runs 3’ to 5’ top-to-bottom. (Think of two lanes of a highway going in opposite directions)
-
Complementary Base Pairing: The strands are held together by specific hydrogen bonds between the bases projecting inwards:
- Adenine (A) pairs with Thymine (T) via two hydrogen bonds
- Guanine (G) pairs with Cytosine (C) via three hydrogen bonds
- This A=T and G≡C specificity ensures that the sequence of one strand dictates the sequence of the other
- Helical Shape: The structure twists into a right-handed helix (most commonly the B-form DNA)
- Major and Minor Grooves: The twisting creates two grooves on the surface of the helix. The major groove is wider, and the minor groove is narrower. These grooves are important because they expose the edges of the base pairs, allowing proteins (like transcription factors) to recognize specific DNA sequences without unwinding the helix
- Stability: The covalent phosphodiester bonds form a strong, stable backbone. The numerous hydrogen bonds between base pairs, although individually weak, collectively provide significant stability to the double helix. Hydrophobic interactions (base stacking) also contribute
RNA Structure: More Versatile
RNA differs structurally from DNA in several key ways:
- Sugar: Contains ribose instead of deoxyribose (with that 2’-OH group)
- Base: Contains Uracil (U) instead of Thymine (T). U pairs with A
- Single-Stranded: RNA is typically single-stranded, although it can fold back on itself
-
Secondary/Tertiary Structures: Because it’s usually single-stranded, RNA can fold into complex three-dimensional shapes by forming base pairs (A=U, G≡C, and sometimes non-canonical pairs like G=U) between different regions of the same molecule. This results in structures like:
- Hairpin loops: A stem region where the strand is base-paired with itself, capped by a loop of unpaired bases
- Internal loops and bulges: Regions of unpaired bases within a double-helical section
- Pseudoknots: More complex folding involving loop regions base-pairing with sequences outside the loop
- Functional Diversity: These complex structures allow RNA to perform diverse functions beyond just carrying genetic information (like mRNA). Examples include the structural and catalytic roles of ribosomal RNA (rRNA), the adapter function of transfer RNA (tRNA), and the regulatory functions of microRNAs (miRNAs) and long non-coding RNAs (lncRNAs). Some RNAs even act as enzymes (ribozymes)
Clinical Laboratory Relevance
Understanding the chemical structure is fundamental to virtually everything we do in the molecular lab:
- DNA/RNA Extraction: Methods exploit the chemical properties, like the negative charge (binding to silica matrices or anion exchange resins) and differences in stability (using specific buffers to protect RNA from degradation by its reactive 2’-OH)
- PCR Amplification: Relies on the double helix structure, strand separation (denaturation by heat), primer annealing (based on complementarity and hydrogen bonding), and enzymatic synthesis (following the 5’-3’ directionality)
- Hybridization Techniques (FISH, Southern/Northern blots, Microarrays): Depend on the denaturation of double strands and the specific re-annealing (hybridization) of labeled probes to complementary target sequences based on A-T/U and G-C pairing
- Sequencing: Determines the precise order of bases along the backbone, revealing the genetic code. Techniques often rely on enzymatic synthesis (Sanger) or detecting base incorporation during synthesis (NGS)
- Electrophoresis: The uniform negative charge of the sugar-phosphate backbone allows DNA and RNA fragments to migrate through a gel matrix towards the positive electrode, separating primarily based on size
- Restriction Enzyme Digestion: These enzymes recognize specific short DNA sequences (determined by the bases) and cut the phosphodiester backbone at precise locations
- Drug Design: Many drugs target nucleic acid structure or enzymes that interact with them (e.g., intercalating agents that slip between base pairs, or drugs that inhibit polymerases)
Key Terms
- Nucleotide: The monomer unit of nucleic acids, consisting of a nitrogenous base, a pentose sugar (deoxyribose or ribose), and one or more phosphate groups
- Phosphodiester Bond: The covalent bond linking adjacent nucleotides in a nucleic acid strand. It connects the 5’ phosphate group of one nucleotide to the 3’ hydroxyl group of the previous nucleotide’s sugar
- Sugar-Phosphate Backbone: The repeating chain of alternating sugar and phosphate units formed by phosphodiester bonds in a nucleic acid strand. The bases project from this backbone
- N-Glycosidic Bond: The covalent bond linking the nitrogenous base to the 1’ carbon of the pentose sugar
- Directionality (5’ to 3’): The intrinsic orientation of a nucleic acid strand, defined by the phosphodiester linkages, resulting in distinct 5’ (phosphate) and 3’ (hydroxyl) ends. Sequences are read 5’ to 3’
- 5’ End: The end of a nucleic acid strand terminating with a phosphate group attached to the 5’ carbon of the terminal sugar
- 3’ End: The end of a nucleic acid strand terminating with a hydroxyl group attached to the 3’ carbon of the terminal sugar. Site of nucleotide addition during synthesis
- Double Helix: The typical secondary structure of DNA, consisting of two antiparallel polynucleotide strands wound around a central axis, held together by complementary base pairing
- Antiparallel: The orientation of the two strands in a DNA double helix, where one strand runs 5’ to 3’ and the complementary strand runs 3’ to 5’
- Complementary Base Pairing: Specific hydrogen bonding between A and T (or U) and between G and C in nucleic acids
- Hydrogen Bond: Weak electrostatic attraction holding complementary base pairs together. Two bonds between A-T/U, three bonds between G-C
- Major Groove / Minor Groove: The two distinct grooves winding along the surface of the DNA double helix, formed by the arrangement of the sugar-phosphate backbones. Sites for protein-DNA interactions
- Secondary Structure (RNA): Local folding patterns of an RNA molecule, such as hairpins, loops, and bulges, formed by intramolecular base pairing
- Tertiary Structure (RNA): The overall three-dimensional shape of an RNA molecule, resulting from interactions between secondary structural elements