Key Points: Deoxyribonucleic Acid (DNA)
- Miescher (1869) isolated a substance from white blood cell nuclei (pus from bandages) and called it nuclein — the first discovery of nucleic acid.
- Nuclein properties — High phosphorus content + acidic nature → renamed nucleic acid.
- Two types of nucleic acids: DNA and RNA.
- Early belief — Scientists thought proteins were the genetic material (large, complex, varied). DNA was wrongly considered simple and unimportant.
- 1928–1952 — Over 25 years, three key experiments proved that DNA (not protein) is the genetic material.
- Role of DNA — It is stable, can replicate accurately, and passes traits to the next generation — making it the true genetic material.
Key Points: Molecular Structure of DNA
1. DNA structure was first studied by Rosalind Franklin (1953); later explained by Watson and Crick, who proposed the double helix model (Nobel Prize, 1962).
2. DNA is a macromolecule made of two complementary strands twisted into a double helix.
3. Each strand is made up of nucleotides, which include phosphate, sugar (pentose), and a nitrogenous base.
4. There are four nitrogenous bases:
- Adenine (A) pairs with Thymine (T) (2 hydrogen bonds)
- Guanine (G) pairs with Cytosine (C) (3 hydrogen bonds)
5. The two strands form a ladder-like structure, with bases as rungs and sugar-phosphate as the backbone.
Key Points: Griffith’s Experiment
- Frederick Griffith (1928), a British medical officer, experimented on Streptococcus pneumoniae to find a cure for pneumonia.
- Two strains were used - S-strain (virulent, smooth, pathogenic, encapsulated) and R-strain (non-virulent, rough, non-pathogenic, non-capsulated).
- Four experiments - R-strain injected: the mouse survived. S-strain injected: mouse died. Heat-killed S-strain injected: the mouse survived. Live R-strain + Heat-killed S-strain injected: the mouse died.
- In the 4th experiment, live S-strain bacteria were recovered from the dead mouse's blood, even though only heat-killed S-strain was used alongside the R-strain.
- R-strain bacteria picked up something from the heat-killed S-strain, transformed into a virulent S-strain, and synthesised a smooth polysaccharide coat.
- Griffith called the unknown substance responsible for this change the "transforming principle", believed to be some form of genetic material.
- Griffith proved genetic material can transfer between bacteria, causing a permanent, heritable change, but did not identify what the transforming principle was (later proven to be DNA).
Key Points: Avery, McCarty and MacLeod’s Experiment
- In 1944, Oswald T. Avery, Colin M. MacLeod, and Maclyn McCarty proved that DNA is the genetic material (transforming principle).
- They used cell-free extracts of heat-killed S strain bacteria and mixed them with harmless R strain bacteria.
- Only DNA was able to transform the R strain into the virulent S strain, showing its role in heredity.
- Treatment with proteases and RNases did not stop transformation, proving that protein and RNA are not genetic material.
- Treatment with DNase stopped transformation, confirming that DNA is responsible for it.
- This experiment provided strong evidence that DNA is the hereditary material, though final confirmation came later from the Hershey and Chase experiment.
Key Points: The Hershey-Chase Experiment
- In 1952, Alfred Hershey and Martha Chase proved that DNA is the genetic material using bacteriophages and E. coli bacteria.
- They used radioactive isotopes: ³²P to label DNA and ³⁵S to label proteins.
- Viruses grown in ³²P medium had radioactive DNA, while those grown in ³⁵S medium had radioactive protein.
- These labelled viruses were allowed to infect E. coli, and then blending and centrifugation were done to separate viral coats.
- Only bacteria infected with ³²P-labelled viruses became radioactive, showing that DNA entered the bacterial cells.
- Bacteria infected with ³⁵S-labelled viruses were not radioactive, proving proteins did not enter the cells; hence, DNA is the genetic material.
Key Points: Structure of Eukaryotic Chromosome (Packaging of DNA)
- DNA is extremely long (about 2.2 m in humans) and needs to be highly compacted to fit inside very small cells.
- In prokaryotes, DNA is present in a nucleoid without a true nucleus and is organised into loops and supercoils with the help of HU proteins, DNA gyrase, and topoisomerase enzymes.
- In eukaryotes, DNA is associated with positively charged histone proteins rich in lysine and arginine, which help in packaging.
- DNA wraps around a histone octamer made of H2A, H2B, H3, and H4 to form nucleosomes, which are the basic repeating units of chromatin.
- Nucleosomes further coil into higher-order structures like 10 nm fibres, 30 nm solenoids, and looped domains with the help of histone H1 and non-histone chromosomal proteins, ultimately forming chromosomes.
- Chromatin exists in two forms: euchromatin, which is loosely packed and transcriptionally active, and heterochromatin, which is densely packed and transcriptionally inactive.
Key Points: DNA Replication
- DNA replication is the process of synthesis of DNA from parental DNA, occurring in the S-phase of interphase and follows a semi-conservative mode.
- It is semi-conservative, meaning each daughter DNA has one parental strand and one newly synthesized strand, as proved by Meselson and Stahl experiment (E. coli).
- Replication begins at a specific site called the origin (Ori), where nucleotides are activated, and DNA unwinds using the enzyme helicase, forming a replication fork.
- Single-strand binding proteins (SSB) stabilise separated strands, and RNA primers initiate synthesis of new strands.
- DNA polymerase synthesizes new strands in the 5’ → 3’ direction, continuously on the leading strand and discontinuously on the lagging strand.
- On the lagging strand, short DNA segments called Okazaki fragments are formed and later joined by DNA ligase.
- RNA primers are removed and replaced by DNA, and finally, two identical daughter DNA molecules are formed.
Key Points: Semi-Conservative Replication
- Semiconservative replication means each new DNA molecule has one old (parental) strand and one newly synthesised strand.
- This was proved by Meselson and Stahl in 1958 using E. coli and density gradient centrifugation.
- E. coli were first grown in ¹⁵N (heavy nitrogen), so the DNA became heavy.
- When transferred to ¹⁴N (light nitrogen), after first generation a hybrid (¹⁴N–¹⁵N) DNA band was formed.
- After the second generation, two bands appeared (hybrid and light DNA), confirming semiconservative replication.
Key Points: Transcription
- Transcription is the process in which genetic information from one strand of DNA (template strand) is copied into RNA with the help of RNA polymerase.
- It occurs in the nucleoid in prokaryotes and in the nucleus in eukaryotes; mRNA then moves to the cytoplasm for translation.
- A transcription unit has three parts: promoter (start site), structural gene, and terminator (stop site).
- The process occurs in three stages: initiation (RNA polymerase binds promoter), elongation (RNA chain is formed), and termination (RNA polymerase detaches).
- Only one DNA strand acts as a template (3′→5′), while the other is the coding strand (5′→3′).
- In eukaryotes, primary RNA (hnRNA) is processed by capping, tailing, and splicing to form mature mRNA.
Key Points: Protein Synthesis
- Protein synthesis is the process by which cells produce proteins, which act as structural components, enzymes, and hormones.
- It involves two main steps: transcription (DNA → RNA) and translation (RNA → protein).
- In transcription, genetic information from DNA is copied into mRNA, where uracil (U) replaces thymine (T).
- Central dogma, proposed by Francis Crick (1958), states that information flows from DNA → RNA → protein.
- In retroviruses, reverse transcription occurs (RNA → DNA), explained by Temin and Baltimore (1970) using RNA-dependent DNA polymerase.
Key Points: tRNA – the Adapter Molecule
- tRNA (transfer RNA) is called the adapter molecule because it links mRNA codons to their corresponding amino acids during the process of translation.
- The structure of tRNA resembles a cloverleaf in 2D (with loops and arms) and a hairpin/L-shape in 3D. It has two important sites — the anticodon loop (for codon recognition on mRNA) and the amino acid attachment site at the 3' end (acceptor end).
- tRNA has the following main structural parts: Acceptor arm (3' end, carries amino acid), TΨC arm/loop, DHU arm/loop, Anticodon arm with anticodon loop (recognises mRNA codon), and a Variable loop.
- The anticodon present on tRNA is complementary to the codon on mRNA. This ensures the correct amino acid is added during protein synthesis.
- The 3' end of tRNA always ends with the sequence CCA, where the amino acid attaches. The 5' end starts with G. Hydrogen bonds hold the structure together.
Key Points: Characteristics of the Genetic Code
- The genetic code is a triplet code, where three nucleotides (codons) specify one amino acid.
- It has polarity and is always read in 5′ → 3′ direction.
- The genetic code is non-overlapping and commaless, meaning codons are read continuously without gaps.
- It is non-ambiguous, so one codon codes for only one specific amino acid.
- The genetic code shows degeneracy, where one amino acid can be coded by more than one codon.
- It is universal, meaning the same codon specifies the same amino acid in most organisms.
- AUG is the start codon (codes for methionine), while UAA, UAG, and UGA are stop codons that terminate protein synthesis.
Key Points: Mutation and Genetic Code
- A mutation is a sudden heritable change in the genetic material (DNA).
- A point mutation involves a change in a single base pair; for example, sickle cell anaemia.
- A frame-shift mutation occurs due to the insertion or deletion of bases.
- Frame-shift mutations change the reading frame of the genetic code, affecting the entire protein sequence.
Key Points: Regulation of Gene Expression
- Gene regulation = switching genes ON or OFF based on the cell's requirement and stage of development.
- In eukaryotes, regulation occurs at 4 levels: Transcriptional (primary transcript), Processing (splicing), Transport (mRNA from nucleus to cytoplasm), and Translational.
- In prokaryotes, control of transcriptional initiation rate is the main site for gene expression control.
- E. coli produces β-galactosidase to break lactose → galactose + glucose. If lactose is absent, the enzyme is not produced, proving the environment regulates gene expression.
- Enzymes synthesized based on substrate availability are called inducible enzymes. The process = induction; the triggering molecule = inducer. This is a positive control.
- Feedback repression = when the end product (e.g., amino acid) is already available, genes for its production are switched OFF. This is a negative control.
Key Points: Operon Concept
- The operon model was given by F. Jacob & J. Monod (1961). It controls gene expression in prokaryotes and is based on the lac operon in E. coli.
- An operon has 4 parts — Regulator (makes repressor), Promoter (RNA Polymerase binds here), Operator (controls structural genes), and Structural genes (lac-z, lac-y, lac-a — encode enzymes for lactose digestion).
- The inducer is allolactose. It binds to the repressor, inactivates it, and allows transcription to occur.
- Without lactose (Operon OFF): Repressor binds to the operator → blocks RNA polymerase → no enzymes produced.
- With lactose (Operon ON): Lactose is converted to allolactose → inactivates repressor → operator is free → RNA polymerase transcribes → β-galactosidase, Permease, Transacetylase are produced → lactose broken down into galactose + glucose.
- The lac operon is an inducible operon — normally OFF, switched ON only when lactose is present.
- Important bonds — Hydrogen bond: links nitrogen bases; Glycosidic bond: base to sugar; Phosphoester bond: phosphate to sugar; Phosphodiester bond: links nucleotides; Peptide bond: links amino acids.
Key Points: The Lac Operon
- The lac operon is a gene regulatory system in E. coli that controls lactose metabolism by regulating transcription of structural genes. It includes the regulator gene (lacI), operator, promoter, and structural genes (z, y, a).
- The lacI gene produces a repressor protein that binds to the operator in the absence of lactose, preventing RNA polymerase from initiating transcription.
- In the absence of lactose, the operon remains “off” because the repressor blocks RNA polymerase, so structural genes are not expressed.
- In the presence of lactose, a small amount enters the cell via permease and is converted into allolactose by β-galactosidase.
- Allolactose acts as an inducer by binding to the repressor and changing its shape, inactivating it so it cannot bind the operator.
- Once the repressor is inactivated, RNA polymerase binds to the promoter and transcribes the structural genes, producing enzymes for lactose metabolism.
- This leads to the synthesis of β-galactosidase, permease, and transacetylase, enabling efficient utilisation of lactose.
Key Points: Human Genome Project
- The Human Genome Project (HGP) was an international mega project launched in 1990 and completed in 2003 to sequence the entire human genome.
- It was coordinated mainly by the US Department of Energy and the National Institutes of Health (NIH), with participation from about 18 countries.
- The main aim was to identify all human genes, determine their locations, and sequence the complete human DNA (about 3.2 billion base pairs).
- The human genome contains approximately 20,000–25,000 genes, and less than 2% of DNA codes for proteins, while most consists of non-coding and repetitive sequences.
- The project used advanced techniques like automated DNA sequencing, cloning using BAC and YAC vectors, and genome mapping approaches.
- HGP revealed that about 99.9% of human DNA is identical, with variations such as SNPs and CNVs responsible for individual differences.
- The project has applications in disease diagnosis, gene therapy, genetic counselling, and understanding human evolution, along with ethical concerns related to genetic data use.
Key Points: DNA Fingerprinting
- DNA fingerprinting is a technique used to identify individuals based on unique patterns in their DNA, mainly using VNTRs (Variable Number Tandem Repeats), also called minisatellites.
- VNTRs are short repetitive DNA sequences that show high variation among individuals, making each DNA profile unique (except identical twins).
- The technique was developed by Alec Jeffreys and even a very small amount of DNA can be used for analysis.
- The process involves DNA extraction from samples like blood, hair, semen, or tissue, followed by PCR amplification if needed and restriction enzyme digestion.
- DNA fragments are separated using gel electrophoresis, transferred to a membrane by Southern blotting, and hybridised with specific VNTR probes.
- The hybridised DNA is visualized using autoradiography, producing a unique banding pattern for each individual.
- DNA fingerprinting is widely used in forensic science, paternity testing, criminal investigations, identification, genetic diversity studies, and disease diagnosis.
Key Points: Transcription Unit
| Component |
Location |
Function |
| Promoter |
At the 5′ end of the structural gene |
Provides binding site for RNA polymerase and initiates transcription |
| Structural Gene |
Between promoter and terminator |
Contains genetic information to be transcribed |
| Template Strand |
DNA strand with 3′ → 5′ polarity |
Serves as template for RNA synthesis |
| Coding Strand |
DNA strand with 5′ → 3′ polarity |
Does not code directly; used as reference strand |
| Terminator |
At the 3′ end of the coding strand |
Signals the end of transcription |
Key Points: Transcription Unit and the Gene
- A transcription unit is a segment of DNA consisting of a promoter (start site), a structural gene, and a terminator (end site).
- The template strand (3′→5′) is used for RNA synthesis, while the coding strand (5′→3′) has the same sequence as mRNA (except U replaces T).
- RNA polymerase binds to the promoter, synthesises RNA in 5′→3′ direction, and stops at the terminator.
- A gene is a DNA sequence that codes for RNA or protein; a cistron is a unit coding for a polypeptide.
- Monocistronic mRNA (in eukaryotes) has one gene per transcript, while polycistronic mRNA (in bacteria) has multiple genes in one transcript.
- In eukaryotes, three RNA polymerases are present: RNA polymerase I (rRNA), RNA polymerase II (mRNA/hnRNA), and RNA polymerase III (tRNA and snRNA).