Genetics

Genetics

In 2023, 23andMe reported that over 14 million people had spit into a tube, mailed it off, and paid roughly $199 to learn what their DNA "says about them." Most got back a colorful report telling them they're 43% Northern European, probably don't like cilantro, and have slightly elevated risk for macular degeneration. Some cried. Some called their parents with awkward questions. And nearly all of them walked away understanding less about genetics than they thought they did before opening the email.

Here's what those reports won't tell you: your DNA is not a fortune teller. It's a recipe book with 20,000 entries, and the kitchen it operates in — your environment, your habits, sheer chance — determines which dishes actually get made. The distance between "you carry a variant associated with" and "you will develop" is enormous, and it's exactly that distance where real genetics lives. Not in the marketing. Not in the panic or the relief. In the biology.

So forget the consumer packaging for a minute. What does your genetic code actually do, how did we figure that out, and why does it matter for the medicine you'll receive, the food you'll eat, and the children you might have? That's the territory we're covering — from a monk counting peas in the 1850s to oncologists selecting cancer drugs based on a tumor's specific mutations in 2025.

A Monk, Some Peas, and the Birth of a Science

Before Gregor Mendel, heredity was basically folklore. People knew children resembled their parents. Breeders knew you could sometimes get a fast, strong foal by crossing a fast horse with a strong one. But the mechanism? Total mystery. The dominant theory was "blending inheritance" — offspring as a smooth mixture of both parents, like stirring cream into coffee. Sounds reasonable until you realize that if traits just blended, all variation would wash out within a few generations. Every horse would be medium.

Mendel saw something different. Working in a monastery garden in Brno (modern-day Czech Republic) between 1856 and 1863, he cross-pollinated roughly 28,000 pea plants and tracked seven traits across generations. The results defied blending. When he crossed a purple-flowered plant with a white-flowered plant, the first generation was entirely purple. No lavender. But when he crossed those purple offspring with each other, white flowers reappeared in the next generation — at a ratio stunningly close to 3:1.

Key Insight

Mendel's 3:1 ratio revealed that traits don't blend — they're carried by discrete "factors" (what we now call genes) that can hide for a generation and reappear unchanged. This was the single most important observation in the history of genetics.

From these numbers, Mendel deduced two principles that still anchor genetics today. His Law of Segregation states that every organism carries two copies of each hereditary factor, and those copies separate during the formation of sex cells, so each egg or sperm carries only one. His Law of Independent Assortment says that genes for different traits sort independently of each other — whether you inherit the tall gene doesn't determine whether you also inherit the purple-flower gene, assuming those genes sit on different chromosomes.

The tragedy? Almost nobody noticed. Mendel published in 1866 in a regional journal that gathered dust for 34 years. He died in 1884 without knowing he'd founded an entire science. Three European botanists independently rediscovered his work in 1900 and realized the monastery gardener had been right all along.

Tracing Traits Through Your Family Tree

You can see Mendelian genetics playing out at every family reunion. The toddler with red hair when both parents are brunette. The kid who's the only one in the family who can roll their tongue. Your grandmother's cleft chin showing up in your nephew but skipping everyone in between. These aren't random. They're patterns, and once you understand dominant and recessive alleles, the patterns snap into focus.

Take a concrete example. The gene for earwax type — yes, there's a gene for that — comes in two versions. The allele for wet, sticky earwax (W) is dominant over the allele for dry, flaky earwax (d). If you inherit one W from your mother and one d from your father, you have wet earwax. You need two copies of d to have dry earwax. Most people of European and African descent carry at least one W allele. In East Asian populations, the dd genotype is far more common, sometimes exceeding 90%.

Parent 1: Wd (wet earwax, carrier of dry)
Parent 2: Wd (wet earwax, carrier of dry)
Offspring: WW (25%) | Wd (50%) | dd (25%)

That flowchart is a Punnett square in disguise. Two carrier parents, each with one dominant and one recessive allele, produce offspring in a predictable ratio: roughly one in four will express the recessive trait. This is the exact pattern Mendel observed with his peas — and it's the same math a genetic counselor uses when telling prospective parents about carrier risk for conditions like cystic fibrosis or sickle cell anemia.

But families also demonstrate where Mendel's clean ratios break down. Skin color isn't controlled by a single gene — it involves at least a dozen genes working together, producing a continuous spectrum rather than a 3:1 split. Height is influenced by hundreds of genetic variants plus nutrition, sleep, and childhood illness. These polygenic traits are the norm in human biology, not the exception. Mendel's pea traits were unusual precisely because they were so simple.

The Molecule That Runs the Show

Mendel knew nothing about DNA. He didn't need to. But understanding what genes physically are — what they're made of, how they store information, how they copy themselves — transforms genetics from a pattern-matching exercise into a molecular science.

DNA (deoxyribonucleic acid) is a polymer of four types of nucleotides, each containing a sugar (deoxyribose), a phosphate group, and one of four nitrogenous bases: adenine (A), thymine (T), cytosine (C), and guanine (G). Two strands wind around each other in the famous double helix, held together by hydrogen bonds between complementary bases — A pairs with T, C pairs with G. The sequence of those bases is the genetic code.

3.2 Billion — Base pairs in a single copy of the human genome, packed into 46 chromosomes inside nearly every cell of your body

If you unwound all the DNA from a single human cell and laid it end to end, it would stretch about two meters. Yet it's crammed into a nucleus roughly six micrometers across. That's the equivalent of stuffing 40 kilometers of thread into a tennis ball. The cell manages this through an extraordinary packing system involving proteins called histones, around which DNA wraps like thread around tiny spools, forming structures called nucleosomes that coil and fold into the dense bundles we see as chromosomes during cell division.

When a cell needs to divide, it first copies its entire genome through DNA replication. The enzyme helicase unzips the double helix. DNA polymerase then moves along each exposed strand, adding complementary nucleotides one by one at a rate of roughly 1,000 bases per second in humans. The error rate is astonishingly low — about one mistake per billion nucleotides copied, thanks to built-in proofreading. But "one per billion" across 3.2 billion bases still means a handful of new mutations every time a cell divides.

How Watson and Crick (and Franklin) cracked the structure

The double helix wasn't discovered in a single eureka moment. Rosalind Franklin, working at King's College London, produced the critical X-ray crystallography image — Photo 51 — in 1952 that revealed DNA's helical structure and key dimensions. James Watson and Francis Crick, at the Cavendish Laboratory in Cambridge, used Franklin's data (shown to Watson by Maurice Wilkins without her explicit consent) to build their famous model in 1953. Watson, Crick, and Wilkins received the Nobel Prize in 1962. Franklin had died of ovarian cancer in 1958 at age 37 and was not eligible for posthumous nomination. The scientific community has increasingly recognized her contribution as foundational and indispensable.

From Gene to Protein: How Your DNA Actually Does Things

A gene sitting silently in your chromosomes isn't doing much. Genes matter because they encode proteins — and proteins do almost everything in your body. Hemoglobin carries oxygen. Collagen holds your skin together. Insulin regulates blood sugar. Enzymes catalyze thousands of chemical reactions per second. The path from DNA sequence to functional protein is called gene expression, and it happens in two major steps.

Transcription comes first. RNA polymerase binds to a promoter region on the DNA, unwinds a short stretch, and reads one strand as a template to build a complementary molecule of messenger RNA (mRNA). Think of mRNA as a photocopy of the gene — it can leave the nucleus while the original stays protected. In eukaryotic cells, the raw transcript gets processed: non-coding sections called introns are snipped out, and the remaining exons are spliced together into mature mRNA.

DNA (gene)
Transcription
mRNA
Translation
Protein

Translation happens at ribosomes, either floating in the cytoplasm or attached to the endoplasmic reticulum. The ribosome reads the mRNA three bases at a time — each triplet is a codon, and each codon specifies one amino acid. Transfer RNA (tRNA) molecules act as translators, each carrying one amino acid and bearing an anticodon that matches a specific mRNA codon. As the ribosome slides along the mRNA, it links amino acids together into a growing polypeptide chain that folds into a functional protein.

Here's what makes this system extraordinary: every cell in your body carries the same genome, but a liver cell expresses a completely different set of genes than a neuron or a skin cell. This selective activation is gene regulation, and it's what allows a single fertilized egg to develop into an organism with over 200 distinct cell types. Transcription factors, enhancers, silencers, and the physical accessibility of DNA (determined partly by how tightly it's wound around histones) all control which genes get transcribed and when.

The Epigenetic Layer: When the Environment Edits the Editor

Your DNA sequence is fixed at conception. Barring mutations, the A's, T's, C's, and G's you were born with are the ones you'll die with. But epigenetics — a word that literally means "above genetics" — adds a layer of control that can change how genes behave without changing the sequence itself.

The two best-studied epigenetic mechanisms are DNA methylation and histone modification. Methylation involves attaching a small chemical tag (a methyl group) to cytosine bases, typically in CG-rich regions called CpG islands near gene promoters. Heavy methylation generally silences a gene. Histone modification works by adding or removing chemical groups on the histone proteins that DNA wraps around, making the chromatin structure either more open (gene on) or more compact (gene off).

Real-World Scenario

Consider identical twins. They share 100% of their DNA sequence. Yet by middle age, one twin might develop type 2 diabetes while the other doesn't. One might show early signs of rheumatoid arthritis. Studies of monozygotic twins have found that their epigenetic profiles diverge significantly over time, especially if they lived in different environments, ate different diets, or experienced different levels of stress. Same recipe book, different meals — because the bookmarks and sticky notes changed.

What makes epigenetics genuinely startling is that some of these marks appear to be heritable. Research on the Dutch Hunger Winter — a famine in the Netherlands during 1944-1945 — found that children conceived during the famine had higher rates of obesity and cardiovascular disease decades later. More remarkably, some effects appeared in their grandchildren. The famine changed gene expression patterns that persisted across generations, not through DNA mutations but through epigenetic tags passed along during development.

This blurs the old nature-versus-nurture divide. Your genes set the range of possibilities. Your environment, through epigenetic mechanisms, determines where within that range you actually land. A person with genetic variants associated with high intelligence who grows up malnourished and without access to education won't reach the same cognitive outcomes as that same genome in a well-nourished, stimulating environment. The genome is not destiny. It's a set of conditional instructions.

When Genes Go Wrong: Genetic Disorders

A single misplaced letter in 3.2 billion can kill you. That's not hyperbole. Sickle cell anemia results from one nucleotide substitution in the gene for beta-globin, a component of hemoglobin. Instead of glutamic acid at position six, the mutant protein has valine. That one amino acid change causes hemoglobin molecules to stick together under low-oxygen conditions, deforming red blood cells into rigid, crescent shapes that clog capillaries, destroy tissue, and cause episodes of excruciating pain.

Genetic disorders fall into several categories, each with different inheritance patterns and consequences.

Single-Gene (Mendelian) Disorders

Autosomal dominant: One mutant copy is enough. Huntington's disease — caused by an expanded CAG repeat in the HTT gene — strikes even heterozygous carriers, typically in their 30s or 40s. The parent with the mutation has a 50% chance of passing it to each child.

Autosomal recessive: Two mutant copies required. Cystic fibrosis (CFTR gene), PKU (PAH gene), Tay-Sachs (HEXA gene). Carriers are typically unaffected. Two carriers have a 25% chance per pregnancy of an affected child.

X-linked recessive: Gene on the X chromosome. Hemophilia A, Duchenne muscular dystrophy, red-green color blindness. Males (XY) are affected with just one copy. Females (XX) are usually carriers unless both X chromosomes carry the variant.

Complex & Chromosomal Disorders

Polygenic/multifactorial: Heart disease, diabetes, schizophrenia, most cancers. Dozens or hundreds of genes contribute small effects, interacting with environmental factors. No single gene is "the cause." Risk is probabilistic, not deterministic.

Chromosomal abnormalities: Down syndrome (trisomy 21 — three copies of chromosome 21), Turner syndrome (single X in females), Klinefelter syndrome (XXY in males). These involve entire chromosomes or large segments being duplicated, deleted, or rearranged.

Mitochondrial: Inherited exclusively from the mother, since mitochondria in the egg are the source. Leber hereditary optic neuropathy and MELAS syndrome are examples. These affect energy production in cells.

The sickle cell story has a twist that perfectly illustrates evolutionary genetics. In regions where malaria is endemic — sub-Saharan Africa, parts of India, the Mediterranean — being a carrier of one sickle cell allele (heterozygous) provides significant resistance to the malaria parasite Plasmodium falciparum. Two normal copies leave you vulnerable to malaria. Two sickle copies give you sickle cell disease. One of each is the sweet spot. This is heterozygote advantage, and it explains why a seemingly harmful allele persists at high frequency in certain populations. Natural selection isn't optimizing for perfection. It's optimizing for survival in a specific environment.

Beyond Mendel: The Complications That Make Genetics Real

Mendel's laws work beautifully for traits controlled by single genes with clearly dominant and recessive alleles. The problem is that most traits aren't that tidy.

Incomplete dominance produces a blended phenotype in heterozygotes. Cross a red snapdragon (RR) with a white one (WW), and the offspring are pink (RW). Neither allele fully masks the other. This looks superficially like blending inheritance, but the key difference is that crossing two pink flowers still yields red, pink, and white offspring in a 1:2:1 ratio. The original alleles haven't blended at all — they're still discrete units.

Codominance is different again. Both alleles express fully and simultaneously. The textbook example is the ABO blood group system. If you inherit an IA allele from one parent and an IB from the other, your red blood cells display both A and B surface antigens. Your blood type is AB, not some intermediate. The i allele (type O) is recessive to both. This system matters clinically: give type A blood to a type B patient and the immune response can be fatal.

Family Example

A father with blood type A (genotype IAi) and a mother with blood type B (genotype IBi) can produce children with any of the four blood types: A, B, AB, or O. This single gene system creates enough variety that blood type was once used in paternity disputes — it can exclude a man as the father but can't confirm he is one.

Sex-linked inheritance adds another layer. Genes on the X chromosome follow different patterns in males and females because males have only one X. Red-green color blindness affects about 8% of men but fewer than 0.5% of women. A woman needs two copies of the recessive allele (one on each X) to be color-blind; a man needs only one. This is why color blindness often appears to "skip" generations — a carrier mother passes the allele to her sons, who express it, while her daughters typically become carriers themselves.

And then there's epistasis, where one gene masks or modifies the expression of another gene entirely. Labrador retriever coat color is a classic case. One gene determines whether pigment is brown or black. A second gene determines whether pigment gets deposited in the fur at all. A dog can carry alleles for black pigment, but if it's homozygous recessive at the deposition gene, it's yellow regardless. Two genes, interacting, producing an outcome neither would produce alone.

Reading the Genome: From Sanger to Sequencing in a Day

The Human Genome Project took 13 years (1990–2003), involved 20 institutions across six countries, and cost approximately $2.7 billion. It produced the first essentially complete sequence of the human genome. At the time, that was a staggering achievement.

Cost to sequence a human genome in 2003$2.7 Billion
Cost in 2015$1,500
Cost in 2025~$200

That cost plummeted faster than Moore's Law. Next-generation sequencing technologies — Illumina's sequencing-by-synthesis, Pacific Biosciences' long-read platforms, Oxford Nanopore's pocket-sized MinION devices — have made whole-genome sequencing accessible to individual research labs and, increasingly, to clinical settings. A genome that took years now takes hours.

The data is staggering. The 1000 Genomes Project catalogued genetic variation across 2,504 individuals from 26 populations. The UK Biobank holds genetic and health data on 500,000 participants. The All of Us Research Program in the United States aims for one million. These massive datasets let researchers run genome-wide association studies (GWAS), scanning millions of genetic variants across thousands of people to find correlations with diseases, drug responses, and physical traits.

But here's the sobering reality that consumer DNA tests gloss over: for most common conditions, individual genetic variants contribute tiny effects. A GWAS might find 200 variants associated with height, each adding or subtracting a millimeter or two. A variant that raises your risk of type 2 diabetes by 1.1-fold isn't meaningfully predictive on its own. The challenge of modern genomics isn't reading the code — it's understanding what the code means when thousands of variants interact with each other and with everything you eat, breathe, and do.

Personalized Medicine: Your Genome in the Doctor's Office

Despite those caveats, genetics is already transforming clinical medicine in ways that would have seemed fictional 20 years ago.

Pharmacogenomics is perhaps the most immediately practical application. The enzyme CYP2D6, encoded by a single gene with over 100 known variants, metabolizes roughly 25% of all prescription drugs. Some people carry variants that make them "ultra-rapid metabolizers" — codeine converts to morphine so fast it can cause respiratory depression. Others are "poor metabolizers" who get almost no pain relief from the same dose. Before pharmacogenomic testing, doctors prescribed based on averages. Now, a cheek swab can guide dosing for antidepressants, blood thinners, pain medications, and certain cancer drugs.

Real-World Scenario

In 2024, the FDA's Table of Pharmacogenomic Biomarkers listed over 450 drug-gene interactions on approved drug labels. A breast cancer patient whose tumor tests positive for HER2 overexpression (driven by amplification of the ERBB2 gene) receives trastuzumab (Herceptin), which specifically targets HER2-positive cells. A patient whose tumor lacks that amplification gets a different treatment entirely. The drug didn't change. The diagnosis did — because we can now read what the tumor's genome is doing.

Genetic testing has expanded beyond rare diseases. Newborn screening panels in most U.S. states now test for over 30 conditions using a heel prick blood sample — catching diseases like phenylketonuria (PKU) early enough that a modified diet prevents intellectual disability entirely. Carrier screening lets prospective parents know if they both carry recessive alleles for conditions like Tay-Sachs or cystic fibrosis. Prenatal testing can detect chromosomal abnormalities. And predictive testing for adult-onset conditions — the BRCA1 and BRCA2 genes being the most famous example — allows people to make proactive health decisions.

The BRCA story illustrates both the power and the complexity. Women carrying certain BRCA1 mutations face a lifetime breast cancer risk of 55–72%, compared to about 13% in the general population. Angelina Jolie's 2013 op-ed about her preventive double mastectomy after testing positive put genetic testing on the front page. But BRCA mutations account for only about 5–10% of all breast cancers. Most breast cancer has no single genetic cause. A negative BRCA test doesn't mean you're safe. A positive one doesn't mean cancer is inevitable. Context, always, is everything.

CRISPR and the Frontier of Gene Editing

If sequencing lets us read the genome, CRISPR-Cas9 lets us rewrite it. Discovered as part of the bacterial immune system — bacteria use it to chop up invading viral DNA — it was adapted for gene editing by Jennifer Doudna and Emmanuelle Charpentier, who shared the 2020 Nobel Prize in Chemistry for the work.

The system is elegant in its simplicity. A short guide RNA is designed to match a target DNA sequence. The Cas9 enzyme, carrying this guide, scans the genome until it finds the match, then cuts both strands of the DNA at that exact spot. The cell's natural repair machinery then fixes the break — and researchers can exploit that repair process to delete a gene, correct a mutation, or insert new DNA.

2012
CRISPR-Cas9 adapted for gene editing

Doudna and Charpentier demonstrate programmable DNA cutting, publishing the landmark paper in Science.

2015
First human embryo editing (China)

Chinese researchers edit non-viable human embryos, sparking global ethical debate about germline modification.

2018
He Jiankui's "CRISPR babies"

A Chinese scientist announces twin girls born with edited CCR5 genes (intended HIV resistance). International condemnation follows. He receives a three-year prison sentence.

2023
Casgevy approved for sickle cell disease

The UK's MHRA approves the first CRISPR-based therapy, Casgevy (exagamglogene autotemcel), for sickle cell disease and transfusion-dependent beta-thalassemia. FDA approval follows weeks later.

2025
Expanding therapeutic pipeline

Clinical trials underway for CRISPR treatments targeting hereditary blindness (Leber congenital amaurosis), high cholesterol (PCSK9 gene editing), and certain cancers via modified T cells.

Casgevy's approval in December 2023 was a watershed moment. Patients with sickle cell disease undergo extraction of their own bone marrow stem cells, which are edited with CRISPR to reactivate fetal hemoglobin — a form of hemoglobin that doesn't sickle. The modified cells are then infused back. Early results showed patients going years without vaso-occlusive crises, the agonizing pain episodes that define the disease. The cost, however, sits at roughly $2.2 million per patient, raising urgent questions about who gets access.

And that's just the therapeutic side. CRISPR is being used in agriculture to develop disease-resistant crops without introducing foreign DNA (sidestepping some GMO regulations), in environmental science to engineer gene drives that could suppress malaria-carrying mosquito populations, and in basic research to systematically knock out genes and study what happens. The tool is roughly a decade old. The implications will unfold over the next century.

The Genetics of You: Polygenic Scores, Ancestry, and What DNA Can't Tell

Back to that 23andMe report. What's it actually measuring?

Consumer DNA tests typically genotype around 600,000 to 700,000 specific positions (called single nucleotide polymorphisms, or SNPs) scattered across the genome. That's a tiny fraction of 3.2 billion base pairs, but SNPs at known locations can serve as markers for broader genomic regions. By comparing your SNP profile to reference populations, the company estimates your genetic ancestry. By checking specific SNPs associated with traits or health conditions, it generates risk reports.

The ancestry estimates are approximations, not birth certificates. They depend heavily on the reference populations in the company's database. If your heritage includes populations that are underrepresented in the reference panel — which includes most non-European groups — the estimates become less precise. "43% Northern European" really means "43% of your tested SNPs match patterns most common in our Northern European reference samples." As reference databases grow and diversify, these estimates shift — which is why your ancestry results can literally change between years without your DNA changing at all.

Critical Caveat

Health risk reports from consumer DNA tests are not clinical diagnoses. They assess a handful of variants out of thousands that may influence a condition. A "low risk" result for Alzheimer's does not mean you won't develop it. A "high risk" result for type 2 diabetes doesn't mean you will. These reports measure statistical associations in study populations, not certainties in your individual biology. Always discuss concerning results with a genetic counselor or physician.

Polygenic risk scores represent an attempt to aggregate the tiny effects of many variants into a single number. A score might combine information from hundreds of thousands of SNPs to estimate someone's genetic predisposition to heart disease, schizophrenia, or educational attainment. These scores explain meaningful amounts of variation at the population level but are still poor predictors for individuals. A person in the top 5% of a polygenic risk score for coronary artery disease has roughly triple the average risk — significant, but that still means most people in that top 5% won't have a heart attack.

The ethics get thorny fast. Polygenic scores for "educational attainment" have been published. They're correlated with socioeconomic status. They differ across racial groups in ways that reflect historical inequities in the study populations, not biological truths about intelligence. The potential for misuse — by insurers, employers, or governments — is why the Genetic Information Nondiscrimination Act (GINA) exists in the U.S., though it notably doesn't cover life insurance, disability insurance, or long-term care insurance.

Where Genetics Goes From Here

The trajectory is unmistakable: genetic information is becoming cheaper, faster, and more integrated into everyday decisions. Newborns in pilot programs are receiving whole-genome sequencing at birth. Cancer patients routinely have their tumors genetically profiled to guide treatment. Agricultural companies use genomic selection to breed crops that can tolerate drought, resist disease, and produce higher yields on less land.

The takeaway: Genetics is no longer a field confined to research laboratories. It shapes which medications your doctor prescribes, which crops survive changing climates, which diseases get caught before symptoms appear, and how you understand your own family history. The molecule that Mendel never knew existed now influences insurance debates, criminal forensics, food policy, and reproductive choices. Understanding genetics isn't optional for navigating the 21st century — it's foundational.

But every new capability arrives with new questions. Should parents be able to select embryos based on polygenic scores? Should gene drives be released into wild mosquito populations? Should employers have access to genetic information? Should we edit the human germline — making changes that pass to all future generations — even to cure devastating diseases? These aren't hypothetical scenarios. They're active policy debates in legislatures, ethics boards, and courtrooms around the world right now.

The science will keep advancing regardless of how fast the policy catches up. Base editing, prime editing, and epigenome editing are already refining what CRISPR started, offering more precise modifications with fewer off-target effects. Long-read sequencing is revealing structural variants that short-read technology missed for years. Single-cell genomics lets researchers study gene expression in individual cells rather than bulk tissue, revealing cellular diversity we didn't know existed.

Mendel counted peas. We sequence genomes on a chip the size of a USB drive. But the fundamental question hasn't changed in 160 years: how does information pass from one generation to the next, and what does that information do? The answers just keep getting more detailed, more powerful, and more consequential. What you do with that knowledge — as a patient, a voter, a parent, or simply a person living inside a genome — is the part that Mendel couldn't predict. That part is up to you.