Microevolution (adaptation and other changes within a species) is commonly observed, but the more striking consequences of evolution usually take place on timescales far too long to observe from start to finish. However, there are some well-established cases where macroevolution can be observed in real time. Some definitions here will be useful:
Biological species concept ~ a species is any group who is reproductively isolated from other such groups, due to e.g. behavioural isolation, genetic incompatibility or failure to produce viable offspring. This is the most common species concept for studying extant life, but is undefined for asexual organisms (prokaryotes), so another concept is required.
Phylogenetic species concept ~ a species is the smallest monophyletic grouping when performing comparative genomic analysis on a population. This is much more suited for prokaryotes, defining species via genetic similarity.
Speciation ~ formation of more than one species from a population of one species, where species is defined suitably using one of the species concepts (like the above).
Macroevolution ~ variations in heritable traits in populations with multiple species over time. Speciation marks the start of macroevolution.
Although microevolution is useful for conceptualising how Darwinian evolution works (e.g. adaptation: heritable changes with natural selection), it is somewhat trivial and is not contested by critics of evolutionary theory. Therefore, we will list here only examples of macroevolution that have been observed in real time. Most of these are in the wild, with a few lab-based studies included too.
If macroevolution can be observed, and we know of no means by which the mechanisms of neo-Darwinian evolution (mutation/selection/drift/gene flow) can stop, and we have consilient evidence indicating continuation of the process back through time, and there is no reason to believe any contrary explanations (e.g. intelligent design/creationism), then the methodologically naturalistic, parsimonious, evidence-driven conclusion follows.
Direct observations are not the best evidence of evolution as a whole. Direct observation is just one line of inquiry: the other lines serve to justify and corroborate the extrapolation of those observations through deep time, synthesising the theory of evolution as we know it.
Reptiles are known for usually giving birth via egg-laying (oviparity), but there is evidence that some snakes and lizards (order Squamata) transitioned to giving live birth (viviparity) independently and recently. A ’transitional form’ between these two modes is ’lecithotrophic viviparity’, where the egg and yolk is retained and held wholly within the mother.
While observing a population of Zootoca vivipara in the Alps, reproductive isolation was found between these two subgroups, and attempts at producing hybrids in the lab led to embryonic malformations. Sometimes, the viviparous group would even give birth to two live young and one egg within the same litter of three. The oviparous group is now confined to the range spanning northern Spain and southern France (the Pyrenees), while the viviparous lizards extend across most of Europe.
Sources: here (paper), here (paper) and here (video).
The apple maggot fly (Rhagoletis pomonella) usually feeds on the berries of hawthorn trees, and is named after apples only because eastern American/Canadian apple growers in 1864 found its maggots feeding on their trees. Since then, the apple-eating and berry-eating groups have become more distinct.
This is a case of ‘sympatric speciation’: the geographic range of the apple group (north-eastern America) is contained within that of the berry group (temperate biomes globally). There is a barrier between the groups because 1) the two trees flower at different times of the year (apples in summer, hawthorns in autumn/fall) so flies must reproduce asynchronously, and 2) each group only lays its eggs on their respective fruit.
They were named due to people being bit by them while hiding in the underground tunnels of London’s tube train network during the Blitz of World War 2. It’s recently been shown that they did not first evolve there. It turns out that the ancestral species, Culex pipiens, lived above ground, while the new species, C. p. f. molestus, evolved in the Middle East ~2000 years ago, adapted to warm and dark below-ground city environments, of which the sealed tunnels of the 1860s London Underground was one.
The new species breeds all-year-round, is cold intolerant and bites rats, mice and humans, while the prior species hibernates in winter. This is a case of ‘allopatric speciation’ (geographic isolation) by ‘disruptive selection’, a rarer type of natural selection where an intermediate trait is selected against while extreme traits are favoured, leading to rapid separation into a bimodal distribution of the two lifecycles. Cross-breeding the two forms in the lab led to infertile eggs, implying reproductive isolation.
‘Colonialism’ (simple clumping/aggregation of single-celled organisms) is well-known, and does not count as multicellularity. But if the cells become obligately multicellular (lifecycle uses clonal division by mitosis and remain together, and splitting them apart kills the organism), the groundwork for de novo multicellularity is laid. This was observed in the lab by introducing a population of green algae (Chlamydomonas reinhardtii, a protist) to cultures of another predatory protist, over a period of 1 year (~750 generations). The strong selective pressure to defend against predation led to obligate multicellularity in the algae. This process, featuring increasingly large clusters of cells, is well-reflected in the extant clade Archaeplastida, which includes green algae (single cell protist), a variety of other colonial protists and plants (complex multicellular).
Another common trait of extant multicellular life is differentiated tissue formation due to cell specialisation. This too has been observed, and represents the formation of complex genetic control systems (by negative feedback loops) as studied by evolutionary developmental biology. Volvox is a good example, being within clade Archaeplastida (above) and having two cell types - one for sexual reproduction, one for phototaxis. Genetics also finds that the famous ‘Yamanaka factors’ for cell differentiation (as well as many other key innovations like cell-to-cell signaling, adhesion and the innate immune system) in animals inherit from those in choanoflagellates (the closest-related protists to animals and our likely last unicellular ancestors). So, both protist-to-plant and protist-to-animal transitions look pretty reasonable on this alone.
Sources: here, here (papers), here for cell specialisation, here (video) and here (long video).
This is a textbook example of bird microevolution from Darwin’s 1830s voyage of the Galápagos islands, but studies from the 1980s onwards have identified speciation in the ‘Big Bird lineage’ on Daphne Major island. Regional droughts which reduce seed dispersal to the islands, such as those that occurred in 1977 and 2004, as well as arrival of competitors, were found to be drivers of selection for beak stiffness. The new lineage of finches reproduces only with its own.
A ‘ring species’ is a rare and aesthetically-pleasing display of speciation wherein a population living outside a circular barrier (e.g. the sands surrounding a lagoon) sequentially mutates and migrates around the circle, so that when they meet up again on the other side, they cannot interbreed. One of the most well-known cases of this is the salamander Ensatina eschscholtzii, which spread around the edge of a dry uninhabitable valley in California. A total of seven subspecies of these salamanders developed around the circle, two of which cannot interbreed with each other. Actually, this case is not a ’true’ ring species, as the diversification process was more complex than simply continuously spreading around the circle, but it still does represent an example of complete speciation.
This process took millions of years, so it wasn’t directly observed, but the studies showing interbreeding capability of neighbouring subspecies despite isolation between two were done in the present, so it’s pretty conclusive as to what happened.
This is another ring species, and one that is closer to a true ring species than the Californian salamanders (though still not a perfect ring species - it seems there are no simple true cases!). These birds, Phylloscopus trochiloides, inhabit the closed boundary of the Tibetan Plateau, of which two reproductively isolated forms co-exist in central Siberia. Genetic studies find some degree of selection against interbreeding, contributing to the speciation process. This happened over about a million years, so we’re using the phylogenetic species concept here.
Tragopogon miscellus are ‘allopolyploid’ plants (multiple sets of chromosomes, some from another species) that formed repeatedly during the past 80 years following the introduction of three diploid species from Europe to the US. This new species has become established in the wild and reproduces on its own. The crossbreeding process that we have used to make new fruits and crops more generally exploits polyploidy (e.g. cultivated strawberries) to enhance susceptibility to selection for desired traits.
In the lab, a change in the expression patterns (controlled by upstream genes) of two regulatory genes led to alligators developing feathers on their skin instead of scales. These occur via the ‘Sonic hedgehog’ (Shh) pathway, one of the many developmental cascades activated by homeotic genes. The phenotypes observed in these experiments closely resembled those of the unusual filamentous appendages found in the fossils of some feathered dinosaurs, as if transitional.
A similar thing has been done to turn the chickens’ scales on their feet into feathers, this time with only one change to the Shh pathway, showing how birds are indeed dinosaurs and descend within Sauropsida.
There is excessive evidence that the organelles like mitochondria and chloroplasts (and more recently discovered, the nitroplast) found within extant eukaryotes were originally free-living prokaryotes, which became incorporated (endosymbiosis), but no such thing had been observed, until now. The bacterial order Legionellales are responsible for Legionnaire’s disease and live in water, but are uniquely able to survive and reproduce even after being ’eaten’ by some amoebae before returning to free-living conditions.
In the lab, it was found that some strains of wild amoeboid protists in clade Rhizaria, class Thecofilosea, were transmitting fully-incorporated Legionellales vertically by cell division. Extracellular transmission of bacteria was not observed, indicating mutualistic endosymbiosis, and genetic studies confirmed divergence of the endosymbiont via a shrinkage of its genome (as expected) and gene translocation to the protist’s nuclear DNA.
The migratory bird Sylvia atricapilla typically flies either south-westerly towards Spain or south-easterly into Asia as winter approaches in Europe, but the rise of birdwatching as a hobby in the UK in the 1960s led to a new food source in Britain that the westerly-flying birds could migrate to. This change is known to be genetic in basis. Those that instead migrated to the British Isles in winter returned home 10 days earlier (due to the shorter distance to central Europe) than those that went towards Spain, and therefore would mate only with themselves (sympatric speciation). The UK-migrating group now has rounder wings and narrower, longer beaks, over just ~30 generations, and although genetic differentiation has not yet reached the point of preventing interbreeding entirely, these birds are quite clearly well on their way to speciation.
Genetic similarity between organisms is indicative of evolutionary relatedness, since mutation accumulate within lineages and are passed on to offspring. Studying the genomes of extant life therefore informs evolutionary history.
Pseudogenes
29.49% of the human genome is made up of pseudogenes, most (but not all) of which are non-functional (either not transcribed or transcribed levels too low for functionality).
GULO (L-gulonolactone oxidase): GULO is mostly conserved across the animal kingdom, with a similar gene GLDH appearing in other eukaryotes. It encodes for the enzyme synthesising vitamin C (ascorbic acid) from L-gulono-γ-lactone (in turn from glucose). However, in haplorhines (tarsiers, monkeys, apes: including humans), GULO has been lost, so we have to source our own vitamin C from our diet (or die from scurvy). GULO has also been lost independently in the bat genus Pteropus and guinea pigs (Cavia porcellus).
If a retrovirus infects a germline cell (usually a sperm cell progenitor e.g. spermatocyte), then the viral genome will be inserted inside the germline DNA. When the sperm cell multiplies and fertilises an egg, the viral genome can be passed into the offspring. As long as the virus remains in its dormant state, it will not cause any problems and may become permanently fixed in the genome due to genetic drift. ERV sequences become quickly methylated on inheritance and have their LTRs mutated so cannot jump around the genome any more like retrotransposons, becoming fixed in position before speciation occurs. The viral genome is then said to be ’endogenous’ and will appear in all subsequent descendants of the first infected individual.
We can look for traces of these ’endogeneous retroviruses’ (ERVs) in modern genomes. ERVs can be identified by the ’long terminal repeats’ (LTRs) at either end of the genome, and the gag, pol (contains the reverse transcriptase, integrase and protease) and env genes for the viral proteins. Since ERVs insert themselves mostly randomly into the genome, if ERVs are found in extant species with exactly the same positions and identities, it can be safely assumed to be inherited from a common ancestor as the chance of a coincidental separate identical insertion is negligible. Most (at least 90%: source) ERVs are non-functional, so the common creationist argument of “common design” loses its validity for ERVs. Some ERVs can be exapted for use by the organism at a later time as a de novo gene, as new promoters form readily by random mutation alone.
For example, one type of ERV that is found in both humans and chimpanzees is called HERV-W. For this particular ERV, there are 211 of them in humans, 208 of them in chimps, of which 205 of are found in identical locations of both genomes (source). This tells us that the human-chimp common ancestor had the 205 HERV-W insertions that we both have, and then a few more were acquired more recently after the split. The env gene of HERV-W remains transcribable, and produces a protein called syncytin-1, which has been co-opted for use in the formation of the placenta, but can also act as an immunogen in multiple sclerosis.
To drive the point home, we can estimate explicitly the probability that two genomes would just so happen to have the observed 205 identical insertions, when one of them has 211 and the other has 208, if the ERVs are inserted randomly without any common ancestry (the null hypothesis). We assume a total of possible insertion sites in both genomes (‘hotspots’). The problem can be stated as listing as the enumerated possible ERV insertion sites. We then take, uniformly, randomly, independently and without replacement, a ‘human’ subset of insertions of size , and a ‘chimp’ subset of insertions of size . We want to find the probability that the intersection of and has exactly elements (shared insertions).
If the text is hard to read, please switch to light mode on the website! Diagram of the counting problem. We randomly place 211 X’s in the top set of boxes and 208 X’s in the bottom set of boxes, independently (max one X per box). How many X’s will we observe in the same place? These represent the common ERV insertions.
We can solve for the required probability with combinatorics:
since there are ways of choosing the intersection, ways of choosing the rest of the chimp elements, and ways of choosing the rest of the human elements (source). Using the binomial coefficient identity (source), we find it corresponds to the probability mass function (PMF) of a hypergeometric distribution:
I believe this model is more accurate than the calculations performed by some other sources (e.g. Stated Clearly uses a binomial distribution, source here, video here), although it doesn’t affect the conclusion we make.
Now, we sum over this probability between and to get our -value. The answer is so tiny that normal calculators simply return zero. Using the more powerful WolframAlpha for the computation instead, we find that the answer is:
As intuited, the odds of getting the observed HERV-W distribution in humans and chimps without common ancestry is ridiculously tiny: about 1 in . There are about atoms in the observable universe, so this is about the same chance of randomly picking the same atom in the universe 14 times in a row! And this is just for one type of ERV in one pair of species - many other types of ERVs are known in many different species (mostly mammals), and they can be used to reconstruct phylogenies in the same way as any other section of the genome.
The full genomes of Homo sapiens, Homo neanderthalensis and Denisovans are available, as well as all extant primates, which helps us reconstruct evolutionary relationships and study the origins of individual genes.
Human-specific mutations affecting brains and intelligence:
ARHGAP11: the basal form, ARHGAP11A, encodes the protein RhoGAP with nuclear localisation, found in all extant non-human mammals. A partial duplication ~5 MYA seen in Homo sapiens, Neanderthals and Denisovans led to them additionally acquiring ARHGAP11B, which shows mitochondrial localisation instead. It promotes basal progenitor cells (BP cells) and increases the neocortex size significantly. Sources: here, here and here.
TKTL1 (transketolase-like 1): modern Homo sapiens has an arginine point mutation (K261R) while Neanderthals, Denisovans, archaic Homo sapiens and other extant primates have the lysine form. The human gene promotes production of basal radial glial cells (bRG cells, neural stem cells), significantly increasing upper-layer cortical neuron production and the size of the brain’s gyri (ridges) in the frontal lobe. Source: here and here (video)
NOTCH2NL: NOTCH genes prolong proliferation of neuronal progenitor cells and expand cortical neurogenesis. Many of these genes are duplicated in Homo sapiens, Neanderthals and Denisovans to varying degrees. Source: here
SRGAP2: Partially duplicated to SRGAP2B 3.4 MYA, followed by two larger duplications at 2.4 MYA and 1 MYA. Source: here and here.
FOXP2: Linked to development of speech and language skills. Source: here.
TBC1D3: another human-specific gene contributing to the frontal cortex. Source: here
Human-specific mutations affecting muscles and biomechanics:
PPARGC1A and MHY7: promotes a higher proportion of slow-twitch muscle fibres rather than fast-twitch.
MYH16: changes the musculature of the jaw. Source: here.
HACSN1: a developmental enhancer leading to limb and digit specialisations. Source: here
Examples from recent human evolution (<300 kYA): source
ADH1B (alcohol dehydrogenase): the SNP Arg48His is more common in East Asians due to rice domestication, and reduces the risk of alcoholism. Another SNP Arg370Cys occurs in Africa which reduces alcohol dependence.
PDE10A: leads to enlarged spleens in the Bajau people. The spleen is a reservoir of oxygenated red blood cells, allowing them to hold their breath for longer (hypoxia tolerance) while freediving. Source: here.
NOS3 (nitric oxide synthase) and others for high-altitude adaptation: in three distinct populations (Tibetans, Andeans and Ethiopians), multiple different mutations in a variety of genes lead to hypoxia tolerance, allowing for their survival at high altitudes.
Sickle cell trait: in regions of Africa where malaria is prominent, carrying one copy of the recessive sickle cell anaemia allele confers resistance to the Plasmodium parasite. While there are associations of sickle cell trait to other medical conditions, many people with the trait remain healthy, making it net beneficial in malaria-endemic regions. Source: here.
White skin colour: in northern Europeans, the SLC24A5 gene has an SNP Ala111Thr that leads to decreases melanin expression and hence lighter skin pigmentation, which is beneficial for vitamin D synthesis in the low-sunlight high-latitude regions.
Interesting neutral mutations in recent human evolution
In some of these cases, no harmful effect is observed despite what is typically thought of as a ’loss of function’ (e.g. gene deletion). These result in variation in the population, and may serve as a substrate for future selection, or simply be neutral. Additionally, what is neutral in a current environment may become beneficial in a future environment.
Blue eyes: leads to blue eyes instead of brown, due to a mutation in OCA2. It has been shown that all blue-eyed people today share a common ancestor living around 6-10 kYA (a perfectly resolved founder event). This is presumed to be a neutral mutation, with the possibility of sexual selection. Source: here
Retention of the median artery into adulthood: normally considered an embryonic structure that regresses around the 8th week of gestation, but it has been found to be retained with increasing frequency in recent times. Source: here.
Palmaris longusmuscle: a small muscle in the forearm that is absent in about 10-15% of the population, with no loss of overall grip strength or hand function, despite a small reduction in pinch strength in the 4th and 5th fingers.
ABCC11: the T/T allele carried by nearly all Koreans and many other East Asians is non-functional, preventing its expression. This leads to dry flaky earwax and significantly reduced body odour, even after sweating and exercise. Source: here
Human lactose tolerance
In lactose intolerant people (~65% of humans worldwide), the ability to digest lactose is lost during adolescence. The lactase enzyme is required to metabolise lactose into glucose and galactose. Without lactase in the small intestine, lactose remains available for the bacteria in the large intestine which ferment it, leading to fatty acid and gas production, causing symptoms of lactose intolerance.
The LCT gene codes for lactase, and has a low-affinity promoter. The MCM6 gene, found upstream on chromosome 2, codes for a subunit of helicase (an unrelated protein used in DNA replication), and an intron of MCM6 contains an enhancer for LCT. Transcription factors that bind to the LCT promoter include HNF1-α, GATA and CDX-2, while Oct1 binds to the LCT enhancer.
In mammals, most metabolic genes except lactase are expressed at low levels early in development as nutrients are provided primarily by breast milk, but during adolescence, as these other genes are promoted, low-affinity promoters like LCT are outcompeted, sharply reducing LCT expression. In lactase persistence, point mutations to the LCT enhancer result in an increased affinity for the LCT promoter, allowing it to remain competitive for transcription throughout life, allowing lifelong lactase synthesis. So, this is not a loss of regulation or function, as routinely claimed by ID advocates. Some mutations also reduce the age-related DNA methylation of the enhancer. Lactase persistence has evolved independently with several SNPs (single nucleotide polymorphisms) under strong positive selection in the past 10,000 years of human history, primarily in societies that had dairy farming and pastoralist agriculture.
The complete genomes of many extant species have been sequenced. Projects in bioinformatics aim to describe and compare the contents and functions of these genomes.
The Human Genome Project (2003) found that there are about 22,300 protein-coding genes in humans (similar to other mammals) and contains a large number of repetitive sequences (low copy repeats).
The Chimpanzee Genome Project (2013) revealed that humans and chimpanzees share 99% sequence similarity in protein-coding regions, and 96% sequence similarity across the full genomes, both higher than humans and any other animal.
Whole-genome comparisons of different pairs of animals. Sex chromosomes were omitted, slightly increasing similarity in the primates. Telomere-to-telomere assemblies were used (non-scaffolded). Analysis performed by Gutsick Gibbon, randomly sampling short sequences (1000 x 300 bp/chromosome) with gapped aligment using BLAST, documented here, here and here.
The genetic similarity correlates precisely with the expected phylogeny of the apes.
The ENCODE Project (2021) found that, in humans, about 75-80% of the genome is transcribed into RNAs. However, most of this transcription is either 1) spurious (too low-level to have meaningful biochemical functionality), or 2) forming transposons (LTRs, ERVs, SINEs, LINEs), which simply ‘jump’ around the genome with no useful function.
Among human non-coding DNA, 5% is for gene-related regulatory sequences (promoters, enhancers…). 20% is for introns in genes, most of which serve no function. The rest is completely non-functional (sometimes called ‘junk DNA’, although historically this term was incorrectly applied to all non-coding DNA).
The proportion of non-coding DNA across life tends to increase with genome size, as more complex organisms require more control over their gene expression and accumulate more ‘junk DNA’.
E. Coli citrate metabolism in the LTEE
The Lenski long-term evolution experiment (LTEE) is a famous study that’s been ongoing since 1988, following 12 initially-identical but separate lines of E. coli bacteria over 80,000+ generations thus far. There are no external selective pressures in the LTEE, so the experiment is about what the bacteria could do on their own. Among the outcomes include de novo gene birth from non-coding DNA and near-complete speciation into two variants with differing colony size, but most importantly, one line evolved the ability to eat citrate (Cit) in aerobic conditions, a trait universally absent in wild-type E. coli. This led to an immediate rise in population density.
Contrary to the claims of top intelligent design (ID) proponents (e.g. Dr Michael Behe), this is not merely due to the loss of regulation of CitT (the relevant gene) expression, which would constitute a loss of function. In fact, the CitT gene was in an operon controlled by an anaerobically-active promoter, and underwent gene duplication, and the duplicate was inserted downstream of an aerobically-active promoter. This is therefore a gain of functionality. However, this duplication conferred a negligible (~1%) fitness advantage in the experiment, and at least two other mutations (in an intron of the dctA gene after, and in the gltA gene before) were shown to be necessary to obtain fully-functional citrate metabolism. This therefore meets the criteria for an “irreducibly complex” trait - and it’s one that emerged under experimental conditions normally adverse to innovation (stasis - promotes stabilising selection)!
In an amusing attempt to refute this, ID advocate Scott Minnich (works at Discovery Institute, a politically-motivated creationist organisation) reproduced the experiment in 2016 with a new colony of wild-type E. coli and found the same Cit+ trait emerge! And this time, much faster than in the LTEE, via the same pathway, featuring CitT and dctA. The abstract of their paper ends rather desperately: “We conclude that the rarity of the LTEE mutant was an artifact of the experimental conditions and not a unique evolutionary event. No new genetic information (novel gene function) evolved.” - despite us having disproven that already.
The human immunodeficiency virus (HIV) is a retrovirus that infects human immune cells expressing the CD4 surface protein, such as helper T-cells and macrophages. Once inside cells, HIV-1’s Nef and Vpu proteins work independently to reduce the expression of CD4, which prevents ‘superinfection’ (two viruses infecting the same cell) and decreases the chance of an immune response. The slow death of helper T-cells leads to a weakened immune system.
If HIV infects a different immune cell, such as a macrophage, the virus’ escape is hampered by the high expression of a cell protein called tetherin. This limits the virulence of HIV in macrophages. However, some strains of HIV-1 have evolved ways to antagonise tetherin using their Vpu and Nef proteins, giving them a second function in addition to retaining their CD4-degrading activity in helper T-cells:
In HIV-1 group M, tetherin antagonism occurred with 4 concurrent point mutations in Vpu.
In HIV-1 group N, weak tetherin antagonism occurred with 4 different point mutations in Vpu, but it led to loss of CD4-degrading activity.
In HIV-1 group O, this occured with just 1 point mutation in Nef (C169S).
So, the same trait evolved two ways (in groups M and O), one of which (group M) was supposedly ‘irreducibly complex’: it was a beneficial trait that required sequential mutations in already functional proteins. Group M now dominates worldwide HIV cases while group O resides mainly in sub-Saharan Africa and group N is very rare.
HIV also simultaneously demonstrates observed ‘macroevolution’ (to the extent that it can be defined for viruses, which are not life). HIV has a zoonotic (animal) origin, as it came from chimpazees’ endemic SIV (simian immunodeficiency virus) strain. SIV is rampant among non-human primates, but each species has evolved to tolerate its own strain. It became human transmissible as HIV in the early 1900s due to mutations that allowed it to bind our CD4 receptors, which differ slightly between humans and other apes, and is far more virulent in humans.
The flagellum is the flagship allegedly irreducibly complex structure, cited ad nauseum by ID advocates. Since it is the one that has been talked about the most, it has also attracted a lot of attention from real scientists who have promptly disarmed it. In one experiment, the master regulator for flagellum synthesis (FleQ) was knocked out, leaving all of the other flagellar genes intact. But under selective pressure for motility, it was found that another transcription factor that regulates nitrogen uptake from the same protein family (NtrC) was able to ‘substitute’ for FleQ, essentially by becoming hyperexpressed, so there’s so much NtrC in the cell that some of it binds to the FleQ-regulated genes and activates them too.
This is an incredibly reliable two-step process, after 24-48 hours we get a mutation in one of the genes upstream of NtrC that leads to higher expression and activation, then within 96 hours of the start we see a second mutation, normally within NtrC itself, that helps fine-tune the expression.
A striking visualisation of antibiotic resistance is a video of an experiment done by Harvard Medical School, where they created four zones with antibiotic concentrations increasing by a factor of 10 each time. The bacteria would spread outwards up to the boundaries. Most would go extinct, but a few mutants from some populations would survive and move into the next zone. When all zones had been reached, the colonies traced out the pattern of their phylogenetic tree.
The end of the experiment, showing how the colonies connect together to show their relatedness. Video: here. Paper: here.
Nylon-eating bacteria
Nylon is a synthetic insoluble semi-crystalline polymer of 6-aminohexanoic acid, invented in 1935 and used in a variety of consumer products. In 1965, Japanese researcher Takashi Fukumura found that 11 bacterial strains in the wastewater of the Toyo Rayon 6-polyamide factory (today Toray) in Nagoya were able to grow on ε-caprolactam, the cyclic amide precursor to nylon 6. One more species, Corynebacterium aurantiacum, also was found to be able to metabolise linear and cyclic 6-aminohexanoate oligomers. Another group of researchers 4 years later found in the waste water of the same factory a strain from the phylum Pseudomonas also able to metabolise 6-aminohexanoate oligomers.
One of the ’nylonase’ enzymes involved was produced by a frame-shift mutation that completely scrambled existing genetic code data, with the new gene still having a novel, albeit weak, catalytic capacity.
In 1974 then we have Hirosuke Okada with his research on Flavobacterium. He found that Flavobacterium was able to metabolize ε-caprolactam, 6-aminohexanoate and cyclic aminohexanoate-dimer as well as the linear di- bis hexamers of 6-aminohexanoate.
Living in cold environments poses a serious challenge to poikilothermic (not thermally regulated) life, as the water in cells may freeze, halting all metabolic processes and killing the organism. Antifreeze proteins have evolved as a solution: when an ice crystal nucleates, the protein’s ice-binding domain attaches to the surface of the crystal, arresting its growth. The ice-binding domain is a regular arrangement of polar hydrophobic amino acids with a separation very close to the lattice constant of ice, creating an ideal fit for hydrogen bonding and Van der Waals’ forces at the interface.
The β-helix motif found in these proteins is common enough in natural secondary structures that it has convergently evolved many times: ice restructing proteins making use of it are known in the winter flounder (Pseudopleuronectes americanus), the spruce budworm (Choristoneura fumiferana), the mealworm beetle (Tenebrio molitor), the snow flea (Hypogastrura harveyi), some sea ice-living diatoms (Fragilariopsis cylindrus) and even some plants like winter rye (Secale cereale). The spruce budworm antifreeze can inhibit freezing as low as -30 °C, below even the supercooling limit of liquid water, but some others only work down to about -5 °C, allowing only marginal additional exploratory capacity in cold environments. These antifreeze proteins have different amino acid composition but all perform the same function. Some protein sequences resemble C-type lectins or sialic acid synthase.
The cytochrome c oxidase (COX) enzyme is a ubiquitous component of the electron transport chain for respiration, found in bacteria, archaea and the mitochrondria of eukaryotes. Since COX is universally conserved, there is a strong correlation between the number of amino acid substitutions in the COX enzyme and the time since the divergence of the species, an example of the ‘molecular clock’.
NOTE: this section only shows non-human fossils. For fossils of primates and human evolution, see the Bioanthropology section.
Fossils are remnants of long-dead life and provide a tangible record of the distant past. We can compare fossilised structures and estimate fossil age using radiometric dating of nearby ash layers to help piece together evolutionary lineages, which can be cross-checked against more precise genetic studies. Taken together, they serve as signposts of how lineages changed over time.
Horse evolution. Tree diagram source: here (Bruce J. MacFadden, Fossil Horses - Evidence for Evolution. Science 307, 1728-1730 (2005). doi:10.1126/science.1105458).
Two specimens of Archaeopteryx, a classic ‘dinosaur to bird’ transitional fossil. Left: Berlin specimen (1874). Right: London specimen (1861), known by Darwin.
Ambulocetus can be described as a ‘walking whale’, as some land mammals (Artiodactyla) became aquatic within the past 60 MYA. Cetacean evolution is especially well recorded.
A new species of transitional ‘dinosaur to bird’ fossil, Jinguofortis perplexus, with anatomy labelled in situ and to scale. It is from clade Pygostylia, dated to the early Cretaceous, 127 million years ago. Source: here.
Animals did not appear in the Cambrian explosion. Phyla belonging to the animal kingdom are known from the Ediacaran period, before the Cambrian. The Cambrian explosion was a relatively rapid radiation of these phyla. The Ediacaran biota is known from lagerstätten (exceptionally well-preseved soft tissue) and ichnofossils (trace fossils), and are among the earliest known macroscopic life.
Infamous paleontological frauds commonly cited by critics of the theory of evolution to suggest that all or most of the fossil record cannot be trusted. All cases were quickly exposed by the scientific community, not its critics, and some were accepted due to systemic ideological dogmas of the time, such as ethnonationalism.
GEOLOGY
Stratigraphy
The idea that rocks are deposited in layers (strata: older below, younger above) has been known since Steno in the 17th century (the ’law of superposition’). It is therefore usually the case that fossils found in deeper layers are older than those found above, serving as a rough guide to their age (a qualitative, relative dating method). However, other geologic processes like erosion, folding and faulting can disrupt this order occasionally, so more reliable references are needed.
Fossil species that are used to distinguish one layer from another are called index fossils, which occur for a limited interval of time. Usually index fossils are fossil organisms that are common, easily identified, and found across a large area. When a fossil is found, the nearest volcanic ash layers above and below it can be radiometrically dated, allowing us to bound the age of the fossil (tephrochronology).
In 1997, a study surveyed the existing fossil record for 384 different clades all across the animal kingdom, and cross-referenced them with their claimed evolutionary relationships. Using three different statistical metrics (Spearman’s rank coefficient, and two others dedicated to quantifying the presence of ‘ghost lineages’), it was found that all three falsify the null hypothesis (if the fossil record does not reflect the major patterns of evolution, there would be no evidence for congruence between the two sets of data in our random sample of cladograms).
Trees grow at a rate of approximately 1 ring in their trunk per year. By sampling the rings of multiple trees in a given region, and matching the thicknesses of each ring to the others, we can estimate the age of trees. This also allows for identification of missing or additional rings in a given tree, which are indicative of ecological disturbances (e.g. wildfires, insect outbreaks…).
Dendrochronology can be used to date wooden artifacts in archaeology from about 10000 years ago to present, since the last ring can be matched to the year it was cut down and used. The carbonised wood in charcoal can be both carbon dated and dendrochronologically dated, allowing us to cross-reference the two methods. We can also infer the climate conditions over its lifetime (paleoclimatology), inferring the past temperature, precipitation and cloud cover from the ring data. Climate data can also be cross-referenced against other sources (e.g. ice cores, sediment cores, historical records, meteorological data…). Examining the trace mineral content of the rings (e.g. carbon-12/13 ratio) provides further data on atmospheric conditions (stable isotope dendrochronology).
Varve chronology
Sedimentary layers in lakes and oceans.
Paleomagnetic dating
As per the Vine-Matthews-Morley hypothesis, the rates of tectonic plate spreading at mid-oceanic ridges can be measured by recording the magnetisation of the sediment, which tracks the orientation of the geomagnetic field at its formation. This allows dating by measuring continental drift, and can calibrate magnetic analyses elsewhere by correlating with geomagnetic field reversals.
Oklo natural nuclear reactor
Radiometric Dating with Uranium series
Radiometric Dating with Argon
Many volcanic rocks naturally contain the isotope potassium-40 (K), which decays ~10% of the time to the stable isotope argon-40 (Ar) via electron capture followed by gamma decay with a half-life of 1.25 billion years. When these rocks first form from molten lava, any argon is expelled to the atmopshere on solidification, and the K begins to decay from its initial concentration at a predictable rate, forming trapped Ar in the rock.
Mount Vesuvius: Pliny the Younger was a Roman eyewitness to the Mount Vesuvius eruption, which he recorded as occuring in the early afternoon of 24th August, 76 AD, destroying Pompeii, Herculaneum and other Roman cities. In 1997, a piece of volcanic tephra from the region was subject to Ar/Ar (argon) dating, yielding an age of 1925 94 years: only 7 years older than the actual age of 1918 years. A second sample, sanidine phenocrysts in pumice, was taken in 2004 using the same method, which yielded an age of 1925 66 years, which is the exact calendar year. Additionally, in 2003, the same sample was dating using the U-Th/He dating method, giving an age of 1866 ± 243 years, which is very precise considering this method is usually used for dating much older rocks.
Mount St Helens dating, using isochron dating
Radiocarbon dating of recent fossils and artifacts
The radioisotope carbon-14 is continuously formed in the upper atmosphere by cosmic rays, which can be absorbed by plant matter as CO (at a slightly lower C abundance due to slower gas diffusion of CO) and incorporated into the plant’s tissues (e.g. glucose, cellulose). When the plant dies, carbon exchange stops, and the remaining carbon-14 decays with a half-life of 5,700 years. The ratio of C to C in a sample of organic matter can therefore be used to estimate the time of death of the organism, up to about 50,000 years ago due to resolution limits.
The Teide volcano in located in Tenerife (the Canary Islands). Stratigraphy found an age younger than 2000 years, while paleomagnetic dating found an age of 500 - 900 years. Historical records give an age of at least 500 years (European settlement began in 1494): Christopher Columbus reported seeing “a great fire in the Orotava Valley” as he sailed past Tenerife on his first voyage to the New World in 1492, interpreted to have been the Teide eruption. C dating gave a precise range of eruption between 1470 - 1490 AD (510 - 530 years ago). K/Ar dating gave a range of 800 300 years ago. These ranges and accounts all corroborate with each other.
Han van Meegeren was a Dutch painter during World War 2 and orchestrated a sophisticated art forgery. While his art skills were considered mediocre, he was able to create several convincing fakes of 17th century Vermeer paintings, some of which were sold to the Nazis in exchange for Nazi loot, including his piece The Supper at Emmaus. Van Meegeren confessed in 1946 and was found guilty with thorough chemical and X-ray analyses of the paintings. In 1967, a study using the lesser-known Pb radiometric dating technique was used to analyse the white lead (lead oxide) used in the paint. The smelting process to obtain lead removes much of the radium, which decays to Pb, so the method studied the ratio of Po (as a surrogate for Pb) to Ra. It was shown that the paint could not have been made more than a few decades prior (the 1930s), rather than the 300 years ago if it were genuine. Another study using gas chromatography in 1977 confirmed this finding.
Cheddar man is a well-preserved skeleton of a Mesolithic (middle stone age) Homo sapiens found in the UK. DNA analysis found that he was likely a hunter-gatherer with bright blue-green eyes, slightly curly hair and black skin, with no lactase persistence. He probably arrived there via Doggerland, a low-lying region of Europe spanning between modern-day Britain, France and Germany, which sank under rising sea levels around 10-7 kYA, as the last glacial period ended. His Y-chromosomal haplogroup was I2a2, and 10% of British ancestry can be linked to Cheddar Man. Cheddar man was radiocarbon dated on two occasions to 8540-7990 BC and 8470-8230 BC, i.e. about 10,000 years ago.
Map of Western Europe around the time of Cheddar Man, with Y-chromosomal haplogroups labelled. The original skull and his facial reconstruction based on genetic data is shown.
Ötzi the Iceman is a copper-age naturally frozen mummy radiocarbon dated to about 3,200 BC, found in the Alps.
The Dead Sea Scrolls were found in 1947 in caves near the Dead Sea, and contained the earliest known records of the books of the Bible, written in Hebrew, Greek and Aramaic. Radiocarbon dating of the different scrolls gave dates from between 400 BC and 400 AD, which were mostly within 100 years of the dates estimated by analysis of the writing style (paleography).
The Shroud of Turin is a linen cloth with a distinctive imprinting that resembles the traditional face of Jesus Christ, which is said to have appeared on the cloth shortly after his crucifixion. However, spectroscopic analysis in 1978 suggested the imprint was painted on using a red ochre and vermilion pigment commonly used in medieval art. Additionally, in 1988, three independent radiocarbon dating laboratories all dated the cloth to between 1260-1390 AD, matching its first known appearance in church history in France.
Thermoluminescence dating
Electron spin resonance dating
Amino acid racemisation dating
Oxygen isotope ratio cycle (δ18O/δ16O)
Ice core paleoclimate data
BIOGEOGRAPHY
Ecological succession
This is fun one to bring up in the context of ‘creationism vs evolution’, as it refers to the macroscopic and very well-accepted process of ‘primary succession’. This describes the sequence that follows formation of a new region of land (well-studied in physical geography) as life moves in for the first time. The resulting ecosystems that form (in the ‘climax community’) are highly interdependent, such that removing one would collapse the whole food web, which is a defining feature of irreducible complexity. Yet, we watch it happen all the time - and this is something that must have happened regardless of whether creation or evolution is true!
Main structures of central nervous system (including optic tectum) and inner ear (vestibular system and cochlear duct) across birds (extinct and extant) and closely related non-avian dinosaurs. The highlighted specimen, MPM-334-1, is a recently-discovered basal bird cranium.
Comparative anatomy of the endosseous labyrinth and cochlear duct across selected birds (extinct and extant) and closely related non-avian dinosaurs.
Aphids have an endosymbiotic relationship with Buchnera Aphidicola bacteria. The bacteria break down nutrients that aphids need to survive but can only live within specialised ‘bacteriocyte’ structures in Aphids. But some aphids later in their evolution dropped B. Aphidicola and now have a yeast-like symbiont (YLS) that performs similar functions. These aphids still have the bacteriocytes but the YLS is located both inside and outside of them. So, these aphids have the same specialised structures as their cousins to host the bacteria but their symbiote is a fungus that doesn’t need those structures.
An example of higher-degree endosymbiosis is the Darwin termite (Mastotermes darwiniensis). The termite relies on a protist Mixotricha paradoxa to process the wood. The protists further rely on other bacteria living on its surface (each look like a thin hair that wiggles; symbiotic signalling in exchange for food). Within the protist, there is another endosymbiont spherical bacterium that digests cellulose in wood, releasing acetate for the protist. These bacteria have removed the need for M. paradoxa to have mitochondria, which have degraded into simpler organelles (hydrogenosomes and mitosomes).
Sources: here and Chapter 38 of The Ancestor’s Tale by Richard Dawkins.
Evolution of eyesight
As one of the most impressively complex sensory organs, the eye’s evolution has been especially well studied. Eyes in some form have evolved independently over 50 times, with vastly different structures and functions suited to each lineage.
Different types of eyes
Retinal phototrophy: the retinal molecule is used by Haloarchaea in bacteriorhodopsin proteins to capture light energy via chemiosmosis. This is the key light sensor molecule needed for eyes, without any of the complex structures that evolved later.
Eyespots: a simple light-sensitive organelle found in euglenids (unicellular photosynthetic protists), containing photoreceptor proteins. Without any nerve cells, the signal cascade on light detection results in flagellar movement, enabling phototaxis.
Pit eye: one of the types of eyes found in invertebrates. A pit with photosensitive cells inside allows for some vague directionality of light detection.
Pinhole camera eye: Found in Nautilus. More directional sensitivity, by nearly closing the pit, allowing light to enter only through a small aperture.
Lens formation: Evolved 8 times. An inhomogeneous lens formed of crystalline proteins continuously bends light for focussing of light onto the photosensitive layer, giving a clearer image. It also corrects for spherical aberration.
Multiple lenses: Found in *Pontella. A triplet lens (two in females) with the outer elements outside the eye. The first surface is parabolic and corrects for spherical aberration of the other 5 surfaces. The retina has only 6 receptors.
Telescoping lens: Found in Copilia. Two lenses work like a telescope with a point-like retina and only a 3° field of view. There is a horizontal scanning eye movements at <5 Hz, while the bottom apparatus (eyepiece + retina) moves in image plane of the ‘objective’. The prey (plankton) moves vertically, giving a second dimension of scanning.
Corneal refraction in land animals: to correct for the air-water interface and spherical aberration. In humans, 2/3 of the optical power is in the cornea rather than the lens.
Reflective concave mirror (argentea) in the scallop.
Tapetum lucidum: a reflective layer behind the retina in many nocturnal and crepuscular mammals (cats, dogs, deer), aiding in night vision. A variety of structures and molecules are used.
Compound eyes in insects and crustaceans.
Nanostructured cornea anti-reflection surfaces for quarter-wave matching in moths.
Binocular/stereoscopic vision in vertebrates.
Trichromatic vision in primates.
Physical constraints in the evolution of the eye
Solar radiation is the main source of high-exergy free energy in the open biosphere, and its exploitation is therefore strongly selected for, such as in photosynthesis, photocatalysis and eyesight. At the molecular level, interaction with light requires a molecule that can absorb photons at the appropriate energy (wavelength), typically found in highly conjugated organic molecules. For wavelengths in the visible spectrum (most intense at Earth’s surface), these molecules include e.g. chlorophyll, retinal, 7-dehydrocholesterol, bacteriorhodopsin and phototropin.
With eyesight, there is the additional task of extracting information from the radiation, providing both thermodynamic and information-theoretic constraints on the evolution of the eye.
Radiation entropy maximisation: solar radiation is the main source of high-exergy free energy in the open biosphere, and its exploitation is therefore strongly selected for (e.g. photosynthesis). Solar radiation contains both energy and entropy, with slightly different Wien peaks for the two. The eye evolved to maximise the information extracted from the radiation, and the eye’s spectral sensitivity is optimised to the maximum entropy peak instead of the energy peak. Sources: here and here.
Utility-based coding: updates the opponent process theory explaining how S, M, L cone signals are encoded in the optic nerve and visual pathway. The new theory describes the optimal encoding of spectral information given competing selective pressure to extract high-acuity spatial information. Source: here
Principal components of reflectance spectra: the encoding of S, M and L into three channels can be explained by the observation that the three channels are the first three PCs in a PCA of the reflectance spectra of natural materials and scenes, encoding 98% of the total variance. This allows minimal information loss with the fewest number of channels, a consequence of the fact that the eye evolved to extract information from the environment. Source: here
Evolution of photosynthesis
Photosynthesis is another form of phototrophy which uses chlorophyll as the light-absorbing molecule. Like the eye, it is extremely complex today, but started out from simpler systems based on the same fundamental principles.
Photosystems in unicellular life
The two main parts of photosynthesis are Photosystems I and II:
PSI: an electron transport chain using ferredoxin to generate NADPH.
PSII: a water-splitting complex generating protons, which can be used for chemiosmosis in ATP synthase. Likely to have evolved first due to its simpler structure and immediate utility in generating ATP.
The ATP produced can then be used in a metabolic cycle to fix carbon dioxide into useful organic compounds.
The bacterial kingdom Bacillati contains a range of photosynthetic bacteria:
Phylum Cyanobacteria: contains both Photosystem I and II, using the Calvin cycle.
Phylum Bacillota: only uses Photosystem I, without any associated synthetic cycle.
Phylum Chloroflexota, order Chloroflexales, only uses Photosystem II, using the 3-hydroxypropionate bi-cycle.
The bacterial kingdom Pseudomonadati also contains a similar variety:
Phylum Chlorobiota (green sulfur bacteria): contains Photosystem I, using the reverse Krebs cycle.
Phylum Pseudomonadota (purple bacteria): contains Photosystem II, using the Calvin cycle.
Cyanobacteria became incorporated into eukaryotes as chloroplasts via endosymbiosis, allowing plants and algae to make use of photosynthesis, with both photosystems I and II.
Thermodynamics of photosynthesis
The biosphere, and life itself (such as a cell) is an open system, and one in a highly non-equilibrium state, using free energy to maximally generate entropy in the surroundings while maintaining a low-entropy internal state.
Intro-level explainer on the thermodynamics of life: here.
The concept of exergy is very helpful in understanding the thermodynamics of photosynthesis. The exergy content of an energy source is the maximum useful work that can be extracted reversibly from it. Although a plant radiates away as much energy as it receives (otherwise it would heat up!), the exergy of the incoming solar radiation is much higher than that of the outgoing thermal radiation, due to the Sun’s high blackbody spectrum temperature giving it a low entropy. The net exergy flux into the plant is what allows its internal processes to do useful work, which is to perform the endergonic, entropy-reducing reactions in photosynthesis, building up complex organic molecules from simple inorganic ones.
The transpiration stream of the plant provides the high-entropy output in the form of water vapour, increasing the entropy of the surroundings significantly, allowing plants to harvest the external energy of sunlight to create internal order. All life indirectly enjoys this benefit, since plants act as producers, providing energy (via metabolism) for organisms higher up the food chains.
Basic thermodynamic analysis of photosynthesis: here
How is it that every cell in your body has the same DNA, yet different body parts can have completely different functions? How does your body know where to put everything? Development from an embryo is a tightly-regulated process, with the goal of controlling what genes get expressed where and when. There is a close relationship between evolutionary diversity and developmental diversity, and so we can study one to learn about the other.
Humans possess tails in early development because they descend from a common ancestor which had tails. The tail serves no purpose and normally regresses by week 8 of a pregnancy. It is therefore a vestigial trait.
POPULATION GENETICS
Phylogenetic reconstruction
Comparative genomics studies the similarities and differences between the genomes of different species. This can be used to reconstruct phylogenetic trees, which show how closely related different species are. The more similar two genomes are, the more closely related the two species are likely to be.
A test of the validity of this reconstruction can be done using a known phylogeny. In 1992, a study was done on an artificially mutated strain of the virus bacteriophage T7, whose genome was sequenced repeatedly as it reproduced in bacteria. The experiment was stopped after 9 different viral strains had emerged, and only their genomes were used in 5 different phylogenetic reconstruction algorithms. All 5 algorithms produced the same, correct known tree, out of the 135,135 possible tree structures, with slight variation in the time to branching, showing that the algorithms are valid and can be used to reconstruct phylogenies from extant genome data more generally.
Analyses of strain-level bacterial diversity within hominid gut microbiomes revealed that clades of Bacteroidaceae and Bifidobacteriaceae have been maintained exclusively within host lineages across hundreds of thousands of host generations. Divergence times of these cospeciating gut bacteria are congruent with those of hominids, indicating that nuclear, mitochondrial, and gut bacterial genomes diversified in concert during hominid evolution. This study identifies human gut bacteria descended from ancient symbionts that speciated simultaneously with humans and the African apes.
Human evolution is an especially well-studied topic. We are primates and great apes, and there is an abundance of fossils to tell us how our lineages developed over time. Genus Homo arose from the prior genus Australopithecus about 2.5 million years ago, and following a period filled with numerous species of Homo, our species Homo sapiens emerged about 300,000 years ago. Our close relationship with chimpanzees, gorillas and bonobos make them great for studying behaviour, too.
Cranial fossils of some hominin skulls close to the human lineage, showing the steady variation in morphology from past to present.
Preserved footprints found in an ash bed in Tanzania, indicating bipedality in genus Australopithecus.
A nearly-complete specimen of the extinct hominin Australopithecus africanus, with many of its joints still in full articulation.
A specimen of Australopithecus afarensis, found to display numerous traits associated with bipedalism, in particular the pelvis and the angled femur.
A list of many of the known extinct primate species, some of which are close to the human lineage.
Vitiligo is an autoimmune disease that removes pigmentation from skin. A gorilla’s fingers look strikingly similar to a human’s when the black pigment is removed.
1 in 6 chimpanzees have white sclera like humans do. It’s thought that the trait evolved to help us discern where our tribe-mates are looking, aiding in communication and coordination.
Bipedalism in Hominins
Morphology and biomechanics are linked by causal morphogenesis (Wolff’s law). The suite of characteristics indicative of bipedalism, which originated in late Miocene hominids, is:
Anterior foramen magnum*: skull rests on the top of the spine.
Arched foot: three arches (medial, lateral, transverse) in the feet act as shock absorbers in walking.
* strongest indicators, since these biomechanically prevent quadrupedalism.
Taxonomy of Australopithecus and Homo habilis
Our genus, Homo, is thought to have emerged from one of the coexisting species of the genus Australopithecus. Based on the available fossil record, this transition appears to be quite subtle: the brain case sizes overlap, both can use stone tools, both have similar dentition (teeth), and biomechanics studies indicate both were mostly bipedal (though A. afarensis only had two arches plus a less-curved third arch in the foot, suggesting habitual bipedality, while H. habilis had three fully formed arches, suggesting full obligate bipedality). This high degree of similarity has even led to some paleoanthropologists to suggest that H. habilis should in fact be Australopithecus habilis. Although this has not been formally adopted, the challenge of a clear-cut taxonomic classification demonstrates the highly transitional nature of these species.
Another example of this comes (ironically) from creationists, who are ideologically required to divide the hominin fossil record into two allegedly mutually exclusive groups: the ‘ape’ kind and the ‘human’ kind. However, six famous hominin cranium fossils of H. habilis and early H. erectus (KNM-ER 1813, Java man, Peking man, KNM-ER 1470, KNM-ER 3733 and Turkana Boy) were all classified by seven different creationists completely differently, precisely as expected of a ’transitional’ species without a true clear divide.
Autapomorphies of Homo neanderthalensis include retreating cheekbones (zygomatics), the occipital bun, large nasal aperture, enhanced prognathism, enhanced brow ridges (supraorbital torus), platycephalic skull, angled squamosal suture, retromolar gap and an elliptical foramen magnum.
Significant hominin fossils, ichnofossils and artifacts
Ardi: partial skeleton of Ardipithecus ramidus, 4.4 MYA, discovered in the Afar rift valley (Ethiopia).
Little Foot (StW 573): near-complete specimen, Australopithecus africanus, 3.67 MYA. Found in the ‘Cradle of Humankind’ in South Africa, an area home to many other early hominins.
Burtele foot: partial foot, 3.4 MYA, with a divergent hallux. Tentatively assigned to Ardipithecus ramidus, implying contemporaneity with Homo, Australopithecus, Paranthropus and Kenyanthropus.
Taung Child: skull of a 3-year-old Australopithecus africanus, 3.3 MYA.
Dikika Child (Selam): skull of a 3-year-old Australopithecus afarensis, 3.3 MYA.
Lucy (AL 288-1): partial skeleton of Australopithecus afarensis, 3.2 MYA.
Ledi Geraru mandible (LD 350-1): jaw assigned to basal genus Homo, 2.78 MYA.
Mrs Ples: complete skull of Australopthecus africanus, 2.35 MYA.
Dmanisi skulls: a set of skulls of Homo erectus, 1.81 MYA.
Turkana Boy: near-complete young Homo erectus, 1.55 MYA.
Peking Man: a Chinese specimen of Homo erectus, 500 kYA.
East Asian archaics: a collection of late Homo erectus, archaic Homo sapiens and Denisovan specimens from China and surroundings, e.g. Dali man, Xiahe mandible, Harbin skull/Dragon man…
Denny: teeth from a 13-year old female containing DNA, Neanderthal-Denisovan hybrid, 90 kYA.
Laetoli footprints: two trackways, attributed to Australopithecus, 3.6 MYA. The indentations suggest a fully in-line hallux.
Lomekwi stone tools: oldest stone tools, attributed to Australopithecus or Kenyanthropus.
APPLICATIONS OF EVOLUTION
This section focusses on practical applications of the evolutionary theory, primarily in engineering, medicine and agriculture. It does not include applications of explaining aspects of biology itself, which are numerous. (“Nothing in biology makes sense except in the light of evolution” - Theodosius Dobzhansky). The utility of evolution serves as a ‘proof of concept’ that the theory aligns with reality.
Protein Engineering
We can develop entirely new enzymes using ‘directed evolution’ of proteins with a variety of uses. By artificially cloning the gene for an enzyme and introducing mutations, screening for activity and stability, we ‘artificially select’ more optimal enzymes for our use case. This is a well-established lab procedure that puts evolution into practice.
Glucose biosensors: when type 1 diabetics need to monitor their blood glucose levels to time insulin injections, they use a glucose biosensor, which typically works by measuring the rate of reaction of a glucose-binding enzyme. The wild-type enzymes found naturally are usually not stable enough for reliable operation in a biosensor (due to activity reduction on immobilisation, stability to temperature variations and cofactor dependence), so new enzymes are needed, which come from directed evolution. These are the most common type of commericalised glucose biosensors in use today. Although enzyme-free biosensors have been researched, they have not yet been commercialised due to lacking the chemoselectivity that enzymes excel at, so evolution-backed biosensors remain the state-of-the-art in type 1 diabetes management as of 2025.
Lactate biosensors: another type of electrochemical biosensor making use of directed evolution. Lactate biosensors are used by high-performance athletes to monitor lactate levels in the blood and sweat, indicative of fatigue, and require directed evolution for thermostability. They are also used in hospitals at triage to test for septic shock.
Artificial metalloenzymes: new enzymes can be designed to catalyse completely new chemical reactions unseen in biology. In one study (figure below), a new enzyme was evolved containing an abiotic di-rhodium cofactor, which catalyses the cyclopropanation of styrene and diazo compounds with enantioselectivity. Industrial applications include the synthesis of pharmaceutical (e.g. sulfonamide antibiotics), improvement of enzymatic biofuel cells, carbonic anhydrase-based carbon capture technology and bioremediation by degradation of toxins (PAHs, PCBs, organophosphates…).
The procedure for the combinatorial mutagenesis of enzymes. Each cycle is one ‘generation’. The error-prone PCR provides the mutations, while the activity and enantioselectivity assays provide the artificial selection. The metal cofactor is inserted via reaction with a specially encoded amino acid in the enzyme. Source: here.
Bio-detergents: subtilisin is a protease used in laundry detergents, which was engineered to be more stable at high temperatures and alkaline pH. Its cofactor dependence on calcium ions was also removed, as calcium would be chelated by EDTA in the wash.
High-fructose corn syrup: glucose isomerase is an enzyme used in the production of high-fructose corn syrup (HFCS), which is sweeter than regular corn syrup. It was engineered to be more stable at high temperatures and to have a higher affinity for glucose. This allows for more efficient conversion of glucose to fructose, resulting in a sweeter product.
Genetically engineered bacteria: E. coli can be engineered to produce new useful products, such as carotenoids. They can also produce bioplastics, which are biodegradable, although issues of scale-up and durability remain.
Animal Model Selection
The animals used in lab studies for medicines are chosen based on evolutionary relatedness. They use rats for most in vivo studies since they’re one of the closest non-primate animal orders to us (order Rodentia). Rabbits are in another very close order (order Lagomorpha), and we are all mammals. For neurological studies, primates are sometimes used, as their brain structure is closer to ours, although animal welfare laws and ethics regulations mean these studies are only done for behavioural studies and occasionally for neurosurgery (e.g. neural prosthetic implants).
Without evolution, we’d be stabbing in the dark as to whether a particular animal would serve as a good model for our in vivo testing, which would mean significantly fewer successful drugs and therapies passing the animal testing phase.
Importantly, medicine is complicated, and evolution is not the only factor in determining suitability. In some diseases where there are relevant genotypic differences, mouse models are not effective representatives of humans, most notably cancer. Most cancer treatments animal-tested in mice fail in humans (more so than other drugs). In mice, telomeres are longer than in humans due to stronger telomerase activity, which reduces the contribution of cellular senescence to age-related cancers, making them less relevant for human studies. Telomerase repression is an evolutionary control measure for high body-mass mammals that compensates between longevity and cancer risk.
Another example is the fact that heart xenotransplants predominantly use pigs as donors rather than chimpanzees, despite the latter’s closed relatedness. Chimps carry viruses that are deadly to humans and are prohibitively impractical and unethical to keep in captivity for organ donation. Chimp hearts are also too small for human patients, as we are endurance-adapted with large cardiac output while chimps are not. Chimps in general are rarely useful models for humans despite their genetic similarity, due to a variety of specific differences. Pigs happen to have the right sized hearts and are far easier to domesticate. Genetic engineering of the pigs is required to remove the proteins that would trigger an immune rejection in humans: all xenotransplants without such genetic modification have failed.
Protein Folding Prediction
Protein structure prediction is famously hard task, and has only recently become feasible with powerful machine learning models like AlphaFold, trained on structures painstakingly obtained manually via cryo-electron microscopy and X-ray crystallography. AlphaFold uses a transformer-based ML architecture (the same structure as used in LLMs like ChatGPT) called the EvoFormer, which combines protein sequence data with data on sequence identity conservation across evolutionary lineages, which essentially provides information on which amino acid residues are crucial to the 3D structure and which are less constrained.
It’s hard to understate how revolutionary solving protein folding has been: it’s already been used to develop lots of new medicines by predicting protein-substrate interactions, and the newest model AlphaFold 3 can handle protein-DNA interactions too. AlphaFold 3 has recently been used to predict the consequences of how a virus will mutate during a pandemic which could help develop more robust vaccines. That’s using evolution to fight evolution!
Universal Flu Vaccine
Influenza has been around since antiquity, but the most famous outbreak was the Spanish Flu pandemic (1918-1919, 1/3 world infected, 50 million deaths). Influenza A has 8 segments of ssRNA in their capsid, with two glycoproteins (envelope spike proteins): neuraminidase (N) and hemagglutinin (H). There are several isotypes of the H and N proteins, denoted H1, H2…, N1, N2… . Only H1-3 and N1-2 are human transmissible (at present!). The swine flu epidemic of 2009 was due to reassortment (genetic mixing) of the H1N1 bird flu, the H1N1 swine flu and the H3N2 seasonal human flu. The virus’ fitness is a balance of H and N expression, as too much of either will limit the lifetime of the virus. Anti-influenza drugs (e.g. Tamiflu) inhibit N activity, but must be taken early after infection.
Due to antigenic drift (spike protein point mutations allowing evasion of antibodies), flu is a seasonal virus. The DNA polymerase within influenza A is error-prone (lacks proofreading) and therefore mutates rapidly, maintaining its virulence by the high viral loads. Influenza usually has a low mortality rate, and most of its deaths are due to secondary bacterial infections (e.g. pneumonia) which occur when the lung tissue is compromised. If two different influenza virus subtypes infect the same cell at the same time, their individual ssRNA segments may form a ‘hybrid’ virus during capsid assembly (antigenic shift), which is how influenza can jump between species.
The PB1 protein in influenza A steals the protective 5’ caps from host nuclear mRNAs and uses them on its own RNA, both increasing its virulence and dampening the host’s self-protein synthesis and immune response. An alternate reading frame intrinsically disordered protein, PB1-F2, also contributes to virulence by inducing apoptosis in infected cells. A single point mutation, Asn66Ser, in PB1-F2 is associated with signfiicantly stronger pathogenicity and a more aggressive influenza infection.
A bird flu (H5N1) epidemic occurred in 1997. Since 2020, there has been an ongoing pandemic of H5N1, due to reassortments from a variety of flu strains increasing virulence among domestic poultry, and the virus is primarily responsible for the US chicken egg shortage in 2025. There have been a small number of human cases, though human-human transmission has not been observed as of 2025. The neuraminidase spike protein has been mutating over time, with longer alpha-helix stalk structure associated with increased transmissibility across mammalian species, and the rapid mutation rate of the H5N1 spike proteins make vaccine development challenging. A potential universal vaccine approach instead targets the nucleoproteins inside the capsid e.g. PB1, which are less tolerant to mutations, and can be recognised in infected cell MHC class I molecules by T cells. Universal bird flu vaccines are currently under development in case of a future human-transmissible bird flu pandemic.
Explanation of bird flu and the universal vaccine: here (video)
Development programme for universal flu vaccines (April 2025): here
Cancer Research
The evolutionary concept of virulence extends to cancer cells, with additional complications due to multi-level selection: cancer breaks the altruistic nature of multicellularity, placing them in direct competition with the host. Cancer’s trajectory is only explainable via evolutionary dynamics. It behaves like a pathogen where intra-host competition is maximised and inter-host competition is absent.
In her 2020 book Rebel Cell: Cancer, Evolution and the Science of Life, biomedical scientist Dr Kat Arney (PhD from Cambridge) writes about how not recognising the evolutionary processes involved in cancer has held the field of cancer therapeutics back for decades, and we’re only now just catching up, breaking the stagnation in progress.
Chemotherapy or radiotherapy are ‘brute force’ approaches that can kill as many healthy cells as cancer cells. Additionally, the ‘gene targeted’ approaches developed in the 2000s are ineffective as the cancer cells often mutate to resist the drug, and the cancer returns at a later date, so that further treatments of the same drug simply select for the resistant cancer cells. An adaptive therapy strategy is needed, modulating doses to force the cancer cells to compete with each other, rather than growing unimpeded. The tumor microenvironment behaves as an ecological niche which can be manipulated. This being incorporated into modern cancer therapies. Some of the treatments that can be/have been augmented with evolutionary principles to yield improved prognoses include:
Mitotic checkpoint gene transfection, via mRNA vaccines: intended to control the cell cycle of cancer cells.
Oncolytic virus therapy: immunotherapy, triggering an immune response. Effective in immune-rich tissues.
Dendritic cell vaccines: another immunotherapy approach. Personalised medicine is required to moderate the immune response to avoid autoimmunity.
Combination chemodrug-loaded hyaluronan hydrogels: cancer cells are less likely to evolve mutations to multiple bio-orthogonal drugs when delivered at once, reducing the risk of resistance. Using hyaluronan to target CD44 receptors increases specificity to leukemia stem cells. However, toxicity is also additive in combination drugs.
More info:
How evolution explains cancer: here (video #3 in playlist)
Basin modelling is a technique widespread in the global multi-trillion-dollar oil and gas industry, which synthesises geological, petrological and paleontological data to predict the locations of oil and gas reserves within the Earth’s crust. Common targets include oil shales from the Cambrian, Ordovician, Devonian, and Jurassic (as source rocks), as well as tight oil reservoirs found in Devonian, Carboniferous, and Cretaceous formations. It makes extensive use of radiometric dating and stratigraphy (e.g. foraminifera (protist) biostratigraphy) to date the sedimentary layers and model the thermal history of the hydrocarbon-bearing rocks.
In oil and gas, predictions mean profits, and errors mean tremendous financial losses! The success of this industry (at the expense of the climate, unfortunately…) would not be possible without the validity of the underlying theory.
There exists only one oil prospecting company in the world that refuses to use old-earth evolutionary models in their work: they are Zion Oil and Gas Corporation (ZNOG), founded by Christian fundamentalists who believe that Israel would yield oil reserves on theological grounds. Zion Oil has failed to find any ’economically recoverable’ oil reserves in over 20 years of trying, operates incurring annual losses of several tens of millions of USD and are practically bankrupt as of 2025, staying afloat only by selling shares to gullible investors.
Evolutionary Algorithms
Evolutionary (genetic) algorithms are rarely the best solution to a problem since they are stochastic and highly generalised: they are essentially biased parallelised random searches, and this goes some way to explaining why biological evolution is so slow and messy. However, with domain-specific knowledge and a well-designed objective function and genetic encoding scheme, they can be very powerful, outcompeting conventional algorithms in some cases, illustrating the power of the underlying Darwinian principles.
Topology optimisation - Evolutionary Structural Optimisation (ESO) and ‘Generative Design’ in Autodesk Fusion 360. Examples here.
Diffusion models are a type of generative AI used to create images or videos, and have been mathematically shown to be equivalent to an evolutionary algorithm.
Artificial selection is nothing more than natural selection but with humans deciding what is ‘most fit’, widely used in agriculture and animal husbandry. We can guide the evolutionary process towards exhibiting desired traits using selective breeding.
Plants like the wild mustard (Rhamphospermum arvense) and wild cabbage (Brassica oleracea) have been selectively bred in different ways to produce a variety of modern crops. Despite the variability in form, all are within the same species and descended from the same plant.
All of modern fruit and vegetable produce is a result of selective breeding: man-made fruits and vegetables don’t exist in the wild. We have bred plants for about 10,000 years, making them bigger, sweeter, less bitter, less toxic (less cyanide in seeds), seedless, etc. Wild fruits are smaller, not sweet, hard or inedible and full of seeds.
Domestication of dogs and cats
Domestication of livestock
Gene drive for mosquitoes to eliminate malaria
MISCELLANEOUS
These are not necessarily evidence for evolution, but are tangentially related and generally support the theory.
Scientific Consensus
The overwhelming majority of scientists accept evolution as the best explanation for the diversity of life on Earth. It is not an ‘argument from authority’ fallacy to cite the scientific community for a scientific argument, as the consensus is based on the evidence acquired through the scientific method, not on opinions or beliefs of individuals with power.
According to Pew Research Center, as of 2019, 98% of scientists accept evolution, whether religious or not. The 2% who do not are almost exclusively religious fundamentalists outside of the fields of biology, and are therefore both uninformed and biassed. There are practically zero scientists who reject evolution on scientific grounds.
Most US adults, religious or not, accept evolution, and virtually all scientists accept evolution. Source: Pew Research Center, 2019.
The concept of intelligent design (ID) was conceived by the Discovery Institute (DI), an evangelical Christian ’think tank’, as an attempt to circumvent the 1987 Edwards v. Aguillard ruling prohibiting the teaching of creationism in public schools due to violation of church-state separation. ID was also intended to be more appealing to the general public, a necessary part of the DI’s “Wedge Strategy”, whose long-term goal is to effectively work towards installing theocracy in the US, as outlined in their leaked Wedge Document in 2001. Once this document was exposed, the DI published The Wedge Document: So What? in an attempt at damage control, where they both denied and doubled down on some their intentions.
As part of this effort to promote ID, the DI put out a petition called “Dissent from Darwinism”, which asked a wide variety of experts including scientists, doctors and engineers whether they agreed with the following statement:
“I am skeptical of claims for the ability of random mutation and natural selection to account for the complexity of life”.
This petition was intentionally and deceptively worded, since a reasonable scientist may well agree that mutation and selection are not sufficient for evolution, since there are in fact other mechanisms of evolution (e.g. genetic drift, gene flow, sexual selection…). Additionally, by including non-biologists such as engineers and doctors, the DI deliberately inflated the number of qualified signatories, and in fact the vast majority of the signatories were not biologists, totalling about 1,000 individuals in total.
The National Center for Science Education (NCSE) put out a counter-petition called “Project Steve”, which asked scientists (not engineers or doctors) a far more precise statement to the contrary, and only signatories named “Steve” (or some close variation thereof) were counted. About 1,500 Steves signed this petition, outnumbering the original.
The teaching of ID in US public schools was challenged in the 2005 Kitzmiller v. Dover case, which ruled that ID is not science, but creationism, and therefore cannot be taught. Overwhelming evidence was brought against the ID proponents, including several refutations to talking points which are still parroted to this day by their followers. After the trial, it was noticed that the DI’s creation science textbook had simply copy-pasted the word “design proponents” in place of “creationists” in the text, with one edition of the book featuring the telling typo, “cdesign proponentsists”.
The judge at Kitzmiller v. Dover ruled in favour of maintaining the separation of church and state, identifying ID as creationism-aligned. Source: NCSE.
In the run-up to the 2024 US presidential election, the DI quietly became a ‘coalition partner’ for Project 2025. Once the public became aware of Project 2025 and its draconian plans, the DI seemingly requested its logo be removed from the Project 2025 website to cover its tracks, but this was caught and the DI remains a coalition partner as of 2025.
Project 2025 Coalition Partners webpage in June 2024, featuring Discovery Institute. It was removed from the website shortly after. Screenshot posted on Twitter/X by Jackson Wheat.
Creationism is not science, and intelligent design (ID) is merely creationism with a science-coloured coat of paint. Neither of these ideas make any testable falsifiable predictions, except for those which have already been tested and falsified.
“Creation scientists” are degree-holding scientists who intend to use science to find evidence for their religious convictions. Despite being qualified (sometimes), they have a reputation for being exceptionally dishonest, untrustworthy, and most importantly, highly incompetent in their work promoting creationism. Examples include:
Jeffrey Tomkins, who made numerous basic arithmetic and methodological errors in attempting to show that humans and chimpanzees share much less DNA than the conventional figure of >95%.
Günter Bechly worked at the DI and allegedly ended his life in a murder-suicide car crash, killing one other person. The DI made no mention of these allegations in their post reminiscing on his career. Sources: here and here.
“Mendel’s Accountant”, a computer program written by John Sanford, intended to show that mutation would always lead to ‘genetic entropy’ (loss of genetic information and degradation). The code was found to be heavily skewed to favour this conclusion, by biassing the effect of harmful mutations, among several other flaws.
“Biblical Radiocarbon”, a website running a program that aims to ‘recalibrate’ conventional radiocarbon dates into a young-earth timeline. The program’s code was found to be of exceptionally poor quality with numerous bugs.
Salvador (Sal) Cordova, a creation scientist who gave a presentation at a conventional evolution conference. The presentation was highly unprofessional and he made no attempt to communicate any of his points clearly. Dr Dan Stern Cardinale and Dr Zach Hancock reviewed the presentation here.
Nathaniel Jeanson is a Harvard PhD who has readily admitted that he only acquired his prestigious degree in order to promote creationism with authority. He has been caught recycling arguments without addressing the counters many times.
It is often difficult to discern whether these “creation scientists” are incompetent or intentionally dishonest - the former seems less likely in the case of the more qualified individuals.
Some Relevant Quotes
Nothing in biology makes sense except in the light of evolution.
— Theodosius Dobzhansky, Evolutionary Biologist
As a natural historian according to the principles of science, up to the present time I have been not been able to discover any character by which man can be distinguished from the ape; for there are somewhere apes which are less hairy than man, erect in position, going just like him on two feet, and recalling the human species by the use they make of their hands and feet, to such an extent, that the less educated travellers have given them out as a kind of man.
I demand of you, and of the whole world, that you show me a generic character — one that is according to generally accepted principles of classification, by which to distinguish between Man and Ape. I myself most assuredly know of none…. But, if I had called man an ape, or vice versa, I should have fallen under the ban of all the ecclesiastics. It may be that as a naturalist I ought to have done so.
— Carl Linnaeus, pre-Darwinian Botanist and Taxonomist
I write about biology from the point of view of a physicist. Some physicists are arrogant and some are humble. I prefer to be humble. Arrogant physicists say that biology needs better concepts; since physicists are good at concepts, our job is to tell biologists how to think. Humble physicists say that biology needs better hardware; since physicists are good at hardware, our job is to invent new tools for biologists to use. With the exception of Max Delbruck and Francis Crick and a few other pioneers in the heroic age of molecular biology, physicists who tried to teach biologists how to think have failed dismally.
— Freeman Dyson, Theoretical Physicist, as cited by Dr David Tong in his Cambridge Mathematical Biology lecture notes.
It doesn’t matter how beautiful your theory is, it doesn’t matter how smart you are. If it doesn’t agree with experiment, it’s wrong.
— Richard Feynman, Theoretical Physicist
All models are wrong, but some are useful.
— George Box, Statistician
I believe God did intend, in giving us intelligence, to give us the opportunity to investigate and appreciate the wonders of His creation. He is not threatened by our scientific adventures.
— Francis Collins, Director of the Human Genome Project and the NIH