The human genome is represented by a sequence of 3 billion As, Cs, Gs, and Ts. With such large numbers, sequencing the entire genome of a complex organism isn't just a challenge in biochemistry. It's a logistical nightmare, which can only be solved with clever algorithms.
"It's a match!" cries the CSI. At first glance it might seem that if the police have matched a suspect's DNA to evidence from the crime scene, then the case is closed. But some statistical thinking is required to understand exactly what a match is, and importantly, how juries should assess this as part of the evidence in a trial.
David Spiegelhalter explains that waiting for an infinite number of monkeys to produce the complete works of Shakespeare is not just a probabilistic certainty, it also gives us an insight into how long we can expect to wait for a rare event to happen.