Skip to main content
Home
plus.maths.org

Secondary menu

  • My list
  • About Plus
  • Sponsors
  • Subscribe
  • Contact Us
  • Log in
  • Main navigation

  • Home
  • Articles
  • Collections
  • Podcasts
  • Maths in a minute
  • Puzzles
  • Videos
  • Topics and tags
  • For

    • cat icon
      Curiosity
    • newspaper icon
      Media
    • graduation icon
      Education
    • briefcase icon
      Policy

    Popular topics and tags

    Shapes

    • Geometry
    • Vectors and matrices
    • Topology
    • Networks and graph theory
    • Fractals

    Numbers

    • Number theory
    • Arithmetic
    • Prime numbers
    • Fermat's last theorem
    • Cryptography

    Computing and information

    • Quantum computing
    • Complexity
    • Information theory
    • Artificial intelligence and machine learning
    • Algorithm

    Data and probability

    • Statistics
    • Probability and uncertainty
    • Randomness

    Abstract structures

    • Symmetry
    • Algebra and group theory
    • Vectors and matrices

    Physics

    • Fluid dynamics
    • Quantum physics
    • General relativity, gravity and black holes
    • Entropy and thermodynamics
    • String theory and quantum gravity

    Arts, humanities and sport

    • History and philosophy of mathematics
    • Art and Music
    • Language
    • Sport

    Logic, proof and strategy

    • Logic
    • Proof
    • Game theory

    Calculus and analysis

    • Differential equations
    • Calculus

    Towards applications

    • Mathematical modelling
    • Dynamical systems and Chaos

    Applications

    • Medicine and health
    • Epidemiology
    • Biology
    • Economics and finance
    • Engineering and architecture
    • Weather forecasting
    • Climate change

    Understanding of mathematics

    • Public understanding of mathematics
    • Education

    Get your maths quickly

    • Maths in a minute

    Main menu

  • Home
  • Articles
  • Collections
  • Podcasts
  • Maths in a minute
  • Puzzles
  • Videos
  • Topics and tags
  • Audiences

    • cat icon
      Curiosity
    • newspaper icon
      Media
    • graduation icon
      Education
    • briefcase icon
      Policy

    Secondary menu

  • My list
  • About Plus
  • Sponsors
  • Subscribe
  • Contact Us
  • Log in
  • Example of the use of TDA

    Understanding life with topology

    Topological data analysis and its uses in the life sciences
    by
    Marianne Freiberger
    1 October, 2025

    Brief summary

    This article gives a basic introduction to topological data analysis. It's a method for understanding data by analysing its shape which has its roots in pure mathematics.

    "In mathematics you often build from the ground up. You have blocks and you build up your building using these blocks. But in biology we can't see all the [blocks]. We are still trying to figure out what [they] are and asking whether maths can help identify [them]."

    The answer to this question, posed by mathematician Heather Harrington, appears to be "yes". A technique which has its origins in pure mathematics, called topological data analysis (TDA), has seen some interesting successes over the last few years. It featured extensively at a recent event called Topological advances in the life sciences which was organised by the Newton Gateway to Mathematics. Harrington was a speaker at the event.  TDA was also the topic of Harrington's special lecture at the European Congress of Mathematics last year.

    Bring on topology

    One problem that faces life scientists, such as biologists, is that many of the objects they are interested in are not only hard to see, they are also hard to compare. No two tumours look alike. Proteins have complex structures and are also dynamic. People's brains, and the processes that happen inside them, look different from person to person.

    You can listen to an interview with Heather Harrington talking about TDA in our podcast.

    To see if two objects are of a similar type, what's needed is a method that can capture important features of their shapes without getting side tracked by irrelevant details. And since many data sets that arise in biology (for example those coming from genomic data) aren't images in the traditional sense, but live in high-dimensional spaces we can't even visualise, that method should not rely on humans simply looking at something.

    Topology, traditionally an area of pure mathematics, practically cries out to be used in this context.  In topology two shapes are considered the same if one can be morphed into the other without tearing or cutting. The ring on your finger is considered to be the same shape as the tired rubber band lying on your desk. What characterises both is that they form a loop, in other words, they surround a hole. In topology loops and holes play an important role in defining shapes.

    A toy example

    For a toy example of how TDA works, imagine 20 points sitting on a circle, equally spaced. Now draw a small disc of radius $r$ around each point and gradually increase  $r$ so the discs become bigger. If the distance between the points is sufficiently small compared to the circle they are sitting on, then the discs will merge to form a loop. This happens when  $r$ grows beyond half the distance between neighbouring points. Increase $r$ further and the discs will merge to form a single blob. This happens when $r$  grows beyond the radius of the large circle the points are on. 

    Example involving circles
    Circles drawn around 20 points in the plane. If the radius r is less than r0, the circles are small enough to not overlap (left). Once the radius exceeds r0, but is smaller than r1, the circles overlap and together form a ring-like structure (middle). One the radius is larger than r1 the circles join up in the centre of this ring-like structure. What you see now is a single blob without a hole.

    To keep track of the changing picture that emerges as discs expand, topological data analysts use what they call a bar code. The bar code corresponding to our toy example is shown below. For $r < r_0$ there are 20 red lines indicating there are twenty connected components without holes. For $r_0 < r < r_1$ there is one green line indicating there is one connected component with one hole (the colours red and green differentiate between no hole and one hole). For $r > r_1$ there is one red line indicating there is one connected component without a hole. The length of the interval from $r_0$ to $r_1$ indicates how long the connected component with one hole persists.

    Example of a bar code in TDA
    The barcode captures this information. For r < r0 there are 20 red lines indicating there are twenty connected components without holes. For r0 < r < r1 there is one green line indicating there is one connected component with one hole (the colours red and green differentiate between no hole and one hole). For r > r1 there is one red line indicating there is one connected component without a hole.

    Crucially, the bar code would look quite similar if our points were arranged, not in a perfect circle, but in a deformed ring. The bar code captures the fact that the data are arranged in a ring, without bothering about precise geometrical details. The length of the single green line in the bar code indicates how long the loop persists as discs expand. The bar code is a sort of fingerprint of the topological shape of the data.

    The general idea illustrated in our simple example works with much more complex data sets as well, including those that live in high dimensions. To keep track of the features that are born, persist, and die, mathematicians use something called persistent homology, a tool which has its origins in the pure mathematical area of algebraic topology.

    TDA successes

    An early success of TDA involved breast cancer. In 2011 a team of mathematicians were able to identify a new subtype of tumour by applying TDA to genomic data. It turned out that patients with that subtype had a 100% survival rate — quite an important piece of information to have if you have this type of tumour. 

    At the Topological advances in the life sciences event speakers explored a range of other applications of topology to cancer research but also to neurology and hematology. The excitement was palpable — in the age of Big Data, a method that can classify the shape of that data in a meaningful, and also automated way holds much promise. And we have pure mathematics to thank for it.


    About this article

    Marianne Freiberger, Editor of Plus, attended the Topological advances in the life sciences event, organised by the Newton Gateway to Mathematics, in June 2025. The event was part of a longer research programme organised by the Isaac Newton Institute for Mathematical Sciences called Equivariant homotopy theory in context. You can see more of our content produced from this research programme here.


    This content was produced as part of our collaborations with the Isaac Newton Institute for Mathematical Sciences (INI) and the Newton Gateway to Mathematics.

    The INI is an international research centre and our neighbour here on the University of Cambridge's maths campus. The Newton Gateway is the impact initiative of the INI, which engages with users of mathematics. You can find all the content from the collaboration here.

    INI logo
    Newton Gateway logo
    • Log in or register to post comments

    You might also like

    podcast
    Heather Harrington

    Euromaths: Heather Harrington

    We all know what data is and you might know what topology is. But what is topological data analysis? We find out with Heather Harrington.

    article

    Maths in a minute: Topology

    When you let go of the notions of distance, area, and angles, all you are left with is holes.

    article

    The mathematical shapes in your brain

    Join us as we follow Kathryn Hess on a mathematical mystery tour of the marvellous intricacy of the brain!

    Read more about...

    INI
    topology
    topological data analysis
    biology
    medicine and health
    cancer
    Newton Gateway

    Our Podcast: Maths on the Move

    Our Maths on the Move podcast brings you the latest news from the world of maths, plus interviews and discussions with leading mathematicians and scientists about the maths that is changing our lives.

    Apple Podcasts
    Spotify
    Podbean

    Plus delivered to you

    Keep up to date with Plus by subscribing to our newsletter or following Plus on X or Bluesky.

    University of Cambridge logo

    Plus is part of the family of activities in the Millennium Mathematics Project.
    Copyright © 1997 - 2025. University of Cambridge. All rights reserved.

    Terms