Skip to main content
Home
plus.maths.org

Secondary menu

  • My list
  • About Plus
  • Sponsors
  • Subscribe
  • Contact Us
  • Log in
  • Main navigation

  • Home
  • Articles
  • Collections
  • Podcasts
  • Maths in a minute
  • Puzzles
  • Videos
  • Topics and tags
  • For

    • cat icon
      Curiosity
    • newspaper icon
      Media
    • graduation icon
      Education
    • briefcase icon
      Policy

    Popular topics and tags

    Shapes

    • Geometry
    • Vectors and matrices
    • Topology
    • Networks and graph theory
    • Fractals

    Numbers

    • Number theory
    • Arithmetic
    • Prime numbers
    • Fermat's last theorem
    • Cryptography

    Computing and information

    • Quantum computing
    • Complexity
    • Information theory
    • Artificial intelligence and machine learning
    • Algorithm

    Data and probability

    • Statistics
    • Probability and uncertainty
    • Randomness

    Abstract structures

    • Symmetry
    • Algebra and group theory
    • Vectors and matrices

    Physics

    • Fluid dynamics
    • Quantum physics
    • General relativity, gravity and black holes
    • Entropy and thermodynamics
    • String theory and quantum gravity

    Arts, humanities and sport

    • History and philosophy of mathematics
    • Art and Music
    • Language
    • Sport

    Logic, proof and strategy

    • Logic
    • Proof
    • Game theory

    Calculus and analysis

    • Differential equations
    • Calculus

    Towards applications

    • Mathematical modelling
    • Dynamical systems and Chaos

    Applications

    • Medicine and health
    • Epidemiology
    • Biology
    • Economics and finance
    • Engineering and architecture
    • Weather forecasting
    • Climate change

    Understanding of mathematics

    • Public understanding of mathematics
    • Education

    Get your maths quickly

    • Maths in a minute

    Main menu

  • Home
  • Articles
  • Collections
  • Podcasts
  • Maths in a minute
  • Puzzles
  • Videos
  • Topics and tags
  • Audiences

    • cat icon
      Curiosity
    • newspaper icon
      Media
    • graduation icon
      Education
    • briefcase icon
      Policy

    Secondary menu

  • My list
  • About Plus
  • Sponsors
  • Subscribe
  • Contact Us
  • Log in
  • Overbooking: How to avoid plane rage

    by
    Christine Currie
    13 July, 2017

    Overbooking became infamous overnight after United Airlines made a huge reputational error in dragging a customer off a flight to make way for what turned out to be a crew member. For anyone who missed this sorry spectacle, overbooking is the practice of selling more seats for a flight than exist on the plane. The ethics of overbooking were discussed at length in the days that followed the incident, but what about the maths? In this article we describe a simple, but effective model of overbooking that makes use of the binomial distribution.

    Plane

    Welcome on board?

    The vast majority of people who buy a ticket for a flight intend to use it but sometimes circumstances get in the way. Easyjet states that 2.6 million of its passengers didn't show up for their flight in 2016 (around 3.5%), so if overbooking wasn't used, many aircraft would fly with empty seats. This results in reduced efficiency and less cash for the airlines. While you might not be worried about how much profit the airlines make, it's in everyone's interests that aircraft fly at full capacity.

    Overbooking forms part of an area of mathematics called operational research, in which mathematical ideas are developed to solve optimisation problems in the "real world". These can range from business optimisation problems such as that described here, to optimal scheduling of operating theatres in hospitals, improving the efficiency of manufacturing lines, routing of vehicles, location of facilities, etc. For more information about operational research see the Learn about O.R. website.

    Formulating the problem

    Each person who buys a ticket has a probability that they will show up, which we label $p$ (so the probability they cancel is $q=1-p$). We're going to assume that these probabilities are the same for everyone and are independent of each other. Neither of these are perfect assumptions: certain classes of passenger are more likely to cancel than others, and if one person cancels, there is likely to be a higher chance that other people on the plane will also cancel (e.g. if there are major traffic problems just outside the airport many people might miss the flight). Nonetheless, the assumption makes a good first approximation. Let's assume that our aircraft has $C$ seats and the airline sells $y$ tickets, where $y \geq C$. The decision the airline has to make is how big to set $y$, that is, how many seats to overbook. But we'll first work out the probability distribution for the number of people who show up given that we have sold $y$ tickets, which we'll write as $Z(y)$.
    Plane

    Overbooking ruins holiday dreams.

    There is a probability distribution that is usually phrased in terms of repeated experiments of which each outcome can be either a success or a failure. The binomial distribution gives you the probability of observing a given number of successes within a given total of trials of the experiment. We can look at our overbooking problem in the same way. Each passenger is regarded as a trial of the experiment. If they show up, the outcome is a success and if not then it's a failure.

    If you're not familiar with the binomial distribution, take a look at this brief explanation to find out more.

    Using the binomial equations, we can write the probability $P\left[Z(y)=z\right]$ that we have $z$ people showing up for the flight given we have sold $y$ tickets as \begin{equation} P\left[Z(y)=z\right] = {{y}\choose{z}} p^z (1-p)^{y-z}.\end{equation} Here the variable $z$ can take values from $0$ up to $y$.

    The first term in the expression above is the binomial coefficient as described here.

    Using the model

    We now have a set of equations that give us the probabilities of different numbers of people showing up for the flight. Here we consider the case where we want to restrict overbooking so that passengers are turned away on only a small percentage of flights.

    For someone to be turned away on a flight, the number who show up, $Z(y)$, needs to be greater than the number of seats, $C$. We can write this in terms of the probability $P[Z(y)=z]$ defined in equation 1 above: $$P\left[Z(y)>C\right] = P\left[Z(y)=C+1\right]+ P\left[Z(y)=C+2\right]+...+P\left[Z(y)=y\right].$$ This expression comes from the fact that the probability of either one of two independent events occurring is equal to the sum of the probabilities of the individual events.

    An example

    Let's see how this works for an example. The model is simple enough to be run easily on an Excel spreadsheet and you can download a copy here. Let's assume a cancellation rate of 5\% (so $p=0.05$) and correspondingly, a show rate of 95\% (so $1-p = 0.95$). For a plane of capacity $C = 100$, we might want to keep the probability of the number showing up exceeding capacity below 5\%. This would mean fewer than 1 in 20 flights needed to turn someone away. The table and graph below shows the relevant probabilities as given by the binomial distribution.

    Number of seats soldProbability number of shows is greater than C
    1010.56%
    1023.4%
    10310.65%
    10423.08%
    10539.24%
    10656.22%
    10771.21%
    10882.67%
    10990.40%
    11095.08%
    Zipf's law for COCA

    The graph shows the probabilities of someone being turned away.

    We see that for $y = 101$ we have $P\left[Z(101) > 100\right] = 0.56\%$, for $y = 102$ we have $P\left[Z(102) > 100\right] = 3.40\%$ and for $y = 103$ we have $P\left[Z(103) > 100\right] = 10.65\%$. Therefore, the airline should sell $102$ tickets.

    Conclusion

    Hopefully this gives some insight into how an airline makes a decision about whether or not to overbook and how to decide on the number of seats to sell. More advanced methods take account of the cost of turning away a passenger. If the compensation costs are very high, not to mention the damage done to the airline's reputation, overbooking is not going to be worthwhile, but if the cost is very low the airline might be happier to put it into practice. It's also necessary to know the market well and have a good estimate of the probability $p$ of a no show. Without that, the model is likely to suggest poor values for the number of tickets.


    Abut the author

    Christine Currie is an Associate Professor in Operational Research in Mathematical Sciences at the University of Southampton. Her research includes modelling diseases as well as pricing tickets. She spends most of her spare time running around the countryside and orienteering.
    • Log in or register to post comments

    Read more about...

    operational research
    probability
    probability theory
    binomial distribution
    probability distribution

    Our Podcast: Maths on the Move

    Our Maths on the Move podcast brings you the latest news from the world of maths, plus interviews and discussions with leading mathematicians and scientists about the maths that is changing our lives.

    Apple Podcasts
    Spotify
    Podbean

    Plus delivered to you

    Keep up to date with Plus by subscribing to our newsletter or following Plus on X or Bluesky.

    University of Cambridge logo

    Plus is part of the family of activities in the Millennium Mathematics Project.
    Copyright © 1997 - 2025. University of Cambridge. All rights reserved.

    Terms