# Solution to "Yet more taxis!"

To solve this problem we can extend the technique we used in Issue No 2. To make our working easier to read, let’s define to be the number of possible colours other than blue. That is, = . For convenience, we’ll label the other colours .

Consider 100 possible cases (contingencies), in which taxis are involved, in proportion to their numbers. Each case is equally probable. We expect 85 cases to involve blue taxis, the other 15 cases being equally distributed between the other colours.

If a blue taxi were really involved, the witness will report blue with probability 80% or some other colour, say , with probability 20% divided by . If a taxi were really involved the witness will report blue or some other colour, say , with probability 20% divided by and with probability 80%.

The number of times that these outcomes are expected to occur in each case can be shown in a contingency table. The following table only contains entries for blue, and but the pattern should be obvious.

So, given that the witness reported seeing a blue taxi, we must use the row in the table corresponding to the reported blue taxi.

The probability that the taxi was blue is therefore:

This is simply the number of cases in which the witness reported a blue taxi and was right, divided by the total number of cases in which the witness reported a blue taxi.

If we let equal 1 in this expression we simply get 68/71, which is of course the same result we got for the blue/green case. Notice that as the number of taxi firms increases the probability that a blue taxi really was involved in the accident gets closer and closer to 100%.

Now let’s look at what happens if the witness reports some other colour, say . What is the probability that a taxi really was involved?

This time we use the row in the table corresponding to the reported taxi. The probability that the taxi was is therefore:

If we put equal to 1 again we get the (perhaps) surprising result of 12/29, which is just 41%. Clearly the witness's evidence is of very little value. Notice also that as the number of taxi firms increases the probability that a taxi was involved in the account *decreases* towards 37.5%. Having more firms actually weakens the evidence still further. The reason for this "paradox" is that the mistaken sightings of other taxis are swamping the small number of correct identifications.