What would you Choose – Mean or Median?
Hypothetically speaking, given a choice between a question on median or one on mean, which would you choose? (not that we are fortunate enough to have a choice but no harm in dreaming) I would certainly pick the one testing on median. Here is why:
A point gives you median – to be precise, the one which divides the increasing data set into two equal halves. You don’t care what is on the left and what is on the right of this point. An outlier will do nothing to the median. The problem with mean is that it depends on every value in the set. If you increase one element, mean changes. Outliers can drastically change the mean and hence, every element has to be kept in mind!
So with median, there is a lot less to worry about.
Let’s illustrate this with an example.
Question on Median
At a bakery, cakes are sold every day for a certain number of days. If 6 or more cakes were sold for 20% of the total number of days, is the median number of cakes sold less than 4?
Statement 1: On 75% of the days that less than 6 cakes were sold, the number of cakes sold each day was less than 4.
Statement 2: On 50% of the days that 4 or more cakes were sold, the number of cakes sold each day was 6 or more.
Solution:
Following is the number of cakes sold on any of those days:
1 2 3 4 5 6 more….
Say there were 100 days (since all figures are in terms of percentages, we can assume a number to simplify our understanding).
Information from the question stem: 6 or more cakes were sold for 20% of the days
So for 20 days, 6 or more cakes were sold.
Then for 80 days, 1/2/3/4/5 cakes were sold.
Question: Is the median number of cakes sold in one day less than 4?
We know how to get the median. When we arrange all figures in increasing order, the median will be the average of the 50th and the 51st terms. We need to know if the average of the 50th and 51st term is less than 4.
Statement 1: On 75% of the days that less than 6 cakes were sold, the number of cakes sold each day was less than 4.
No of days that less than 6 cakes were sold = 80
75% of these 80 days will be 60 days. On 60 days, less than 4 cakes were sold. So the 50th and 51st terms will be less than 4 and so will their average. Hence, the median will be less than 4.
This statement alone is sufficient.
Statement 2: On 50% of the days that 4 or more cakes were sold, the number of cakes sold each day was 6 or more.
On 20 days, 6 or more cakes were sold. This constitutes 50% of the days on which 4 or more cakes were sold. So on another 20 days, 4 or 5 cakes were sold.
Hence on the leftover 60 days, less than 4 cakes were sold. So the 50th and 51st terms will be less than 4 and so will their average.
Hence, the median will be less than 4.
This statement alone is sufficient.
Answer (D).
All we needed to worry about here were the 50th and 51st terms. The whole scene changes when we talk about mean instead of median.
Same Question on Mean
At a bakery, cakes are sold every day for a certain number of days. If 6 or more cakes were sold for 20% of the total number of days, is the average number of cakes sold less than 4?
Statement 1: On 75% of the days that less than 6 cakes were sold, the number of cakes sold each day was less than 4.
Statement 2: On 50% of the days that 4 or more cakes were sold, the number of cakes sold each day was 6 or more.
Solution:
Information from the question stem: 6 or more cakes were sold for 20% of the days
So for 20 days, 6 or more cakes were sold.
Then for 80 days, 1/2/3/4/5 cakes were sold.
Question: Is the average number of cakes sold in one day less than 4?
This question asks us about the average. That is far more complicated than the median. Every value matters when we talk about the average. We need to know the number of cakes sold on each of these 100 days to get the average.
6 or more cakes were sold on 20 days. Note that the number of cakes sold on these 20 days could be any number greater than 6 such as 20 or 50 or 120 etc. The minimum number of cakes sold on these 20 days would be 620 = 120
There is no limit to the maximum number of cakes sold.
Statement 1: On 75% of the days that less than 6 cakes were sold, the number of cakes sold each day was less than 4.
On 80 days, less than 6 cakes were sold. Of this number, 75% is 60 days. On 60 days, less than 4 cakes were sold.
So on 60 days, you have minimum 160 = 60 cakes sold and maximum 360 = 180 cakes sold.
On the leftover 20 days, 4 or 5 cakes were sold so minimum of 420 = 80 cakes and a maximum of 520 = 100 cakes.
The minimum value of the average is cakes
The maximum average could be anything.
This statement alone is not sufficient.
Statement 2: On 50% of the days that 4 or more cakes were sold, the number of cakes sold each day was 6 or more.
The 20 days when 6 or more cakes were sold make up 50% of the days when 4 or more cakes were sold. So for another 20 days, 4 or 5 cakes were sold so minimum of 420 = 80 cakes and a maximum of 520 = 100 cakes.
For 60 days, 1/2/3 cakes were sold. So on 60 days, you have minimum 160 = 60 cakes sold and maximum 360 = 180 cakes sold.
The minimum value of the average is cakes
The maximum average could be anything.
This statement alone is not sufficient.
Note that both statements give you the same information. So if they are not sufficient independently, they are not sufficient together.
Answer of this modified question would be (E).
Note that here we had to assume the minimum and maximum value for each data point to get the range of average. We couldn’t just rely on one or two data points. We need much more information to get the answer to what the mean is.