For a dataset, the quartiles are Q1=42, Q2=55, and Q3=70.
(a) Determine whether the data is positively skewed, negatively skewed, or symmetric.
(b) A student argues: "Since Q2−Q1=13 and Q3−Q2=15, the data is positively skewed because Q3−Q2>Q2−Q1." Is this reasoning correct?
(c) If the interquartile range is IQR=28, state the outlier boundaries using the 1.5×IQR rule.
[Difficulty: hard. Tests interpretation of quartile positions to identify skewness and outlier detection.]
Solution:
(a) The distances from the median are:
Q2−Q1=55−42=13
Q3−Q2=70−55=15
Since Q3−Q2>Q2−Q1, the right tail is longer than the left tail, indicating positive skew.
(b) The student's reasoning is correct in principle: positive skew means the right tail is longer. However, the student should note that this is a heuristic — formal skewness is measured by the moment coefficient n1∑(sxi−xˉ)3, not just quartile differences. The quartile-based test is a quick check, not definitive proof.
A dataset has the following coded values. The coding is y=10x−50:
∑y=45,∑y2=285,n=9
(a) Find xˉ and sx (the standard deviation of x).
(b) A student computes sy=9285−25=31.67−25=6.67 and concludes sx=sy. Explain why this is wrong.
[Difficulty: hard. Tests coded data transformations and the effect on mean and standard deviation.]
Solution:
(a)yˉ=945=5.
Since y=10x−50, we have x=10y+50:
xˉ=10yˉ+50=10(5)+50=100
For the standard deviation: sx=10sy.
sy=n∑y2−yˉ2=9285−25=31.67−25=6.67≈2.58
sx=10×2.58=25.8
(b) The student's error is concluding sx=sy. The coding y=10x−50 scales by a factor of 101 and shifts by 50. Scaling by c multiplies the standard deviation by ∣c∣, so sx=10sy, not sy. The student forgot to account for the scaling factor. Additionally, the student used 9285≈31.67 and then subtracted 25 (where 25=52), which is correct for computing sy, but then incorrectly applied the result to sx.