Statistics 简明教程

Statistics - Boxplots

箱型图是一种标准化方法,用于根据以下五个数字摘要显示数据分布。

The box plot is a standardized way to display the distribution of data based on following five number summary.

对于均匀分布的数据集,在箱线图中,中心矩形跨越第一四分位数到第三四分位数(或四分位间距,IQR)。矩形内部的一条线表示中位数,而矩形上方和下方的“胡须”表示最小值和最大值。此类箱线图显示从最小值到最大值的完整变化范围、可能的变异范围、IQR 和中位数。

For a uniformly distributed data set,in box plot diagram, the central rectangle spans the first quartile to the third quartile (or the interquartile range, IQR). A line inside the rectangle shows the median and "whiskers" above and below the box show the locations of the minimum and maximum values. Such box plot displays the full range of variation from min to max, the likely range of variation, the IQR, and the median.

box plot

Problem Statement:

针对以下两个数据集创建一个箱线图。

Create a box plot for the following two datasets.

0.22

-0.87

-2.39

-1.79

0.37

-1.54

1.28

-0.31

-0.74

1.72

0.38

-0.17

-0.62

-1.10

0.30

0.15

2.30

0.19

-0.50

-0.09

-5.13

-2.19

-2.43

-3.83

0.50

-3.25

4.32

1.63

5.18

-0.43

7.11

4.87

-3.10

-5.81

3.76

6.31

2.58

0.07

5.76

3.50

Solution:

这里两个数据集在零值周围均匀平衡,因此均值在零值附近。在第一个数据集中,变化范围大约在 -2.5 到 2.5 之间,而在第二个数据集中,范围大约在 -6 到 6 之间。如下所示绘制图表:

Here both datasets are uniformly balanced around zero so mean is around zero. In first data set variation ranges approximately from -2.5 to 2.5 whereas in second data set ranges approximately from -6 to 6. Draw the chart as shown below:

box plot1