对于实数的 boxplot,upper whisker ends at min(Vmax,Q3+1.5×IQR), lower whisker ends at max(Q1−1.5×IQR,Vmin)
Variance
Population variance
σ2=N∑i=1N(xi−μ)2=N∑i=1Nxi2−μ2
Sample variance
s2=n−1∑i=1n(xi−xˉ)2=n−1∑i=1nxi2−nxˉ2
Coefficient of Variance (CV)
measure of relative dispersion that expresses the standard deviation as a percentage of the mean
population CV = μσ×100%, sample CV = xˉs×100%
Empirical Rule
大约 0.68 的数据落在 [μ±σ] 的区间里
大约 0.95 的数据落在 [μ±2σ] 的区间里
大约 0.997 的数据落在 [μ±3σ] 的区间里
z-Score
measures the location or position of a value relative to the mean of the distribution: it is a standardized value that indicates the number of standard deviations a value is from the mean
Population zi=σxi−μ, sample zi=sxi−xˉ
Skewness of Distribution
skewness=n1s3∑i=1n(xi−xˉ)3
Covariance
Population covariance
Cov(x,y)=σx,y=N∑i=1N(xi−μx)(yi−μy)
Sample covariance
sx,y=n∑i=1n(xi−μx)(yi−μy)
Correlation
free of units and provides both the direction and strength of a relationship