Showing posts with label skewness. Show all posts
Showing posts with label skewness. Show all posts

Tuesday, June 21, 2011

Using Mean, Standard Deviation, Skewness, and Kurtosis

Previous blogs have discussed the properties of the first four moments which can be computed from a data set. The next step is to use these easily computed statistics in everyday applications. When presented with a set of data, it is important to understand what information may be hidden in the “sea of numbers”. We know that the Mean gives us the central tendency of the data, the Standard Deviation explains the dispersion about the Mean, the Skewness represents the symmetry/asymmetry of the data, and the Kurtosis is related to the shape or peakedness characteristics. In essence, we are using these numerical quantities to explain the properties of the underlying distribution or probability density function (PDF). These statistics can be used to qualitatively perform distribution fitting for you data.

Since a set of data can have any Mean and Standard Deviation, we can use these statistics to determine the location and relative dispersion. Using the Skewness and Kurtosis, we can learn much more as shown in the table below;



Skewness

Kurtosis*
Classical
Distribution
0
1.8
Uniform PDF


Any negative number
2.4
Left-skewed Triangular PDF


0
2.4
Symmetric Triangular PDF


Any positive number
2.4
Right-skewed Triangular PDF


0
3
Normal PDF


0.63
3.26
Raleigh PDF


2
9
Exponential PDF



Using this information, you can make as-like comparisons of your data to the properties of some of the known classical distribution. While you may not be able to conclude that the data set is from a population with a particular distribution, you will be able to infer that “based on the data, the uncertainty is representative of that of a _______ distribution”.

*Note that these quantities are for Kurtosis where 3 = Normal PDF. If quantities for Excess of Kurtosis, where 0 = Normal PDF, are desired, then subtract 3 from the values shown.

Tuesday, May 31, 2011

Skewness and Kurtosis

In our previous blog entry, we discussed how a probability distribution can be described using the first four moments which are mean, variance/standard deviation, skewness, and kurtosis. Mean and standard deviation were described and now we will talk about skewness and kurtosis.

Skewness: The measurement of the asymmetry of a random variable is a dimensionless quantity called skewness (b1)^0.5, which is calculated from the second and third moments about the mean of a distribution. If (b1)^0.5 < 0 the distribution is negatively skewed (tail to left) and if (b1)^0.5 > 0 the distribution is positively skewed (tail to the right). The equations for calculating skewness from data are:


Distributions that are: (a) positively skewed with tail to the right (b1)^0.5  > 0, (b) centered (b1)^0.5  = 0, and (c) negatively skewed with tail to the left (b1)^0.5  < 0. Red lines are means.


Kurtosis: A dimensionless quantity that characterizes the peakedness of a random variable is called kurtotsis b2, which is calculated from the fourth and second moments of the distribution about the mean. If b2 >> 3, the distribution has a high peak and for b2 =1.8, the distribution becomes flat. At b2=3, the distribution is normal



(a) distribution with a high peak (b2  >> 3), (b) normal distribution (b2  = 3), (c) flat (uniform) distribution with (b2  =1.8).

In Excel, the functions SKEW and KURT can be used to calculate skewness and kurtosis, respectively. Be warned -- the Excel KURT function calculates the "Excess of Kurtosis" relative to a normal distribution.  In Excel, KURT will return 0 for a normal distribution rather than the value of 3 found in statistics textbooks.  We suggest that you add 3 to any to value returned by the KURT function when presenting data.

=KURT(A1:A10) + 3