Standard Deviation: A Measure of Dispersion

Standard Deviation

The standard deviation is a widely used concept in statistics and it tells how much variation (spread or dispersion) is in the data set. It can be defined as the positive square root of the mean (average) of the squared deviations of the values from their mean.
To calculate the standard deviation one have to follow these steps:

  1. First, find the mean of the data.
  2. Take the difference of each data point from the mean of the given data set (which is computed in step 1). Note that, the sum of these differences must be equal to zero or near to zero due to rounding of numbers.
  3. Now computed the square the differences that were obtained in step 2, It would be greater than zero, that it, I will be a positive quantity.
  4. Now add up all the squared quantities obtained in step 3. We call it the sum of squares of differences.
  5. Divide this sum of squares of differences (obtained in step 4) by the total number of observation (available in data) if we have to calculate population standard deviation ($\sigma$). If you want t to compute sample standard deviation ($S$) then divide the sum of squares of differences (obtained in step 4) by the total number of observation minus one ($n-1$) i.e. the degree of freedom. Note $n$ is the number of observations available in the data set.
  6. Find the square root (also known as under root) of the quantity obtained in step 5. The resultant quantity in this way known as the standard deviation for the given data set.

The sample standard deviation of a set of $n$ observation, $$X_1, X_2, \cdots, X_n$$ denoted by $S$ is
\begin{aligned}
\sigma &=\sqrt{\frac{\sum_{i=1}^n (X_i-\overline{X})^2}{n}}; Population\, Standard\, Deviation\\
S&=\sqrt{ \frac{\sum_{i=1}^n (X_i-\overline{X})^2}{n-1}}; Sample\, Standard\, Deviation
\end{aligned}
The standard deviation can be computed from variance too as $S= \sqrt{Variance}$.

The real meaning of the standard deviation is that for a given data set 68% of the data values will lie within the range $\overline{X} \pm \sigma$ i.e. within one standard deviation from mean or simply within one $\sigma$. Similarly, 95% of the data values will lie within the range $\overline{X} \pm 2 \sigma$ and 99% within $\overline{X} \pm 3 \sigma$.

Examples of Standard Deviation and Variance

A large value of standard deviation indicates more spread in the data set which can be interpreted as the inconsistent behaviour of the data collected. It means that the data points tend to away from the mean value. For the case of smaller standard deviation, data points tend to be close (very close) to mean indicating the consistent behaviour of data set.
The standard deviation and variance both are used to measure the risk of a particular investment in finance. The mean of 15% and standard deviation of 2% indicates that it is expected to earn a 15% return on investment and we have 68% chance that the return will actually be between 13% and 17%. Similarly, there are 95% chance that the return on the investment will yield an 11% to 19% return.

Muhammad Imdad Ullah

Currently working as Assistant Professor of Statistics in Ghazi University, Dera Ghazi Khan. Completed my Ph.D. in Statistics from the Department of Statistics, Bahauddin Zakariya University, Multan, Pakistan. l like Applied Statistics, Mathematics, and Statistical Computing. Statistical and Mathematical software used is SAS, STATA, GRETL, EVIEWS, R, SPSS, VBA in MS-Excel. Like to use type-setting LaTeX for composing Articles, thesis, etc.

You may also like...

2 Responses

  1. Ajay GUpta says:

    Hi,
    The formula shown for Population Standard Deviation and Sample Standard Deviation has a typo error i.e. the sum of the deference’s of each data point from the mean is not squared.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

x Logo: Shield Security
This Site Is Protected By
Shield Security