Sampling Distribution of Means

Suppose, we have a population of size $N$ having mean $\mu$ and variance $\sigma^2$. We draw all possible samples of size $n$ from this population with or without replacement. Then we compute the mean of each sample and denote it by $\overline{x}$. These means are classified into a frequency table which is called frequency distribution of means and the probability distribution of means is called the sampling distribution of means.

Sampling Distribution

A sampling distribution is defined as the probability distribution of the values of a sample statistic such as mean, standard deviation, proportions, or difference between means, etc., computed from all possible samples of size $n$ from a population. Some of the important sampling distributions are:

  • Sampling Distribution of Means
  • Sampling Distribution of the Difference Between Means
  • Sampling Distribution of the Proportions
  • Sampling Distribution of the Difference between Proportions
  • Sampling Distribution of Variances

Notations of Sampling Distribution of Means

The following notations are used for sampling distribution of means:

$\mu$: Population mean
$\sigma^2$: Population Variance
$\sigma$: Population Standard Deviation
$\mu_{\overline{X}}$: Mean of the Sampling Distribution of Means
$\sigma^2_{\overline{X}}$: Variance of Sampling Distribution of Means
$\sigma_{\overline{X}}$: Standard Deviation of the Sampling Distribution of Means

Formulas for Sampling Distribution of Means

The following following formulas for the computation of means, variance, and standard deviations can be used:

\begin{align*}
\mu_{\overline{X}} &= E(\overline{X}) = \Sigma (\overline{X}P(\overline{X})\\
\sigma^2_{\overline{X}} &= E(\overline{X}^2) – [E(\overline{X})]^2\\
\text{where}\\
E(\overline{X}^2) &= \Sigma \overline{X}^2P(\overline{X})\\
\sigma_{\overline{X}} &= \sqrt{E(\overline{X}^2) – [E(\overline{X})]^2}
\end{align*}

Numerical Example: Sampling Distribution of Means

A population of $(N=5)$ has values 2, 4, 6, 8, and 10. Draw all possible samples of size 2 from this population with and without replacement. Construct the sampling distribution of sample means. Find the mean, variance, and standard deviation of the population and verify the following:

Sr. No.Sampling with ReplacementSampling without Replacement
1)$\mu_{\overline{X}} = \mu$$\mu_{\overline{X}} = \mu$
2)$\sigma^2_{\overline{X}}=\frac{\sigma^2}{n}$$\sigma^2_{\overline{X}}=\frac{\sigma^2}{n}\frac{N-n}{N-1}$
3)$\sigma_{\overline{X}} = \frac{\sigma}{\sqrt{n}}$$\sigma_{\overline{X}} = \frac{\sigma}{\sqrt{n}} \sqrt{\frac{N-n}{N-1}}$

Solution

The solution to the above example is as follows:

Sampling with Replacement (Mean, Variance, and Standard Deviation)

The number of possible samples is: $N^n = 5^2 = 25.

Samples$\overline{X}$Samples$\overline{X}$Samples$\overline{X}$
2, 224, 1078, 88
2, 436, 248, 109
2, 646, 4510, 26
2, 856, 6610, 47
2, 1066, 8710, 68
4, 236, 10810, 89
4, 448, 2510, 1010
4, 658, 46
4, 868, 67

The sampling distribution of sample means will be

$\overline{X}$Freq$P(\overline{X}$$\overline{X}P(\overline{X})$$\overline{X}^2$$\overline{X}^2P(\overline{X}$
211/252/2544/25
322/256/25918/25
433/2512/251648/25
544/2520/2525100/25
655/2530/2536180/25
744/2528/2549196/25
833/2524/2564192/25
922/2518/2581162/25
10112510/25100100/25
Total25/25=1150/25 = 61000/25=40

\begin{align*}
\mu_{\overline{X}} &= E(\overline{X}) = \Sigma \left[\overline{X}P(\overline{X})\right] = \frac{150}{25}=6\\
\sigma^2_{\overline{X}} &= E(\overline{X}^2) – [E(\overline{X}]^2=\Sigma [\overline{X}^2P(\overline{X})] – [\Sigma [\overline{X}P(\overline{X})]]^2\\
&= 40 – 6^2 = 4\\
\sigma_{\overline{X}} &= \sqrt{4}=2
\end{align*}

Mean, Variance, and Standard Deviation for Population

The following are computations for population values.

$X$24681030
$X^2$4163664100220

\begin{align*}
\mu &= \frac{\Sigma}{N} = \frac{30}{5} = 6\\
\sigma^2 &= \frac{\Sigma X^2}{N} – \left(\frac{\Sigma X}{n} \right)^2\\
&=\frac{220}{5} – (6)^2 = 8\\
\sigma&= \sqrt{8} = 2.82
\end{align*}

Verifications:

  1. Mean: $\mu_{\overline{X}} = \mu \Rightarrow 6=6$
  2. Variance: $\sigma^2_{\overline{X}} = \frac{\sigma^2}{n} \Rightarrow 4=\frac{8}{2}$
  3. Standard Deviation: $\sigma_{\overline{X}}=\frac{\sigma}{\sqrt{n}} \Rightarrow 2=\frac{2.82}{\sqrt{2}}=2$

Sampling without Replacement

The possible samples for sampling without replacement are: $\binom{5}{2}=10$

Samples$\overline{x}$Samples$\overline{x}$
2, 434, 86
2, 644, 107
2, 856, 87
2, 1066, 108
4, 648, 109

The sampling distribution sample means for sampling without replacement is

$\overline{x}$Freq$P(\overline{x})$$\overline{x}P(\overline{x})$$\overline{x}^2$$\overline{x}^2P(\overline{x})$
311/103/1099/10
411/104/101616/10
522/1010/102550/10
622/1012/103672/10
722/1014/104998/10
811/108/106464/10
911/209/108181/10
Total10/10=160/10=6390/10 = 39

\begin{align*}
\mu_{\overline{X}} &= E(\overline{X}) = \Sigma \left[\overline{X}P(\overline{X})\right] = \frac{60}{10}=6\\
\sigma^2_{\overline{X}} &= E(\overline{X}^2) – [E(\overline{X}]^2=\Sigma [\overline{X}^2P(\overline{X})] – [\Sigma [\overline{X}P(\overline{X})]]^2\\
&= 39 – 6^2 = 3\\
\sigma_{\overline{X}} &= \sqrt{3}=1.73
\end{align*}

Verifications:

  1. Mean: $\mu_{\overline{X}} = \mu \Rightarrow 6=6$
  2. Variance: $\sigma^2_{\overline{X}} = \frac{\sigma^2}{n}\cdot \left(\frac{N-n}{N-1}\right) \Rightarrow 3=\frac{8}{2}\cdot\left(\frac{5-2}{5-1}\right)=3$
  3. Standard Deviation: $\sigma_{\overline{X}}=\frac{\sigma}{\sqrt{n}} \Rightarrow 1.73=\sqrt{3}$

Why is Sampling Distribution Important?

  • Inference: Sampling distribution of means allows users to make inferences about the population mean based on sample data.
  • Hypothesis Testing: It is crucial for hypothesis testing, where the researcher compares sample statistics to population parameters.
  • Confidence Intervals: It helps construct confidence intervals, which provide a range of values likely to contain the population mean.
Sampling Distribution of Means

Note that the sampling distribution of means provides a framework for understanding how sample means vary from sample to sample and how they relate to the population mean. This understanding is fundamental to statistical inference and decision-making.

R and Data Analysis, Online Quiz Website

Poisson Probability Distribution

The Poisson Probability Distribution is discrete and deals with events that can only take on specific, whole number values (like the number of cars passing a certain point in an hour). Poisson Probability Distribution models the probability of a given number of events occurring in a fixed interval of time or space, given a known average rate of occurrence ($\mu$). The events must be independent of each other and occur randomly.

The Poisson probability function gives the probability for the number of events that occur in a given interval (often a period of time) assuming that events occur at a constant rate during the interval.

Poisson Random Variable

The Poisson random variable satisfies the following conditions:

  • The number of successes in two disjoint time intervals is independent
  • The probability of success during a small time interval is proportional to the entire length of the time interval.
  • The probability of two or more events occurring in a very short interval is negligible.

Apart from disjoint time intervals, the Poisson random variable is also applied to disjoint regions of space.

Applications of Poisson Probability Distribution

The following are a few of the applications of Poisson Probability Distribution:

  • The number of deaths by horse kicking in the Prussian Army (it was the first application).
  • Birth defects and genetic mutations.
  • Rare diseases (like Leukemia, but not AIDS because it is infectious and so not independent), especially in legal cases.
  • Car accidents
  • Traffic flow and ideal gap distance
  • Hairs found in McDonald’s hamburgers
  • Spread of an endangered animal in Africa
  • Failure of a machine in one month

The formula of Poisson Distribution

The probability distribution of a Poisson random variable $X$ representing the number of successes occurring in a given time interval or specified region of space is given by

\begin{align*}
P(X=x)&=\frac{e^{-\mu}\mu^x}{x!}\,\,\quad x=0,1,2,\cdots
\end{align*}

where $P(X=x)$ is the probability of $x$ events occurring, $e$ is the base of the natural logarithm (~2.71828), $\mu$ is the mean number of successes in the given time interval (or region of space), $x$ is the number of events we are interested in, and $x!$ is the factorial of $x$.

Poisson Probability Distribution

Mean and Variance of Poisson Distribution

If $\mu$ is the average number of successes occurring in a given time interval (or region) in the Poisson distribution, then the mean and the variance of the Poisson distribution are both equal to $\mu$. That is,

\begin{align*}
E(X) &= \mu\\
V(X) &= \sigma^2 =\mu
\end{align*}

A Poisson distribution has only one parameter, $\mu$ is needed to determine the probability of an event. For binomial experiments involving rare events (small $p$) and large values of $n$, the distribution of $X=$ the number of success out of $n$ trials is binomial, but it is also well approximated by the Poisson distribution with mean $\mu=np$.

When to Use Poisson Probability Distribution

The Poisson distribution is useful in various scenarios:

  • Modeling Rare Events: Like accidents, natural disasters, or equipment failures.
  • Counting Events in a Fixed Interval: Such as the number of customers arriving at a store in an hour, or the number of calls to a call center in a minute.
  • Approximating the Binomial Distribution: When the number of trials ($n$) is large and the probability of success ($p$) is small.

It is important to note that

  • The Poisson distribution is related to the exponential distribution, which models the time between events.
  • It is a fundamental tool in probability theory and statistics, with applications in fields like operations research, queuing theory, and reliability engineering.

R and Data Analysis, Test Preparation MCQs

Frequently Asked Questions about Poisson Distribution

  1. What is the Poisson Random Variable?
  2. What is Poisson Probability Distribution?
  3. Write the Formula of Poisson Probability Distribution.
  4. Poisson distribution is related to what distribution?
  5. Give some important applications of Poisson Distribution.
  6. Describe the general situations in which Poisson distribution can be used.
  7. Name the distribution that has equal mean and variance.
  8. What are the required conditions for poison random variables?

Online MCQs Basic Statistics 17

The post is about Online MCQs Basic Statistics with Answers. 20 Multiple-Choice questions covers the topics related to Tables, Frequency Distribution, Measures of Central Tendency, Measure of Dispersion, Coefficient of Variation, Skewness and Kurtosis, etc. Let us start with the Online MCQs Basic Statistics.

Online Multiple Choice Questions about Basic Statistics

1. The portion of the table containing row captions is called

 
 
 
 

2. For moderately skewed distribution, the empirical formula holds

 
 
 
 

3. If $b_2=3$ then the distribution is

 
 
 
 

4. If the third moment about mean is zero, the distribution is

 
 
 
 

5. The measures of dispersion remains unchanged by the change of

 
 
 
 

6. If $b_2 > 3$ the distribution is

 
 
 
 

7. The average is also called measures of

 
 
 
 

8. The first moment about mean is

 
 
 
 

9. In Skewed distribution, approximately 95% of cases are falling between

 
 
 
 

10. The standard deviation is independent of change of

 
 
 
 

11. The difference between the upper and lower class boundary is called

 
 
 
 

12. The second moment about mean is

 
 
 
 

13. The portion of the table containing column caption is called

 
 
 
 

14. Class Mark is also called

 
 
 
 

15. Title of a table should be in

 
 
 
 

16. The sum of absolute deviations is minimum if these deviations are taken from

 
 
 
 

17. The measures of dispersion are changed by the change of

 
 
 
 

18. The frequency of the class divided by the total frequency is called

 
 
 
 

19. If $x=40$ and $S^2=64$ then the coefficient of variation is

 
 
 
 

20. If $\overline{x} = 8$ which of the following is minimum?

 
 
 
 

Online MCQs Basic Statistics with Answers

  • If the third moment about mean is zero, the distribution is
  • The first moment about mean is
  • If $b_2=3$ then the distribution is
  • For moderately skewed distribution, the empirical formula holds
  • If $b_2 > 3$ the distribution is
  • The portion of the table containing row captions is called
  • The portion of the table containing column caption is called
  • Title of a table should be in
  • The difference between the upper and lower class boundary is called
  • Class Mark is also called
  • The frequency of the class divided by the total frequency is called
  • The sum of absolute deviations is minimum if these deviations are taken from
  • The measures of dispersion remains unchanged by the change of
  • The measures of dispersion are changed by the change of
  • The standard deviation is independent of change of
  • In Skewed distribution, approximately 95% of cases are falling between
  • If $\overline{x} = 8$ which of the following is minimum?
  • If $x=40$ and $S^2=64$ then the coefficient of variation is
  • The second moment about mean is
  • The average is also called measures of
Online MCQs Basic Statistics Quiz with Answers

Frequently Asked Questions about R Language
Test Preparation MCQs