Elementary Statistics Quiz 20

This Statistics Test is about MCQs Basic Elementary Statistics Quiz with Answers. There are 20 multiple-choice questions from Basics of Statistics, measures of central tendency, measures of dispersion, Measures of Position, and Distribution of Data. Let us start with the MCQS Basic Elementary Statistics Quiz with Answers

Elementary Statistics Quiz Questions

1. What is the 25th percentile of the following data set; 1, 3, 3, 4, 5, 6, 6, 7, 8, 8

 
 
 
 

2. The interquartile range (IQR) is which of the following?

 
 
 
 

3. What is the general tendency of a set of data to change over time called?

 
 
 
 

4. The formula of mid-range is

 
 
 
 

5. Which dispersion is used to compare the variation of two series?

 
 
 
 

6. Which one of the following is not included in measures of central tendency?

 
 
 
 

7. Which of the following measures of central tendency will always change if a single value in the data changes? MCQs in Statistics

 
 
 
 

8. What is a branch of mathematics dealing with the collection, analysis, interpretation, and presentation of numerical or quantitative data? MCQs General Knowledge

 
 
 
 

9. For the data 2, 3, 7, 0, -8. The Geometric mean will be

 
 
 
 

10. Which of the following is a measure of variability?

 
 
 
 

11. Under which of the following conditions would the standard deviation assume a negative value?

 
 
 
 

12. Which of the following is written at the top of the table?

 
 
 
 

13. Which data sets have a mean of 10 and a standard deviation of 0?

 
 
 
 

14. If the variance of a dataset is correctly computed with the formula using ($n – 1$) in the denominator, which of the following options is true?

 
 
 
 

15. What is meta data?

 
 
 
 

16. When you are calculating the middle value of a data field in a data set, actually, what are you calculating?

 
 
 
 

17. Which of the following is an example of categorical data?

 
 
 
 

18. The median represents a value in the data set where:

 
 
 
 

19. What is one of the common measures of Central Tendency?

 
 
 
 

20. Which of the following is NOT a descriptive statistic?

 
 
 
 

Elementary Statistics Quiz with Answers

  • What is the 25th percentile of the following data set; 1, 3, 3, 4, 5, 6, 6, 7, 8, 8
  • Which of the following is a measure of variability?
  • Which of the following measures of central tendency will always change if a single value in the data changes?
  • Which data sets have a mean of 10 and a standard deviation of 0?
  • What is meta data?
  • Which of the following is an example of categorical data?
  • The median represents a value in the data set where:
  • If the variance of a dataset is correctly computed with the formula using ($n – 1$) in the denominator, which of the following options is true?
  • Which of the following is NOT a descriptive statistic?
  • What is one of the common measures of Central Tendency?
  • What is a branch of mathematics dealing with the collection, analysis, interpretation, and presentation of numerical or quantitative data?
  • When you are calculating the middle value of a data field in a data set, actually, what are you calculating?
  • What is the general tendency of a set of data to change over time called?
  • The interquartile range (IQR) is which of the following?
  • Which dispersion is used to compare the variation of two series?
  • Which of the following is written at the top of the table?
  • The formula of mid-range is
  • Which one of the following is not included in measures of central tendency?
  • For the data 2, 3, 7, 0, -8. The Geometric mean will be
  • Under which of the following conditions would the standard deviation assume a negative value?
Basic Elementary Statistics Quiz with Answers

MCQs in Statistics

MCQs General Knowledge

Testing Population Proportion

Testing population proportion is a hypothesis testing procedure used to assess whether or not a sample from a population represents the true proportion of the entire population. Testing a sample population proportion is a widely used statistical method with various applications across different fields.

Purpose of Testing Population Proportion (one-sample)

The main purpose of testing a sample population proportion is to make inferences about an entire population based on the sample information. Testing a sample population proportion helps to determine whether an observed sample proportion is significantly different from a hypothesized population proportion.

Common Uses of Testing Population Proportion

The following are some common uses of population proportion:

  • Marketing research: To determine if a certain proportion of customers prefer one product compared to another.
  • Quality control: In manufacturing, population proportion tests can be used to test/check if the proportion of defective items in a production batch exceeds an acceptable threshold.
  • Medical research: To test the efficacy of a new treatment by comparing the proportion of patients who recover using the new treatment versus a standard treatment.
  • Political polling: To estimate the proportion of voters supporting a particular candidate or policy.
  • Social sciences: To examine the prevalence of certain behaviors or attitudes in a population.

Applications Population Proportion in Various Fields

  • Business: Testing customer satisfaction rates, conversion rates in A/B testing for websites, or employee retention rates.
  • Public health: Estimating vaccination rates, disease prevalence, or the effectiveness of public health campaigns.
  • Education: Assessing the proportion of students meeting certain academic standards or the effectiveness of new teaching methods.
  • Psychology: Evaluating the proportion of individuals exhibiting certain behaviors or responses in experiments.
  • Environmental science: Measuring the proportion of samples that exceed pollution thresholds.

Types of Testing Population Proportion

There are two types of population proportion tests.

  1. One-sample z-test for proportion: One-sample proportion tests are used when comparing a sample proportion to a known or hypothesized population proportion.
  2. Two-sample z-test for proportions: Two-sample proportion tests are used when comparing proportions from two independent samples.

Assumptions and Considerations

The following are assumptions and considerations when testing population proportion:

  • The sample should be randomly selected and representative of the population.
  • The sample size (number of observations in the sample) should be large enough (typically $np$ and $n(1-p)$ should both be greater than 5, where $n$ is the sample size and $p$ is the proportion).
  • For two-sample tests, the samples should be independent of each other.
  • Interpretation: The results of these tests are typically interpreted using p-values or confidence intervals, allowing researchers to make statistical inferences about the population based on the sample data.

Data Frive Decisions from Proportion Tests

By using tests for population proportions, researchers and professionals can make data-driven decisions, validate hypotheses, and gain insights into population characteristics across a wide range of fields and applications.

Suppose, a random sample is drawn and the population proportion (say) $\hat{p}$ is measured and $n\hat{p}\ge 5$, $n\hat{q}\ge5$, the distribution of $\hat{p}$ is approximately normal with $\mu_{\hat{p}} =p$ and $\sigma_{\hat{p}}=\sqrt{\frac{pq}{n}}$. Also, suppose that one of the possible null hypotheses of the following form, when testing a claim about a population proportion is:

$H_o: p=p_o$
$H_o:p\ge p_o$
$H_o\le p_o$

For simplicity, we will assume the null hypothesis $H_o:p=p_o$. The standardized test statistics for a one-sample proportion test is

\begin{align*}
Z&=\frac{\hat{p} – \mu_{\hat{p}}}{\sigma_{\hat{p}}}\\
&=\frac{\hat{p} -p_o }{\sqrt{\frac{p_oq_o}{n}}}
\end{align*}

This random variable will have a standard normal distribution. Therefore, the standard normal distribution will be used to compute critical values, regions of rejection, and p-values, as we use it to test a mean using a large sample.

Testing Population Proportion

Example 1 (Defective Items): Testing Population Proportion

A computer chip manufacturer tests microprocessors coming off the production line. In one sample of 577 processors, 37 were found to be defective. The company wants to claim that the proportion of defective processors is only 4%. Can the company claim be rejected at the $\alpha = 0.01$ level of significance?

Solution:

The null and alternative hypotheses for testing the one-sample population proportion will be

$H_o:p=0.04$
$H_1:p\ne 0.04$

By focusing on the alternative hypothesis symbol ($\ne$), the test is two-tailed with $p_o=0.04$.

The $\hat{p} = \frac{37}{577} \approx 0.064$.

the standardized test statistics is

\begin{align*}
Z &= \frac{\hat{p} – p_o}{\sqrt{\frac{p_oq_o}{n}}}\\
&=\frac{0.064 – 0.04}{\sqrt{\frac{(0.04)(0.96)}{577}}}\\
&=\frac{0.024}{0.008}\approx 3.0
\end{align*}

Looking up $Z=3.00$ in the standard normal table (area under the standard normal curve), we get a value of 0.9987. Therefore, $P(Z\ge 3.00) = 1-0.9987) = 0.0013$.
Note that the test is two-tailed, the p-value will be twice this amount or $0.0026$.

Since the p-value ($0.0026$) is less than the level of significance ($0.01$), that is $0.0025 < 0.01$ (p-value < level of significance), we will reject the company’s claim. It means that the proportion of defective processors is not 4%, it is either less than 4% or more than 4%.

Example 2 (Opinion Poll): Testing Population Proportion

An opinion poll of 1010 randomly chosen/selected adults finds that only 47% approve of the president’s job performance. The president’s political advisors want to know if this is sufficient data to show that less than half of adults approve of the president’s job performance using a 5% level of significance.

Solution:

The null and alternative hypothesis of the problem above will be

$H_o:p\ge 0.50$
$H_1:p< 0.50$

By focusing on the alternative hypothesis symbol (<), the test is left-tailed with $p_o=0.50$.

The $\hat{p} = 0.47$. The standardized test statistics for one-sample population proportion will be

\begin{align*}
Z &= \frac{\hat{p} – p_o}{\sqrt{\frac{p_oq_o}{n}}}\\
&=\frac{0.47 – 0.50}{\sqrt{\frac{(0.5)(0.5)}{1010}}}\\
&=\frac{-0.03}{0.01573}\approx -1.91
\end{align*}

For a left-tailed test (for $\alpha = 0.05$), the $Z_o=-1.645$. Since $-1.91 < -1.645$, the null hypothesis should be rejected. So the data does support the claim that $p<0.50$ at the $\alpha=0.05$ level of significance.

Performing Data Analysis in R Language

Intermediate First Year Mathematics Quiz

Efficiency of an Estimator

Introduction to Efficiency of an Estimator

The efficiency of an estimator is a measure of how well it estimates a population parameter compared to other estimators. It is possible to have more than one unbiased estimator of a parameter. We should have at least one additional criterion for choosing among the unbiased estimator of the parameter. Usually, unbiased estimators are compared in terms of their variances. Thus, the comparison of variances of estimators is described as a comparison of the efficiency of estimators.

Use of Efficiency

The efficiency of an estimator is often used to evaluate an estimator through the following concepts:

  • Bias: An estimator is unbiased if its expected value equals the true parameter value ($E[\hat{\theta}]=\theta$). The efficiency of an estimator can be influenced by bias; thus, unbiased estimators are often preferred.
  • Variance: Efficiency is commonly assessed by the variance of the estimator. An estimator having a lower variance is considered more efficient. The Cramér-Rao lower bound provides a theoretically lower limit for the variance of unbiased estimators.
  • Mean Squared Error (MSE): Efficiency can also be measured using MSE, which combines both variance and bias. MSE is given by: MSE = $Var(\hat{\theta}) + Bias (\hat{\theta})^2$. An estimator with a lower MSE is more efficient.
  • Relative Efficiency: The relative efficiency compares the efficiency of two estimators, often expressed as the ratio of their variances: Relative Efficiency = $\frac{Var(\hat{\theta}_2)}{Var(\hat{\theta}_1)}, where $\hat{\theta}_1$ is the estimator being compared, and $\hat{\theta}_2$ is a competitor.
Efficiency of an estimator

The efficiency of an estimator is stated in relative terms. If say two estimators $\hat{\theta}_1$ and $\hat{\theta}_2$ are unbiased estimators of the same population parameter $\theta$ and the variance of $\hat{\theta}_1$ is less than the variance of $\hat{\theta}_2$ (that is, $Var(\hat{\theta}_1) < Var(\hat{\theta}_2)$ then $\hat{\theta}_1$ is relatively more efficient than $\hat{\theta}_2$. The ration is $E=\frac{Var(\hat{\theta}_2)}{var(\hat{\theta}_1)}$ is a measure of relative efficiency of $\hat{\theta}_1$ with respect to the $\hat{\theta}_2$. If $E>1$, $\hat{\theta}_1$ is said to be more efficient than $\hat{\theta}_2$.

If $\hat{\theta}$ is an unbiased estimator of $\theta$ and $Var(\hat{\theta})$ is minimum compared to any other unbiased estimator for $\theta$, then $\hat{\theta}$ is said to be a minimum variance unbiased estimator for $\theta$.

It is preferable to make efficient comparisons based on the MSE instead of its variance.

\begin{align*}
MSE(\hat{\theta}) & = E(\hat{\theta} – \theta)^2\\
&= E\left[(\hat{\theta} – E(\hat{\theta}) + E(\hat{\theta}) – \theta \right]\\
&= E\left[ \left(\hat{\theta} – E(\hat{\theta})\right) ^2 + \left(E(\hat{\theta})-\hat{\theta}\right)^2 + 2(\hat{\theta}-E(\hat{\theta}))(E(\hat{\theta}) -\theta)\right]\\
&= E[\hat{\theta} – E(\hat{\theta})]^2 + [E(\hat{\theta})-\theta]^2 \\
&= Var(\hat{\theta}) + (Bias)^2
\end{align*}

where $E[\hat{\theta}-E(\hat{\theta})] = E(\hat{\theta}) – E(\hat{\theta})=0$

Question about the Efficiency of an Estimator

Question: Let $X_1, X_2, \cdots, X_n$ be a random sample of size 3 from a population with mean $\mu$ and variance \sigma^2$. Consider the following estimators of mean $\mu$:

\begin{align*}
T_1 &= \frac{X_1+X_2+X_3}{2}\qquad Sample\,\, mean\\
T_2 &- \frac{X_1 + 2X_2 + X_3}{4} \qquad Weighted \,\, mean
\end{align*}

which estimator should be preferred?

Solution

First, we check the unbiasedness of $T_1$ and $T_2.

\begin{align*}
E(T_1) &= \frac{1}{3} E(X_1 + X_2 + X_3)=\mu\\
E(T_2) &= \frac{1}{4}E(X_1+2X_2 + X_4) = \mu
\end{align*}

Therefore, $T_1$ and $T_2$ are unbiased estimators of $\mu$.

For efficiency, let us check the variances of these estimators.

\begin{align*}
Var(T_1) &= Var\left(\frac{X_1 + X_2 + X_3}{3} \right)\\
&= \frac{1}{9} \left(Var(X_1) + Var(X_2) + Var(X_3)\right)\\
&= \frac{1}{9} (\sigma^2 + \sigma^2 + \sigma^2) = \frac{\sigma^2}{3}\\
Var(T_2) &= Var\left(\frac{X_1 + 2X_2 + X_3}{4}\right)\\
&= \frac{1}{16} \left(Var(X_1) + 4Var(X_2) + Var(X_3)\right)\\
&= \frac{1}{16}(\sigma^2 + 4\sigma^2 + \sigma^2) = \frac{3\sigma^2}{8}
\end{align*}

Since $\frac{1}{3} < \frac{3}{8}$, that is, $Var(T_1) < Var(T_2). The $T_1$ is better estimator of $\mu$ than $T_2$.

Reasons to Use Efficiency of an Estimator

  1. Optimal Use of Data: An efficient estimator makes the best possible use of the available data, providing more accurate estimates. This is particularly important in research, where the goal is often to make inferences or predictions based on sample data.
  2. Reducing Uncertainty: Efficiency reduces the variance of the estimators, leading to more precise estimates. This is essential in fields like medicine, economics, and engineering, where precise measurements can significantly impact decision-making and outcomes.
  3. Resource Allocation: In practical applications, using an efficient estimator can lead to savings in money, time, and resources. For example, if an estimator provides a more accurate estimate with less data, it can result in fewer resources needed for data collection.
  4. Comparative Evaluation: Comparisons between different estimators help researchers and practitioners choose the best method for their specific context. Understanding efficiency allows one to select estimators that yield reliable results.
  5. Statistical Power: Efficient estimators contribute to higher statistical power, which is the probability of correctly rejecting a false null hypothesis. This is particularly important in hypothesis testing and experimental design.
  6. Robustness: While efficiency relates mostly to variance and bias, efficient estimators are often more robust to violations of assumptions (e.g., normality) in some contexts, leading to more reliable conclusions.

In summary, the efficiency of an estimator is vital as it directly influences the accuracy, reliability, and practical utility of statistical analyses, ultimately affecting the quality of decision-making based on those analyses.

statistics help https://itfeature.com

MCQs Functions and Limits

Packages in R for Data Analysis