Types of Hypothesis Tests in Statistics

Introduction to Types of Hypothesis Tests

In statistics, hypothesis tests are methods used to make inferences or draw conclusions about a population based on sample data. In this pose, we will discuss the Basic Types of Hypothesis Tests in Statistics. There are three basic types of hypothesis tests, namely (i) Left-Tailed Test, (ii) Right-Tailed Test, and (iii) Two-Tailed Test.

Note that I am not talking about Statistical tools used under specific conditions related to the data type and distribution. I am talking about the nature of the hypotheses being tested. Therefore, I will focus in this post on the area under the curve in the tails. In hypothesis testing, the distribution of the test’s rejection region can be characterized as either one-tailed or two-tailed. The one-tailed tests include both left- and right-tailed tests.

Hypothesis-Testing-Tails-Critical-Region

Left-Tailed Test

The left-tailed tests are used when the null hypothesis is being tested in a claim that the population parameter at least ($\ge$) a given value. Note that the alternative hypothesis then claims that the parameter is less than (<) the value. For example,

A tire manufacturer claims that their tires last on average more than 35000 miles. If one thinks that the claim is false, then one would write the claim as $H_o$, remembering to include the condition of equality. The hypothesis for this test would be: 
$$H_o:\mu\ge 35000$$
$$H_1: \mu<35000$$

One would hope that the sample data would allow the rejection of the null hypothesis, refuting the company’s claim.

The $H_o$ will be rejected in the case above if the sample mean is statistically significantly less than 35000. That is, if the sample mean is in the left-tail of the distribution of all sample means.

Right Tailed Test

The right-tailed test is used when the null hypothesis ($H_0$) being tested is a claim that the population parameter is at most ($\le$) a given value. Note that the alternative hypothesis ($H_1$) then claims that the parameter is greater than (>) the value.

Suppose, you worked for the tire company and wanted to gather evidence to support their claim then you will make the company's claim $H_1$ and remember that equality will not be included in the claim (H_o$). The hypothesis test will be

$$H_0:\mu \le 35000$$
$$H_1:\mu > 35000$$

If the sample data was able to support the rejection of $H_o$ this would be strong evidence to support the claim $H_1$ which is what the company believes to be true.

One should reject $H_o$ in this case if the sample mean was significantly more than 35000. That is, if the sample mean is in the right-tailed of the distribution of all sample means.

Two-Tailed Test

The two-tailed test is used when the null hypothesis ($H_o$ begins tested as a claim that the population parameter is equal to (=) a given value. Note that the alternative hypothesis ($H_1$) then claims that the parameter is not equal to ($\ne$) the value. For example, the Census Bureau claims that the percentage of Punjab’s area residents with a bachelor’s degree or higher is 24.4%. One may write the null and alternative hypotheses for this claim as:

$$H_o: P = 0.244$$
$$H_1: P \ne 0.244$$

In this case, one may reject $H_o$ if the sample percentage is either significantly more than 24.4% or significantly less than 24.4%. That is if the sample proportion was in either tail (both tails) of the distribution of all sample proportions.

Key Differences

  • Directionality: One-tailed tests look for evidence of an effect in one specific direction, while two-tailed tests consider effects in both directions.
  • Rejection Regions: In a one-tailed test, all of the rejection regions are in one tail of the distribution; in a two-tailed test, the rejection region is split between both tails.
Statistics and Data Analysis Types of Hypothesis Tests in Statistics

Online Quiz and Test Website

R Programming Language and Frequently Asked Questions

MCQs on Statistical Inference 9

The quiz is about MCQs on Statistical Inference with Answers. The quiz contains 20 questions about hypothesis testing and p-values. It covers the topics of formulation of the null and alternative hypotheses, level of significance, test statistics, region of rejection, and decision about acceptance and rejection of the hypothesis. Let us start with the Quiz MCQs on Statistical Inference.

Online MCQs on Statistical Inference with Answers

1. You perform two studies to test a potentially life-saving drug. Both studies have 80% power. What is the chance of two type 2 errors (of false negatives) in a row?

 
 
 
 

2. Suppose a research article indicates a value of $p = 0.001$ in the results section ($\alpha = 0.05$).

You have found the probability of the null hypothesis being true ($p = .001$).

 
 
 

3. When the null hypothesis is true, the probability of finding a specific p-value is ————-.

 
 
 
 

4. Suppose a research article indicates a $p = 0.001$ value in the results section ($\alpha = 0.05$).

Obtaining a statistically significant result implies that the effect detected is important.

 
 
 

5. Person A is very skeptical about homeopathy. Person B believes strongly in homeopathy. They both read a study about homeopathy, which reports a positive effect and $p < 0.05$. Person A would be more likely than Person B to conclude that ———-, and Person B would be more likely than Person A to think that ————-.

 
 
 
 

6. When $H_0$ is true, the probability that at least 1 out of a $X$ completely independent findings is a Type 1 error is equal to ————, this probability ———— when you look at your data and collect more data if a test is not significant.

 
 
 
 

7. Suppose a research article indicates a $p = 0.30$ value in the results section ($\alpha = 0.05$).

You have found the probability of the null hypothesis being true ($p = 0.30$).

 
 
 
 

8. When the difference between means is 5, and the standard deviation is 4, Cohen’s d is ————— which is ————— according to the benchmarks proposed by Cohen.

 
 
 
 

9. Suppose a research article indicates a $p = 0.001$ value in the results section ($\alpha = 0.05$).

The probability that the results of the given study are replicable is not equal to $1-p$.

 
 
 

10. Suppose a research article indicates a $p = 0.30$ value in the results section ($\alpha = 0.05$).

You have proven the null hypothesis (that is, you have proven that there is no difference between the population means).

 
 
 

11. Suppose a research article indicates a value of $p = 0.30$ in the results section ($\alpha = 0.05$).

Obtaining a statistically non-significant result implies that the effect detected is unimportant.

 
 
 

12. Study A and B are completely identical, except that all tests reported in Study A were pre-registered at a publicly available location (and the reported tests match the pre-registered tests), but all tests in Study B are not pre-registered. Both contain analyses with covariates. Based on research on flexibility in the data analysis, we can expect that on average study A will have ————; the covariate analyses are ————-.

 
 
 
 

13. Suppose a research article indicates a $p = 0.30$ value in the results section ($\alpha = 0.05$).

The p-value gives the probability of obtaining a significant result whenever a given experiment is replicated.

 
 
 

14. Suppose a research article indicates a $p = 0.001$ value in the results section ($\alpha = 0.05$).

You have absolutely proven your alternative hypothesis (that is, you have proven that there is a difference between the population means).

 
 
 

15. After finding a single statistically significant p-value we can conclude that ————-, but it would be incorrect to conclude that ————.

 
 
 
 

16. It is important to have access to all (and not just statistically significant) research findings to be able to ————. A consequence of publication bias is that ———–.

 
 
 
 

17. Suppose that a research article indicates a value of $p = 0.001$ in the results section ($\alpha = 0.05$).

The null hypothesis has been shown to be false.

 
 
 

18. A Type-I error is ————–, and the Type-I error rate is determined by ————–.

 
 
 
 

19. Suppose a research article indicates a $p = 0.30$ value in the results section ($\alpha = 0.05$).

The alternative hypothesis has been shown to be false.

 
 
 

20. Suppose a research article indicates a value of $p = 0.001$ in the results section ($\alpha = 0.05$).

The p-value of a statistical test is the probability of the observed result or a more extreme result, assuming the null hypothesis is true.

 
 
 

MCQs on Statistical Inference with Answers

  • A Type-I error is ————–, and the Type-I error rate is determined by ————–.
  • Suppose a research article indicates a $p = 0.30$ value in the results section ($\alpha = 0.05$). You have found the probability of the null hypothesis being true ($p = 0.30$).
  • Suppose a research article indicates a $p = 0.30$ value in the results section ($\alpha = 0.05$). You have proven the null hypothesis (that is, you have proven that there is no difference between the population means).
  • Suppose that a research article indicates a value of $p = 0.001$ in the results section ($\alpha = 0.05$). The null hypothesis has been shown to be false.
  • Suppose a research article indicates a $p = 0.30$ value in the results section ($\alpha = 0.05$). The p-value gives the probability of obtaining a significant result whenever a given experiment is replicated.
  • Suppose a research article indicates a value of $p = 0.30$ in the results section ($\alpha = 0.05$). Obtaining a statistically non-significant result implies that the effect detected is unimportant.
  • Suppose a research article indicates a value of $p = 0.001$ in the results section ($\alpha = 0.05$). The p-value of a statistical test is the probability of the observed result or a more extreme result, assuming the null hypothesis is true.
  • Suppose a research article indicates a $p = 0.001$ value in the results section ($\alpha = 0.05$). Obtaining a statistically significant result implies that the effect detected is important.
  • Suppose a research article indicates a $p = 0.001$ value in the results section ($\alpha = 0.05$). You have absolutely proven your alternative hypothesis (that is, you have proven that there is a difference between the population means).
  • Suppose a research article indicates a value of $p = 0.001$ in the results section ($\alpha = 0.05$). You have found the probability of the null hypothesis being true ($p = .001$).
  • Suppose a research article indicates a $p = 0.001$ value in the results section ($\alpha = 0.05$). The probability that the results of the given study are replicable is not equal to $1-p$.
  • Person A is very skeptical about homeopathy. Person B believes strongly in homeopathy. They both read a study about homeopathy, which reports a positive effect and $p < 0.05$. Person A would be more likely than Person B to conclude that ———-, and Person B would be more likely than Person A to think that ————-.
  • You perform two studies to test a potentially life-saving drug. Both studies have 80% power. What is the chance of two type 2 errors (of false negatives) in a row?
  • Study A and B are completely identical, except that all tests reported in Study A were pre-registered at a publicly available location (and the reported tests match the pre-registered tests), but all tests in Study B are not pre-registered. Both contain analyses with covariates. Based on research on flexibility in the data analysis, we can expect that on average study A will have ————; the covariate analyses are ————-.
  • When the null hypothesis is true, the probability of finding a specific p-value is ————-.
  • After finding a single statistically significant p-value we can conclude that ————-, but it would be incorrect to conclude that ————.
  • When $H_0$ is true, the probability that at least 1 out of a $X$ completely independent findings is a Type 1 error is equal to ————, this probability ———— when you look at your data and collect more data if a test is not significant.
  • It is important to have access to all (and not just statistically significant) research findings to be able to ————. A consequence of publication bias is that ———–.
  • When the difference between means is 5, and the standard deviation is 4, Cohen’s d is ————— which is ————— according to the benchmarks proposed by Cohen.
  • Suppose a research article indicates a $p = 0.30$ value in the results section ($\alpha = 0.05$). The alternative hypothesis has been shown to be false.
MCQs on Statistical Inference

Online MCQs and Test Website, gmstat.com

R Programming Language and Statistics

One Sample Hypothesis Test (t-test)

Introduction: One Sample Hypothesis Test

In this post, I will discuss One Sample Hypothesis Test (One Sample t-test). When testing a claim about the mean using sample data with a small number of observations (i.e., sample size), the appropriate t-distribution instead of the standard normal distribution should be used to determine the standardized test statistic, critical values, rejection region, and p-values.

Recall that if the sample of values drawn follows the normal distribution, the sample size (number of observations in the sample) is less than 30, and the population standard deviation is unknown, then the random variable

$$t=\frac{\overline{x} – \mu}{\frac{s}{\sqrt{n}}}$$

has the Student’s t-distribution with $n-1$ degrees of freedom.

The procedure of locating the rejection regions for a t-distribution hypotheses test is the same as for the normal distribution tests, however, the critical values will differ. To find the critical value(s) $t_0$ for a test, determine if the test is one-tailed or two-tailed and the significance level ($\alpha$). The critical values can be found in the t-distribution table by looking up the entry in the column giving the level of significance and the row showing the degrees of freedom.

Note that:

  • For a right-tailed test, $t_0$ is the positive value in the table
  • For a left-tailed test, $t_0$ is the negative of the value in the table
  • For a two-tailed test, there are two critical values $t_0$ both the value and its opposite.
One Sample Hypothesis Testing-Tails-Critical-Region

Assumptions of the One Sample Hypothesis Test (t-test)

  • Independence: Observations in the sample should be independent of each other.
  • Normality: The population from which the sample is drawn should be normally distributed. However, the t-test is relatively robust to violations of normality, especially for larger sample sizes.
  • Random Sampling: The sample should be a random sample from the population.

One Sample Hypothesis Test for Mean

Example 1: SAT Math scores are normally distributed. A sample of SAT Math scores for 16 students has an average score of 522.8 with a sample standard deviation of 154.5. Suppose, one wishes to support the claim that the average SAT Math score exceeds 500 using a level of significance of 0.05.

Solution

Step 1: The null and alternative hypotheses test in this case are

$H_0: \mu \le 500$ vs $H_1: \mu > 500$

From the alternative hypothesis, the test is right-hand-tailed with $\mu_0=500$.

Step 2: Level of Significance is 5% = 0.05

Step 3: Critical Value

Using the t-distribution table with one Tail, 5% level of significance, and $n-1=16-1=15$ degrees of freedom, the critical value is $t_0=1.753$. The rejection region is thus $t\ge 1.7535$.

Step 4: Test Statistics

The standardized test statistic is

\begin{align*}
t&=\frac{\overline{X} – \mu_0 }{\frac{s}{\sqrt{n}}}\\
&= \frac{522.8 – 500}{\frac{154.5}{\sqrt{16}}}\\
&= \frac{22.8}{38.625} = 0.59
\end{align*}

Step 5: Interpretation of One Sample Mean Test

Since the standardized test statistic is not in the region of rejection, therefore, one should not reject $H_0$ and so the sample data is not sufficient to support the claim that the average exceeds 500 at the 0.05 level of significance.

Example 2: A biologist measures the weights of anesthetized female grizzly bears during winter. A sample of 14 bears is found to have an average weight of $\overline{X} = 376.6$lbs. with a sample standard deviation of $s=32.5$lbs. Is there sufficient evidence to support the claim that the average weight of all female bears in the area is less than 400 lbs? Use $\alpha=0.01$ level of significance.

Solution:

Step 1: The null and alternative hypotheses in this case

$H_0:\mu \ge 400$ vs $H_1:\mu < 400$

From the alternative hypothesis, the test is left-hand-tailed with $\mu_0=400$.

Step 2: Level of Significance is 1% = 0.01

Step 3: Critical Value

Using the Student’s t-distribution table with one tail, the level of significance $\alpha = 0.01$, and $n-1=14-1=13$ degrees of freedom, the critical value $t_0=-2.650$. The region of rejection is thus $t\le -2.650$.

Step 4: The Test Statistics is

\begin{align*}
t&=\frac{\overline{X} – \mu_0 }{\frac{s}{\sqrt{n}}}\\
&= \frac{376.6 – 400}{\frac{32.5}{\sqrt{14}}}\\
&= \frac{-23.4}{8.686} = -2.694
\end{align*}

Since the standardized test statistic is in the rejection region, one should reject the null hypothesis ($H_0$), which supports the claim that the average bear weight is less than 400 lbs.

Statistics Help, One sample Hypothesis Test

https://rfaqs.com, https://gmstat.com