Properties of Measure of Central Tendency

Understanding the Properties of Measure of Central Tendency helps in selecting the appropriate measure for accurate data interpretation. This blog post explores the key properties of measures of central tendency: mean, median, and mode, along with their advantages and limitations.

Introduction: Properties of Measure of Central Tendency

In statistics, measures of central tendency are crucial for summarizing and interpreting data. Measures of central tendency provide a single value that represents the center or typical value of a dataset. The three most common measures of central tendency are the mean, median, and mode. Each central tendency has unique properties that make it suitable for different types of data and analytical purposes.

Mean (Arithmetic Average)

The mean (the most widely used measure of central tendency) is the sum of all values in a dataset divided by the number of values $\left(\frac{\sum\limits_{i=1}^n X_i}{n}\right)$.

Properties of Mean

  • Sensitive to All Data Points
    The mean considers every value in the dataset, making it highly responsive to changes. A single extreme value (outlier) can significantly affect the mean.
  • Algebraic Manipulability
    The mean is used in further mathematical operations (measures of dispersion, e.g., calculating variance, standard deviation). The sum of deviations from the mean ($x-\overline{x}$) is always zero:
    $$\sum\limits_{i=1}^n (X_i – \overline{X}) =0$$
  • Applicable to Interval and Ratio Data
    The mean is suitable for continuous numerical data (for example, height, weight, and income). It is not appropriate for nominal or ordinal data.
  • Affected by Skewness
    In skewed distributions, the mean is pulled toward the tail, making it less representative of central tendency.

Advantages of the Mean

  • Mean uses all data points, providing a comprehensive measure.
  • It is useful in statistical inferences and parametric tests.

Limitations of the Mean

  • Distorted by outliers.
  • Mean should not be used for highly skewed data.
properties of measures of central tendency

Median (Middle Value)

The median is the middle value (the most central data value) in an ordered dataset/array. If the dataset has an even number of observations, the median is the average of the two central values.

Properties of Median

  • Resistant to Outliers
    Unlike the mean, the median is not influenced/affected by extreme values (outliers). It is because the median only depends on the middle value(s) in the ordered dataset. It is also applicable to Ordinal, Interval, and Ratio Data. On the other hand, median works well for ranked (ordinal) and continuous numerical data. However, the median is not suitable for nominal data (categories without order).
  • Unaffected by Skewness
    The median remains stable in skewed distributions, making it a better measure than the mean in such cases.
  • Not Algebraically Manipulable
    Unlike the mean, the median cannot be used in further mathematical computations (for example, standard deviation).

Advantages of the Median

  • Median is robust against outliers.
  • Median better represents the central tendency in skewed distributions.

Limitations of the Median

  • Median does not consider all data points.
  • It is less efficient than the mean for normally distributed data.

Mode (Most Frequent Value)

The mode is the value that appears most frequently in a dataset. A dataset can have one mode (unimodal), two modes (bimodal), or multiple modes (multimodal). It is the only measure of central tendency that can have more than one value.

Properties of Mode

Mode applies to All Data Types (that is, it works with nominal, ordinal, interval, and ratio data). However, it is the only measure of central tendency suitable for categorical data (e.g., colors, brands).

  • Unaffected by Outliers
    Since the mode depends on frequency, extreme values do not impact the mode.
  • Not Necessarily Unique
    Some datasets have no mode (if all values are unique or no value repeats in the dataset,) or data may have multiple modes.
  • Not Useful for Small Datasets
    In small samples, the mode may not accurately represent central tendency.

Advantages of the Mode

  • Mode is useful for categorical data.
  • Mode helps identify peaks in frequency distributions.

Limitations of the Mode

  • May not exist in some datasets.
  • Less informative for continuous numerical data with no repeated values.

Comparison of Mean, Median, and Mode

PropertyMeanMedianMode
Sensitive to OutliersYesNoNo
Works with Skewed DataNoYesSometimes
Applicable to Nominal DataNoNoYes
Mathematical UsabilityHighLowLow
Best for Symmetric DataYesYesSometimes

Choosing the Right Measures of Central Tendency

The choice between mean, median, and mode depends on:

  • Data Type
    • Use the mean for normally distributed numerical data, that is, data points are homogeneous.
    • Use the median for ordinal or skewed numerical data, that is, data points are heterogeneous.
    • Use mode for categorical data, or when data points repeat.
  • Presence of Outliers
    • If outliers are present, the median is preferred.
    • If data is clean and normally distributed, the mean is ideal.
  • Purpose of Analysis
    • For statistical computations (e.g., regression), the mean is necessary.
    • For descriptive summaries (e.g., income distribution), the median is better.

Summary: Properties of Measures of Central Tendency

Measures of central tendency: mean, median, and mode, each has unique properties that determine their suitability for different datasets. The mean is precise but affected by outliers, the median is robust against skewness, and the mode is versatile for categorical data. Understanding these properties ensures accurate data interpretation and informed decision-making in statistical analysis.

By selecting the appropriate measure based on data characteristics, analysts can derive meaningful insights and avoid misleading conclusions. Whether summarizing exam scores, income levels, or survey responses, the right measure of central tendency provides clarity in a world of data.

General Knowledge Quiz

Basic Statistics MCQs Test 25

Test your knowledge of fundamental statistics concepts with this 20-question multiple-choice quiz! This Basic Statistics MCQs Test is perfect for students, statisticians, data analysts, and data scientists. This Basic Statistics MCQs Test Quiz covers key topics like:

Online Basic Statistics MCQs Test with Answers
  • Measures of central tendency (mean, median, mode)
  • Measures of dispersion (range, variance, standard deviation)
  • Frequency distributions (class width, relative & cumulative frequency)
  • Data summarization (five-number summary, quartiles)
  • Statistical inference (sample vs. population, descriptive vs. inferential stats)

Sharpen your skills for exams, job interviews, and competitive tests with these practical Basic Statistics MCQs Test. Whether you’re preparing for university tests, certifications, or data-related job roles, Basic Statistics MCQs Test Quiz helps reinforce core statistical concepts. Let us start with the Online Basic Statistics MCQs Test now.

Online Basic Statistics Quiz with Answers

1. The following data shows the number of hours worked by 200 statistics students:
frequency distribution mcqs


The class width for this distribution is

 
 
 
 

2. A researcher has collected the following sample data

5  12  6  8  5  6  7  5  12  4

The mean is

 
 
 
 

3. If a dataset has an even number of observations, the median

 
 
 
 

4. A researcher has collected the following sample data

5  12  6  8  5  6  7  5  12  4

The mode is

 
 
 
 

5. The difference between the largest and the smallest data values is the

 
 
 
 

6. In a five-number summary, which of the following is not used for data summarization?

 
 
 
 

7. $\mu$ is an example of

 
 
 
 

8. The value that has half of the observations above it and half the observations below it is called the

 
 
 
 

9. The standard deviation of a sample of 100 observations is 64. The variance of the sample equals

 
 
 
 

10. The following data shows the number of hours worked by 200 statistics students:
frequency distribution mcqs


The relative frequency of students working 9 hours or less

 
 
 
 

11. The following data shows the number of hours worked by 200 statistics students:
frequency distribution mcqs


The number of students working 19 hours or less

 
 
 
 

12. A tabular summary of a set of data showing the fraction of the total number of items in several classes is a

 
 
 
 

13. A statistics professor asked students in a class their ages. On the basis of this information, the professor states that the average age of all the students in the university is 21 years. This is an example of

 
 
 
 

14. A numerical value used as a summary measure for a sample, such as sample mean, is known as a

 
 
 
 

15. A researcher has collected the following sample data

5  12  6  8  5  6  7  5  12  4

The median is

 
 
 
 

16. The sum of deviations of the individual data elements from their mean is

 
 
 
 

17. The following data shows the number of hours worked by 200 statistics students:
frequency distribution mcqs


The cumulative relative frequency for the class of 10 — 19

 
 
 
 

18. In a sample of 800 students in a university, 160 or 20% are Business majors. Based on this information, the school’s University reported that “20% of all the students at the university are Business majors”. This report is an example of

 
 
 
 

19. If the variance of a dataset is correctly computed with the formula using $n-1$ in the denominator, which of the following is true?

 
 
 
 

20. The sum of the percentage frequencies for all classes will always equal ————?

 
 
 
 

Question 1 of 20

Online Basic Statistics MCQs Test with Answers

  • A numerical value used as a summary measure for a sample, such as sample mean, is known as a
  • $\mu$ is an example of
  • The sum of the percentage frequencies for all classes will always equal ————?
  • In a five-number summary, which of the following is not used for data summarization?
  • The following data shows the number of hours worked by 200 statistics students: The class width for this distribution is
  • The following data shows the number of hours worked by 200 statistics students: The number of students working 19 hours or less
  • The following data shows the number of hours worked by 200 statistics students: The relative frequency of students working 9 hours or less
  • The following data shows the number of hours worked by 200 statistics students: The cumulative relative frequency for the class of 10 — 19
  • The difference between the largest and the smallest data values is the
  • If a dataset has an even number of observations, the median
  • The sum of deviations of the individual data elements from their mean is
  • The value that has half of the observations above it and half the observations below it is called the
  • In a sample of 800 students in a university, 160 or 20% are Business majors. Based on this information, the school’s University reported that “20% of all the students at the university are Business majors”. This report is an example of
  • A statistics professor asked students in a class their ages. On the basis of this information, the professor states that the average age of all the students in the university is 21 years. This is an example of
  • A tabular summary of a set of data showing the fraction of the total number of items in several classes is a
  • The standard deviation of a sample of 100 observations is 64. The variance of the sample equals
  • A researcher has collected the following sample data 5  12  6  8  5  6  7  5  12  4. The median is
  • A researcher has collected the following sample data 5  12  6  8  5  6  7  5  12  4. The mode is
  • A researcher has collected the following sample data 5  12  6  8  5  6  7  5  12  4. The mean is
  • If the variance of a dataset is correctly computed with the formula using $n-1$ in the denominator, which of the following is true?

Learn R Programming Language

Data Preparation Quiz 24

Test your knowledge with this Data Preparation Quiz featuring 20 MCQs designed for researchers, statisticians, data analysts, and data scientists. Covering key concepts like data understanding, feature engineering, outlier detection, and best practices in data preparation, this Data Preparation Quiz helps assess your expertise in preparing datasets for analysis. Whether you are refining data for machine learning or ensuring accuracy in statistical models, these questions will challenge your understanding of this critical phase in the data science methodology. Perfect for professionals and students looking to sharpen their data preparation skills!

Online Data Preparation Quiz with answers
Please go to Data Preparation Quiz 24 to view the test

Online Data Preparation Quiz with Answers

  • The Data Preparation stage is the least time-consuming phase of a data science project, typically taking between 5 and 10 percent of the overall project time.
  • In the case study, during the Data Understanding stage, data scientists discovered that not all the expected congestive heart failure admissions were being captured. What action did they take to resolve the issue?
  • In the case study, while working through the Data Preparation stage, data scientists learned that the initial definition did not capture all expected congestive heart failure admissions.
  • Select the correct statement about the Data Preparation stage of the data science methodology.
  • What is the primary role of the data understanding phase in the data science methodology?
  • How does the Data Preparation stage affect the next steps in a data science project?
  • Why is the Data Preparation stage considered time-consuming for a data science project?
  • What is the purpose of feature engineering during the Data Preparation stage?
  • How does automating data collection and preparation processes affect the overall project time?
  • Which method is used to detect outliers by calculating the range between the 25th and 75th percentiles?
  • What is the best practice for handling extreme outliers in a dataset when analyzing average compensation?
  • What is the definition of data preparation?
  • When data is arranged, the middle value in a set of observations is classified as
  • A branch of statistics in which data is collected according to an ordinal scale or a nominal scale is classified as
  • Data measurement, which arises from a specific measuring process, is classified as
  • Types of structured questions do not include
  • Reports published by the International Labor Organization and the International Monetary Fund are considered
  • Which of the following has the same units as the units of the original data?
  • If the range and maximum value of a dataset are 30 and 20, respectively, then the minimum value of the dataset is
  • Advertising expenditure is an example of

Online Quiz Website