Basic Statistics - Statistics for Data Science & Analytics

Properties of Measure of Central Tendency

Aug 1, 2025 by Muhammad Imdad Ullah

Understanding the Properties of Measure of Central Tendency helps in selecting the appropriate measure for accurate data interpretation. This blog post explores the key properties of measures of central tendency: mean, median, and mode, along with their advantages and limitations.

Introduction: Properties of Measure of Central Tendency

In statistics, measures of central tendency are crucial for summarizing and interpreting data. Measures of central tendency provide a single value that represents the center or typical value of a dataset. The three most common measures of central tendency are the mean, median, and mode. Each central tendency has unique properties that make it suitable for different types of data and analytical purposes.

Mean (Arithmetic Average)

The mean (the most widely used measure of central tendency) is the sum of all values in a dataset divided by the number of values $\left(\frac{\sum\limits_{i=1}^n X_i}{n}\right)$.

Properties of Mean

Sensitive to All Data Points
The mean considers every value in the dataset, making it highly responsive to changes. A single extreme value (outlier) can significantly affect the mean.
Algebraic Manipulability
The mean is used in further mathematical operations (measures of dispersion, e.g., calculating variance, standard deviation). The sum of deviations from the mean ($x-\overline{x}$) is always zero:
$$\sum\limits_{i=1}^n (X_i – \overline{X}) =0$$
Applicable to Interval and Ratio Data
The mean is suitable for continuous numerical data (for example, height, weight, and income). It is not appropriate for nominal or ordinal data.
Affected by Skewness
In skewed distributions, the mean is pulled toward the tail, making it less representative of central tendency.

Advantages of the Mean

Mean uses all data points, providing a comprehensive measure.
It is useful in statistical inferences and parametric tests.

Limitations of the Mean

Distorted by outliers.
Mean should not be used for highly skewed data.

properties of measures of central tendency

Median (Middle Value)

The median is the middle value (the most central data value) in an ordered dataset/array. If the dataset has an even number of observations, the median is the average of the two central values.

Properties of Median

Resistant to Outliers
Unlike the mean, the median is not influenced/affected by extreme values (outliers). It is because the median only depends on the middle value(s) in the ordered dataset. It is also applicable to Ordinal, Interval, and Ratio Data. On the other hand, median works well for ranked (ordinal) and continuous numerical data. However, the median is not suitable for nominal data (categories without order).
Unaffected by Skewness
The median remains stable in skewed distributions, making it a better measure than the mean in such cases.
Not Algebraically Manipulable
Unlike the mean, the median cannot be used in further mathematical computations (for example, standard deviation).

Advantages of the Median

Median is robust against outliers.
Median better represents the central tendency in skewed distributions.

Limitations of the Median

Median does not consider all data points.
It is less efficient than the mean for normally distributed data.

Mode (Most Frequent Value)

The mode is the value that appears most frequently in a dataset. A dataset can have one mode (unimodal), two modes (bimodal), or multiple modes (multimodal). It is the only measure of central tendency that can have more than one value.

Properties of Mode

Mode applies to All Data Types (that is, it works with nominal, ordinal, interval, and ratio data). However, it is the only measure of central tendency suitable for categorical data (e.g., colors, brands).

Unaffected by Outliers
Since the mode depends on frequency, extreme values do not impact the mode.
Not Necessarily Unique
Some datasets have no mode (if all values are unique or no value repeats in the dataset,) or data may have multiple modes.
Not Useful for Small Datasets
In small samples, the mode may not accurately represent central tendency.

Advantages of the Mode

Mode is useful for categorical data.
Mode helps identify peaks in frequency distributions.

Limitations of the Mode

May not exist in some datasets.
Less informative for continuous numerical data with no repeated values.

Comparison of Mean, Median, and Mode

Property	Mean	Median	Mode
Sensitive to Outliers	Yes	No	No
Works with Skewed Data	No	Yes	Sometimes
Applicable to Nominal Data	No	No	Yes
Mathematical Usability	High	Low	Low
Best for Symmetric Data	Yes	Yes	Sometimes

Choosing the Right Measures of Central Tendency

The choice between mean, median, and mode depends on:

Data Type
- Use the mean for normally distributed numerical data, that is, data points are homogeneous.
- Use the median for ordinal or skewed numerical data, that is, data points are heterogeneous.
- Use mode for categorical data, or when data points repeat.
Presence of Outliers
- If outliers are present, the median is preferred.
- If data is clean and normally distributed, the mean is ideal.
Purpose of Analysis
- For statistical computations (e.g., regression), the mean is necessary.
- For descriptive summaries (e.g., income distribution), the median is better.

Summary: Properties of Measures of Central Tendency

Measures of central tendency: mean, median, and mode, each has unique properties that determine their suitability for different datasets. The mean is precise but affected by outliers, the median is robust against skewness, and the mode is versatile for categorical data. Understanding these properties ensures accurate data interpretation and informed decision-making in statistical analysis.

By selecting the appropriate measure based on data characteristics, analysts can derive meaningful insights and avoid misleading conclusions. Whether summarizing exam scores, income levels, or survey responses, the right measure of central tendency provides clarity in a world of data.

General Knowledge Quiz

Basic Statistics MCQs Test 25

Jul 21, 2025 by Muhammad Imdad Ullah

Test your knowledge of fundamental statistics concepts with this 20-question multiple-choice quiz! This Basic Statistics MCQs Test is perfect for students, statisticians, data analysts, and data scientists. This Basic Statistics MCQs Test Quiz covers key topics like:

Online Basic Statistics MCQs Test with Answers

Measures of central tendency (mean, median, mode)
Measures of dispersion (range, variance, standard deviation)
Frequency distributions (class width, relative & cumulative frequency)
Data summarization (five-number summary, quartiles)
Statistical inference (sample vs. population, descriptive vs. inferential stats)

Sharpen your skills for exams, job interviews, and competitive tests with these practical Basic Statistics MCQs Test. Whether you’re preparing for university tests, certifications, or data-related job roles, Basic Statistics MCQs Test Quiz helps reinforce core statistical concepts. Let us start with the Online Basic Statistics MCQs Test now.

Online Basic Statistics MCQs Test with Answers

A numerical value used as a summary measure for a sample, such as sample mean, is known as a
$\mu$ is an example of
The sum of the percentage frequencies for all classes will always equal ————?
In a five-number summary, which of the following is not used for data summarization?
The following data shows the number of hours worked by 200 statistics students: The class width for this distribution is
The following data shows the number of hours worked by 200 statistics students: The number of students working 19 hours or less
The following data shows the number of hours worked by 200 statistics students: The relative frequency of students working 9 hours or less
The following data shows the number of hours worked by 200 statistics students: The cumulative relative frequency for the class of 10 — 19
The difference between the largest and the smallest data values is the
If a dataset has an even number of observations, the median
The sum of deviations of the individual data elements from their mean is
The value that has half of the observations above it and half the observations below it is called the
In a sample of 800 students in a university, 160 or 20% are Business majors. Based on this information, the school’s University reported that “20% of all the students at the university are Business majors”. This report is an example of
A statistics professor asked students in a class their ages. On the basis of this information, the professor states that the average age of all the students in the university is 21 years. This is an example of
A tabular summary of a set of data showing the fraction of the total number of items in several classes is a
The standard deviation of a sample of 100 observations is 64. The variance of the sample equals
A researcher has collected the following sample data 5 12 6 8 5 6 7 5 12 4. The median is
A researcher has collected the following sample data 5 12 6 8 5 6 7 5 12 4. The mode is
A researcher has collected the following sample data 5 12 6 8 5 6 7 5 12 4. The mean is
If the variance of a dataset is correctly computed with the formula using $n-1$ in the denominator, which of the following is true?

Learn R Programming Language

Data Preparation Quiz 24

Jul 6, 2025 by Muhammad Imdad Ullah

Test your knowledge with this Data Preparation Quiz featuring 20 MCQs designed for researchers, statisticians, data analysts, and data scientists. Covering key concepts like data understanding, feature engineering, outlier detection, and best practices in data preparation, this Data Preparation Quiz helps assess your expertise in preparing datasets for analysis. Whether you are refining data for machine learning or ensuring accuracy in statistical models, these questions will challenge your understanding of this critical phase in the data science methodology. Perfect for professionals and students looking to sharpen their data preparation skills!

Please go to Data Preparation Quiz 24 to view the test

Online Data Preparation Quiz with Answers

The Data Preparation stage is the least time-consuming phase of a data science project, typically taking between 5 and 10 percent of the overall project time.
In the case study, during the Data Understanding stage, data scientists discovered that not all the expected congestive heart failure admissions were being captured. What action did they take to resolve the issue?
In the case study, while working through the Data Preparation stage, data scientists learned that the initial definition did not capture all expected congestive heart failure admissions.
Select the correct statement about the Data Preparation stage of the data science methodology.
What is the primary role of the data understanding phase in the data science methodology?
How does the Data Preparation stage affect the next steps in a data science project?
Why is the Data Preparation stage considered time-consuming for a data science project?
What is the purpose of feature engineering during the Data Preparation stage?
How does automating data collection and preparation processes affect the overall project time?
Which method is used to detect outliers by calculating the range between the 25th and 75th percentiles?
What is the best practice for handling extreme outliers in a dataset when analyzing average compensation?
What is the definition of data preparation?
When data is arranged, the middle value in a set of observations is classified as
A branch of statistics in which data is collected according to an ordinal scale or a nominal scale is classified as
Data measurement, which arises from a specific measuring process, is classified as
Types of structured questions do not include
Reports published by the International Labor Organization and the International Monetary Fund are considered
Which of the following has the same units as the units of the original data?
If the range and maximum value of a dataset are 30 and 20, respectively, then the minimum value of the dataset is
Advertising expenditure is an example of

Online Quiz Website

Table of Contents

Introduction: Properties of Measure of Central Tendency

Mean (Arithmetic Average)

Properties of Mean

Advantages of the Mean

Limitations of the Mean

Median (Middle Value)

Properties of Median

Advantages of the Median

Limitations of the Median

Mode (Most Frequent Value)

Properties of Mode

Advantages of the Mode

Limitations of the Mode

Comparison of Mean, Median, and Mode

Choosing the Right Measures of Central Tendency

Summary: Properties of Measures of Central Tendency

Share this:

Online Basic Statistics MCQs Test with Answers

Share this:

Online Data Preparation Quiz with Answers

Share this: