Importance of Statistics in Various Disciplines

Introduction to the Importance of Statistics

Statistics is used as a tool to make appropriate decisions in the face of uncertainty. We all apply statistical concepts in our daily life either we are educated or uneducated. Therefore the importance of Statistics cannot be ignored.

The information collected in the form of data (observation) from any field/discipline will almost always involve some sort of variability or uncertainty, so this subject has applications in almost all fields of research. The researchers use statistics in the analysis, interpretation, and communication of their research findings.

Some examples of the questions which statistics might help to answer with appropriate data are:

  1. How much better yield of wheat do we get if we use a new fertilizer as opposed to a commonly used fertilizer?
  2. Does the company’s sales are likely to increase in the next year as compared to the previous?
  3. What dose of insecticide is used successfully to monitor an insect population?
  4. What is the likely weather in the coming season?
Importance of Statistics
https://www.intellspot.com/applications-of-statistics/

Application of Statistics

Statistical techniques being powerful tools for analyzing numerical data are used in almost every branch of learning. Statistics plays a vital role in every field of human activity. Statistics has an important role in determining the existing position of per capita income, unemployment, population growth rate, housing, schooling medical facilities, etc in a country. Now statistics holds a central position in almost every field like Industry, Commerce, Biological and Physical sciences, Genetics, Agronomy, Anthropometry, Astronomy, Geology, Psychology, Sociology, etc are the main areas where statistical techniques have been developed and are being used increasingly.

Statistics has its application in almost every field where research is carried out and findings are reported. Application of statistics (by keeping in view the importance of statistics) in different fields as follows:

Social Sciences

In social sciences, one of the major objectives is to establish a relationship that exists between certain variables. This end is achieved through postulating hypothesis and their testing by using different statistical techniques. Most of the areas of our economy can be studied by econometric models because these help in forecasting and forecasts are important for future planning.

Plant Sciences

The most important aspect of statistics in plant sciences is its role in the efficient planning of experiments and drawing valid conclusions. A technique in statistics known as “Design of Experiments” helps introduce new varieties. Optimum plot sizes can be worked out for different crops like wheat, cotton, sugarcane, and others under different environmental conditions using statistical techniques.

Physical Sciences

The application of statistics in physical sciences is widely accepted. The researchers use these methods in the analysis, interpretation, and communication of their research findings, linear and nonlinear regression models are used to establish cause and effect relationships between different variables, and also these days computers have facilitated experimentation and it is possible to simulate the process rather than experimentation.

Medical Sciences

The interest may be in the effectiveness of new drugs, the effect of environmental factors, heritability, standardization of various records, and other related problems. Statistics come to the rescue. It helps to plan the next investigation to get trustworthy information from the limited resources. It also helps to analyze the current data and integrate the information with that previously existing.

How statistics is used by banks, insurance companies, Business and economic planning and administration, Accounting and controlling of events, Construction Companies, Politicians

Banks

Banks are a very important economic part of a country. They do their work on the guess that all the depositors do not take their money on the same day. Bankers use probability theory to approximate the deposits and claims to take out their money.

Insurance Companies

Insurance companies play an important role in increasing economic progress. These companies collect payment from the people. They approximate the death rate, accident rate, and average expected life of people from the life tables. The payment per month is decided on these rates.

Business

Business planning for the future is very important such as the price, quality, quantity, demand, etc of a particular product. Businessmen can make correct decisions about the location of the business, marketing of the products, financial resources, etc. Statistics helps a businessman to plan production according to the taste of the customers, the quality of the products can also be checked more efficiently by using statistical methods

The relationship between supply and demand is a very important topic of everyday life. The changes in prices and demands are studied by index numbers. The relation between supply and demand is determined by correlation and regression.

Economic Planning

Economic planning for the future is a very important problem for economists. For example (i) opening of new educational institutions such as schools, and colleges, revision of pay scales of employees, construction of new hospitals, and preparation of government budgets, etc. all these require estimates at some future time which is called forecasting which is done by regression analysis and the different sources of earning, planning of projects, forecasting of economic trends are administered by the use of various statistical techniques.

Accounting and Controlling of Events

All the events in the world are recorded, for example, births, deaths, imports, exports, and crops grown by the farmer etc. These are recorded as statistical data and analyzed to make better policies for the betterment of the nation.

Administrator

An administrator whether in the public or private sector leans on statistical data to provide a factual basis for appropriate decisions.

Politician

A politician uses statistical advantageously to lend support and credence to his argument while elucidating the problems he handles.

Construction Companies

All kinds of construction companies start and run their programs after making judgments about the total cost of the project (job, work). To guess the expenditure a very important statistical technique of estimation is used.

Biology

In biology correlation and regression are used for analysis of hereditary relations. To classify the organization into different classes according to their age, height, weight, hair color, eyebrow color, etc. the rules of classification are tabulation of statistics are used.

https://rfaqs.com

https://gmstat.com

Median of Ungrouped Data

Introduction to Median of Ungrouped Data

The post is about calculating the median ungrouped data. The median is the most central point (middlemost central value) of the data/set of observations, with the condition that the data or set of observations should be arranged in ascending or descending order. The median divides the data into two equal parts. That is the main objective of the median.

It is important to note that the criteria for finding the median for grouped and ungrouped data are different.

The primary and secondary data can be defined as:

  1. Primary data, also called raw or ungrouped data, does not undergo any statistical procedure/method, which is not in the form of frequency distribution.
  2. Secondary data may also be called group data if it is in the form of frequency distribution.

Let us discuss how to find the median for ungrouped data.

There are two cases for ungrouped data. These cases are based on no of observations which is $n$

When the number of observations is odd (Say $n$ i.e. $n$ is odd), and when the number of observations is even (Say $n$ i.e. $n$ is even).

Median Calculations

The data below contains the odd number of observations.

Observation No.
(Ascending Order)
1st2nd3rd4th5th6th7th8th9th10th11th
Data Values81899096100102103104108109118
(Descending Order)1110987654321

Since the number of observations is odd ($n = 11$), the central value after arranging in ascending order will be the 6th value. and the 6th value is 102. That is the median is 102 for the above data.

The position of the median can be located mathematically, as follows:

\begin{align*}
\tilde{x} &= \left( \frac{n+1}{2} \right)th\,\, \text{value}\\
&=\frac{11+1}{2} = 6th\,\, \text{value}
\end{align*}

The value at the 6th position (from sorted data) is 102. The $\tilde{x}$ can be read as “x-tild” which is the notation of the median.

Median for Even Numbers of Observations

Consider the following data that contains an even number of observations.

Observation No.12345678910
Data Values81100961089010210410310989

Data after sorting (either in ascending or descending order) is

Observations No.1st2nd3rd4th5th6th7th8th9th10th
x81899096100102103104108109

Since $n=10$ which is even, the central position (that is median) lies between the 5th value and the 6th value. This central value is the average of the 5th and 6th values (from the sorted data). The average of these two central observations is called the median. The two central positions are 100 and 102, take the average of these two numbers and find the median.

$$Median = \frac{100+102}{2} = 101$$

Median Formula for Large Data Sets

The median formula for large or small data sets can be represented mathematically.

  • For large data sets one can find the median of data mathematically. The formula for both odd number of observations and even numbers of observations is different.

The point to remember when computing the median is that

  • For an odd number of observations, the median is the centermost value after sorting the data
  • For an even number of observations, the median is the average of two central values after sorting the data

\begin{align*}
\tilde{x} &= \frac{1}{2} \left[ \left(\frac{n}{2}th \, \, value \right)+ \left(\frac{n}{2}+1 \right)the \,\, value \right]\quad \quad \text{(When observations are even)}\\
&= \frac{n+1}{2} \quad \quad \text{(when observations are odd)}
\end{align*}

The other way of the median formula is

Median of ungrouped data

Consider, a data set containing 157 observations. To compute the median, first of all, you need to sort the data in either ascending or descending order. The formula for this data will be

$$\tilde{x} = \frac{n+1}{2} = \frac{157+1}{2}=79th$$.

The 79th observation in the sorted data will be the median of the data.

In case, if there are even number of observations (say $n=396$, the median will be

\begin{align*}
\tilde{x} &= \frac{1}{2}\left[\left(\frac{n}{2}\right)th + \left(\frac{n+1}{2}\right)th \right]\\
&=\frac{1}{2} \left[\frac{396}{2}th + \frac{396}{2}+1 \right]\\
&= \frac{1}{2} \left[198th + 199th\right]
\end{align*}

The average of 198th value and 199th value from the sorted data will be the median of the data.

https://rfaqs.com

https://gmstat.com

Geometric Mean

Introduction to Geometric Mean

The geometric mean (GM) is a way of calculating an average, but instead of adding values like the regular (arithmetic) mean, it multiplies them and then takes a root. The geometric mean is defined as the $n$th root of the product of $n$ positive values.

If we have two observations let’s say 9 and 4, then the geometric mean is the square root of the product of these values, which is 6 ($\sqrt{9\times 4}=6$. If there are three values let’s say  3, 9, and 3 then the geometric average will be the $sqrt[3]{3\times 9 \times 3} = 3$. In a similar pattern, mathematically, for $n$ number of observations ($x_1, x_2, \cdots, x_n$) then the Geometric Average Formula will be

$$GM = (x_1 \times x_2 \times x_3 \times \cdots \times x_n)^{\frac{1}{n} }$$

Geometric Mean

Geometric Mean Example

Suppose we have the following set of values $x=32, 36, 36, 37, 39, 41, 45, 46, 48$. The Computation of Geometric Mean will be

\begin{align*}
GM &= (32\times 36 \times 36 \tmies 37 \times 39 \times 41 \times 45 \times 46 \times 48)^{\frac{1}{9}}\\
&=(243790484520960)^{\frac{1}{9}} = 39.7
\end{align*}

For a large number of observations one can compute the GM by taking the log of all observations using the following formula:

$$GM = antilog \left[\frac{\sum\limits_{i=1}^n log\, x}{n} \right]$$

$x$$log\, x$
32Log 32 = 1.5051
36log 36 = 1.5563
36log 36 = 1.5563
37log 37 = 1.5682
39log 39 = 1.5911
41log 41 = 1.6128
45log 45 = 1.6532
46log 46 = 1.6628
48log 48 = 1.6812
Total14.3870

\begin{align*}
GM &= antilog \left[ \frac{\sum\limits_{i=1}^n log\, x}{n} \right]\\
&= antilog \left[\frac{14.3870}{9}\right] = antilog [1.5986]\\
&= 38.7
\end{align*}

One important point that should be remembered is that if any value in the data set is zero or negative then the GM cannot be computed.

Geometric Mean for Grouped Data

The GM for grouped data can also be computed using the following formula:

$$GM = antilog \left[ \frac{\Sigma f\times log\, x}{\Sigma f} \right]$$

Suppose, we have the following frequency distribution as follows:

ClassesFrequency
65 to 849
85 to 10410
105 to 12417
125 to 14410
145 to 1645
165 to 1844
185 to 2045
Tota60

The GM of the above frequency distribution can be performed as follows

Classes$f$$X$$log\, X$$f \times log\, X$
65-84974.5log 74.5 = 1.872216.8494
85-1041094.5log 94.5 = 1.975419.7543
105-12417114.5log 114.5 = 2.058834.9997
125-14410134.5log 134.5 = 2.128721.2872
145-1645154.5log 154.5 = 2.188910.9446
165-1844174.5log 174.5 = 2.24188.9672
185-2045194.5log 194.5 = 2.288911.4446
Total60  124.2471

\begin{align*}
GM &= antilog \left[ \frac{124.2471}{60} \right]\\
&=antilog (2.0708) = 117.4
\end{align*}

The GM is particularly useful when dealing with rates of change or ratios, such as growth rates in investments. That is because geometric mean considers how things are multiplied over time, rather than simply added.

Use and Application of Geometric Mean

Geometric Mean is useful in situations like:

  • Investment returns: When one looks at average investment growth, one wants to consider how much one’s money is multiplied over time, not just the change each year. That is why the GM is better suited for this scenario.
  • Rates of change: Similar to investment returns, if something is increasing or decreasing by a percentage each time, the GM is a more accurate measure of the overall change.
  • Growth Rates: When dealing with percentages or ratios that change over time (like investment returns or population growth), the geometric mean provides a more accurate picture of the overall change compared to the arithmetic mean.
  • Proportional Changes: It is helpful for situations where changes are multiplied, not added. For example, if a recipe calls for doubling all ingredients, the geometric mean of the original quantities represents the final amount.

https://rfaqs.com

https://gmstat.com