Method of Semi Averages (2020)

The secular trends can also be measured by the method of semi averages. The steps are:

  • Divide the time series data into two equal portions. If observations are odd then either omit the middle value or include the middle value in each half.
  • Take the average of each part and place these average values against the midpoints of the two parts.
  • Plot the semi-averages in the graph of the original values.
  • Draw the required trend line through these two potted points and extend it to cover the whole period.
  • It is simple to compute the slope and y-intercept of the line drawn from two points. The trend values can be found from the semi-average trend line or by an estimated straight line as explained:

Let y1 and y2 be the semi-averages placed against the times x1 and x2. Let the estimated straight line y=a+bx is to pass through the points (x1, y1) and (x2, y2). The constant “a” and “b” can easily be determined. the equation of the line passing through the points (x1, y1) and (x2, y2) can be written as:

yy1=y2y1x2x1(xx1)=b(xx1)y=(y1bx1)+bx=a+bx, where a=y1bx1

For an even number of observations, the slope of the trend line can be found as:

b=1n/2(S2n/2S1n/2)=1n/2(S2S1n/2)=4(S2S1)n2,

where S1 is the sum of y-values for the first half of the period, S2 is the sum of y-values of the second half of the period, and n is the number of time units covered by the time series.

The following merits and demerits of the Method of Semi Averages are as described:

Merits of Method of Semi Averages

  • The method of semi-averages is simple, easy, and quick.
  • It smooths out seasonal variations
  • It gives a better approximation to the trend because it is based on a mathematical model.

Demerits of Method of Semi Averages

  • It is a rough and objective method.
  • The arithmetic mean used in Semi Average is greatly affected by very large or by very small values.
  • The method of semi-averages is applicable when the trend is linear. This method is not appropriate if the trend is not linear.

Numerical Example 1: Method of Semi Averages

The following table shows the property damaged by road accidents in Punjab for the year 1973 to 1979.

Year1973197419751976197719781979
Property Damage201238392507484648742
  1. Obtain the semi-averages trend line
  2. Find out the trend values.

Solution

Let x=t1973

YearProperty DamagedSemi TotalSemi AverageCoded YearTrend Values
1973201  0y=190+87(0)=190
19742388312771y=190+87(1)=277
1975392  2y=190+87(2)=364
1976507  3y=190+87(3)=451
1977484  4y=190+87(4)=538
197854918756255y=190+87(5)=625
1979742  6y=190+87(6)=712
method of semi-averages (trend values)

y1=277,x1=1,y2=625,x2=5b=y2y1x2x1=62527751=87a=y1bx1=27787(1)=190

The semi-average trend line y=190+87x (with the origin at 1973)

Numerical Example 2: Method of Semi Averages

The following table gives the number of books in thousands sold at a bookstore for the years 1973 to 1981

Year197319741975197619771978197919801981
No. of Books Sold423835253224201917
  1. Find the equation of the semi-average trend line
  2. Compute the trend values
  3. Estimate the number of books sold for the year 1982.

Solution

Let x=t1973

YearNo. of books (y)Semi TotalSemi AverageCoded yearTrend Values
197342  0y=39.53(0)=39.5
197438140351y=39.53(1)=36.5
1975352y=39.53(2)=33.5
197625  3y=39.53(3)=30.5
197732  4y=39.53(4)=27.5
197824  5y=39.53(5)=24.5
19792080206y=39.53(6)=21.5
198019  7y=39.53(7)=18.5
198117  8y=39.53(8)=15.5

y1=35,x1=1.5,y2=20,x2=6.5b=y2y1x2x1=20356.51.5=3a=y1bx1=35(3)(1.5)=39.5y=39.53x(with origin at 1973)

For the year 1982, the estimated number of books sold is y=39.53(9)=12.5.

Computer MCQs Test Online

R Programming Language

The Method of Free Hand Curve (2020)

Introduction to Free Hand Curve

The Method of Free Hand Curve is a simple and non-mathematical technique used to fit a curve to a set of data points by visually estimating the trend. It provides a quick, approximate visualization of the relationship between variables.

It is used to

  • Plot Data Points: Scatter plot the data points on a graph.
  • Draw Curve: Using a smooth, freehand line, draw a curve that best represents the overall trend of the data points.
  • No Rigid Rules: The curve is drawn based on the analyst’s judgment, without strict mathematical formulas.

Exploring Data Patterns

This method is subjective and less precise than mathematical curve-fitting techniques but is useful for initial data exploration or when data patterns are irregular.

The secular trend is measured by the method of the free hand curve in the following steps:

  • Take the time periods along the x-axis by taking appropriate scaling
  • Plot the points for observed values of the Y variable as the dependent variable against the given time periods
  • Join these plotted points by line segments to get a historigram
  • Draw a free-hand smooth curve (or a straight line) through the histogram

In this method we draw the given times series data on graph paper, then we draw a free-hand trend line through the plotted graph according to the trend of the graph. Then we read trend values from this free-hand trend line.

It is generally preferred to use a curve instead of a straight line to show the secular trend.

Merits (Free Hand Curve)

  • The free-hand curve method is simple, easy, and quick for measuring secular trends.
  • A well-fitted trend line (or curve) approximates the trend closely based on a mathematical model.

Demerits (Free Hand Curve)

  • It is a rough and crude method.
  • It is greatly affected by personal bias as different persons may fit different trends to the same data.
    The estimates are not reliable due to personal bias.

Practical Example of Curve

Question: The following time series shows the number of road accidents in Punjab from 1972 to 1978.

Year1972197319741975197619771978
No. of Accidents2493263826993038374540794688
  • Obtain the historigram showing the number of road accidents and a free-hand trend line by drawing a straight line
  • Find the trend values for this time series

Solution:

Method of Free Hand Curve
YearValueTotalMeanTrend value
19722493  2200
19732638  2550
19742699  2950
1975303823338033403340
19763745  3650
19774079  4050
19784688  4499

The usefulness of the Method of Free Hand Curve

The method of freehand curve is useful for:

  1. Exploratory Data Analysis (EDA): As a preliminary step free free-hand curve method helps us to understand the basic characteristics of the data and identify potential relationships between variables.
  2. Visual Communication: It also helps to present trends in the data in a clear and easily understandable way for non-statistical audiences.
  3. Limited Data: When you have a relatively small dataset, a free hand curve might be sufficient to get a basic idea of the central tendency.

Real-Life Examples of Free-Hand Curve

Some real-life examples where the Method of Free Hand Curve can be applied:

  • Economics:
    • Plotting and estimating trends in stock market prices over time.
    • Visualizing the relationship between inflation and unemployment rates.
  • Meteorology:
    • Drawing temperature trends over days or months based on scattered weather data.
    • Estimating rainfall patterns over a year.
  • Education:
    • Plotting student performance trends over semesters to identify improvements or declines.
    • Visualizing the relationship between study hours and exam scores.
  • Healthcare:
    • Tracking a patient’s blood pressure or glucose levels over time to observe trends.
    • Estimating the progression of a disease based on irregular data points.
  • Business:
    • Visualizing sales trends over months or years to identify seasonal patterns.
    • Estimating the relationship between advertising expenditure and revenue growth.
  • Sports:
    • Plotting an athlete’s performance metrics (e.g., running speed, scores) over time to observe trends.
    • Visualizing the relationship between training hours and performance improvement.

Summary

By understanding the method of free hand curves and its limitations, one can use it as a valuable tool for initial data exploration and visualization alongside other statistical techniques for a more robust analysis. The freehand curve provides a quick, intuitive way to understand trends or relationships without relying on complex mathematical models.

MCQs Intermediate Mathematics Part-I Quadratic Equations

Heteroscedasticity Residual Plot (2020)

The post is about Heteroscedasticity Residual Plot.

Heteroscedasticity and Heteroscedasticity Residual Plot

One of the assumptions of the classical linear regression model is that there is no heteroscedasticity (error terms have constant error terms) meaning that ordinary least square (OLS) estimators are (BLUE, best linear unbiased estimator) and their variances are the lowest of all other unbiased estimators (Gauss Markov Theorem).

If the assumption of constant variance does not hold then this means that the Gauss Markov Theorem does not apply. For heteroscedastic data, regression analysis provides an unbiased estimate of the relationship between the predictors and the outcome variables.

As we have discussed heteroscedasticity occurs when the error variance has non-constant variance.  In this case, we can think of the disturbance for each observation as being drawn from a different distribution with a different variance.  Stated equivalently, the variance of the observed value of the dependent variable around the regression line is non-constant. 

We can think of each observed value of the dependent variable as being drawn from a different conditional probability distribution with a different conditional variance. A general linear regression model with the assumption of heteroscedasticity can be expressed as follows

yi=β0+β1Xi1+β2Xi2++βpXip+εiVar(εi)=E(εi2)=σi2;i=1,2,,n

Note that we have a i subscript attached to sigma squared.  This indicates that the disturbance for each of the n units is drawn from a probability distribution that has a different variance.

If the error term has non-constant variance, but all other assumptions of the classical linear regression model are satisfied, then the consequences of using the OLS estimator to obtain estimates of the population parameters are:

  • The OLS estimator is still unbiased
  • The OLS estimator is inefficient; that is, it is not BLUE
  • The estimated variances and covariances of the OLS estimates are biased and inconsistent
  • Hypothesis tests are not valid

Detection of Heteroscedasticity Residual Plot

The residual for the ith observation, εi^, is an unbiased estimate of the unknown and unobservable error for that observation, εi^. Thus the squared residuals, εi2^, can be used as an estimate of the unknown and unobservable error variance,  σi2=E(εi^)

One can calculate the squared residuals and then plot them against an explanatory variable that you believe might be related to the error variance.  If you believe that the error variance may be related to more than one of the explanatory variables, you can plot the squared residuals against each one of these variables.  Alternatively, you could plot the squared residuals against the fitted value of the dependent variable obtained from the OLS estimates.  Most statistical programs (software) have a command to do these residual plots.  It must be emphasized that this is not a formal test for heteroscedasticity.  It would only suggest whether heteroscedasticity may exist.

Below there are residual plots showing the three typical patterns. The first plot shows a random pattern that indicates a good fit for a linear model. The other two plot patterns of residual plots are non-random (U-shaped and inverted U), suggesting a better fit for a non-linear model, than a linear regression model.

Heteroscedasticity Regression Residual Plot 3
Heteroscedasticity Residual Plot 1
Heteroscedasticity Residual Plot 1
Heteroscedasticity Residual Residual Plot 2
Heteroscedasticity Residual Plot 2
Heteroscedasticity Residual Plot 3

Learn R Language from R Frequently Asked Questions