A * histogram* is very similar to the bar chart for a

*based on*

**frequency distribution***showing the distribution of*

**quantitative data***. It is a useful graphical representation of data which helps to visualize the*

**qualitative data***.*

**distribution of data*** Histogram* is constructed from the

*by taking the*

**grouped data***(not class limits) along x-axis and the corresponding frequencies along y-axis. For ungrouped data we have to form the*

**class boundaries***grouped frequency distribution*before making a

*consists of a set of bars (like*

**histogram.****Histogram***) but these bars are adjacent to each other and the*

**bar chart***height*of bars is proportional to the frequency associated with respective classes. The area of each rectangle represented the respective class frequencies. When the class intervals are equal, the rectangles all have the same width and their heights directly represent the class frequencies. For the case in which

*class-intervals*are not all equal, the

*height of the rectangle*(bar) over an

*unequal class-interval*, is to be adjusted because it is area, and not height that measures frequency. This means that the height of a rectangle must be proportionally decreased if the length of the corresponding class-interval increases. For example, if the length of a class-interval becomes double, then the height of the rectangle is to be halved so that area, being the fundamental property of the rectangle of a histogram remains unchanged. This sort of rescaling is necessary to observe the correct pattern of distribution.

The feature of * histogram* is that there is no gap (space) between the vertical bars, because the variable plotted on the horizontal axis is

*quantitative*and variable is from measure of scale either

*interval*or

*ratio.*Thus, the

*provides an easy interpreted visual representation of a frequency distribution. Note that class midpoints are used as the labels for the classes.*

**histogram*** Histogram* allow us to analyze extremely large

*datasets*by reducing them to a single graphical representation which is used to show primary, secondary and tertiary peaks in data and also help us by giving a visual representation of the

*statistical significance*of those peaks.

An alternative to the * histogram* is

*kernel density estimation*, which uses a

*kernel*to

*smooth*samples. This will construct a smooth

*probability density function*, which will in general more accurately reflect the underlying variable.

**Histogram for continuous grouped data**

To draw a * histogram* from the

*continuous grouped frequency distribution*, the following steps are taken.

- Mark
*class boundaries*of the classes along x-axis. - Mark frequencies along y-axis.
- Draw a rectangle for each class such that the height of each rectangle is proportional to the frequency corresponding to that class. This is the case when classes are of equal width as they often are.
- If the classes are of unequal width, then the area instead of height of each rectangle is proportional to the frequency corresponding to that class and the height of each rectangle is obtained by dividing the frequency of the class by width of that class.

It may be noted that the area under a ** histogram** can be calculated by adding up the areas of all the rectangles that constitute the

*The area of one rectangle is obtained by the multiplication of width of the class by the corresponding frequency i.e.*

**histogram.****Area of a single rectangle = width of the class x frequency of the class**

**Histogram for Discrete Data**

Bar graphs are usually drawn for discrete and categorical data but there are some situations where there is need to make approximation, the * histogram* may be constructed. To construct a

*for*

**histogram***discrete grouped data*, following steps are taken:

- Mark possible values on x-axis.
- Mark frequencies along y-axis.
- Draw a rectangle centered on each value with equal width on each side possible 0.5 to either side of the value.

**Advantages:**

**Advantages:**

The advantages of the * histogram* as compared to the unprocessed data are:

- It gives range of the data.
- It gives location of the data.
- it gives clue about the skewness of the data.
- It gives information about the out of control situation.
- Histogram are density estimates (gives a good impression of the distribution of data.
- Can be compared to normal curve.

*Disadvantages:*

*Disadvantages:*

- Exact values cannot be read from
because data is grouped into categories and individuality of data vanishes in grouped data.**histogram** - It is more difficult it compare two data sets.
- It is used only for continuous data set.

**Download pdf file:**
**Histogram 90.80 KB**

**Histogram 90.80 KB**