Inverse Regression Analysis
In most regression problems we have to determine the value of
corresponding to a given value of
. We will consider the inverse problem, which is called inverse regression or calibration.
Assume we have known values of
and their corresponding
values, which both form a simple linear regression model and we have also an unknown value of
, such as
, which cannot be measured and we can observe the corresponding value of
, say
. Then,
can be estimated and a confidence interval for
can be obtained.
In regression analysis we want to investigate the relationship between variables. Regression has many applications, which occur in many fields: engineering, economics, the physical and chemical sciences, management, biological sciences and social sciences. We only consider the simple linear regression model, which is a model with one regressor
that has a linear relationship with a response
. It is not always easy to measure the regressor
or the response
.
We now consider a typical example for this problem. If
is the concentration of glucose in certain substances, then a spectrophotometric method is used to measure the absorbance. This absorbance depends on the concentration
. The response
is easy to measure with the spectrophotometric method, but the concentration on the other hand is not easy to measure. If we have
known concentrations, then the absorbance can be measured. If there is a linear relation between
and
, then a simple linear regression model can be made with these data. Suppose we have an unknown concentration, which is difficult to measure, but we can measure the absorbance of this concentration. Is it possible to estimate this concentration with the measured absorbance? This is called the calibration problem.
Suppose we have a linear model
and we have an observed value of the response
, but we do not have the corresponding value of
. How can we estimate this value of
? The two most important methods to estimate
are the classical method and the inverse method.
The classical method is based on the simple linear regression model
where 
where the parameters
and
are estimated by Least Squares as
and
. At least two of the
values of
have to be distinct, otherwise we cannot fit a reliable regression line. For a given value of
, say
(unknown), a
value, say
(or random sample of
values of
) is observed at the
value. The problem is to estimate
. The classical method uses a
value (or the mean of
values of
) to estimate
, which is then estimated by
.
The inverse estimator is the simple linear regression of
on
. In this case, we have to fit the model
![itfeature.com \[X=\alpha_0+\alpha_1 Y + \epsilon\]](http://itfeature.com/wp-content/ql-cache/quicklatex.com-f426d77937be281359e8190851521db1_l3.png)
where
![itfeature.com \[\epsilon \sim N(0,\sigma^2\]](http://itfeature.com/wp-content/ql-cache/quicklatex.com-bae3851e50b14028243c3f156bbf6c7a_l3.png)
to obtain the estimator. Then the inverse estimator of
is
![itfeature.com \[X_0=\alpha_0+\alpha_1 Y + \epsilon\]](http://itfeature.com/wp-content/ql-cache/quicklatex.com-fb34e9f88d3342de4fe25d929ab36e41_l3.png)

explained by the regression. Ris the correlation between
and is usually the multiple correlation coefficient.
when all the values are different. When repeats runs exists in the data the value of 




