EigenValues and EigenVectors (2020)

Introduction to Eigen Values and Eigen Vectors

Eigenvalues and eigenvectors of matrices are needed for some of the methods such as Principal Component Analysis (PCA), Principal Component Regression (PCR), and assessment of the input of collinearity.

Eigenvalues and Eigenvectors

For a real, symmetric matrix $A_{ntimes n}$ there exists a set of $n$ scalars $lambda_i$, and $n$ non-zero vectors $Z_i,,(i=1,2,cdots,n)$ such that

begin{align*}
AZ_i &=lambda_i,Z_i\
AZ_i – lambda_i, Z_i &=0\
Rightarrow (A-lambda_i ,I)Z_i &=0
end{align*}

The $lambda_i$ are the $n$ eigenvalues (characteristic roots or latent root) of the matrix $A$ and the $Z_i$ are the corresponding (column) eigenvectors (characteristic vectors or latent vectors).

There are non-zero solutions to $(A-lambda_i,I)=0$ only if the matrix ($A-lambda_i,I$) is less than full rank (only if the determinant of $(A-lambda_i,I)$ is zero). $lambda_i$ are obtained by solving the general determinantal equation $|A-lambda,I|=0$.

The determinant of $(A-lambda,I)$ is an $n$th degree polynomial in $lambda$. Solving this equation gives the $n$ values of $lambda$, which are not necessarily distinct. Each value of $lambda$ is used in equation $(A-lambda_i,I)Z_i=0$ to find the companion eigenvectors $Z_i$.

When the eigenvalues are distinct, the vector solution to $(A-lambda_i,I)Z_i=0$ is unique except for an arbitrary scale factor and sign. By convention, each eigenvector is defined to be the solution vector scaled to have unit length; that is, $Z_i’Z_i=1$. Furthermore, the eigenvectors are mutually orthogonal; ($Z_i’Z_i=0$ when $ine j$).

When the eigenvalues are not distinct, there is an additional degree of arbitrariness in defining the subsets of vectors corresponding to each subset of non-distinct eigenvalues.

Eigen Values and Eigen Vectors Examples

Example: Let the matrix $A=begin{bmatrix}10&3\3 & 8end{bmatrix}$.

The eigenvalues of $A$ can be found by $|A-lambda,I|=0$. Therefore,

begin{align*}
|A-lambda, I|&=Big|begin{matrix}10-lambda & 3\ 3& 8-lambdaend{matrix}Big|\
Rightarrow (10-lambda)(8-lambda)-9 &= lambda^2 -18lambda+71 =0
end{align*}

By Quadratic formula, $lambda_1 = 12.16228$ and $lambda_2=5.83772$, arbitrarily ordered from largest to smallest. Thus the matrix of eigenvalues of $A$ is

$$L=begin{bmatrix}12.16228 & 0 \ 0 & 5.83772end{bmatrix}$$

The eigenvectors corresponding to $lambda_1=12.16228$ are obtained by solving

$(A-lambda_2,I)Z_i=0$ for the element of $Z_i$;

begin{align*}
(A-12.16228I)begin{bmatrix}Z_{11}\Z_{21}end{bmatrix} &=0\
left(begin{bmatrix}10&3\3&8end{bmatrix}-begin{bmatrix}12.162281&0\0&12.162281end{bmatrix}right)begin{bmatrix}Z_{11}\Z_{21}end{bmatrix}&=0\
begin{bmatrix}-2.162276 & 3\ 3 & -4.162276end{bmatrix}begin{bmatrix}Z_{11}\Z_{21}end{bmatrix}&=0
end{align*}

Arbitrary setting $Z_{11}=1$ and solving for $Z_{11}$, using first equation gives $Z_{21}=0.720759$. Thus the vector $Z_1’=begin{bmatrix}1 & 0.72759end{bmatrix}$ statisfy first equation.

$Length(Z_1)=sqrt{Z_1’Z_1}=sqrt{1.5194935}=1.232677$, where $Z’=0.999997$.

begin{align*}
Z_1 &=begin{bmatrix} 0.81124&0.58471end{bmatrix}\
Z_2 &=begin{bmatrix}-0.58471&0.81124end{bmatrix}
end{align*}

The elements of $Z_2$ are found in the same manner. Thus the matrix of eigenvectors for $A$ is

$$Z=begin{bmatrix}0.81124 &-0.58471\0.8471&0.81124end{bmatrix}$$

Note that matrix $A$ is of rank two because both eigenvalues are non-zero. The decomposition of $A$ into two orthogonal matrices each of rank one.

begin{align*}
A &=A_1+A_2\
A_1 &=lambda_1Z_1Z_1′ = 12.16228 begin{bmatrix}0.81124\0.58471end{bmatrix}begin{bmatrix}0.81124 & 0.58471end{bmatrix}\
&= begin{bmatrix}8.0042 & 5.7691\ 5.7691&4.1581end{bmatrix}\
A_2 &= lambda_2Z_2Z_2′ = begin{bmatrix}1.9958 & -2.7691\-2.7691&3.8419end{bmatrix}
end{align*}

EigenValues and EigenVectors (2020)

Thus the sum of eigenvalues $lambda_1+lambda_2=18$ is $trace(A)$. Thus the sum of the eigenvalues for any square symmetric matrix is equal to the trace of the matrix. The trace of each of the component rank $-1$ matrix is equal to its eigenvalue. $trace(A_1)=lambda_1$ and $trace(A_2)=lambda_2$.

In summary, understanding eigenvalues and eigenvectors is essential for various mathematical and scientific applications. They provide valuable tools for analyzing linear transformations, solving systems of equations, and understanding complex systems in various fields.

itfeature.com

Computer MCQs Test Online

R and Data Analysis

Mathematical Expressions Used in Math Word Problems

Mathematical Expressions

To solve any mathematical problem, it is important to understand and know the mathematical meaning of the words used in the problem. Many mathematical expressions or even general expressions have the same meaning and may indicate a relationship between quantities or a mathematical operation (such as addition, subtraction, multiplication, and division).

Equality Expressions

All of the following expressions represent that two quantities are equal (=).

  • is equal to (or equals)
  • is the same as
  • the result is
  • yields
  • gives

For example, 2+5 is equal to 7.

Note: The word “is” is also used to mean “equals”. For example, 8 is 5 more than 3, or 8 equals 5 + 3.

Mathematical Expressions

Addition Expressions

Among mathematical expressions. all of the following expression shows that the numbers $X$ and $Y$ are added

Mathematical Expressions DescriptionExample
$X+Y$$2+3$
The sum of $X$ and $Y$The sum of 2 and 3
The total of $X$ and $Y$The total of 2 and 3
$X$ added to $Y$2 added to 3
$X$ increased by $Y$2 increased by 3
$X$ more than $Y$2 more than 3
$X$ greater than $Y$2 greater than 3

Subtraction Expressions

All of the following expressions show that the number $Y$ is to be subtracted from the number $X$

Mathematical Expressions DescriptionExample
$X-Y$$7-2$
$X$ minus $Y$7 minus 3
$X$ less $Y$7 less 3
The difference of $X$ and $Y$The difference between 7 and 3
from $X$ subtract $Y$from 7 subtract 3
$X$ take away $Y$7 take away 3
$X$ decreased by $Y$7 decreased by 3
$X$ diminished by $Y$7 diminished by 3
$Y$ is subtracted from $X$7 is subtracted from 3
$Y$ less than $X$3 less than 7

Multiplication Expressions

The following expression can be used if the numbers $X$ and $Y$ need to be multiplied.

Expression DescriptionExample
$X \times Y$$2\times 3$
$X$ multiplied by $Y$2 multiplied by 3
The product of $X$ and $Y$The product of 2 and 3
$X$ times $Y$2 times 3

For multiplication of two or more numbers symbol $\times$ or $\cdot$ is used. In algebra, a number before a variable is a coefficient, such as $4Y$ means 4 times $Y$, where 4 is the coefficient.

Division Expressions

All of the following expressions indicate the division of the numbers $X$ and $Y$ (in the order $X \div Y$.

Mathematical ExpressionsExample
$X \div Y$$10 \div 2$
$X$ divided by $Y$10 divided by 2
The quotient of $X$ and $Y$The quotient of 10 and 2

Read about Significant Figures

Learn R Programming Language

Statistical Data: Introduction and Real Life Examples (2020)

By statistical Data we mean, the piece of information collected for descriptive or inferential statistical analysis of the data. Data is everywhere. Therefore, everything that has past and/ or features is called statistical data.

One can find the Statistical data

  • Any financial/ economics data
  • Transactional data (from stores, or banks)
  • The survey, or census (of unemployment, houses, population, roads, etc)
  • Medical history
  • Price of product
  • Production, and yields of a crop
  • My history, your history is also statistical data

Data

Data is the plural of datum — it is a piece of information. The value of the variable (understudy) associated with one element of a population or sample is called a datum (or data in a singular sense or data point). For example, Mr. Asif entered college at the age of 18 years, his hair is black, has a height of 5 feet 7 inches, and he weighs about 140 pounds. The set of values collected for the variable from each of the elements belonging to the sample is called data (or data in a plural sense). For example, a set of 25 weights was collected from the 25 students.

Types of Data

The data can be classified into two general categories: quantitative data and qualitative data. The quantitative data can further be classified as numerical data that can be either discrete or continuous. The qualitative data can be further subdivided into nominal, ordinal, and binary data.

Qualitative data represent information that can be classified by some quality, characteristics, or criterion—for example, the color of a car, religion, blood type, and marital status.

When the characteristic being studied is non-numeric it is called a qualitative variable or an attribute. A qualitative variable is also known as a categorical variable. A categorical variable is not comparable to taking numerical measurements. Observations falling in each category (group, class) can only be counted for examples, gender (either male or female), general knowledge (poor, moderate, or good), religious affiliation, type of automobile owned, city of birth, eye color (red, green, blue, etc), etc. Qualitative variables are often summarized in charts graphs etc. Other examples are what percent of the total number of cars sold last month were Suzuki, what percent of the population has blue eyes?

Quantitative data result from a process that quantifies, such as how much or how many. These quantities are measured on a numerical scale. For example, weight, height, length, and volume.

When the variables studied can be reported numerically, the variable is called a quantitative variable. e.g. the age of the company president, the life of an automobile battery, the number of children in a family, etc. Quantitative variables are either discrete or continuous.

Statistical Data

Note that some data can be classified as either qualitative or quantitative, depending on how it is used. If a numerical is used as a label for identification, then it is qualitative; otherwise, it is quantitative. For example, if a serial number on a car is used to identify the number of cars manufactured up to that point then it is a quantitative measure. However, if this number is used only for identification purposes then it is qualitative data.

Binary Data

The binary data has only two possible values/states; such as, defected or non-defective, yes or no, and true or false, etc. If both of the values are equally important then it is binary symmetric data (for example, gender). However, if both of the values are not equally important then it can be called binary asymmetric data (for example, result: pass or fail, cancer detected: yes or no).

For quantitative data, a count will always give discrete data, for example, the number of leaves on a tree. On the other hand, a measure of a quantity will usually be continuous, for example, weigh 160 pounds, to the nearest pound. This weight could be any value in the interval say 159.5 to 160.5.

The following are some examples of Qualitative Data. Note that the outcomes of all examples of Qualitative Variables are non-numeric.

  • The type of payment (cheque, cash, or credit) used by customers in a store
  • The color of your new cell phone
  • Your eyes color
  • The make of the types on your car
  • The obtained exam grade

The following are some examples of Quantitative Data. Note that the outcomes of all examples of Quantitative Variables are numeric.

  • The age of the customer in a stock
  • The length of telephone calls recorded at a switchboard
  • The cost of your new refrigerator
  • The weight of your watch
  • The air pressure in a tire
  • the weight of a shipment of tomatoes
  • The duration of a flight from place A to B
  • The grade point average

Learn about the Measures of Central Tendency

Visit Online MCQs Quiz Website

Hierarchical Multiple Regression SPSS

In this tutorial, we will learn how to perform hierarchical multiple regression analysis SPSS, which is a variant of the basic multiple regression analysis that allows specifying a fixed order of entry for variables (regressors) to control for the effects of covariates or to test the effects of certain predictors independent of the influence of other.

Step By Step Procedure of Hierarchical Multiple Regression SPSS

The basic command for hierarchical multiple regression analysis SPSS is “regression -> linear”:

Hierarchical Multiple Regression SPSS

In the main dialog box of linear regression (as given below), input the dependent variable. For example “income” variable from the sample file of customer_dbase.sav available in the SPSS installation directory.

Next, enter a set of predictor variables into an independent(s) pan. These variables that you want SPSS to put into the regression model first (that you want to control for when testing the variables). For example, in this analysis, we want to find out whether the “Number of people in the house” predicts the “Household income in thousands”.

We are also concerned that other variables like age, education, gender, union member, or retirement might be associated with both the “number of people in the house” and “household income in thousands”. To make sure that these variables (age, education, gender, union member, and retired) do not explain away the entire association between the “number of people in the house” and “Household income in thousands”, let’s put them into the model first.

This ensures that they will get credit for any shared variability that they may have with the predictor that we are interested in, “Number of people in the house”. any observed effect of “Number of people in the house” can then be said to be “independent of the effects of these variables that already have been controlled for. See the figure below

Linear Regression Variable

In the next step put the variable that we are interested in, which is the “number of people in the house”. To include it in the model click the “NEXT” button. You will see all of the predictors (that were entered previously) disappear. Note that they are still in the model, just not on the current screen (block). You will also see Block 2 of 2 above the “independent(s)” pan.

Hierarchical Regression

Now click the “OK” button to run the analysis.

Note you can also hit the “NEXT” button again if you are interested in entering a third or fourth (and so on) block of variables.

Often researchers enter variables as related sets. For example demographic variables in the first step, all potentially confounding variables in the second step, and then the variables that you are most interested in in the third step. However, it is not necessary to follow. One can also enter each variable as a separate step if that seems more logical based on the design of your experiment.

Output Hierarchical Multiple Regression Analysis

Using just the default “Enter” method, with all the variables in Block 1 (demographics) entered together, followed by “number of people in the house” as a predictor in Block 2, we get the following output:

Output Hierarchical Regression

The first table of output windows confirms that variables are entered in each step.

The summary table shows the percentage of explained variation in the dependent variable that can be accounted for by all the predictors together. The change in $R^2$ (R-squared) is a way to evaluate how much predictive power was added to the model by the addition of another variable in STEP 2. In our example, predictive power does not improve with the addition of another predictor in STEP 2.

Hierarchical Regression Output

The overall significance of the model can be checked from this ANOVA table. In this case, both models are statistically significant.

Hierarchical Regression Output

The coefficient table is used to check the individual significance of predictors. For model 2, the Number of people in the household is statistically non-significant, therefore excluded from the model.

Learn about Multiple Regression Analysis

R Language Frequently Asked Questions