Split Plot Design in Agriculture

The article is about the use and application of split plot design in Agriculture, here we will discuss the conditions in which split plot design should be used in agriculture, the related real-life examples of split plot design, and the model of the design. In factorial experiments, there are certain situations where it becomes difficult to handle all the combinations of different levels of the factors. This may be because of the following reasons:

  • The nature of the factors may be such that levels of one factor require large experimental units as compared to the levels of other factors. For example, If the two factors are Rowing Methods and Nitrogen Levels”, then in the two-factor experiment the rowing methods require machinery, so they require large experimental units, and the nitrogen levels can be applied to the smaller units.
  • Greater precision may be required for levels of one factor as compared to the levels of other factors. For example, If we want to compare two factors, varieties, and fertilizers, and more precision is required for fertilizers, then varieties would be in the larger units and the fertilizers would be in the smaller units.
  • It may be that new treatments have to be introduced into an experiment that is already in progress.

Conditions in which Split Plot Design Used

The split plot design (and a variation, the split block) is frequently used for factorial experiments in which the nature of the experimental material or the operations involved makes it difficult to handle all factor combinations in the same manner.

  • If irrigation is more difficult to vary on a small scale and fields are large enough to be split, a split-plot design becomes appropriate.
  • Usually used with factorial sets when the assignment of treatments at random can cause difficulties, large-scale machinery can required for one factor but not another irrigation and tillage.
  • Plots that receive the same treatment must be grouped.
  • Degree of Precision: For greater precision for Factor $B$ than for factor $A$, the factor $B$ should be assigned to the subplot and factor $A$ to the main plot.
  • Relative Size of the Main Effects: If the main effect of (say factor $B$) is much larger and easier to detect than that of the other factor (factor $A$), the factor $B$ can be assigned to the main plot, and factor $A$ to the subplot. This increases the chance of detecting the difference among levels of factor $A$ which has a smaller effect.
  • Management Practices: The cultural practices required by a factor may dictate the use of large plots. For example, in an experiment to evaluate water management and variety, it may be desirable to assign water management to the main plot to minimize water movement between adjacent plots, facilitate the simulation of the water level required, and reduce border effects.

Split Plot Design in Agriculture: Irrigation and Fertilizer (Example 1)

In agricultural experiments involving two factors “irrigation” and “nitrogen” fertilizer. Sometimes, it is very convenient to apply different levels of irrigation to small neighbouring plots but there is no such difficulty for the application of different levels of nitrogen fertilizer. To meet such situations, it is desirable to have different sizes of the experimental units in the same experiment. For this purpose, we have two sizes of the experimental units. First, a design with bigger plots is taken to accommodate the factors that require bigger plots. Next, each of the bigger plots is split into as many plots as the number of treatments coming from the other factors.

The bigger plots are called main plots. The treatments allotted to them are called main plot treatments or simply main treatments. The consequent parts of the main plots are called sub-plots or split plots and the treatments allotted to them are called sub-plot treatments. The different types of treatments are allotted at random to their respective plot. Such a design is called split-plot design.

Split Plot design in Agriculture

Split Plot Design in Agriculture: Irrigation and Fertilizer (Example 2)

Let there be 3 levels of irrigation prescribing 3 different amounts of water per plot and 4 doses of nitrogen fertilizer.

First, a randomized block design with a suitable plot is taken with 3 levels of irrigation as treatments say with 5 replications of the design. The irrigation treatments are then allotted at random to each five blocks, each consisting of 4 sub-plots.

Next, each of these main plots is split into 4 sub-plots to accommodate the 4 levels of nitrogen. The main 15 plots serve as 15 replications of the subplot treatments. Treatments are allotted at random to sub-plots of each of the main plots. The split-plot design is the combination of two or more randomized designs depending on several factors, such as the plots of one design from the block of another design. The main plot treatment or the levels of one factor or different factors each of which requires a similar plot size.

Model of Split Plot Design

\begin{align} y_{ijk} &= \mu + \tau_i + \beta_j + (\tau \beta){ij} + \gamma_k + (\tau \gamma){ik} + (\beta\gamma){jk}+(\tau \beta\gamma){ijk} + \varepsilon_{ijk}\\
i &= 1,2,\cdots, a \text{ levels of factor } A\\
j &= 1,2,\cdots, b \text{ levels of factor } B\\
k &= 1,2,\cdots, c \text{ levels of factor } C
\end{align}

Model Terms

  • Linear Terms
    • $\mu$: Overall mean
    • $\tau_i$: Effect of $i$th level of $A$
    • $\beta_j$: Effect of $j$th level of $B$
    • $\gamma_k$: Effect of $k$th level of $C$
  • Interactions Terms
    • $(\tau \beta){ij}$: Interaction effect of $A$ and $B$\ $(\tau \gamma){ik}$: Interaction effect of $A$ and $C$\
    • $(\beta\gamma){jk}$: Interaction effect of $B$ and $C$\ $(\tau\beta\gamma){ijk}$:Interaction effect of $A$, $B$ and $C$ \item \textbf{Error} $\varepsilon{ijk}$: Random error at $i$th level of $A$, $j$th level of $B$ and $k$th level of $C$\
    • $\varepsilon_{ijk} \sim NID(0,\sigma_{\varepsilon}^2)$
  • Response
    • $y_{ijk}$: Response of $i$th level of $A$, $j$th level of $B$ and $k$th level of $C$

https://rfaqs.com

https://gmstat.com

Important MCQs DOE Quiz 4

The quiz contains MCQs on the Design of Experiments DOE Quiz. Most MCQs on the DOE Quiz are from Basics of Design of Experiments.

Online Multiple Choice Questions about Design of Experiments with Answers

1. When treatments are continuous quantitative variables we use?

 
 
 
 

2. What is the design of the experiment?

 
 
 
 

3. What is the purpose of the experiment?

 
 
 
 

4. What is the main characteristic of a designed experiment?

 
 
 
 

5. Robustness against missing observations means?

 
 
 
 

6. Probability theory is based on the paradigm of:

 
 
 
 

7. Conducting Bayesian experimentation we use:

 
 
 
 

8. Robustness against outliers means?

 
 
 
 

9. Randomized complete block design is used in agriculture when?

 
 
 
 

10. Evaluation and comparison of basic design configuration is important applications in:

 
 
 
 

11. The most simple blocked design is:

 
 
 
 

12. The first step in the random experiment is:

 
 
 
 

13. One of the main objectives of an experiment?

 
 
 
 

14. When prior knowledge of variables is available we should use?

 
 
 
 

15. Common types of DOE for environmental sciences include.

 
 
 
 

16. The important use of DOE in life sciences is?

 
 
 
 

17. What treatments are continuous quantitative variables we should use?

 
 
 
 

18. When the experiment is to be repeated a large number of times under similar conditions, this is called?

 
 
 
 

19. The important use of DOE in engineering is?

 
 
 
 

20. What is a random experiment?

 
 
 
 

Design of experiments (DOE) is a systematic method used to plan, conduct, analyze, and interpret controlled tests to study the relationship between factors and outcomes. Design of Experiment is a powerful tool used in various fields, including science, engineering, and business, to gain insights and optimize processes.

Design of Experiments DOE Quiz

By following the principles of DOE, one can conduct more efficient and informative experiments, ultimately leading to better decision-making and improved outcomes in various fields.

DOE Quiz with Answers

  • What is the purpose of the experiment?
  • What is a random experiment?
  • Probability theory is based on the paradigm of:
  • What is the design of the experiment?
  • What is the main characteristic of a designed experiment?
  • The first step in the random experiment is:
  • One of the main objectives of an experiment?
  • Robustness against missing observations means?
  • Robustness against outliers means?
  • Randomized complete block design is used in agriculture when?
  • When treatments are continuous quantitative variables we use?
  • The most simple blocked design is:
  • The important use of DOE in engineering is?
  • What treatments are continuous quantitative variables we should use?
  • Evaluation and comparison of basic design configuration is important applications in:
  • The important use of DOE in life sciences is?
  • When prior knowledge of variables is available we should use?
  • Conducting Bayesian experimentation we use:
  • Common types of DOE for environmental sciences include.
  • When the experiment is to be repeated a large number of times under similar conditions, this is called?

https://gmstat.com

https://rfaqs.com

Student’s t Table Free Download, 2024

The t-distribution was discovered by W. S. Gosset and R.A. Fisher. The entries in Student’s t table entries are the critical values (percentiles) for the t distribution. The applications of Student’s t distribution are related to (i) the sampling distribution of the mean $\overline{x}$, (ii) the distribution of a difference $(\overline{x}_1 – \overline{x}_2)$ of two independent populations, (iii) the distribution of two paired (dependent) populations, and (iv) the significance of correlation coefficient. It is also used for constructing confidence intervals for small samples. The Student’s t distribution is a crucial tool in statistical analysis, especially when dealing with small sample sizes. It helps us make informed decisions based on our data, even when the population standard deviation is unknown.

student's t table, Student's t Distribution

The Student’s t variable can be generated by dividing the standard normal random variable ($Z$) with the square root of a $\chi^2_{v}$ random variable. The $\chi^2_v$ is itself divided by its parameter $v$. That is

\begin{align*}
t_v &= \frac{x – \mu }{s_v} = \frac{\tfrac{(x-\mu)}{\sigma} }{\sqrt{\dfrac{\frac{v\times s^2_v}{\sigma^2} } {v}}}\\
&= \frac{Z}{\sqrt{\dfrac{\chi^2_v}{v}}}
\end{align*}

where

PDF of Student’s t Distribution

The PDF of t having $v$ degrees of freedom is

$$p(t_v) = K_v (1+\frac{t^2}{v})^{-\frac{v+1}{2}}$$

where

$$K_v = \frac{\Gamma \left[ \frac{(v+1}{2} \right]}{\sqrt{v\pi} \left(\frac{v}{2}\right) }$$

The t distribution is symmetric about zero and wider than normal density. It has one mode and it tends to be normal as $v\rightarrow \infty$. Note that $\Gamma(x)$ indicates the Gamma function.

Moments of t Distribution

Since the t distribution is symmetric and its PDF is centered at zero, the expectation (average), the median, and the mode are all zero for the t distribution with $v$ degrees of freedom. The variance ($\sigma^2$) equals $\frac{v}{v-2}$ and kurtosis is $\frac{6}{v-4}$.

For bivariate normal population, the distribution of correlation coefficient $r$ is linked with Student’s t distribution through transformation:

$$\frac{r}{\sqrt{\frac{1-r^2}{n-2}}}\rightarrow t_{n-2}$$

Generation of Pseudo Random t Variates

The following algorithm can be used to generate random variates from Student’s $t(v)$ distribution using serially generated independent uniform $U(0,1)$ random variates. For example,

Let $n=v$ (the degrees of freedom)

$C = -2n$

Repeat
$t = 2 \times U(0, 1) – 1$
$u = 2 \times U(0, 1) – 1$
$r = t^2 + u^2$
Until
$r < 1$
Return
$t \times \sqrt{\frac{n \times (r^C – 1)}{r}}$

Student’s t Table

students-t-table

Online Quiz Website: https://gmstat.com

https://itfeature.com

Simple Linear Regression Model

Frequently, we measure two or more variables on each individual and try to express the nature of the relationship between these variables (for example in simple linear regression model and correlation analysis). Using the regression technique, we estimate the relationship of one variable with another by expressing the one in terms of a linear (or more complex) function of another. We also predict the values of one variable in terms of the other. The variables involved in regression and correlation analysis are continuous. In this post we will learn about Simple Linear Regression Model.

We are interested in establishing significant functional relationships between two (or more) variables. For example, the function $Y=f(X)=a+bx$ (read as $Y$ is function of $X$) establishes a relationship to predict the values of variable $Y$ for the given values of variable $X$. In statistics (biostatistics), the function is called a simple linear regression model or simply the regression equation.

The variable $Y$ is called the dependent (response) variable, and $X$ is called the independent (regressor or explanatory) variable.

In biology, many relationships can be appropriate over only a limited range of values of $X$. Negative values are meaningless in many cases, such as age, height, weight, and body temperature.

The method of linear regression is used to estimate the best-fitting straight line to describe the relationship between variables. The linear regression gives the equation of the straight line that best describes how the outcome of $Y$ increases/decreases with an increase/decrease in the explanatory variable $X$. The equation of the regression line is
$$Y=\beta_0 + \beta_1 X,$$
where $\beta_0$ is the intercept (value of $Y$ when $X=0$) and $\beta_1$ is the slope of the line. Both $\beta_0$ and $\beta_1$ are the parameters (or regression coefficients) of the linear equation.

Estimation of Regression Coefficients in Simple Linear Regression Model

The best-fitting line is derived using the method of the \textit{Least Squares} by finding the values of the parameters $\beta_0$ and $\beta_1$ that minimize the sum of the squared vertical distances of the points from the regression line,

The dotted-line (best-fit) line passes through the point ($\overline{X}, \overline{Y}$).

The regression line $Y=\beta_0+\beta_1X$ is fit by the least-squares methods. The regression coefficients $\beta_0$ and $\beta_1$ both are calculated to minimize the sum of squares of the vertical deviations of the points about the regression line. Each deviation equals the difference between the observed value of $Y$ and the estimated value of $Y$ (the corresponding point on the regression.

The following table shows the \textit{body weight} and \textit{plasma volume} of eight healthy men.

SubjectBody Weight (KG)Plasma Volume (liters)
158.02.75
270.02.86
374.03.37
463.52.76
562.02.62
670.53.49
771.03.05
866.03.12
Simple Linear Regression Models: Scatter plot with regression line

The parameters $\beta_0$ and $\beta_1$ are estimated using the following formula (for simple linear regression model):

\begin{align}
\beta_1 &= \frac{n\sum\limits_{i=1}^{n} x_iy_i -\sum\limits_{i=1}^{n} x_i \sum\limits_{i=1}^{n} y_i} {n \sum\limits_{i=1}^{n} x_i^2 – \left(\sum\limits_{i=1}^{n} x_i \right)^2}\\
\beta_0 &= \overline{Y} – \beta_1 \overline{X}
\end{align}

Regression coefficients are sometimes known as “beta-coefficients”. When slope ($\beta_1=0$) then there is no relationship between $X$ and $Y$ variable. For the data above, the best-fitting straight line describing the relationship between plasma volume with body weight is
$$Plasma\, Volume = 0.0857 +0.0436\times Weight$$
Note that the calculated values for $\beta_0$ and $\beta_1$ are estimates of the population values, therefore, subject to sampling variations.

Simple linear regression model equation

https://gmstat.com

https://rfaqs.com