A simple linear regression

A simple linear regression uses the following notation:

\(y\) is the outcome/dependent variable,
\(x\) is the explanatory/independent variable,
\(\beta\) is the slope, also called a coefficient,
\(\beta_0\) is the y-intercept, and
\(\epsilon\) is the error term that captures our uncertainty over the estimated values.

The equation can be expressed as:

\[\begin{align} y &= \beta x + \beta_0 + \epsilon. \end{align}\]

This is similar to a standard linear equation, with new notation (i.e. \(\beta_0\) instead of \(c\) or \(b\) for the y-intercept/constant).

Note that there are many nonlinear regression functions that are more complex than a straight line in one dimension. A slightly more complex function, called multiple linear regression, will be covered in the second part of this document.