In principle, a SARIMAX model is a linear regression model that uses a SARIMA-type process. This model is useful in cases where we suspect that residuals may exhibit a seasonal trend or pattern.$$w_t = y_t - \beta_1 x_{1,t}-\beta_2 x_{2,t} - \cdots - \beta_b x_{b,t}$$ $$(1-\sum_{i=1}^p {\phi_i L^i})(1-\sum_{j=1}^P {\Phi_j L^{j \times s}})(1-L)^d (1-L^s)^D w_t -\eta = (1+\sum_{i=1}^q {\theta_i L^i})(1+\sum_{j=1}^Q {\Theta_j L^{j \times s}}) a_t$$ $$a_t \sim \textrm{i.i.d} \sim \Phi(0,\sigma^2)$$

Where:

- $L$ is the lag (aka back-shift) operator.
- $y_t$ is the observed output at time $t$.
- $x_{k,t}$ is the k-th exogenous input variable at time $t$.
- $\beta_k$ is the coefficient value for the k-th exogenous (explanatory) input variable.
- $b$ is the number of exogenous input variables.
- $w_t$ is the auto-correlated regression residuals.
- $p$ is the order of the non-seasonal AR component.
- $P$ is the order of the seasonal AR component.
- $q$ is the order of the non-seasonal MA component.
- $Q$ is the order of the seasonal MA component.
- $s$ is the seasonal length.
- $D$ is the seasonal integration order of the time series.
- $\eta$ is a constant in the SARIMA model.
- $a_t$ is the innovation, shock, or error term at time $t$.
- $\{a_t\}$ time series observations are independent and identically distributed (i.e., $i.i.d$) and follow a Gaussian distribution (i.e., $\Phi(0,\sigma^2)$).

Re-ordering the terms in the equation above and assuming the differenced (both seasonal and non-seasonal) results in a stationary time series ($z_t$) yields the following:$$z_t = (1-L)^d(1-L^s)^D w_t$$ $$\mu = E[z_t] = \frac{\eta}{(1-\phi_1-\phi_2-\cdots-\phi_p)(1-\Phi_1-\Phi_2-\cdots-\Phi_P)}$$ $$(1-\sum_{i=1}^p {\phi_i L^i})(1-\sum_{j=1}^P {\Phi_j L^{j \times s}})(1-L)^d (1-L^s)^D (w_t -\mu) = (1+\sum_{i=1}^q {\theta_i L^i})(1+\sum_{j=1}^Q {\Theta_j L^{j \times s}}) a_t$$

## Remarks

- The variance of the shocks is constant or time-invariant.
- The order of an AR component process is solely determined by the order of the last lagged auto-regressive variable with a non-zero coefficient (i.e., $w_{t-p}$).
- The order of an MA component process is solely determined by the order of the last moving average variable with a non-zero coefficient (i.e., $a_{t-q}$).
- In principle, you can have fewer parameters than the orders of the model.

## Related Links

## References

- James Douglas Hamilton; Time Series Analysis, Princeton University Press; 1st edition(Jan 11, 1994), ISBN: 691042896.
- Tsay, Ruey S.; Analysis of Financial Time Series, John Wiley & SONS; 2nd edition(Aug 30, 2005), ISBN: 0-471-690740.

## Comments

Article is closed for comments.