This chapter addresses specifying ARIMA() models for time series, focusing on selecting orders , , and .
It introduces tools like the sample autocorrelation function (ACF), partial autocorrelation function (PACF), extended autocorrelation function (EACF), and the Dickey-Fuller unit-root test, applying them to simulated and real data.
Overview of Model Specification
ARIMA models cover stationary and nonstationary series. The specification process, popularized as the Box-Jenkins method, involves:
- Tentatively selecting , , and .
- Estimating parameters (, , ).
- Checking model adequacy, iterating if needed.
Properties of the Sample Autocorrelation Function
The Autocorrelation Function (ACF) measures the correlation between a time series and its own lagged values, showing how values at time relate to values at previous times . Values range from -1 to 1; significant spikes indicate strong correlation at specific lags .
The sample ACF, , estimates the theoretical ACF . Its sampling properties are complex, but for a stationary series (with , ), is asymptotically normal with variance:
Special Cases
- White Noise: , ().
- AR(1), : , e.g., .
- MA(1): , ().
- MA(q): ().
Model | , | ||
---|---|---|---|
White Noise | 0 | ||
AR(1), | (large ) | ||
MA(1), |
Partial and Extended Autocorrelation Functions
Partial Autocorrelation Function (PACF)
- Measures the correlation between a time series and its lagged values, after removing the effects of earlier lags.
- For lag , PACF shows the direct correlation between and , adjusting for correlations at lags .
The PACF, , measures correlation between and after removing effects of :
- , where is the best linear predictor.
- AR(p): ().
- MA(1): (), decays exponentially.
- Computed via Yule-Walker equations: , solved recursively.
Sample PACF, , uses in place of ; for AR(p), ().
Extended Autocorrelation Function (EACF)
- A tool to identify both AR and MA orders ( and ) in ARIMA models by analyzing the autocorrelation of residuals after fitting an AR model.
- Presented as a table, where rows are AR orders and columns are MA orders; a “zero triangle” of insignificant autocorrelations indicates the appropriate .
For ARMA(), the EACF filters out AR components, leaving an MA(q) process. Define , where are estimated iteratively. The sample ACF of cuts off after lag when , forming a triangle of zeros in an EACF table.
Model Behavior | ACF | PACF |
---|---|---|
AR(p) | Tails off | Cuts off at |
MA(q) | Cuts off at | Tails off |
ARMA(p,q) | Tails off | Tails off |
Specification of Simulated Time Series
Simulations (e.g., MA(1) with , AR(1) with , ARMA(1,1)) show ACF and PACF patterns often match theory, though sample variability can mislead (e.g., MA(2) resembling MA(1)).
Nonstationarity
Nonstationary series (e.g., random walks, oil prices) exhibit slowly decaying sample ACFs. Differencing reduces them to stationary ARMA models (e.g., IMA(1,1) for oil prices). Overdifferencing introduces spurious MA terms (e.g., for a random walk).
Dickey-Fuller Unit-Root Test
Tests in ( stationary):
- Regression: , under .
- ADF statistic (t-statistic of ) has a nonstandard distribution; applied to a random walk, it supports a unit root.
Other Specification Methods
- AIC: , penalizes complexity.
- AIC: Adds for bias correction.
- BIC: , consistent for true ARMA orders.
- Hannan-Rissanen: Fits high-order AR, uses residuals to estimate ARMA via regression.
Subset ARMA models (e.g., ) are identified via best-subset regression.
Specification of Actual Time Series
- LA Rainfall: Logarithms suggest white noise.
- Chemical Color: AR(1) from PACF.
- Hare Abundance: Square root, AR(2) or AR(3).
- Oil Prices: IMA(1,1) or ARIMA(1,1,0) on logarithms.
Summary
Tools like sample ACF, PACF, EACF, and unit-root tests guide ARIMA specification. Iteratively refining models balances simplicity and fit, as demonstrated with diverse examples.