Forecasting Foundations

A tidy forecasting workflow

Forecasting requires methodological discipline:

We use a built-in dataset from fpp3: aus_production.

For this example, we use the last 8 quarters as test data.

n_train  n_test 
    212       6

Before fitting complex models, we establish benchmarks.

Common benchmark methods in tidyverts:

bench_fit <- gas_train |> 
  model(
    mean   = MEAN(Gas),
    naive  = NAIVE(Gas),
    snaive = SNAIVE(Gas),
    drift  = RW(Gas ~ drift())
  )

Generate forecasts with a horizon equal to the test set length.

bench_fcst <- bench_fit |> 
  forecast(h = nrow(gas_test))

(Optional) Plot forecasts:

bench_fcst |> 
  autoplot(gas_train, level = 95) |> 
  autolayer(gas_test, Gas, alpha = 0.7)

We measure forecast accuracy using forecast errors:

e_{T+h} = y_{T+h} - \hat{y}_{T+h|T}

Compute accuracy metrics:

bench_acc <- bench_fcst |> 
  accuracy(gas_test) |> 
  arrange(MASE)

bench_acc

Selection rule:

Using forecast errors, we can compute summary metrics:

Common error metrics
Scale	Metric	Description	Formula
Scale-dependent	RMSE MAE	Root Mean Squared Error Mean Absolute Error	\sqrt{\text{mean}(e_{t}^{2})} \text{mean}(\|e_{t}\|)
Scale-independent	MAPE MASE RMMSE	Mean Absolute Percentage Error Mean Absolute Scaled Error Root Mean Squared Scaled Error	\text{mean}(\|p_{t}\|) \text{mean}(\|q_{t}\|) \sqrt{\text{mean}(q_{t}^{2})}

Once a benchmark is selected based on the test set, refit it using all available data, then forecast the desired future horizon.

Example: refit snaive and forecast the next 8 quarters.

final_fit <- aus_production |> 
  model(
    final_model = SNAIVE(Gas)
  )

final_fcst <- final_fit |> 
  forecast(h = "8 quarters")

Forecasting is not finished when numbers are produced.
Results must be communicated clearly and honestly.

Minimum communication checklist: