May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.Econometric Models Econometric models , also called causal or based models, us
Trang 1DECISION MAKING
Time Series Analysis and Forecasting
12
Trang 2 There are two problems with this approach:
It is not always easy to undercover historical patterns
There are no guarantees that past patterns will
continue in the future.
Trang 3© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Forecasting Methods:
An Overview
There are many forecasting methods
available, and there is little agreement
as to the best forecasting method.
The methods can be divided into three
groups:
1 Judgmental methods
2 Extrapolation (or time series) methods
3 Econometric (or causal) methods
The first method is basically
nonquantitative; the last two are
quantitative.
Trang 4 All of these methods look for patterns in the historical
series and then extrapolate these patterns into the future.
Complex models are not always better than simpler
models.
Simpler models track only the most basic underlying patterns and can be more flexible and accurate in forecasting the future.
Trang 5© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Econometric Models
Econometric models , also called causal or based models, use regression to forecast a time series variable by using other explanatory time series variables.
regression- Prediction from regression equation:
Causal regression models present mathematical
Trang 6Combining Forecasts
This method combines two or more
forecasts to obtain the final forecast.
The reasoning is simple: The forecast
errors from different forecasting
methods might cancel one another.
Forecasts that are combined can be of
the same general type, or of different
Trang 7© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Components of Time Series Data
Exponential trend—occurs when observations increase at a tremendous rate.
S-shape trend—occurs when it takes a while for observations to start
increasing, but then a rapid increase occurs, before finally tapering off to a fairly constant level.
Trang 8Components of Time Series Data
(slide 2 of 4)
seasonal pattern tends to repeat itself every
year.
Trang 9© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Components of Time Series Data
(slide 3 of 4)
A time series has a cyclic component when business cycles
affect the variables in similar ways.
The cyclic component is more difficult to predict than the seasonal
component, because seasonal variation is much more regular.
The length of the business cycle varies, sometimes substantially.
The length of a seasonal cycle is generally one year, while the length of
a business cycle is generally longer than one year and its actual length
is difficult to predict.
Trang 10Components of Time Series Data
These other factors combine to create a certain amount of
unpredictability in almost all time series.
Trang 11© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Measures of Accuracy
(slide 1 of 2)
The forecast error is the difference between the actual value
and the forecast It is denoted by E with appropriate subscripts.
Forecasting software packages typically report several summary measures of the forecast errors:
MAE (Mean Absolute Error) :
RMSE (Root Mean Square Error) :
MAPE (Mean Absolute Percentage Error) :
One other measure of forecast errors is the average of the
errors
Trang 12Measures of Accuracy
(slide 2 of 2)
Some forecasting software packages choose the best model from a given class by minimizing MAE, RMSE, or MAPE.
However, small values of these measures guarantee only
that the model tracks the historical observations well.
There is still no guarantee that the model will forecast
future values accurately.
Unlike residuals from the regression equation, forecast errors are not guaranteed to always average to zero.
If the average of the forecast errors is negative, this
implies a bias, or that the forecasts tend to be too high.
If the average is positive, the forecasts tend to be too low.
Trang 13© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Testing for Randomness
(slide 1 of 2)
All forecasting models have the general form
shown in the equation below:
The fitted value is the part calculated from past data and any other available information.
The residual is the forecast error.
The fitted value should include all components of the original series that can possibly be forecast, and the leftover residuals should be unpredictable noise.
The simplest way to determine whether a time
series of residuals is random noise is to examine time series graphs of residuals visually—although this is not always reliable.
Trang 14Testing for Randomness
(slide 2 of 2)
Some common nonrandom patterns are
shown below.
Trang 15© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Runs Test
of testing for randomness It is a formal test of the null hypothesis of
randomness.
First, choose a base value, which could be the average value of the series, the median value, or even some other value.
Then a run is defined as a consecutive
series of observations that remain on one side of this base level.
If there are too many or too few runs in the
series, the null hypothesis of randomness can
be rejected.
Trang 16Example 12.1:
Stereo Sales.xlsx (slide 1 of 2)
Objective: To use StatTools’s Runs Test procedure to check
whether the residuals from this simple forecasting model
represent random noise.
Solution: Data file contains monthly sales for a chain of stereo
retailers from the beginning of 2009 to the end of 2012, during which there was no upward or downward trend in sales and no
clear seasonality.
A simple forecast model of sales is to use the average of the
series, 182.67, as a forecast of sales for each month.
The residuals for this forecasting model are found by subtracting the average from each observation.
Use the runs test to see whether there are too many or too few runs around the base of 0
Select Runs Test for Randomness from the StatTools Time Series and Forecasting dropdown, choose Residual as the variable to
analyze, and choose Mean of Series as the cutoff value.
Trang 17© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 12.1:
Stereo Sales.xlsx (slide 2 of 2)
Trang 18 Another way to check for randomness of a time series of
residuals is to examine the autocorrelations of the residuals.
An autocorrelation is a type of correlation used to measure whether values of a time series are related to their own past values.
In positive autocorrelation, large observations tend to follow large
observations, and small observations tend to follow small observations.
The autocorrelation of lag k is essentially the correlation between the original series and the lag k version of the series.
Lags are previous observations, removed by a certain number of periods from the present time.
To lag a time series in a spreadsheet by one month, “push down” the series
by one row, as shown below.
Trang 19© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 12.1 (Continued):
Stereo Sales.xlsx (slide 1 of 2)
from the forecasting model for evidence of nonrandomness.
the StatTools Time Series and Forecasting dropdown list.
Specify the times series variable (Residual), the number of lags you want, and whether you want a chart of the autocorrelations, called a
correlogram
It is common practice to ask for no more lags than 25% of the number of observations.
Any autocorrelation that is larger than two standard errors in
magnitude is worth your attention.
One measure of the lag 1 autocorrelation is provided by the Watson (DW) statistic
Durbin-A DW value of 2 indicates no lag 1 autocorrelation.
A DW value less than 2 indicates positive autocorrelation.
A DW value greater than 2 indicates negative autocorrelation.
Trang 20Example 12.1 (Continued):
Stereo Sales.xlsx (slide 2 of 2)
residuals are shown below.
Trang 21© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Regression-Based Trend
Models
Many time series follow a long-term
trend except for random variation.
This trend can be upward or downward.
A straightforward way to model this trend is
to estimate a regression equation for Y t ,
using time t as the single explanatory
variable.
The two most frequently used trend models
are the linear trend and the exponential
trend.
Trang 22Linear Trend
changes by a constant amount each time period
The equation for the linear trend model is:
The interpretation of b is that it represents the
expected change in the series from one period to the next
If b is positive, the trend is upward.
If b is negative, the trend is downward.
The intercept term a is less important: It literally
represents the expected value of the series at time t =
0.
A graph of the time series indicates whether a
linear trend is likely to provide a good fit.
Trang 23© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 12.2:
US Population.xlsx (slide 1 of 2)
Objective: To fit a linear trend line to monthly population and
examine its residuals for randomness.
Solution: Data file contains monthly population data for the
United States from January 1952 to December 2011 During this period, the population has increased steadily from about 156
million to about 313 million.
To estimate the trend with regression, use a numeric time variable representing consecutive months 1 through 720
Then run a simple regression of Population versus Time.
Trang 25© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Exponential Trend
An exponential trend is appropriate when the time series
changes by a constant percentage (as opposed to a constant
dollar amount) each period
The appropriate regression equation contains a multiplicative error term ut:
This equation is not useful for estimation; for that, a linear
equation is required.
You can achieve linearity by taking natural logarithms of both sides
of the equation, as shown below, where a = ln(c) and et = ln(ut).
The coefficient b (expressed as a percentage) is approximately the
percentage change per period For example, if b = 0.05, then the series is
increasing by approximately 5% per period.
If a time series exhibits an exponential trend, then a plot of its logarithm should be approximately linear.
Trang 26Example 12.3:
PC Device Sales.xlsx (slide 1 of 2)
see whether it has been maintained during the entire period from
1999 until the end of 2013.
Solution: Data file contains quarterly sales data for a large PC
device manufacturer from the first quarter of 1999 through the
fourth quarter of 2013.
First, estimate and interpret an exponential trend for the years
1999 through 2008
Use Excel’s Trendline tool,
with the Exponential
option, to superimpose an
exponential trend line and
the corresponding
equation on the time
series graph through 2008.
Trang 27© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 12.3:
PC Device Sales.xlsx (slide 2 of 2)
To use this equation for forecasting the future, substitute later values of Time into the regression equation, so that each future forecast is about 6.54% larger than the
previous forecast.
Check whether the exponential growth continued beyond
2008 by creating the Forecast column shown below (by
substituting into the regression equation for the entire
period through Q4-13).
Then use StatTools to create a time series graph of the two series Sales and Forecast, also shown below.
Trang 28The Random Walk Model
The random walk model is an example of using random series
as building blocks for other time series models.
In this model, the series itself is not random
However, its differences—that is, changes from one period to the
mean 0 and a standard deviation that remains constant through time.
A series that behaves according to this random walk model has
random differences, and the series tends to trend upward (if m > 0),
or downward (if m < 0) by an amount m each period.
If you are standing in period t and want to forecast Yt+1, then a
reasonable forecast is given by the equation below:
Trang 29© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 12.4:
Stock Prices.xlsx (slide 1 of 2)
Objective: To check whether
the company’s monthly
closing prices follow a random
walk model with an upward
trend and to see how future
prices can be forecast.
Solution: The monthly closing
prices of the manufacturing
company’s stock from January
2006 through April 2012 are
shown to the right.
To check for the adequacy of a
random walk model, a series
of differences is required
Calculate this series with an
Excel formula or generate it
automatically by selecting
Difference from the StatTools
Data Utilities dropdown menu.
Trang 30Example 12.4:
Stock Prices.xlsx (slide 2 of 2)
Next, plot the differences, as shown below.
To forecast future closing prices, multiply the mean
difference by the number of periods ahead, and add this to the final closing price.
Trang 31© 2015 Cengage Learning All Rights Reserved May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Moving Averages Forecasts
One of the simplest and the most frequently used
extrapolation models is the moving averages model
A moving average is the average of the observations in the past few periods, where the number of terms in the average is the span
If the span is large, extreme values have relatively little effect
on the forecasts, and the resulting series of forecasts will be much smoother than the original series.
For this reason, this method is called a smoothing method.
If the span is small, extreme observations have a larger effect
on the forecasts, and the forecast series will be much less
smooth.
Using a span requires some judgment:
If you believe the ups and downs in the series are random noise, use
a relatively large span.
If you believe each up and down is predictable, use a smaller span.