SAS/ETS 9.22 User''''s Guide 190 ppt

Using Specification of Weight Constants Any number of weighting constants can be specified.. The middle constant or the constant to the right of the middle if an even number of weight co

Trang 1

1882 F Chapter 29: The TIMESERIES Procedure

Table 29.3 Seasonal Adjustment Formulas

LOGADD log.Ot/D T CtC St C It

PSEUDOADD Ot D T Ct.St C It 1/

LOGADD centered moving average of l og.Ot/ PSEUDOADD centered moving average of Ot

seasonal-irregular component SIC MULT SIt D StIt D Ot=T Ct

LOGADD SIt D StC It D log.Ot/ T Ct

PSEUDOADD SIt D StC It 1D Ot=T Ct

LOGADD seasonal Averages of SIt

PSEUDOADD seasonal Averages of SIt

PSEUDOADD It D SIt St C 1 trend-cycle-seasonal component TCS MULT T CSt D T CtSt D Ot=It

LOGADD T CSt D T Ct C St D Ot It

PSEUDOADD T CSt D T CtSt

PSEUDOADD Tt D T Ct Ct

PSEUDOADD Ct D T Ct Tt

LOGADD SAt D Ot=exp.St/D exp.T CtC It/ PSEUDOADD SAt D T CtIt

The trend-cycle component is computed from the s-period centered moving average as follows:

T Ct D

bs=2c

X

kD bs=2c

yt Ck=s

Trang 2

The seasonal component is obtained by averaging the seasonal-irregular component for each season.

t Dk mod s

SIt

T =s

where 0j T =s and 1ks The seasonal components are normalized to sum to one (multiplica-tive) or zero (addi(multiplica-tive)

Correlation Analysis

Correlation analysis can be performed on the working series by specifying the OUTCORR= option

or one of the PLOTS= options that are associated with correlation The CORR statement enables you to specify options that are related to correlation analysis

Autocovariance Statistics

LAGS h2 f0; : : : ; H g

N Nhis the number of observed products at lag h, ignoring missing values

t DhC1.yt y/.yt h y/

h

PT

t DhC1.yt y/.yt h y/ when embedded missing values are present

Autocorrelation Statistics

ACF

r

1 T

1C 2Ph 1

j D1 O.j /2

ACFNORM Norm.O.h// D O.h/=Std O.h//

ACFPROB P rob.O.h// D 2 1 ˆ jNorm O.h//j//

ACFLPROB LogP rob.O.h// D log10.P rob.O.h//

ACF2STD F lag.O.h// D

8

<

:

1 O.h/ > 2Std O.h//

0 2S t d.O.h// < O.h/ < 2Std O.h//

1 O.h/ < 2Std O.h//

Partial Autocorrelation Statistics

PACF O'.h/ D .0;h 1/ jghj D1

PACFSTD S t d.O'.h// D 1=pN0

PCFNORM Norm.O'.h// D O'.h/=Std O'.h//

PACFPROB P rob.O'.h// D 2 1 ˆ jNorm O'.h//j//

Trang 3

PACFLPROB LogP rob.O'.h// D log10.P rob.O'.h//

PACF2STD F lag.O'.h// D

8

<

:

1 O'.h/ > 2Std O'.h//

0 2S t d.O'.h// < O'.h/ < 2Std O'.h//

1 O'.h/ < 2Std O'.h//

Inverse Autocorrelation Statistics

IACFSTD S t d O h//D 1=pN0

IACFNORM Norm O h//D O.h/=Std O.h//

IACFPROB P rob O h//D 21 ˆ

jNorm O.h//j

IACFLPROB LogP rob O h//D log10.P rob O h//

IACF2STD F lag O h//D

8 ˆ

ˆ

1 O.h/ > 2Std O.h//

0 2S t d O h// < O h/ < 2S t d O h//

1 O.h/ < 2Std O.h//

White Noise Statistics

j D1.j /2=.T j /

j D1Nj.j /2when embedded missing values are present WNPROB P rob.Q.h//D max.1;h p/.Q.h//

WNLPROB LogP rob.Q.h//D log10.P rob.Q.h//

Cross-Correlation Analysis

Cross-correlation analysis can be performed on the working series by specifying the OUTCROSS-CORR= option or one of the CROSSPLOTS= options that are associated with cross-correlation The CROSSCORR statement enables you to specify options that are related to cross-correlation analysis

Cross-Correlation Statistics

The cross-correlation statistics for the variable x supplied in a VAR statement and variable y supplied

in a CROSSVAR statement are:

LAGS h2 f0; : : : ; H g

N Nhis the number of observed products at lag h, ignoring missing values

PT

t DhC1.xt x/.yt h y/

Trang 4

CCOV x;y.h/D N1hPT

t DhC1.xt x/.yt h y/ when embedded missing values are present

CCFSTD S t d.Ox;y.h//D 1=pN0

CCFNORM Norm.Ox;y.h//D Ox;y.h/=S t d.Ox;y.h//

CCFPROB P rob.Ox;y.h//D 2 1 ˆ jNorm Ox;y.h//j

CCFLPROB LogP rob.Ox;y.h//D log10.P rob.Ox;y.h//

CCF2STD F lag.Ox;y.h//D

8

<

:

1 Ox;y.h/ > 2S t d.Ox;y.h//

0 2S t d.Ox;y.h// < Ox;y.h/ < 2S t d.Ox;y.h//

1 Ox;y.h/ < 2S t d.Ox;y.h//

Spectral Density Analysis

Spectral analysis can be performed on the working series by specifying the OUTSPECTRA= option or by specifying the PLOTS=PERIODOGRAM or PLOTS=SPECTRUM option in the PROC TIMESERIES statement PROC TIMESERIES uses the finite Fourier transform to decompose data series into a sum of sine and cosine terms of different amplitudes and wavelengths The Fourier transform decomposition of the series xt is

xt D a20 C

m

X

kD1

Œakcos.!kt /C bksin.!kt /

where

t is the time subscript, t D 1; 2; : : : ; n

xt are the equally spaced time series data

n is the number of observations in the time series

m is the number of frequencies in the Fourier decomposition: mD n2 if n is even,

mD n 12 if n is odd

ak are the cosine coefficients

bk are the sine coefficients

!k are the Fourier frequencies: !k D 2kn

Functions of the Fourier coefficients ak and bk can be plotted against frequency or against wave length to form periodograms The amplitude periodogram Jk is defined as follows:

Jk D n

2.a

2

kC bk2/

The Fourier decomposition is performed after the ACCUMULATE=, DIF=, SDIF= and TRANS-FORM= options in the ID and VAR statements have been applied

Trang 5

Computational Method

If the number of observations, n, factors into prime integers that are less than or equal to 23, and the product of the square-free factors of n is less than 210, then the procedure uses the fast Fourier transform developed by Cooley and Tukey (1965) and implemented by Singleton (1969) If n cannot

be factored in this way, then the procedure uses a Chirp-Z algorithm similar to that proposed by Monro and Branch (1976)

Missing Values

Missing values are replaced with an estimate of the mean to perform spectral analyses This treatment

of a series with missing values is consistent with the approach used by Priestley (1981)

Using Specification of Weight Constants

Any number of weighting constants can be specified The constants are interpreted symmetrically about the middle weight The middle constant (or the constant to the right of the middle if an even number of weight constants is specified) is the relative weight of the current periodogram ordinate The constant immediately following the middle one is the relative weight of the next periodogram ordinate, and so on The actual weights used in the smoothing process are the weights specified in the WEIGHTS option, scaled so that they sum to 1

The moving average calculation reflects at each end of the periodogram to accommodate the period-icity of the periodogram function

For example, a simple triangular weighting can be specified using the following WEIGHTS option:

spectra / weights 1 2 3 2 1;

Using Kernel Specifications

You can specify one of ten different kernels in the SPECTRA statement The two parameters c 0 and e 0 are used to compute the bandwidth parameter

M D cqe

where q is the number of periodogram ordinates + 1,

qD floor.n=2/ C 1

To specify the bandwidth explicitly, set c D to the desired bandwidth and e D 0

For example, a Parzen kernel with a support of 11 periodogram ordinates can be specified using the following kernel option:

spectra / parzen c=5 expon=0;

Trang 6

Kernels are used to smooth the periodogram by using a weighted moving average of nearby points.

A smoothed periodogram is defined by the equation

O

Ji.M /D

q

X

D q

w

M

Q

Ji C

where w.x/ is the kernel or weight function

At the endpoints, the moving average is computed cyclically; that is,

Q

Ji C D

8

ˆ

Ji C 0 i C q

J .i C/ iC < 0

J2q i C/ iC > q where Ji is the i th periodogram ordinate

The TIMESERIES procedure supports the following kernels:

BART: Bartlett kernel

(

1 jxj jxj1

PARZEN: Parzen kernel

8 ˆ

ˆ

1 6jxj2C 6jxj3 0jxj12 2.1 jxj/3 12jxj1

QS: quadratic spectral kernel

2

.x/2

sin.x=M /

TUKEY: Tukey-Hanning kernel

( 1C cos.x//=2 jxj1

TRUNCAT: truncated kernel

(

1 jxj1

0 otherwise

Trang 7

Alternatively, kernel functions can be applied as filters that estimate the autocovariance function in the time domain prior to computing the periodogram by using the DOMAIN=TIME option as

where .h/ D w.h/ To approximate this operation, complementary kernel weighting functions, w. /, can be used to smooth the periodogram by using the same cyclical moving average computation described previously The frequencies used to weight periodogram ordinates are D =q The five complementary weighting functions available to smooth the periodogram in this manner are: BART: Bartlett equivalent lag window filter

2M

sin.M =2/

sin.=2/

2

PARZEN: Parzen equivalent lag window filter

M3

sin.M =4/

sin.=2/

4

3sin

2.=2/

QS: quadratic spectral equivalent lag window filter

w. / D

(3M 4.1 M =/2/ jj =M

TUKEY: Tukey-Hanning equivalent lag window filter

w. / D 1

4DM. =M /C1

2DM. /C1

4DM. C =M /

DM. / D 1

2

sinŒ.M C 1=2/

sin.=2/

TRUNC: truncated equivalent lag window filter

w. / D DM. /

Singular Spectrum Analysis

Given a time series, yt, for t D 1; : : : ; T , and a window length, 2 L < T =2, singular spectrum analysis Golyandina, Nekrutkin, and Zhigljavsky (2001) decompose the time series into spectral groupings using the following steps:

Trang 8

Embedding Step

Using the time series, form a K L trajectory matrix, X, with elements

XD fxk;lgK;LkD1;lD1

such that xk;l D yk lC1 for k D 1; : : : ; Kand l D 1; : : : ; L and where K D T LC 1 By definition L K < T , because 2 L < T =2

Decomposition Step

Using the trajectory matrix, X, apply singular value decomposition to the trajectory matrix

where U represents the K L matrix that contains the left-hand-side (LHS) eigenvectors, where Q represents the diagonal L L matrix that contains the singular values, and where V represents the

L L matrix that conatins the right-hand-side (RHS) eigenvectors

Therefore,

XD

L

X

lD1

X.l/ D

L

X

lD1

ulqlvTl

where X.l/represents the K L principal component matrix, ul represents the K 1 left-hand-side (LHS) eigenvector, ql represents the singular value, and vl represents the L 1 right-hand-side (RHS) eigenvector associated with the l th window index

Grouping Step

For each group index, mD 1; : : : ; M , define a group of window indices Im f1; : : : ; Lg Let

XI m D X

l2I m

X.l/D X

l2I m

ulqlvTl

represent the grouped trajectory matrix for group Im If groupings represent a spectral partition,

M

[

mD1

ImD f1; : : : ; Lg and Im\ InD ; for m ¤ n

then according to the singular value decomposition theory,

XD

M

X

mD1

XI m

Averaging Step

For each group index, mD 1; : : : ; M , compute the diagonal average of XI m,

Qxt.m/D 1

nt

e t

X

lDs

xt lC1;l.m/

Trang 9

where

st D T t 1; et D L; nt D T tC 1 for T LC 1 < t T

If the groupings represent a spectral partition, then by definition

yt D

M

X

mD1

Qxt.m/

Hence, singular spectrum analysis additively decomposes the original time series, yt, into m compo-nent series Qxt.m/for mD 1; : : : ; M

Specifying the Window Length

You can explicitly specify the maximum window length, 2 L 1000, using the LENGTH= option

or implicitly specify the window length using the INTERVAL= option in the ID statement or the SEASONALITY= option in the PROC TIMESERIES statement

Either way the window length is reduced based on the accumulated time series length, T , to enforce the requirement that 2 L T =2

Specifying the Groups

You can use the GROUPS= option to explicitly specify the composition and number of groups,

Im f1; : : : ; Lg or use the THRESHOLDPCT= option in the SSA statement to implicitly specify the grouping The THRESHOLDPCT= option is useful for removing noise or less dominant patterns from the accumulated time series

Let 0 < ˛ < 1 be the cumulative percent singular value THRESHOLDPCT= Then the last group,

IM D fl˛; : : : ; Lg, is determined by the smallest value such that

0

@

l ˛ 1

X

lD1

ql

X

lD1

ql 1

A ˛ where 1 < l˛ L

Using this rule, the last group, IM, describes the least dominant patterns in the time series and the size of the last group is at least one and is less than the window length, L 2

Data Set Output

The TIMESERIES procedure can create the OUT=, OUTCORR=, OUTCROSSCORR=, OUTDE-COMP=, OUTSEASON=, OUTSPECTRA=, OUTSSA=, OUTSUM=, and OUTTREND= data sets

In general, these data sets contain the variables listed in the BY statement If an analysis step that is related to an output data step fails, the values of this step are not recorded or are set to missing in the related output data set and appropriate error and/or warning messages are recorded in the log

Trang 10

OUT= Data Set

The OUT= data set contains the variables specified in the BY, ID, VAR, and CROSSVAR statements

If the ID statement is specified, the ID variable values are aligned and extended based on the ALIGN= and INTERVAL= options The values of the variables specified in the VAR and CROSSVAR statements are accumulated based on the ACCUMULATE= option, and missing values are interpreted based on the SETMISSING= option

OUTCORR= Data Set

The OUTCORR= data set contains the variables specified in the BY statement as well as the variables listed below The OUTCORR= data set records the correlations for each variable specified in a VAR statement (not the CROSSVAR statement)

When the CORR statement TRANSPOSE=NO option is omitted or specified explicitly, the variable namesare related to correlation statistics specified in the CORR statement options and the variable valuesare related to the NLAG= or LAGS= option

ACFSTD autocorrelation standard errors

ACF2STD an indicator of whether autocorrelations are less than (–1), greater than (1), or

within (0) two standard errors of zero ACFNORM normalized autocorrelations

ACFPROB autocorrelation probabilities

ACFLPROB autocorrelation log probabilities

PACF partial autocorrelations

PACFSTD partial autocorrelation standard errors

PACF2STD an indicator of whether partial autocorrelations are less than (–1), greater than

(1), or within (0) two standard errors of zero PACFNORM partial normalized autocorrelations

PACFPROB partial autocorrelation probabilities

PACFLPROB partial autocorrelation log probabilities

IACF inverse autocorrelations

IACFSTD an indicator of whether inverse autocorrelations are less than (–1), greater than

(1), or within (0) two standard errors of zero

Định dạng
Số trang	10
Dung lượng	283,74 KB