 Home Page         STATE SPACE FORECASTING           The concept of a "State" vector came from the physical sciences. A vector includes all the information about the system that carries over into the future. For example, the space shuttle moving through space. Six laws of nature impact its trajectory. You do not need to know anything about the shuttle's past, but only the six components of its vector state. Only the current information is needed to predict its future course. This is the simplest case. Suppose that same shuttle is moving through the atmosphere and there exists measurement error. Our system is not DETERMINISTIC now, but STOCHASTIC. Therefore, the current state of affairs is described not only on current things, but also on a past past "process error". This system can be described by linear equations that we must first determine from the data itself - not just the six laws of nature. Since in reality we do not know the true vectors (determinants) associated with a forecast series, we used the data itself to give us information about the process. The traditional Box-Jenkins approach does not do this. It tries to identify the correct form of the ARIMA processes and then estimate and test to see if patterns remain in the error component. In State Space, the data associated with the dependent and independent variables are used to extract sets of linear combinations of past and future values. This extraction is performed by a common multivariate technique called CANONICAL CORRELATION. In canonical correlation, our interest centers on the linear relationship between one battery of variables (y1,y2) and another battery of variables (x1,x2). The objective is to find sets of linear composites subject to certain conditions. First, we compute the linear composite of y1 called t1. These linear combinations are the results of applying weights to the data. X1's linear composite, u1 is also calculated. These linear composites are calculated in such a way as to obtain the maximum correlation between t1 and u1. Once this correlation is calculated on the linear composites, its y1 and x2's turn. Y2 and x2's linear composites (t2 and u2) are calculated for maximum correlation - BUT subject to being uncorrelated with t1 and t2. Therefore, the correlations between successive pairs will decline in size. Canonical correlation seeks to find successive pairs of linear composites that are maximally correlated, subject to being uncorrelated with the previously found pair. Recapping, the State of a time series consists of all the linear indep. combinations of past and present data that correlates significantly with the future of the endogenous variables. Because of the "all", there is no more information that can be extracted from the past. Because of linear indep endence, there is no erroneous information. Both past and present data is blended together. State Space Models exploit past and present autocorrelations. FOr multivariate cases, cross-correlations between the variables are also exploited. State Space Forecasting has characteristics much like ARIMA - i.e. stationarity requirements, etc. All Box Jenkings models can be restated in their State Space form, except State Space tends to over-parameterize the models. This, however, generally happens by including more moving average terms than would have been estimated by a ARIMA like procedure. Because State Space will work with one or more data series simultaneously, joint stationarity conditions must be considered. As a practical stand, this may mean differencing all variables in the system by the same degree of differencing. If joint stationarity is not attained, autocorrelations among the data may be distorted.        