پیش بینی مدت زمان سری با یک مدل غیر خطی و جستجوی پراکنده ی متا هیوریستیک
|کد مقاله||سال انتشار||مقاله انگلیسی||ترجمه فارسی||تعداد کلمات|
|8050||2008||11 صفحه PDF||سفارش دهید||5956 کلمه|
Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)
Journal : Information Sciences, Volume 178, Issue 16, 15 August 2008, Pages 3288–3299
Forecasting the behavior of variables (e.g., economic, financial, physical) is of strategic value for organizations, which helps to sustain practical interest in the development of alternative models and resolution procedures. This paper presents a non-linear model that combines radial basis functions and the ARMA(p, q) structure. The optimal set of parameters for such a model is difficult to find. In this paper, a scatter search meta-heuristic is used to find this optimal set. Five time series are analyzed to assess and illustrate the pertinence of the proposed meta-heuristic method.
Many planning activities require predicting the behavior of variables (e.g., economic, financial and physical). These predictions support the strategic decisions of organizations , which in turn sustains practical interest in forecasting methods. Several methods – both causal and non-causal – have been proposed to forecast such variable behavior. The causal methods try to explain one variable in terms of a set of other variables. This category includes the widely studied classification problem, which consists of assigning objects into a predefined group given a number of observed attributes of these objects (see  for a review of neural networks applied to this problem). The non-causal methods, on the other hand, study the historic values of a variable in order to forecast its future value. This category includes the time series methods. Time series methods are generally used when there is not much information about the generation process of the underlying variable and when other variables provide no clear explanation of the studied variable . Gooijer and Hyndman  have published a review of the literature for the methods used to model time series over the last 25 years. The most popular methods are the moving average, exponential smoothing and ARIMA (autoregressive integrated moving average) methods. These methods all assume linear relationships among the past values of the forecast variable. Although the linear assumption makes it easier to manipulate the models mathematically, it can lead to inappropriate representations of many real-world patterns in which non-linear relationships are prevalent. Non-linearities can also be used to show abrupt changes in the real data. Nonetheless, incorporating non-linearities into models can lead to very difficult mathematical problems, in which the optimal set of parameters may be difficult to know. This difficulty could explain why there have been fewer studies devoted to non-linear time series than to linear time series. Specific non-linear patterns can be taken into account using TAR (threshold autoregressive) and ARCH (autoregressive conditional heterocedastic) models . The usefulness of alternative resolution methods, approximate in nature but less restrictive, for general non-linearities has been noted by several authors. For instance,  used a genetic fuzzy predictor ensemble;  used probability instructions combined with genetic programming;  applied genetic algorithms to tune radial basis functions;  proposed a neural model;  used neural networks for quarterly time series forecasting;  combined the ARIMA structure with neural networks;  also proposed the use of hybrid ARIMA and artificial neural networks models;  and  presented fuzzy time series models;  proposed a neural ensemble model that incorporates noise into the data used to build different training sets; and  applied an enhanced polynomial artificial neural networks. Although neural networks constitute one of the most popular models for dealing with non-linear time series, the method’s general applicability is now being questioned , and the model’s over-parameterization remains a drawback. (A comparison of neural network forecasting applications with the classical ARMA(p, q) structures can be found in .) This paper investigates the potential of a non-linear model, whose parameters are estimated with the scatter search meta-heuristic, to forecast a given time series. This model is based on the linear ARMA(p, q) model, expressed mathematically as: equation(1) View the MathML sourceY(t)=w0+∑i=1pwiY(t-i)+∑i=p+1p+qwiu(t-i)+u(t),t=p+1,…,n, Turn MathJax on where Y(t) is the value, in period t, of the variable Y to be forecast, wi (i = 0, … , p + q) are parameters of the model, u(t) is the error term and n is the number of available/used periods of time. The non-linear model NARMA(p, q, l) can thus be defined as: equation(2) View the MathML sourceY(t)=α0+∑j=1lαjhjw0+∑i=1pwiY(t-i)+∑i=p+1p+qwiu(t-i)+u(t),t=p+1,…,n, Turn MathJax on where hj(x) is a non-linear function, with x being an input vector, and (p, q, l) is the order of the model. This NARMA(p, q, l) model uses l non-linear functions to transform the ARMA(p, q) output, and then Y(t) is approximated by a linear combination of those transformations. This model is also related to the radial basis function models  and  which incorporate an approximation property, provided that a large number of radial basis functions are used. This paper focuses on the non-linear Gaussian function (3), usually used in studies of neural networks , ,  and , for example). The logistic function, View the MathML sourcehj(x)=11+exp(-x), can also be used as alternative, as  has done; however, the computational experiments conducted for this paper produced worse results for this function. For this reason, only the results for the Gaussian function equation(3) View the MathML sourcehj(x)=exp-x-cjrj2 are presented. Let w = (w0, w1, … , wp, wp+1, … , wp+q), α = (α0, α1, … , αl), c = (c1, … , cl), r = (r1, … , rl) be the parameters to be estimated, represented in a compact form by X ≡ (w, α, c, r). Using a Gaussian function and the NARMA(p, q, l) model, there are a total of p + q + 3l + 2 parameters to be found. The aim of this paper is to find the set of parameters X, also called the solution set that minimizes the mean sum square of the errors (MSE), which is used to evaluate the results: equation(4) View the MathML sourceMSE(X)=∑t=1nYt-Y^(t)2n, where View the MathML sourceY^(t) is the estimated value of Y(t) using the found parameters, X. The problem of finding such a set of parameters for (2) is complex, and alternative methods for deriving these parameters is of significant interest in time series studies. When dealing with non-linear models, two approaches are usually followed. The first seeks to develop exact methods for deriving the optimal parameters for the model. The second tries to find a good “near-optimal” set of parameters using meta-heuristic procedures. The meta-heuristic methods attempt to lead the parameter search to an interesting region discovered by an evolutionary process. Due to the complexity of the underlying problem, the second approach is appealing because meta-heuristic procedures require no assumptions about the parameters and offer the possibility of obtaining a global optimum of (4). Proposed by Fred Glover , the scatter search meta-heuristic used in this paper is based on the progressive combination of interesting solutions. This method has been successfully applied to many optimization problems. (See http://www.uv.es/~rmarti for an updated list of studies based on the scatter search meta-heuristic.) The rest of the paper is organized as follows. Section 2 describes how the scatter search is applied to the estimation problem. Section 3 shows the computational experiments conducted to test the proposed method. Section 4 summarizes the main conclusions drawn from the research presented in the paper.
نتیجه گیری انگلیسی
This paper applied the scatter search meta-heuristic to find the parameters of a non-linear model for the time series forecasting problem. The structure of the procedure proposed here was shown to be more than adequate for finding the parameters for NARMA(p, q, l) models, at least compared to the results available in the literature. The empirical results showed that the procedure is also noteworthy for its capacity to forecast the future values of the underlying variables. In the computational experiments conducted, the best performances in two of the five cases presented occurred in the test data set and not in the training data set. When evaluating the enhancement of the results by incorporating noise effects, the conclusions were not clear. In one time series, any of the evaluation measures improved. To quantify the improvement of the presented results, hybrid models combining the forecasting procedure presented here with exact methods, as proposed by Zhang  for instance, need to be developed. Extending the approach to include the automatic tuning of the order of the NARMA(p, q, l) model could also be interesting. Applying meta-heuristics, namely the scatter search meta-heuristic, to find the parameters of a non-linear model for the classification problem is one possible direction for future research, since the classification problem is one of the most studied problems in the forecasting context and has several applications. In terms of the procedure defined in this paper, only the improvement method would need to be modified in order to generate a new model associated with the classification problem.