Optimal Combined Prediction of Industry Structure Base on GM(1, 1)
by user
Comments
Transcript
Optimal Combined Prediction of Industry Structure Base on GM(1, 1)
Optimal Combined Prediction of Industry Structure Base on GM(1, 1) and ARMA(p, q) Model ZHANG Yuchun, SHI Xiaolei School of Economics and Management, Lanzhou University of Technology, China, 730050 [email protected] Abstract: The optimal combined prediction model of industry structure base on ARMA(p, q) and GM(1,1) model is proposed, it bases minimal residual square sum, power weightiness is calculated by linear programming. The combined prediction model absorbs two model’s excellence, it has more prediction precision, the difficult problem of industry structure prediction is availability settled. Jiangsu’s industry structure is predicted by combined prediction model, that provide scientific gist for adjust industry structure. Keyword: Industry structure, ARMA (p,q) model, GM(1,1) model, combined prediction model 1. Introduction Adjust the industrial structure still is focus to promote economic coordinated development. In order to realize the strategic adjustment of industrial structure, we must grasp the changes of industrial structure, which requires to accurately predicting the future development trends of the industrial structure. There are many factors that impact changes of industrial structure; it is difficult using classical econometric accurate prediction. Time series model is no longer a different causal relationship between variables, but rather look for time-series changes of their own, only the past behavior of the prediction to predict the future [1]. Gray prediction requires small sample data, convenient operation, and high precision short-term prediction [2]. In this paper, to predict the industrial structure of ARMA (p, q) and GM (1, 1) portfolio model, in accordance with the thinking of smallest residual sum of squares, the use of linear programming optimization calculate the weight, so that the model absorb the merits of the two models, with prediction accuracy, an effective solution to the problem of the industrial structure difficult prediction. And predict industrial structure in Jiangsu. 2. ARMA (p, q) model Time series model has three kinds of types: autoregressive model AR, moving average model MA and autoregressive moving average model ARMA [3]. ARMA (p, q) model is commonly used for a class of stochastic time series model, be able to understand better the nature of time series of structural characteristics, to achieve minimum variance within the meaning of the optimal prediction. 2.1. Autoregressive Model AR (p) X t for its pre-value and a linear function of random items, can be expressed as X t = ϕ1 X t −1 + ϕ 2 X t − 2 + L + ϕ p X t − p + u t (1) Time-series Where: p autoregressive model for the order; error. ϕi (i = 1,2, ..., p) for the model coefficients; u t for 2.2. Moving Average model MA (q) If u t is not a hundred noise, which for the current and pre-and random error of the linear function. Can be expressed as u t = ε t − θ 1ε t −1 − L − θ q ε t − q (2) 157 Where: q as a model of order; θj (j = 1,2, ..., q) for the model coefficients; ε t for hundred noise. 2.3. Autoregressive moving average model ARMA (p, q) ARMA (p, q) are auto-regressive model and the combination of moving average models, time series X t for the current and pre-error and random items, as well as its early stage of the value of a linear function. Can be expressed as X t = ϕ1 X t −1 + ϕ 2 X t − 2 + L + ϕ p X t − p + ε t − θ1ε t −1 − L − θ q ε t − q (3) The introduction of lag operator B, the model can be expressed as[4] (4) ϕ ( B ) X t = θ ( B )u t ϕ ( B) = (1 − ϕ1 B − ϕ 2 B 2 − L − ϕ p B P ) θ ( B) = (1 − θ1 B − θ 2 B − L − θ q B q ARMA (p, q) process conditions are stable characteristic equation ϕ ( B ) = 0 , and reversible condition θ ( B) = 0 that all roots outside the B plane unit circle. If the original time series non-stationary time series, the steady after the difference, and then ARMA modeling, is referred to as ARMA (p, d, q) model, in which the number of d for the difference time. 3. Identify and estimate ARMA ((p, q) model 3.1. Model Identification 3.1.1 AR (p) model. k period variance for lag (5) γ k = E ( X t − k X t ) = ϕ1γ k −1 + ϕ 2γ k − 2 + L + ϕ p γ k − p autocorrelation function ρk = γk = ϕ1 ρ k −1 + ϕ 2 ρ k − 2 + L + ϕ p ρ k − p γ0 (6) Autocorrelation function ACF give the Xt and Xt-1 overall relevance, but the overall correlation between variables may be covered up completely different from the implied relations. In contrast, Xt and Xt-k between the partial autocorrelation function PACF is eliminated between variables Xt-1, ..., Xt-k+1 brought after indirectly related to the direct relevance, it values are known sequence Xt-1, ..., Xt-k +1 conditions, Xt and Xt-k inter-relations measure. 3.1.2 MA (q) model. k period lag variance for σ ε2 (1 + θ12 + θ 22 + L + θ q2 ) k =0 2 rk = E ( X t X t −1 ) = σ ε (−θ k + θ1θ k +1 + L + θ q − k ϑq ) 1 ≤ k ≤ q 0 k>q (7) autocorrelation function k =0 1 rk ρ k = = (−θ k + θ 1θ k +1 + L + θ q −k θ q ) /(1 + θ 12 + L + θ q2 ) r0 k>q 0 1≤ k ≤ q (8) MA (q) model identification rules: if the random sequence of auto-correlation function of cut-off tail, that is, since the q after, ρ k = 0 (k> q); and its partial autocorrelation function are trailing, then this sequence are moving average MA (q) sequence. 158 3.1.3 ARMA (p, q) autocorrelation function can be regarded as the mixture of the autocorrelation function of the MA (q) and the autocorrelation function of the AR (p). When p = 0 it has censored nature; when q = 0 it has a trailing character; when p, q are not 0, it has a trailing character. From the identification point of view, usually, ARMA (p, q) process partial autocorrelation function (PACF) may be p lag before the tip has several distinct spikes, but p steps lag of the start gradually moving to zero ; and its autocorrelation function (ACF) is at the q steps lag before there are a few obvious tip column items from the q steps lag and begin to tend to zero. 3.2. The model parameters estimated AR (p), MA (q), ARMA (p, q) select the order of the model further to calculate the unknown parameters of model. Here to choose the least square method to estimate the parameter values, the parameters can use Eviews4.0 to estimate[5]. 4. GM (1, 1) modeling steps: 1) Acquisition of raw data out[6] X (0) = {x ( 0) (1), x ( 0) (2),L , x (0) (n)} Where: n is the number of collected samples. Where: n is the number of collected samples. 2) The original sequence to do I-AGO data generation X (1) = {x (1) (1), x (1) (2),L , x (1) (n)} k Where: x (1) = ∑ x ( 0) (i ), k = 1,2,L, n i =1 Z (1) For X (1) the neighboring mean generation sequence[7] Z (1) = {z (1) (2), z (1) (3),L , z (1) (n)} Among them, z (1) (k ) = 1 (1) ( x (k ) + x (1) (k − 1)) , k = 2,3,L , n 2 3) Set up differential albino equations dx (1) + ax (1) = b , a, b Parameters are to be determined. dt 4) The use of least-squares method to solve the parameters a, b x (0) (2) (0) x (3) Y = M (0) x (n) − z (1) (2) 1 (1) a − z (3) 1 T −1 T B= b = ( B B) B Y M (1) − z (3) 1 ( 0) ( 0) The time response sequence of GM (1,1) model x ( k ) + az ( k ) = b for b b xˆ (1) (k + 1) = x (0) (1) − e −ak + k = 1,2,L , n a a ; (9) Restore the value b xˆ ( 0) (k + 1) = (1 − e a ) x (0) (1) − e −ak a 5. Combination forecasting model ; k = 1,2,L, n (10) Combination prediction is a prediction methods that weighted appropriate average of the prediction outcome of each individual as the ultimate outcome, portfolio prediction theory suggests that: for a 159 problem of the prediction, since the combination prediction methods contained in all of single prediction useful information, so that the number of prediction based on different assumptions of the linear combination method can effectively improve the model fitting capabilities, enhance its ability to adapt to future changes as well as the prediction accuracy. The key of prediction combination model that combination weights are determined, the smallest residual sum of squares as the basis for calculating the optimal combination weights of standards in the paper, use linear programming optimization to calculate weights. The principle is: set up the actual value of an industry in a certain period for y it (i = 1,2,3; t = 1,2, ..., n), if the industry has m kinds of feasible prediction method, predictive value for y ijt (i = 1,2,3; j = 1, ..., m; t = 1,2, ..., n), also located m kinds of prediction methods of weighting coefficients normalized to satisfy constraint conditions: m ∑W ij =1 (i = 1,2,3; j = 1,L m) (11) j =1 Combination forecasting model can be expressed as m yˆ it = ∑ Wij y ijt (i = 1,2,3; j = 1,L m; t = 1,2,L n) (12) j =1 m u it = y it − yˆ it = y it − ∑ Wij y ijt (i = 1,2,3; j = 1,L m; t = 1,2,L n) (13) j =1 Weighting coefficients must satisfy non-negative constraint conditions so that the residual u it sum of squares smaller as possible. The above model can be translated into the following linear programming model n n n t =1 t =1 t =1 min U = ∑ u12t + ∑ u 22t + ∑ u 32t (14) m − y 1t ∑ W1 j y1 jt = u1t j =1 m − y 2t ∑ W 2 j y 2 jt = u 2t j =1 m y 3t − ∑ W3 j y 3 jt = u 3t j =1 m m m ∑ W1 j yˆ1 jt + ∑ W 2 j y 2 jt + ∑ W3 j y 3 jt = 1 s.t. j =1 j =1 j =1 m ∑ W1 j = 1 j =1 m ∑ W 2 j = 1 j =1 m W =1 3j ∑ j =1 Wij ≥ 0 i = 1,2,3; j = 1,L , m. 160 (15) Solute the linear programming problem, you can get the weighting coefficients Wij of optimal combination based on the smallest residual sum of squares, when the new historical data come in ever-increasing, to retake the recent historical data, from the formula to recalculated Wij so as to arrive at optimal portfolio predictive value. 6. Study According to Statistical Yearbook of Jiangsu acquire the 1978-2005 year observed values of the proportion of primary industry, secondary industry, tertiary industry y1 , y 2 and y 3 (%) in Jiangsu ①. ARMA (p, q) theory can only be used for stationary time series, therefore required to test the proportion of three industries smooth nature in Jiangsu. The three industries respectively ADF unit root test and found that the proportion of three industries are non-stationary time series. After carried out the first-order differential unit root test, three industries are smooth time series, that is all I (1). Three industries in accordance with observations by the smallest AIC criteria use Eviews4.0 to set up three industries differential time series ARMA (p, q) model②. ∆y11t = −0.950 − 0.739∆y11t −1 − 0.215∆y11t − 2 + ε t − 1.212ε t −1 − 0.994ε t −2 (16) 2 (-2.246) (-3.456) (-1.376) (16.317) (6.979) R =0.546 DW=1.966 ∆y 21t = 0.177 − 0.959∆y 21t −1 − 0.423∆y 21t − 2 + ε t − 1.037ε t −1 − 0.994ε t − 2 (17) 2 (0.429) (-4.628) (-2.448) (27.248) (12.732) R =0.429 DW=1.790 ∆y 31t = 0.779 + 0.266∆y 31t −1 + 0.296∆y 31t − 2 − 0.274∆y 31t −3 + ε t − 0.209ε t −1 (1.378) (1.034) (1.159) (-1.137) − 0.196ε t − 2 − 0.931ε t −3 (-2.856) (14.933) (-2.738) (18) 2 R =0.419 DW=2.164 ∆y i1t (i = 1,2,3; t = 1,2, ..., n) express the first prediction method of ARMA (p, q) differential three industries. y i1t (i = 1,2,3; t = 1,2, ..., n) express the three industries observations of the first prediction method. Difference model of the above appropriate transform available three industries ARMA (p, q) model, and predict the proportion of three industries predictive value in Jiangsu's, as shown in table 1. y11t = −0.950 + 0.261 y11t −1 + 0.527 y11t −2 + 0.215 y11t −3 + ε t − 1.212ε t −1 − 0.994ε t − 2 (19) y 21t = 0.177 + 0.041 y 21t −1 + 0.527 y 21t − 2 + 0.432 y 21t −3 + ε t − 1.037ε t −1 − 0.994ε t − 2 (20) y 31t = 0.779 + 1.266 y 31t −1 + 0.030 y 31t − 2 − 0.057 y 31t −3 + 0.274 y31t − 4 + ε t − 0.209ε t −1 − 0.196ε t − 2 − 0.931ε t −3 (21) According to original data appropriate metabolism, and set up three industries gray GM (1,1) prediction model is as follows in Jiangsu, y i 2t ( k + 1) (i = 1,2,3; t = 1,2, ..., n; k = 1,2 ... , n) that the second prediction method of GM (1,1) three industries time series, and forecast the proportion of three industries in Jiangsu, in table 1. ① Source: Jiangsu Statistical Bureau, Jiangsu Statistical Yearbook 2005 ② The number in brackets below the parameters estimated in Regression model are t statistics of corresponding parameters. 161 Table 1 ARMA (p, q) and GM (1,1) model prediction of the proportion of three industries ARMA(p, q) Model Model GM(1,1) Model Primary industry Secondary industry Tertiary industry Primary industry Secondary industry Tertiary industry 2007 5.07 55.26 38.73 7.42 56.27 38.71 2008 5.12 55.37 39.51 6.86 55.14 38 2009 4.87 55.4 39.73 6.62 55.23 38.15 2010 3.22 55.71 41.07 6.04 55.31 38.65 Years y12t (k + 1) = −665.582 exp(−0.057 * k ) + 695.082 y 22t (k + 1) = 9912.845 exp(0.004 * k ) − 9865.245 y 32 t (k + 1) = 704.653 exp(0.026 * k ) − 684.853 (22) (23) (24) The data into the equation (14) and (15), acquire a combination of the primary industry weight (0.529, 0.471), the combination of second industry weight (0.529, 0.471) and the combination of the tertiary industry weight (0.125, 0.875). Therefore acquire three industries final optimal prediction model in Jiangsu. The combination prediction model of three industries in Jiangsu: yˆ1t = 0.529 y11t + 0.471 y12t (25) yˆ 2t = 0.529 y 21t + 0.471 y 22t (26) yˆ 3t = 0.125 y 31t + 0.875 y 32 t (27) Where: yˆ 1t , yˆ 2 t , yˆ 3t for the respective three industries combination prediction; y i1t for the respective three industries ARMA (p, q) model prediction; y i 2 t for the respective three industries GM (1,1) model prediction. Use combination forecasting models to predict the future proportion of the first industry, secondary industry and tertiary industry in Jiangsu. in table 2. Years Table 2 portfolio model prediction of the proportion of three industries Primary industry Secondary industry Tertiary industry 2007 6.10 56.00 37.90 2008 5.94 55.88 38.18 2009 5.69 55.96 38.35 2010 5.55 56.08 38.37 7. Conclusion In this paper put forward the ARMA (p, q) and GM (1,1) model prediction model combination, in accordance with the thinking of the smallest residual sum of squares, use linear programming calculate the weight, so that the combination forecasting model absorb the advantages of ARMA (p , q) and GM (1,1) model and have high accuracy characteristics of with the prediction. Analyze the industrial structure prediction in Jiangsu. Effectively solve the difficult problem of the industrial structure prediction, will provide a scientific basis for realize the strategic adjustment of industrial structure. 162 About the author: ZHANG Yu Chun Doctor, an associate professor of School of Economics and Management, which belongs to Lanzhou University of Technology, The main research are directions and methods of management decision-making theory, economic analysis and evaluation of industry SHI Xiao Lei a student of School of Economics and Management, which belongs to Lanzhou University of Technology. Postal Code: 730050 Telephone number: 13993183818, 13679487620 E-mail: [email protected], [email protected] References [1]. Li Zi nai, Pan Wen-qing. Econometrics [M]. Beijing: higher education press, 2000.336-356(in Chinese) [2]. Bao Yi dan, Wu Yan ping, He Yong. A New Forecasting Model Based on the Combination of GM(l,l) Model and Linear Regression[M]. Journal of Systems Engineering Theory & Practice, 2004,(3):95-98(in Chinese) [3]. Li Zi nai, YE A-zhong. Higher econometrics[M]. Beijing: Tsinghua university press, 2005.44-60(in Chinese) [4]. Wu Hui yu. Time series analysis and control[M]. Wuhan: Wuhan university press, 2004.37-95(in Chinese) [5]. Zhang Xiao tong. The use guide of econometrics software Eviews[M].Tianjin: university press, 2004.119-139(in Chinese) [6]. Jia Zheng yuan, Wu Fang-qin, Li Wei-ming. Research on Optimal Combined Prediction of Power Consum Ption Based on Econometric and Gray Model[J]. Journal of North China Electric Power University. 2001, (4):23-28 (in Chinese) [7]. Liu Si feng, Dang Yao guo, Fang Zhi-gen et al. The theory and application of grey system[M]. Beijing: Science press, 2004.125-162(in Chinese) 163