Computer Modeling in Engineering & Sciences |
DOI: 10.32604/cmes.2022.017822
ARTICLE
Wavelet Decomposition Impacts on Traditional Forecasting Time Series Models
1Department of Mathematics and Statistics, Quaid-e-Awam University of Engineering, Science & Technology, Nawabshah, Sindh, Pakistan
2Department of Basic Sciences & Related Studies, Mehran University of Engineering & Technology, Jamshoro, Sindh, Pakistan
3Department of General Studies, Jubail University College, Al Jubail, Saudi Arabia
*Corresponding Author: W. A. Shaikh. Email: wajid@quest.edu.pk
Received: 09 June 2021; Accepted: 10 September 2021
Abstract: This investigative study is focused on the impact of wavelet on traditional forecasting time-series models, which significantly shows the usage of wavelet algorithms. Wavelet Decomposition (WD) algorithm has been combined with various traditional forecasting time-series models, such as Least Square Support Vector Machine (LSSVM), Artificial Neural Network (ANN) and Multivariate Adaptive Regression Splines (MARS) and their effects are examined in terms of the statistical estimations. The WD has been used as a mathematical application in traditional forecast modelling to collect periodically measured parameters, which has yielded tremendous constructive outcomes. Further, it is observed that the wavelet combined models are classy compared to the various time series models in terms of performance basis. Therefore, combining wavelet forecasting models has yielded much better results.
Keywords: Impact; wavelet decomposition; combined; traditional forecasting models; statistical analysis
Due to the predictive importance, researchers have developed various forecasting models. As better environmental forecasting arts can be used to make appropriate management decisions, researchers are continually striving to improve the effectiveness and efficiency of the models. For decades the term wavelet has been used for the exploration of signal processing and geophysics. Therefore, this article looks at the WD combined with various traditional forecast time-series models. For decades the term wavelet has been used for the exploration of signal processing and geophysics. The last decade has shown vast interest in wavelets; it is a subject area that can be appropriate applicable and coalesced in various fields such as applied mathematics, physics, electrical engineering, etc.
Consequently, the WD has significantly impacted various fields, such as image processing, differential equations, statistics, and chemical signal processing [1,2]. WD algorithm is used as a mathematical approach for extracting nontrivial and potentially useful information from different data types such as historical data information, re-analysis, local or global climate model simulations, etc. Usually, WD based models are required to analyse the datasets and decomposing them thoroughly. Its hypothesis is to get better re-construct datasets with minimal losses. The wavelet transform technique uses the signal frequency from the time domain to the wavelet domain. Therefore, the newly acquired domain has contained more complex basis functions called wavelets, mother wavelets, or analysing wavelets [3].
Wavelet-based models are a noteworthy edge in de-noising the datasets to develop an efficient model. It has made it easy to analyse streamflow processes on different parameters without eliminating the effects of the time-frequency accompanied by conventional bandpass filters. The WD tool can let on information within the signal in both frequencies, time and scale domains [4,5]. The WD application controls the time-frequency or signal scale content and judges the temporal variation spectrum [6]. In contrast, the Fourier transforms interpret a quite different perspective that allows estimating the signal frequency but is not suitable to estimate the time-frequency dependence. Therefore, the wavelet transform has its origins in the Fourier transform. Thus, the WD has allowed for tracking the time evolution of processes at various scales in the signal as it has both time and scale measures of localisation. The WD tool signals can be classified as high limited frequency events or a significant number of scale-variable methods because it provides explicit information for rate forecasting classification [7]. In a review of the applications of the wavelet transform in hydrological dataset modelling describing the multifaceted information that can be obtained from such an analysis and recognition of seasonality, streamflow trends, and data de-noising.
The performance and accuracy of the traditional time series forecasting models continuously may be improved. Therefore, it can be inspired by the researchers to intend an improved version of the models [8]. This study describes the performance impact of the WD as an optimization in the traditional time series models in which the optimal response is continuously exchanged during the simulation and can be approved. The effectiveness of the models has been tested on two different streamflow datasets, including Indus and Chenab Rivers.
2 Study Areas and Data Utilization
To endorse the discussed TS forecast models and forecast the rivers, streamflow of the rate of the rivers have been collected 484 and 550 months, respectively, from two renowned Indus and Chenab Rivers of Pakistan (Figs. 1–2).
The use of wavelet application with the various traditional forecasting models such as LSSVM, ANN and MARS has improved the efficiency of the models and found excellent outcomes. These tractable combined models have been implemented as efficient tools on streamflow datasets to forecast phenomena that provide comprehensive signals information. The developed combined wavelet with AI models implements the following two-step protocol for forecasting activities.
1. The WD methodology has been used as a preprocessor of input datasets. As a result, it has a time-frequency signal analysis at distinct intervals in the time-domain and considerable detail about input datasets.
2. After obtaining the input signal by WD, it has been used for further processes as AI input in various traditional forecasting models.
Initially, the forecasting time-series datasets have been decomposed into a sub-time-series
3.1 Wavelet Decomposition (WD)
The algorithm of WD ability to de-noise non-stationary signals into sub-signals at different levels has a suitable resource for improved streamflow elucidation [10]. The novel developed forecasting models have been individual capacities that comprised WD and some conventional AI model techniques. The combined wavelet with various AI forecasting models has been an appropriate methodology [11]. According to forecasting time series models, the wavelet technique is becoming an ever more effective and essential tool used in models. The primary reason for WD has analysed the time series datasets in the time and frequency domain of the valuable decomposition of the original time series by taking useful information in various frequency levels using wavelet functions. The main advantage of using the WD is its robustness, as it does not include any potentially erroneous assumption or parametric testing procedure. The following WD mathematical structure is defined as a continuous time-series
where defined
Therefore, the continuous wavelet transform (CWT) can be defined as follows [12];
where,
where
where,
Decomposition levels
or inverse WD also in a simple format defined as follows:
where,
3.2 Wavelet Artificial Neural Networks (WANN)
The human brain's functioning principle influenced an artificial neural network (ANN) as a forecasting model. Several architectures in the literature are available to forecast the streamflow and many other applications, one of which is the ANN algorithm mostly used. It is comprised of a network system with many interconnected nodes called neurons. The number of layers of an ANN is used to classify it, and layer(s) exists between an input and an output layer. Therefore, a single-layer feed-forward (SLFF) neural network is an architecture with just one layer for establishing connection among the nodes of the input, middle, and output layers. This type of system is characterised as a multi-layer feed-forward (MLFF) neural network built by more than one middle layer [14]. ANN has the merits of fault-tolerance, an efficient nonlinear developing capability that provides a well-organised model for streamflow forecasting. Though the model has some drawbacks, such as slow optimization processing, model complexities, and the approximation of the applications could not be overlooked. Therefore, this forecasting architectural model needed to be improved. The WD has been compounded to the ANN model for this purpose. The WANN model has been combined with the strengths of the WD and ANN applications and to achieve nonlinear strong approximation capability. WANN model architectural design has based on multi-layer perceptron (MLP). WANN is quite corresponding to that of
Layer-i. input layer, which is introduced to the network and takes one or more than one inputs.
Layer-ii. the hidden layer, where data is manipulating procedure with a feed-forward neural network accompanied with orthonormal WD basis by activation functions developed.
Layer-iii. output layer, which contains one or more linear combiners and the corresponding estimations are consistent with the given inputs.
The training process has acquired the weights of the network connections. The WANN model can have various TF of different nodes in identical or different layers. The TF such as sigmoid, hyperbolic tangent functions are used for hidden layers, and there is no appropriation for the output layer. The WANN model has been successfully used for forecasting estimations. Two key approaches to developing the WANN model technique are described as following:
• The WD technique and the ANN model processing are used separately. Firstly decomposed, the input signal employs various WD basis functions (Eq. (1)) with neurons in the hidden layer. Then, the wavelet coefficients have one or more output activities in their input weights have adjusted according to the certain learning algorithm.
• In this case, two structures, WD mathematical and ANN artificial intelligence algorithms have been combined and performed. The transferal and dilation of the WD accomplished weights that have been adjusted according to a certain learning algorithm.
Only dyadic dilations and translations of the WD have developed the wavelet basis function whenever the first approach occurs. Therefore, this objective approach of WANN has often been known as a wavenet.
3.3 Wavelet Least-Squares Support Vector Machine (WLSSVM)
The following reprocessed support vector machine (SVM) classifier governs the application of minimisation [16]
Under the following equality condition
The usage of the LSSVM classifier is implicitly compatible with the definition of regression using binary conditions
Therefore, developing a sense for least square (LS) data fitting is equivalent to LSSVM classifier development
where,
Since
After eliminating
Since
Here, pick
The wavelet least-squares support vector machine (WLSSVM) model has been developed with the potential worth of the WD algorithm and LSSVM processing and obtained optimum nonlinear approximation ability. WLSSVM model has been consists of an input layer, hidden layer, and output layer and the model successfully has been used for forecasting approximations [16].
3.4 Multivariate Adaptive Regression Splines (MARS)
The MARS model schemes discoveries to forecasting continuous numeric outcomes. Appropriate, the MARS model scheme has been implemented in two stages containing forward-backward stepwise techniques. The stepwise forward technique has observed a large set of input variables (basis function) with a different knot; though, this stepwise technique might be developing complexity and a multi-layered model [19]. Determination of such type of model has weak forecasting presentation. For increasing forecast accuracy, the backward stepwise technique eradicates the pointless variables amongst the before chosen set, which may have fewer effects on the approximation procedure pruned by the MARS. For the projection of x the input variable to a novel y output variable based on appropriation, named basis functions that defines point of inflection along with the input range [20]:
In these y functions, x treat as input and c chosen is a threshold value is said to be a knot. The function is useful in forward-backward stepwise techniques for each input unknown to classify the position of knots, where the value of the function changes. These y basis functions are called Spline functions, which is a
where the output variable y estimated by the MARS model,
where,
where d is the penalty for each basis function consisted in the model, M is the number of basis functions. Wavelet Multivariate Adaptive Regression Splines (WMARS) model has combined the prospective techniques of WD and MARS for achieving robust nonlinear estimation potentiality. WMARS model binding designed is found on a multi-layer perception (MLP). The WMARS model is used for forecast estimations and has been dependent on an input layer, hidden layer, and output layer [22].
3.5 Statistical Parameters Assessment
The statistical parameters are used to demonstrate the effectiveness in terms of forecastability of the models assessed by comparing the actual and forecasted values. Usually, the Mean Absolute Error (MAE), the Root Mean Square Error (RMSE), and the correlation coefficients (CC), are used to determine the efficiency of the models and outcomes fitted to the best fit line [23--24]
where
The statistical parameters (21)–(23) have been used to assess the impact of discussed different models. MAE statistics provide an appropriate picture of the actual position in terms of the projected value error, whereas the RMSE statistics are the deviation of the models between the observed and projected values, the least values of these statistics are evaluation criteria for the best model. Similarly, the degree of the linear correlation coefficient is measured by the CC and observed the best flow effect with its high value [25].
This article examines the wavelet impact on traditional forecasting models by fitting input hydrological time-series datasets collected from Indus and Chenab Rivers. The computational code of the conversed forecasting models has been written in the MATLAB application, including the wavelet toolbox.
The six
The training dataset of the models is described for approximated parameters and the testing dataset has characterised by choosing the best combination model amongst every number of hidden layers considered. A trial-and-error technique has estimated the optimum complexity of conversed models. The statistical approaches, such as the Correlation Coefficient (CC), the Mean Absolute Error (MAE) and the Root Mean Square Error (RMSE), respectively, have estimated the outcomes. The estimates of both streamflow datasets are described in Tables 2–7 regarding accuracy and errors of the forecasting time-series models, such as LSSVM, WLSSVM, ANN, WANN, MARS and WMARS, respectively.
In Table 2, the testing phase outcomes of the LSSVM model are observed that the
Additionally, in Table 4 is noticed that
Furthermore, in Table 6 is analysed that
Figs. 3–4 are described the predicted linear trend line scatter displaying the LSSVM, WLSSVM, ANN, WANN, MARS and WMARS models employing the testing phase and show the performance as relatively to observed data on Indus and Chenab banks. The scatter lines are described in the equation of regression line
Clearly, Fig. 3 shows that the forecasting model with wavelet transforms performs remarkably because scatter graphs are more appropriate than traditional models.
Similarly, Fig. 4 shows that forecasting models with wavelet transform have better forecasting scatter graphs than traditional models.
The approximations of each model are shown in Table 8, scrutinised by various statistical parameters with small errors and large coefficient values with the evidence that models with wavelet decomposition are more efficient than the traditional models.
The CC-values of WD combined models are close to 100% for Indus and Chenab datasets compared to traditional models. Therefore, the WD algorithm combined with traditional models has made a tremendous impact and performs the role of a gadget to deliver improved estimations of both streamflow rivers. Consequently, the combined WLSSVM, WANN, and WMARS forecasting methodologies have been used as the second type models and provide excellent results instead of the first type traditional models LSSVM, ANN and MARS.
It is concluded that the use of wavelet application with the addressed forecast time-series models has improved the efficiency and yielded tremendous results. The traditional forecasting time-series models have been prescribed by utilising the impact of the wavelet algorithm. The significance of wavelet information is to improve the efficiency of the models that determines the appropriate outcomes for time-series models. The performance of the wavelet combined models mapping with their associated resampling outcomes. Filtrations of the streamflow time-series datasets have been interpreted from the WD application and these features have not been observed in traditional models. The nonlinear input combination models have been constructed with the WD application and used as input estimators with traditional models that improve the forecast efficiency of the combined models. Therefore, the WD application has become an efficient and interesting valuable tool to analysed simulations of time-series datasets models in various domains.
Thus researchers have a good argument in the future for the extensive usage of the wavelet algorithm to build up the novelty in the model or improve the existing models by suitable transform other than wavelet. The WD algorithm provides the optimum ability to pick the appropriate input and logically improves the output of the traditional forecasting models.
Acknowledgement: This study has been reinforced by the Department of Basic Sciences & Related Studies, Mehran University of Engineering & Technology, Jamshoro, Sindh, Pakistan. The authors have gratefully acknowledged the institute for its support and cooperation in the research activity and providing a healthy research environment and facilities.
Funding Statement: The authors have not received any financial support for the research, authorship, and publication of this article.
Conflicts of Interest: The authors declare that they have no conflicts of interest to report regarding the present study.
1. AL W. S., Ismail, M. T., Karim, S. A. A. (2010). Forecasting financial time series data base on wavelet transforms and neural network model. International Conference on Mathematical Applications in Engineering, pp. 3–5. Kuala Lumpur, Malaysia. [Google Scholar]
2. Shao, X. G., Leung, A. K. M., Chau, F. T. (2003). Wavelet: A new trend in chemistry. Accounts of Chemical Research, 36(4), 276–283. DOI 10.1021/ar990163w. [Google Scholar] [CrossRef]
3. Kisi, O. (2010). Wavelet regression model for short-term streamflow forecasting. Journal of Hydrology, 389(3–4), 344–353. DOI 10.1016/j.jhydrol.2010.06.013. [Google Scholar] [CrossRef]
4. Kaboudan, M. (2004). Wavelets in forecasting. School of Business, University of Redlands, Redlands. [Google Scholar]
5. Nourani, V., Baghanam, A. H., Adamowski, J., Kisi, O. (2014). Applications of hybrid wavelet–artificial intelligence models in hydrology: A review. Journal of Hydrology, 514, 358–377. DOI 10.1016/j.jhydrol.2014.03.057. [Google Scholar] [CrossRef]
6. Lee, H. Y., Beh, W. L., Lem, K. H. (2020). Wavelet as a viable alternative for time series forecasting. Austrian Journal of Statistics, 49(3), 38–47. DOI 10.17713/ajs.v49i3.1030. [Google Scholar] [CrossRef]
7. Esfetanaj, N. N., Nojavan, S. (2018). The use of hybrid neural networks, wavelet transform and heuristic algorithm of WIPSO in smart grids to improve short-term prediction of load, solar power, and wind energy. Operation of Distributed Energy Resources in Smart Distribution Networks, 75–100. DOI 10.1016/B978-0-12-814891-4.00004-7. [Google Scholar] [CrossRef]
8. Pritpal, S. (2021). FQTSFM: A fuzzy-quantum time series forecasting model. Information Sciences, 566, 57–79. DOI 10.1016/j.ins.2021.02.024. [Google Scholar] [CrossRef]
9. Wang, W., Ding, J. (2003). Wavelet network model and its application to the prediction of hydrology. Nature and Science, 1(1), 67–71. [Google Scholar]
10. Sang, Y., Wang, D. (2008). Wavelets selection method in hydrologic series wavelet analysis. Journal of Hydraulic Engineering, 39(3), 295–300. [Google Scholar]
11. Mirbagheri, S. A., Nourani, V., Rajaee, T., Alikhani, A. (2010). Neuro-fuzzy models employing wavelet analysis for suspended sediment concentration prediction in rivers. Hydrological Sciences, 55(7), 1175–1189. DOI 10.1080/02626667.2010.508871. [Google Scholar] [CrossRef]
12. Pandhiani, S. M., Shabri, A. B. (2015). Time series forecasting by using hybrid models for monthly streamflow data. Applied Mathematical Sciences, 9(57), 2809–2829. DOI 10.12988/ams.2015.52164. [Google Scholar] [CrossRef]
13. Senkal, S., Ozgonenel, O. (2013). Performance analysis of artificial and wavelet neural networks for short term wind speed prediction. 8th International Conference on Electrical and Electronics Engineering, pp. 196–198. Antalya, Turkey. [Google Scholar]
14. Pritpal, S., Huang, Y. P. (2019). A high-order neutrosophic-neuro-gradient descent algorithm-based expert system for time series forecasting. International Journal of Fuzzy Systems, 21(7), 2245–2257. DOI 10.1007/s40815-019-00690-2. [Google Scholar] [CrossRef]
15. Yonaba, H., Anctil, F., Fortin, V. (2010). Comparing sigmoid transfer functions for neural network multistep ahead streamflow forecasting. Journal of Hydrologic Engineering, 15(4), 275–283. DOI 10.1061/(ASCE)HE.1943-5584.0000188. [Google Scholar] [CrossRef]
16. Pandhiani, S. M., Shabri, A. B. (2013). Time series forecasting using wavelet-least squares support vector machines and wavelet regression models for monthly stream flow data. Open Journal of Statistics, 3(3), 183. DOI 10.4236/ojs.2013.33021. [Google Scholar] [CrossRef]
17. Shaikh, W., Shah, S., Solangi, M., Pandhiani, S. (2019). Forecasting analysis of GMDH model with LSSVM and MARS models for hydrological datasets (case study). Indian Journal of Science and Technology, 12, 1–6. DOI 10.17485/ijst/2019/v12i39/147941. [Google Scholar] [CrossRef]
18. Ismail, S., Shabri, A., Samsudin, R. (2012). A hybrid model of self organising maps and least square support vector machine for river flow forecasting. Hydrology and Earth System Sciences, 16(11), 4417–4433. DOI 10.5194/hess-16-4417-2012. [Google Scholar] [CrossRef]
19. Bansal, P., Salling, J. (2013). Multivariate Adaptive Regression Splines (MARS). http://www.ideal.ece.utexas.edu/courses/ee380l_ese/2013/mars.pdf. [Google Scholar]
20. Adamowski, J., Chan, H. F., Prasher, S. O., Sharda, V. N. (2012). Comparison of multivariate adaptive regression splines with coupled wavelet transform artificial neural networks for runoff forecasting in himalayan micro-watersheds with limited data. Journal of Hydroinformatics, 14(3), 731–744. DOI 10.2166/hydro.2011.044. [Google Scholar] [CrossRef]
21. Fathian, F., Mehdizadeh, S., Kozekalani Sales, A., Safari, M. J. S. (2019). Hybrid models to improve the monthly river flow prediction: Integrating artificial intelligence and non-linear time series models. Journal of Hydrology, 575, 1200–1213. DOI 10.1016/j.jhydrol.2019.06.025. [Google Scholar] [CrossRef]
22. Kao, L. J., Chiu, C. C., Lu, C. J., Chang, C. H. (2013). A hybrid approach by integrating wavelet-based feature extraction with MARS and SVR for stock index forecasting. Decision Support Systems, 54(3), 1228–1244. DOI 10.1016/j.dss.2012.11.012. [Google Scholar] [CrossRef]
23. Sahoo, B. B., Jha, R., Singh, A., Kumar, D. (2019). Application of support vector regression for modeling low flow time series. KSCE Journal of Civil Engineering, 23(2), 923–934. DOI 10.1007/s12205-018-0128-1. [Google Scholar] [CrossRef]
24. Pritpal, S. (2020). A novel hybrid time series forecasting model based on neutrosophic–PSO approach. International Journal of Machine Learning and Cybernetics, 11, 1643–1658. DOI 10.1007/s13042-020-01064-z. [Google Scholar] [CrossRef]
25. Sun, Y., Niu, J., Sivakumar, B. (2019). A comparative study of models for short-term streamflow forecasting with emphasis on wavelet-based approach. Stochastic Environmental Research and Risk Assessment, 33, 1875–1891. DOI 10.1007/s00477-019-01734-7. [Google Scholar] [CrossRef]
This work is licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |