Computer Modeling in Engineering & Sciences |
DOI: 10.32604/cmes.2021.019027
ARTICLE
Skew t Distribution-Based Nonlinear Filter with Asymmetric Measurement Noise Using Variational Bayesian Inference
1Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Jiangnan University, Wuxi, 214122, China
2School of Science, Jiangnan University, Wuxi, 214122, China
3Department of Chemical and Materials Engineering, University of Alberta, Edmonton, AB T6G 2G6, Canada
*Corresponding Author: Hongtian Chen. Email: chtbaylor@163.com
Received: 30 August 2021; Accepted: 20 October 2021
Abstract: This paper is focused on the state estimation problem for nonlinear systems with unknown statistics of measurement noise. Based on the cubature Kalman filter, we propose a new nonlinear filtering algorithm that employs a skew t distribution to characterize the asymmetry of the measurement noise. The system states and the statistics of skew t noise distribution, including the shape matrix, the scale matrix, and the degree of freedom (DOF) are estimated jointly by employing variational Bayesian (VB) inference. The proposed method is validated in a target tracking example. Results of the simulation indicate that the proposed nonlinear filter can perform satisfactorily in the presence of unknown statistics of measurement noise and outperform than the existing state-of-the-art nonlinear filters.
Keywords: Nonlinear filter; asymmetric measurement noise; skew t distribution; unknown noise statistics; variational Bayesian inference
State estimation serves as an important role in various fields, such as control, signal processing, fault detection and diagnosis, and many more [1–7]. Due to its effectiveness and optimality, the Kalman filter (KF) is the state estimation method of the most widespread used for linear systems with Gaussian noise distribution [8–10]. Limited by the assumption of linear, many nonlinear filters have been presented [11–13], the most famous of which is the extended Kalman filter (EKF) [14]. To solve the error caused by linearization in EKF, the cubature Kalman filter (CKF) and unscented Kalman filter (UKF) were developed by using sigma points to approximate the posterior distribution [15,16]. Among them, CKF is proven to have better estimation performance in high-dimensional nonlinear estimation. However, the above state estimation methods assume Gaussian noise distribution and their noise statistics are completely known, which is not available in practice.
To deal with the unknown noise statistics, various adaptive and robust filters were designed for joint state estimation [17–19]. For example, a recursive state estimation method was presented with unknown Gaussian noise covariance for linear systems [20]. Further, an adaptive variational Bayesian (VB)-based filter was designed for estimating the covariance of process noise and measurement noise by selecting inverse Wishart priors [21]. Combining with maximum correntropy criterion, an adaptive and robust filter was developed by estimating the Gaussian measurement noise covariance [22]. However, the above Gaussian-based estimation methods are unsuitable for the heavy-tailed noise which caused by outliers or impulse interferences. In the case where both the measurement noise and the process noise are Student’s t distributed noise, the Student’s t filter was first proposed in [23]. By minimizing the Kullback-Leibler divergence, an adaptive t-filter was developed to estimate the scale matrix of Student’s t distribution [24]. For nonlinear system, a recursive outlier-robust nonlinear filter was proposed for Student’s t distributed noise in [25] and a robust Gaussian approximate filter was presented with unknown statistics of Student’s t noise distribution in [26].
Due to the complex environment, not only the noise distribution with heavy-tailed characteristics but also the asymmetry of noise distribution should be considered. As shown in Fig. 1, skew t distribution obtains better fitting performance than the Gaussian distribution and Student’s t distribution, which are symmetric distributions. Thus, several estimation methods were presented for the skew t distribution, which has both skewness and heavy-tails [27–29]. For example, a skew t variational Beyasian filter was designed for measurement noise with heavy-tails and skewness in [30] and the estimation accuracy was improved by covariance matrix approximation in [31]. In [32], a robust filter was developed to estimate the skew t distribution, consisting of the scale matrix and degree of freedom (DOF). Moreover, some other filtering algorithms that can describe asymmetric noise distribution also have been proposed in [33,34]. Unfortunately, the above skew t distribution-based methods are all in linear systems and cannot be applied to nonlinear systems.
In this paper, a new skew t cubature Kalman filter (STCKF) is proposed for nonlinear system with heavy-tailed and skewed measurement noise. The skew t distribution is adopted to describe the measurement noise and the prior distributions of the shape matrix, scale matrix and DOF are chosen as Gaussian, inverse Wishart and Gamma distributions, respectively. The unknown statistics including shape matrix, scale matrix and DOF are inferred with the VB approach and the posterior of states is also simultaneously obtained. The results of simulation demonstrate that the proposed STCKF has better estimation accuracy as compared with the CKF and Student’s t distribution-based CKF.
The paper is structured as follows: Section 2 describes the problem studied in this paper. Section 3 proposes a skew t cubature Kalman filter based on VB inference. In Section 4, an example of target tracking is presented to verify the estimation performance of the proposed STCKF. The conclusions of this paper are given in Section 5.
Notations:
Consider the nonlinear state-space model
where n is the discrete time index,
The skew t distribution is used to describe the heavy-tailed and asymmetric of noise, therefore, the measurement noise vn:
where
Fig. 2 shows the different
In the following, based on the CKF, we will design a nonlinear filter under nonlinear model (1)–(2) with the measurement noise followed by skew t distribution. Specifically, the statistics of skew t distribution including the shape matrix, the scale matrix and the DOF are unknown and need to be estimated together with the system states by using VB inference.
3 Proposed Skew t Cubature Kalman Filter Using VB Inference
3.1 Prior Distributions Update
Similar to CKF, the predicted distribution of system state xn is
where
where
The predicted cubature points are
Hence, the predicted state and the corresponding error covariance are given by
In Bayesian theory, the conjugate prior needs ensure posterior distribution have the same functional form with prior distribution. Therefore, to infer shape matrix
where
where Dn|n −1 and cn|n −1 are the inverse scale matrix and DOF, respectively. The prior distribution of
where an|n −1 and bn|n −1 are the shape parameter and the rate parameter, respectively.
To obtain (10)–(12), the dynamic model
Because of the skewed t distribution does not have a strictly closed form, the state posterior distribution will be difficult to obtain. With the introduction of two hidden variables un and
In order to estimate xn from (5), (10)–(12), and (16)–(18), the joint posterior (
where
where
where
where
Based on Bayesian theory, we can obtain the joint posterior distribution as follows:
Substituting (5), (10)–(12), and (16)–(18) into (23) results in
When
where xn|n and Pn|n are the estimate of state and the corresponding covariance respectively, and can be obtained by
where
When
where the mean
where
When
where the location un|n and covariance Un|n are obtained by
The derivation of (36)–(40) can be seen in Appendix B.
When
where the shape parameter
where
When
where the DOF cn|n and inverse scale matrix Dn|n are obtained by
where
When
where the parameters an and bn are given by
The derivation of (47)–(49) can be seen in Appendix D.
Using (25), (33), (36), (41), (44), and (47), the following expectations are required:
where
After taking
Combining prediction steps (5) and (10)–(12) with measurement updates (25), (33), (36), (41), (44) and (47), the proposed STCKF can be realized recursively. To implement the proposed filter, the initial shape matrix
To validate the estimation performance of the proposed STCKF, a target tracking simulation is introduced to perform an evaluation of the results obtained. The STCKF is compared with the CKF [16], the Student’s t distribution based-CKF (T-CKF) [25], and the robust Student’s t distribution-based CKF (RT-CKF) [26].
In this paper, a typical air traffic control scenario is considered, in which the aircraft performs maneuvering turns on the horizontal plane at a constant but unknown turning rate
where the state
The associated parameters are set as:
In this simulation, we consider three cases for measurement noise:
Case 1: Gaussian distribution, that is,
Case 2: Contaminated Gaussian distribution (mixture Gaussian distribution). The mixture Gaussian distributed noise is generated according to [33]
where
Case 3: Contaminated skew t distribution (mixture skew t distribution).
According to [32], the measurement noise vn is generated by
where
In this paper, the root mean square error (RMSE) is adopted to test the filtering performance, and its formula is
where
Figs. 3–5 show that the RMSEs of position, velocity, and tuning rate based on 20 Monte-Carlo runs for three cases. From Fig. 3, the CKF, T-CKF, RT-CKF and the proposed STCKF almost have the same estimation accuracy under Gaussian distribution noise. However, the methods based on the non-Gaussian distribution outperform the methods based on the Gaussian distribution when the measurement noise no longer satisfies the Gaussian distribution. As shown in Fig. 4, the non-Gaussian filters (T-CKF, RT-CKF and STCKF) perform better than the CKF. From Fig. 5, the proposed STCKF obtains the best accuracy in the case of asymmetric noise distribution and unknown noise statistics. As also observed in Table 1, the filter based on the skewed t distribution and the filters based on the Student’s t distribution perform better than the filter based on the Gaussian distribution in Cases 2 and 3, and the proposed STCKF is significantly better than other filters for the asymmetric noise distribution.
In this work, we consider the joint estimation problem of system states and unknown noise statistics for nonlinear discrete-time systems. Combining with the properties of skew t distribution, a hierarchical nonlinear Gaussian model is developed. Based on this model, a skew t cubature Kalman filter is proposed, in which the states, shape matrix, scale matrix and DOF are simultaneously estimated by using VB approach. The results of simulation show that the proposed filter in this paper has better estimation accuracy than the conventional CKF and the Student’s t distribution-based CKF under heavy-tailed and skewed measurement noise. It should be noted that the proposed method in this paper can only realize state estimation of asymmetric measurement noise. How to extend the proposed method to handle asymmetric process and measurement noise is still an open problem.
Funding Statement: This work was supported in part by National Natural Science Foundation of China under Grants 62103167 and 61833007, and in part by the Natural Science Foundation of Jiangsu Province under Grant BK20210451.
Conflicts of Interest: The authors declare that they have no conflicts of interest to report regarding the present study.
1. Frueh, C. (2016). Modeling impacts on space situational awareness phd filter tracking. Computer Modeling in Engineering & Sciences, 111(2), 171–201. DOI 10.3970/cmes.2016.111.171. [Google Scholar] [CrossRef]
2. Zhao, S., Shmaliy, Y. S., Ahn, C. K., Liu, F. (2017). Adaptive-horizon iterative ufir filtering algorithm with applications. IEEE Transactions on Industrial Electronics, 65(8), 6393–6402. DOI 10.1109/TIE.41. [Google Scholar] [CrossRef]
3. Chen, H., Jiang, B., Ding, S. X., Huang, B. (2020). Data-driven fault diagnosis for traction systems in high-speed trains: A survey, challenges, and perspectives. IEEE Transactions on Intelligent Transportation Systems, Early Access. DOI 10.1109/TITS.6979. [Google Scholar] [CrossRef]
4. Jiang, Q., Fu, X., Yan, S., Li, R., Du, W. et al. (2021). Neural network aided approximation and parameter inference of non-markovian models of gene expression. Nature Communications, 12(1), 1–12. DOI 10.1038/s41467-021-22919-1. [Google Scholar] [CrossRef]
5. Chen, H., Jiang, B. (2019). A review of fault detection and diagnosis for the traction system in high-speed trains. IEEE Transactions on Intelligent Transportation Systems, 21(2), 450–465. DOI 10.1109/TITS.6979. [Google Scholar] [CrossRef]
6. Hou, X., Qiao, G. (2020). Observability analysis in parameters estimation of an uncooperative space target. Computer Modeling in Engineering & Sciences, 122(1), 175–206. DOI 10.32604/cmes.2020.08452. [Google Scholar] [CrossRef]
7. Zhao, S., Huang, B., Liu, F. (2016). Linear optimal unbiased filter for time-variant systems without apriori information on initial conditions. IEEE Transactions on Automatic Control, 62(2), 882–887. DOI 10.1109/TAC.2016.2557999. [Google Scholar] [CrossRef]
8. Simon, D. (2006). Optimal state estimation: Kalman, H infinity, and nonlinear approaches. Hoboken, New Jersey: John Wiley & Sons. [Google Scholar]
9. Geng, H., Wang, Z., Cheng, Y., Alsaadi, F. E., Dobaie, A. M. (2019). State estimation under non-Gaussian lévy and time-correlated additive sensor noises: A modified tobit kalman filtering approach. Signal Processing, 154, 120–128. DOI 10.1016/j.sigpro.2018.08.005. [Google Scholar] [CrossRef]
10. Zhao, S., Huang, B. (2020). Trial-and-error or avoiding a guess? Initialization of the kalman filter. Automatica, 121, 109184. DOI 10.1016/j.automatica.2020.109184. [Google Scholar] [CrossRef]
11. Myers, M., Jorge, A., Yuhas, D., Walker, D. (2012). An adaptive extended kalman filter incorporating state model uncertainty for localizing a high heat flux point source using an ultrasonic sensor array. Computer Modeling in Engineering & Sciences, 83(3), 221–248. DOI 10.3970/cmes.2012.083.221. [Google Scholar] [CrossRef]
12. Wang, H., Haynes, R., Huang, H., Dong, L., Atluri, S. N. (2015). The use of high-performance fatigue mechanics and the extended kalman/particle filters, for diagnostics and prognostics of aircraft structures. Computer Modeling in Engineering & Sciences, 105(1), 1–24. DOI 10.3970/cmes.2015.105.001. [Google Scholar] [CrossRef]
13. Geng, H., Haile, M. A., Fang, H. (2021). Ssue: Simultaneous state and uncertainty estimation for dynamical systems. International Journal of Robust and Nonlinear Control, 31(4), 1068–1083. DOI 10.1002/rnc.5344. [Google Scholar] [CrossRef]
14. Welch, G., Bishop, G. (2006). An introduction to the Kalman filter. Technical Report. University of North Carolina, Chapel Hill, North Carolina, USA. [Google Scholar]
15. Julier, S., Uhlmann, J., Durrant-Whyte, H. F. (2000). A new method for the nonlinear transformation of means and covariances in filters and estimators. IEEE Transactions on Automatic Control, 45(3), 477–482. DOI 10.1109/9.847726. [Google Scholar] [CrossRef]
16. Arasaratnam, I., Haykin, S. (2009). Cubature kalman filters. IEEE Transactions on Automatic Control, 54(6), 1254–1269. DOI 10.1109/TAC.2009.2019800. [Google Scholar] [CrossRef]
17. Xu, C., Zhao, S., Ma, Y., Huang, B., Liu, F. et al. (2021). Sensor fault estimation in a probabilistic framework for industrial processes and its applications. IEEE Transactions on Industrial Informatics, Early Access. DOI 10.1109/TII.2021.3063838. [Google Scholar] [CrossRef]
18. Stojanovic, V., He, S., Zhang, B. (2020). State and parameter joint estimation of linear stochastic systems in presence of faults and non-Gaussian noises. International Journal of Robust and Nonlinear Control, 30(16), 6683–6700. DOI 10.1002/rnc.5131. [Google Scholar] [CrossRef]
19. Beelen, H., Bergveld, H. J., Donkers, M. (2020). Joint estimation of battery parameters and state of charge using an extended kalman filter: A single-parameter tuning approach. IEEE Transactions on Control Systems Technology, 29(3), 1087–1101. DOI 10.1109/TCST.2020.2992523. [Google Scholar] [CrossRef]
20. Sarkka, S., Nummenmaa, A. (2009). Recursive noise adaptive kalman filtering by variational Bayesian approximations. IEEE Transactions on Automatic Control, 54(3), 596–600. DOI 10.1109/TAC.2008.2008348. [Google Scholar] [CrossRef]
21. Huang, Y., Zhang, Y., Wu, Z., Li, N., Chambers, J. (2017). A novel adaptive kalman filter with inaccurate process and measurement noise covariance matrices. IEEE Transactions on Automatic Control, 63(2), 594–601. DOI 10.1109/TAC.9. [Google Scholar] [CrossRef]
22. He, J., Sun, C., Zhang, B., Wang, P. (2020). Variational Bayesian-based maximum correntropy cubature kalman filter with both adaptivity and robustness. IEEE Sensors Journal, 21(2), 1982–1992. DOI 10.1109/JSEN.7361. [Google Scholar] [CrossRef]
23. Roth, M., Özkan, E., Gustafsson, F. (2013). A student’s t filter for heavy tailed process and measurement noise. IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 5770–5774. Vancouver, Canada. [Google Scholar]
24. Huang, Y., Zhang, Y., Chambers, J. A. (2019). A novel kullback–leibler divergence minimization-based adaptive Student’s t-filter. IEEE Transactions on Signal Processing, 67(20), 5417–5432. DOI 10.1109/TSP.78. [Google Scholar] [CrossRef]
25. Piche, R., Sarkka, S., Hartikainen, J. (2012). Recursive outlier-robust filtering and smoothing for nonlinear systems using the multivariate Student-t distribution. IEEE International Workshop on Machine Learning for Signal Processing, pp. 1–6. Santander, Spain. [Google Scholar]
26. Huang, Y., Zhang, Y., Li, N., Chambers, J. (2016). A robust Gaussian approximate filter for nonlinear systems with heavy tailed measurement noises. IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4209–4213. Shanghai, China. [Google Scholar]
27. Naveau, P., Genton, M. G., Shen, X. (2005). A skewed kalman filter. Journal of Multivariate Analysis, 94(2), 382–400. DOI 10.1016/j.jmva.2004.06.002. [Google Scholar] [CrossRef]
28. Kim, H. M., Ryu, D., Mallick, B. K., Genton, M. G. (2014). Mixtures of skewed kalman filters. Journal of Multivariate Analysis, 123, 228–251. DOI 10.1016/j.jmva.2013.09.002. [Google Scholar] [CrossRef]
29. Lu, C., Zhang, Y., Ge, Q. (2020). Kalman filter based on multiple scaled multivariate skew normal variance mean mixture distributions with application to target tracking. IEEE Transactions on Circuits and Systems II: Express Briefs, 68(2), 802–806. DOI 10.1109/TCSII.8920. [Google Scholar] [CrossRef]
30. Nurminen, H., Ardeshiri, T., Piche, R., Gustafsson, F. (2015). Robust inference for state-space models with skewed measurement noise. IEEE Signal Processing Letters, 22(11), 1898–1902. DOI 10.1109/LSP.2015.2437456. [Google Scholar] [CrossRef]
31. Nurminen, H., Ardeshiri, T., Piché, R., Gustafsson, F. (2018). Skew-t filter and smoother with improved covariance matrix approximation. IEEE Transactions on Signal Processing, 66(21), 5618–5633. DOI 10.1109/TSP.2018.2865434. [Google Scholar] [CrossRef]
32. Xu, C., Zhao, S., Ma, Y., Huang, B., Liu, F. (2019). Robust filter design for asymmetric measurement noise using variational Bayesian inference. IET Control Theory & Applications, 13(11), 1656–1664. DOI 10.1049/iet-cta.2018.6016. [Google Scholar] [CrossRef]
33. Huang, Y., Zhang, Y., Shi, P., Wu, Z., Qian, J. et al. (2017). Robust kalman filters based on Gaussian scale mixture distributions with application to target tracking. IEEE transactions on systems. Man, and Cybernetics: Systems, 49(10), 2082–2096. DOI 10.1109/TSMC.2017.2778269. [Google Scholar] [CrossRef]
34. Li, S., Feng, X., He, R., Pan, F. (2021). Joint parameter and state estimation for stochastic uncertain system with multivariate skew t noises. Chinese Journal of Aeronautics, Early Access. DOI 10.1016/j.cja.2021.04.032. [Google Scholar] [CrossRef]
35. Ma, Y., Huang, B. (2017). Bayesian learning for dynamic feature extraction with application in soft sensing. IEEE Transactions on Industrial Electronics, 64(9), 7171–7180. DOI 10.1109/TIE.2017.2688970. [Google Scholar] [CrossRef]
36. Zhao, S., Shmaliy, Y. S., Ahn, C. K., Zhao, C. (2019). Probabilistic monitoring of correlated sensors for nonlinear processes in state space. IEEE Transactions on Industrial Electronics, 67(3), 2294–2303. DOI 10.1109/TIE.41. [Google Scholar] [CrossRef]
37. Xu, C., Zhao, S., Liu, F. (2019). Sensor fault detection and diagnosis in the presence of outliers. Neurocomputing, 349, 156–163. DOI 10.1016/j.neucom.2019.01.025. [Google Scholar] [CrossRef]
38. Barr, D. R., Sherrill, E. T. (1999). Mean and variance of truncated normal distributions. The American Statistician, 53(4), 357–361. DOI 10.2307/2686057. [Google Scholar] [CrossRef]
Substituting
Defining the modified likelihood distribution p(zn|xn) as
and using (5) and (67) in (66), we have
According to (5) and (66)–(68), (25)–(32) can be obtained.
Substituting
where the auxiliary parameter
Substituting
Similar to (66)–(68), (33)–(40) can be obtained.
Substituting
where
Substituting
where
According to (72) and (74), (41)–(46) can be obtained.
Substituting
Using Stirling’s approximation:
According to (77), (41)–(43) can be obtained.
This work is licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |