Pattern-Moving-Based Parameter Identification of Output Error Models with Multi-Threshold Quantized Observations

Li, Xiangquan; Xu, Zhengguang; Han, Cheng; Li, Ning

doi:10.32604/cmes.2022.017799

1School of Automation and Electrical Engineering, University of Science and Technology Beijing; Key Laboratory of Knowledge Automation for Industrial Processes, Ministry of Education, Beijing, 100083, China
2School of Information Engineering, Jingdezhen University, Jingdezhen, 333000, China
*Corresponding Author: Zhengguang Xu. Email: xzgustb@163.com
Received: 08 June 2021; Accepted: 03 September 2021

Abstract: This paper addresses a modified auxiliary model stochastic gradient recursive parameter identification algorithm (M-AM-SGRPIA) for a class of single input single output (SISO) linear output error models with multi-threshold quantized observations. It proves the convergence of the designed algorithm. A pattern-moving-based system dynamics description method with hybrid metrics is proposed for a kind of practical single input multiple output (SIMO) or SISO nonlinear systems, and a SISO linear output error model with multi-threshold quantized observations is adopted to approximate the unknown system. The system input design is accomplished using the measurement technology of random repeatability test, and the probabilistic characteristic of the explicit metric value is employed to estimate the implicit metric value of the pattern class variable. A modified auxiliary model stochastic gradient recursive algorithm (M-AM-SGRA) is designed to identify the model parameters, and the contraction mapping principle proves its convergence. Two numerical examples are given to demonstrate the feasibility and effectiveness of the achieved identification algorithm.

Keywords: Pattern moving; multi-threshold quantized observations; output error model; auxiliary model; parameter identification

In the metallurgical, petroleum-chemical and steel industries, there are technologically complicated, highly energy-consuming and polluting large-scale equipments such as electrolytic tank, sintering machine, blast furnace cement rotary kiln and so on. The production process of this kind of equipments presents the following characteristics [1]. 1) The complex system mechanism beyonds the accurate description of mathematical and physical equations; 2) Working conditions and quality parameters are in large quantities, and the system moving mode is full of distributiveness, nonlinearity and parameter perturbations; 3) Some physical and chemical processes are in conformity with statistical law of moving. A feasible method of system modeling and control is the pattern recognition technology for these considered processes [2] and most researchers’ practice is to design the corresponding model and controller according to the different pattern class of the system working condition [3,4]. A novel pattern-moving-based system dynamics description method was proposed in [5]. Its basic idea is to take the pattern class as a moving variable, and it is mapped to a computable space by class centers [6,7], interval numbers [8], and cells [9] due to its lack of arithmetic operation attribute. Furthermore, in view of various metric methods of pattern class, the linear autoregressive model with exogenous input (ARX) or interval ARX (IARX) model was established, and the parameter identification algorithm based on least square [6], minimum-variance-based controller [5], optimal controller [10], state-feedback controller [7] and predictive controller [11] were designed.

Although a series of research achievements have been made in the system modeling, parameter identification and control based on pattern moving, the differences of each pattern sample in one pattern class have never been considered. Since a pattern class is a set of pattern samples with the same or similar characteristics, the hybrid metrics, that is, the combination of implicit metric D¯(⋅) and class center explicit metric D(⋅) is proposed. The way of class center explicit metric indicates the statistical attribute of pattern class, and the way of implicit metric denotes the difference of each pattern sample in one pattern class. By using the hybrid metrics, the outputs of pattern-moving-based system dynamics description are in-line with multi-threshold quantized observations. Therefore, this paper will study the pattern-moving-based parameter identification of a kind of linear system models with multi-threshold quantized observations. To our best knowledge, for the model parameter identification based on accurate measured values, a great number of research results were introduced in [12], such as least square method [13], maximum likelihood and prediction error methods [14], unbiased finite impulse response filter algorithm [15], Kalman linear filtering and prediction algorithm [16,17], Bayesian approach [18], and so forth.

The system with multi-threshold quantized observations is considered as a set-valued system which is different from a conventional one with accurate measurement outputs [19,20]. Although the identification of set-valued systems is not easy task, a series of important research results have been achieved in the past decade. For the linear system identification with threshold quantized observations, a full rank input design method (such as repetitive input design) and an empirical distribution function method were proposed in [21–23]. For the identification of Hammerstein and Wiener systems with binary outputs, the methods of proportional full rank signal, joint identifiability and strong full rank periodic signal inputs were proposed in [24,25], which effectively overcame the problems of system nonlinearity and rough binary output information. The parameter identification of Wiener system was also studied in terms of quantized inputs and binary outputs in [26]. Based on the truncated empirical measurement method of probabilistic statistics, a non-truncated empirical measurement method was proposed for the finite impulse response (FIR) system model with binary outputs, and the progressive effectiveness of the algorithm was demonstrated in [27]. A two-segment design method for a class of FIR systems with binary-valued observations was investigated in [28]. The parameter estimation problem of set-valued system was explained comprehensively from the perspective of combining system principle and practical application in [29]. The idea of parameter identification based on an auxiliary model was systematically expounded in [30]. An auxiliary-model-based least squares recursive algorithm was proposed for a quantized control system with communication constraints in [31].

This work investigates a M-AM-SGRPIA for a kind of linear output error models with multi-threshold quantized observations which is established from the perspective of pattern moving and hybrid metrics. Compared with the existed research results, the main differences and contributions of this paper are summarized as follows:

1) Different from the previous system identification problem of ARX or IARX models based on pattern moving and single metric [6,7], this paper considers hybrid metrics, the model noise distribution and proves the convergence of the designed M-AM-SGRPIA.

2) Compared with the parameter identification algorithms of set-valued system in [21–29], this paper adopts an auxiliary model and designs a M-AM-SGRPIA, which will reduce the estimation error radio to some extern.

3) An auxiliary-model-based least squares recursive algorithm has been designed for a class of linear systems with communication constraints in [31], which is different from the designed algorithm and physical background in this paper.

The outline of this work is organized as follows. Section 2 presents a pattern-moving-based system dynamics description and problem formulation. Section 3 proposes a M-AM-SGRPIA and its convergence is proved by the contraction mapping principle in Section 4. Section 5 demonstrates the feasibility and effectiveness of the M-AM-SGRPIA by two numerical examples. The conclusion comes in Section 6.

Notation: R stands for the real number field; Z+ denotes the positive integer field; λmax[A] is the maximum eigenvalue of matrix A, and λmin[A] is its minimum eigenvalue; f(t)=o(g(t)) represents g(t)>0, limt→∞f(t)g(t)=0; arg⁡maxi(⋅) is the value of i when (⋅) gets the maximum value; ‖⋅‖ is the Euclidean norm; the superscript T denotes the transposition; F(⋅) and F−1(⋅) represent the probability distribution function and its inverse function respectively; E(⋅) is the mathematical expectation; P(⋅) denotes the probability.

Consider a class of unknown SIMO non-affine nonlinear discrete-time systems as follows:

{y1(k)=f1(y1(k−1),⋯,y1(k−n1),u(k−1),⋯,u(k−m1)),y2(k)=f2(y2(k−1),⋯,y2(k−n2),u(k−1),⋯,u(k−m2)),⋮yq(k)=fq(yn(k−1),⋯,yn(k−nq),u(k−1),⋯,u(k−mq)), (1)

where q>0; yi(k)∈R and u(k)∈R denote the output of fi(⋅) and the whole system's input respectively; ni∈Z+ and mi∈Z+ are the unknown output and input orders; fi(⋅) denotes an unknown nonlinear discrete-time function; i∈{1,⋯,q}.

Assumption 2.1 The input of system (1) is bounded, i.e., a constant M1 exists and satisfies that |u(k)|≤M1.

A pattern-moving-based system dynamics description [32] corresponding to system (1) is proposed in the following three steps:

1) Feature extraction (T(⋅)) and output dimension reduction. A large number of input and output data are collected offline, and the input data set {u(k)} and q-dimensional output vector set {[y1(k),⋯,yq(k)]} are obtained. Drawing on the data-driven modeling ideas [33,34] and through the principal component analysis (PCA) feature extraction [35] of the output data, the first principal component information is obtained, and then one-dimensional principal component information set {y(k)} will be obtained.

2) Classification (M(⋅)) and metrics (D(⋅),D¯(⋅)). Using pattern classification technology to classify the first principal component information, the number of pattern class (N), the class center value (si), class radius (ri) and threshold value (Ci) of each pattern class (dxi) can be obtained, i=[1,…,N]. At any time instant k, the system output vector [y1(k),⋯,yq(k)] corresponds to a working condition and a pattern class dx(k) at the same time. Since the pattern class does not have the arithmetic operation attribute, that is, pattern class 1+ pattern class 2 ≠ pattern class 3, the pattern class variable needs to be measured. Since the pattern class is a set of pattern samples with the same or similar attributes, the method of combining the class center explicit metric D(⋅) and implicit metric D¯(⋅) is adopted, that is, si=D(dxi) and dxi¯=D¯(dxi). The implicit metric values are unknown, but there is a definite relationship between an implicit metric value and a class center explicit metric value by choosing a reasonable classification method, such as |si−dxi¯|≤ri.

3) Establishing the pattern-moving-based system dynamics equations. The input sequence {u(k)}, implicit metric sequence {dx¯(k)} and class center explicit metric sequence {s(k)} are employed to construct the following dynamics equations of the system:

s(k)=D(M(dx¯(k)))={s1,dx¯(k)∈[s1−r1,s1+r1];s2,dx¯(k)∈[s2−r2,s2+r2];⋮sN,dx¯(k)∈[sN−rN,sN+rN], (3)

where f(⋅) is a unknown system function; m,n denote the input and output orders of system (2) respectively. It exits a classification-metric deviation e(k) between the dx¯(k) and s(k), and |e(k)|=|s(k)−dx¯(k)|≤r1 while s(k)=ci.

If the contribution rate of the first principal component information obtained by feature extraction T(⋅) is greater than 85%, it can be considered that the output information of source system (1) is not lost or loses very little. For many practical SIMO physical systems, the contribution rate of the first principal component information can reach or even exceed 85%. Taking sintering machine as an example, through data analysis, it is found that the contribution rate of first principal component information is far greater than 85% when the temperatures of four positions near the sintering node under the excitation of ignition input are considered and feature extracted [5].

Remark 2.1 In the process of establishing the pattern-moving-based dynamics Eqs. (2) and (3), the condition of parameter configuration of classification method is to ensure that a certain pattern class corresponds to a specific quality index of the product [36]. Furthermore, a physical SISO nonlinear discrete-time system can also be transformed into a pattern-moving-based SISO system, but it does not need the first step of feature extraction process.

Although there inevitably unmodeled dynamics problems, it is common to employ a linear model to approximate the situation that the system (2) is unknown. Choosing a reasonable classification method such as a modified quantized control classification [36], the following linear output error model with multi-threshold quantized observations can be constructed for system (2)–(3).

{dx¯(k)=a1dx¯(k−1)+…+andx¯(k−n)+b1u(k−d−1)+…+bmu(k−d−m);dx¯∧(k)=dx¯(k)+υ(k);s(k)=D(M(dx¯∧(k)))={c1,c1−r1<dx¯∧(k)≤c1+r1;c2,c2−r2<dx¯∧(k)≤c2+r2;⋮cN,cN−rN<dx¯∧(k)≤cN+rN, (4)

where v(k) is the model noise; s(k)∈{c1,c2,⋯,cN} is the class center explicit metric value; D(M(⋅)) denotes a classification-metric process; Ci=ci+ri=ci+1−ri+1 is satisfied under a reasonable classification.

Assumption 2.2 The input and output orders of the model are known and equal, that is, n=m.

Assumption 2.3 v(k) is an independent and identically distributed (i.i.d.) random signal noise, and its probability distribution function F(s) and inverse function F−1(s) are known and continuously differentiable.

Under the Assumption 2.2, the first expression of model (4) can be written as follows:

where A(q)=1−a1q−…−anqn; B(q)=b1q+b2q2+…+bnqn;φ(k)=[dx¯(k−1),⋯,dx¯(k−n),u(k−1),⋯,u(k−n)]T; the parameter vector θ=[a1,⋯,an,b1,⋯,bn]T is to be identified.

If the input sequence {u(k)} and output sequence {dx¯(k)} can be obtained directly, it is known to all that there exists a lot of methods to identify the model parameters. However, one can only get the observation sequence {s(k)} and {u(k)}. Therefore, this paper will design a parameter identification algorithm based on these two sequences.

Remark 2.2 The model orders and cumulative distribution function of v(k) are supposed to be known in Assumption 2.2 and Assumption 2.3, respectively. These two problems are not the focus of this paper. If the model orders are unknown, in practice, the order selection may mainly depend on the experiential knowledge of the system. From a theoretical point of view, the model order selection could be addressed by means of the parameter estimation technique with quantized observations and drawing on the classical order selection methods of cross-validation, information criteria, the F-test and the statistical tests on the residuals. In view of the unknown distribution function of model noise, it is not easy task to identify that. However, one can refer to the parameterization technique proposed in [37].

For the convenience of calculation, s(k) is denoted by ss(k)=[ssk1,ssk2,⋯,sskN]T, and the corresponding indicator function of each element sski is depicted as follows:

where N is the number of the pattern class; Ci denotes the threshold value and it satisfies that Ci=ci+ri.

The measurement technology of random repeatability test is employed to design the input u(k). The size of input cycle is set to be t(t>2n) and the number of cycles is l. Therefore, the noise free output dx¯(k) is an unmeasurable signal with cycle equal to t. Totally t⋅l rows of vector data {ssT(k)} and a data matrix Q(t⋅l)⋅N is produced. It is known from Assumption 2.3 that the probability distribution of υ(k) is F(s), and it can be derived from the random repeatability test technology that the occurrence probability of event dx¯^(k)≤Ci at time instant k is pi, e.g., the occurrence probability of event sski=1 is pi=P{dx¯^(k)≤Ci}=P{dx¯(k)+υ(k)≤Ci}=F(Ci−dx¯(k)), and then dx¯(k)=Ci−F−1(pi). Due to the influence of υ(k), the corresponding output s(k) obtained from each test may differ under the identical test conditions. Let ξli=1l∑j=0l−1ssk+t⋅ji. It is known from the probability statistics that {ssi(k+t⋅l)} is the result of Bernoulli tests and it satisfies that E(ξli)=pi). Therefore, it follows that ξli is the estimated value of pi. Further, the estimated value of dx¯(k) can be gained that dx¯~(k)=Ci−F−1(ξli) and the estimation error e(k)=dx¯~(k)−dx¯(k). In terms of the estimation error e(k), there exists two Lemmas and one conclusion as follows.

Lemma 3.1 ([31]) For the model (4) and the output sequence {ssi(k+t⋅l)} generated by repeatability test, the estimation error sequence {e(k):e(k)=dx¯~(k)−dx¯(k)} is a time-varying sequence with mean 0 and bounded variance, and the estimation error e(k) converges to 0 according to probability 1.

Lemma 3.2 ([31]) Under the full order input sequence {u(k+t⋅j)},1≤k≤t,0≤j≤l−1 with cycle equal to t(t>2n), the signals to be identified generated by model (4) have the property of persistent excitation, e.g.,

According to Lemma 3.1 and Lemma 3.2, the estimation error sequence {e(k),Γk} is regarded as a martingale difference sequence defined in a probability space {Ω,Γ,P}, and it satisfies the following conditions:

To accomplish the identification task, it is necessary to determine the probability estimate ξli and find the corresponding threshold Ci firstly. Considering the presence of υ(k) , it is required to determine the corresponding threshold segmentation point Ci of s(k) at the occurrence time of event sski=1. For the matrix Q(t⋅l)⋅N , the corresponding row k is the demonstrative vector ssT(k), and it is known from its definition that if ssT(k), then sskj=1, j≥i. The ξli and threshold value Ci are determined by the following formula:

ξli={arg⁡maxi{ξli:ξli=1l∑j=0l−1ssk+t⋅ji},0<ξli<1,i=1,2,…,N;Num;ξli=0 or ξlj=1,i,j∈[1,N], (8)

1) If ξli∈(0,1), the maximum value of ξli and its corresponding interval number i can be found. Further, the corresponding threshold Ci is determined.

2) If ξli is equal to 0 or 1, then corresponding interval number i can be determined when the first ξli=1. Further, the corresponding threshold Ci is determined. Furthermore, e.g., let ξli=Num=0.99,which is a truncation data.

The {e(k),Γk} is considered as a non-stationary martingale difference sequence with mean 0 in Section 3.1. Therefore, one can establish a new identification model by using the estimated value dx¯~(k) such as:

Due to the unknown variables dx¯(k−1),⋯,dx¯(k−n) exist in (9), an auxiliary model is established as follows:

where θ~(k) is an estimate of θ, and φ~(k)=[dx¯a(k−1),⋯,dx¯a(k−n),u(k−1),⋯,u(k−n)]T is an estimate of φ(k) at time instant k.

By introducing a convergence index ε(12<ε≤1) and using the auxiliary model (10), a modified auxiliary model stochastic gradient recursive algorithm (M-AM-SGRA) for the parameter vector θ is designed as follows:

dx¯~(k)=Ci−F−1(ξli),ξli=arg⁡maxi(ξli:ξli=1l∑j=0l−1ssk+t⋅ji)or Num. (15)

Remark 3.1 A truncation method is adopted to get the value ξli. The determination of truncation rules depends on prior information. The truncated empirical measure method was proposed in [25], and the asymptotic efficiency of the algorithm was proved. This paper chooses a reasonable truncation value of ξli by referring to [25].

For the designed M-AM-SGRA, the following Lemma is given and its convergence is to be proved.

Lemma 4.1 ([31]) For the given M-AM-SGRA (11)–(15), the following inequalities hold.

Theorem 4.1 For the model (9) and the corresponding M-AM-SGRA (11)–(15), under the conditions that A1) and A2) are satisfied, if A(q) is strictly positive real, r(k)→∞, letting

θˇ(k)=θˇ(k−1)+φ~(k)rε(k)π(k)=θˇ(k−1)+φ~(k)rε(k)[φ~T(k)θ+e(k)−φ~T(k)θ~(k−1)]

=θˇ(k−1)+φ~(k)rε(k)[−φ~T(k)θˇ(k−1)+e(k)]=θˇ(k−1)+φ~(k)rε(k)[−yˇ(k)+e(k)], (21)

=‖θˇ(k−1)‖2+2rε(k)θˇT(k−1)φ~(k)[−yˇ(k)+e(k)]+‖φ~(k)‖2r2ε(k)[−yˇ(k)+e(k)]2

=‖θˇ(k−1)‖2−2rε(k)−‖φ~(k)‖2r2ε(k)yˇ2(k)+2[rε(k)−‖φ~(k)‖2]r2ε(k)yˇ(k)e(k)+‖φ~(k)‖2r2ε(k)e2(k)

≤‖θˇ(k−1)‖2−2rε(k)−‖φ~(k)‖2r2ε(k)yˇ2(k)+‖φ~(k)‖2r2ε(k)e2(k)+2rε(k)yˇ(k)e(k). (22)

Since e(k) is uncorrelated with θˇ(k), yˇ(k), r(k), and φ~(k), and it is Γk−1 measurable, according to (16)-(17) and taking the conditional expectation of (22) with respect to Γk−1, one gets

E[‖θˇ(k)‖2|Γk−1]≤‖θˇ(k−1)‖2−2rε(k)−‖φ~(k)‖2r2ε(k)yˇ2(k)+‖φ~(k)‖2r2ε(k)Mσς2 (23)

According to Lemma 4.1, the sum of the third term on the right side of (23) from t=1 to t=∞ is finite. Based on the condition (19), it can be concluded from the martingale convergence Theorem (Lemma d.5.3 in [39]) that ‖θˇ(k)‖2 uniformly converges to a finite random variable C and it is noted as

∑k=1∞2rε(k)−‖φ~(k)‖2r2ε(k)yˇ2(k)=[∑k=1∞2r(k)−‖φ~(k)‖2r2(k)yˇ2(k)]ε=1=∑k=1∞r(k)+r(k−1)r2(k)yˇ2(k)

∑k=1∞[1r(k)yˇ2(k)]≤∑k=1∞[2rε(k)−‖φ~(k)‖2r2ε(k)yˇ2(k)], (27)

[∑k=1∞1r2−k(k)yˇ2(k)]12<κ≤1≤∑k=1∞[2rε(k)−‖φ~(k)‖2r2ε(k)yˇ2(k)]. (28)

Further, using the Kronecker Lemma (Lemma d.5.3 in [38]) for inequalities (27), (28) to yield

It is known from the above proof process that the parameter estimation error is continuous and bounded. Using (21), one obtains

θˇ(k)=θˇ(k−i)+∑j=0i−1φ~(k−j)rε(k−j)[−yˇ(k−j)+e(k−j)],i≥1.

φ~T(k−i)θˇ(k)=yˇ(k−i)+φ~T(k−i)∑j=0iφ~(k−j)rε(k−j)[−yˇ(k−j)+e(k−j)]. (31)

Taking the square on both sides of (31) and adopting the inequality (a+b)2≤2(a2+b2), one gets

[φ~T(k−i)θˇ(k)]2={yˇ(k−i)+φ~T(k−i)∑j=0iφ~(k−j)rε(k−j)[−yˇ(k−j)+e(k−j)]}2

≤2yˇ2(k−i)+2‖φ~(k−i)‖2‖∑j=0iφ~(k−j)rε(k−j)[−yˇ(k−j)+e(k−j)]‖2. (32)

Since e(k−j) is uncorrelated with yˇ(k−j), r(k−j), and φ~(k−j), and it is Γk−1 measurable, according to (16), (17) and taking the conditional expectation on both sides of (32) with respect to Γk−1, one has

E{[φ~T(k−i)θˇ(k)]2|Γk−1}≤2yˇ2(k−i)+2‖φ~(k−i)‖2∑j=0i{‖φ~(k−i)‖2r2ε(k−j)[yˇ2(k−i)+Mσς2]}. (33)

Taking the sum from i=0 to i=k−1 on both sides of (33) and dividing by r2−ε(k) yield

E{∑i=0k−1[φ˜ T(k−i)θˇ(k)]2|Γk−1}1r2−ε(k)≤2r2−ε(k)∑i=0k−1yˇ2(k−i)+2∑i=1k−1‖φ˜(k−i)‖2r2−ε(k)∑j=0i[‖φ˜(k−j)‖2r2ε(k−j)yˇ2(k−j)]+2∑i=1k−1‖φ˜(k−i)‖2r2−ε(k)∑j=0i[‖φ˜(k−j)‖2r2ε(k−j)Mσς2],

ψ1(k)=2∑i=1k−1‖φ~(k−i)‖2r2−ε(k)∑j=0i[‖φ~(k−j)‖2r2ε(k−j)Mσς2],

ψ2(k)=2∑i=1k−1‖φ~(k−i)‖2r2−ε(k)∑j=0i[‖φ~(k−j)‖2r2ε(k−j)yˇ2(k−j)].

ψ1(k)=2∑i=1k−1‖φ~(k−i)‖2r2−ε(k)∑j=0i[‖φ~(k−j)‖2r2ε(k−j)Mσς2]

=22r2−ε(k){[r(k−1)−r(0)]‖φ~(k)‖2r2ε(k)Mσς2+∑i=1k−1[[r(k−1)−r(0)]‖φ~(k)‖2r2ε(k)Mσς2]}

≤2r2−ε(k)‖φ~(k)‖2r2ε−1(k)Mσς2+2r2−ε(k)∑i=1k−1[‖φ~(k)‖2r2ε−1(i)Mσς2]

=2Mσς2r2−ε(k)∑i=1k[r2−2ε‖φ~(i)‖2r(i)]≤2Mσς2ln(r(k))rε(k)→0,a.s.,k→∞,12<ε≤1.

ψ2(k)=2∑i=1k−1‖φ~(k−i)‖2r2−ε(k)∑j=0i[‖φ~(k−j)‖2r2ε(k−j)yˇ2(k−j)]

=2‖φ~(k−1)‖2r2−ε(k)[‖φ~(k)‖2r2ε(k)yˇ2(k)+‖φ~(k−1)‖2r2ε(k−1)yˇ2(k−1)]

+2‖φ~(k−2)‖2r2−ε(k)[‖φ~(k)‖2r2ε(k)yˇ2(k)+‖φ~(k−1)‖2r2ε(k−1)yˇ2(k−1)+‖φ~(k−2)‖2r2ε(k−2)yˇ2(k−2)]

+2‖φ~(1)‖2r2−ε(k)[‖φ~(k)‖2r2ε(k)yˇ2(k)+‖φ~(k−1)‖2r2ε(k−1)yˇ2(k−1)+…+‖φ~(1)‖2r2ε(1)yˇ2(1)]

=2r2−ε(k)[r(k−1)−r(0)]‖φ~(k)‖2r2ε(k)yˇ2(k)+2r2−ε(k)∑i=1k−1[[r(i)−r(0)]‖φ~(i)‖2r2ε(i)yˇ2(i)]

≤2r2−ε(k)2rε(k)r2ε−1(k)yˇ2(k)+2r2−ε(k)∑i=1k−1[2rε(k)r2ε−1(i)yˇ2(i)]

=4r2−ε(k)∑i=1k[1rε−1(i)yˇ2(i)]≤4r(k)∑i=1kyˇ2(i)→0,a.s.,k→∞,12<ε≤1.

E{∑i=0k−1[φ~T(k−i)θˇ(k)]2|Γk−1}1r2−ε(k)=E{θˇT(k)R(k)θˇ(k)|Γk−1}r2−ε(k)

Example 1: Consider a SISO linear output error model with multi-threshold quantized observations which has been established based on pattern-moving and hybrid metrics as follows.

{dx¯∧(k)=0.5dx¯(k−1)+0.3dx¯(k−2)+u(k−1)+0.7u(k−2)+v(k)s(k)=D(M(dx¯∧(k)))={c1,c1−r1≤dx¯∧(k)≤c1+r1;c2,c2−r2<dx¯∧(k)≤c2+r2;⋮c13,c13−r13<dx¯∧(k)≤c13+r13,

where ϑ(k)∼N(0,0.252) denotes the model noise; u(k)∼N(0,22) represents the system input; θ=[a1,a2,b1,b2]T=[0.5,0.3,1.0,0.7]T is the model parameter vector to be identified; the detail property values of pattern classes are known and shown in Table 1.

The size of input cycle is initially set as t=3000, and the number of cycles is l=300. The initial conditions are dx¯(1+i⋅l)=dx¯(2+i⋅l)=0;i=0,1,⋯,l−1. It can be collected 900000 groups of data with the corresponding input sequence {u(k)} and output sequence {s(k)}. Since the distribution of model noise and pattern class thresholds are known, {dx¯~(k)} can be estimated by the sequence {ssT(k)}. Using the input series {u(k)} and {dx¯~(k)}, the model parameters can be identified by the designed M-AM-SGRA, and the identification processes with different convergence indexes such as ε=0.99,0.98,0.97,0.96 respectively are shown in Fig. 1. The parameter estimates and the estimation error ratios (δ=‖θ~(k)−θ‖/‖θ‖) with different convergence indexes at time instant k=3000 are shown in Table 2.

It is shown from Fig. 1 and Table 2 that the convergence index ε is smaller, the parameter estimate converges faster, and the estimation error ratio becomes larger. One can get a good strategy which is to choose a small convergence index at the beginning, and then let it gradually increase with time instant k, and finally approach unity. Therefore, the system identification accuracy can be satisfied by choosing and adjusting the appropriate convergence index ε.

{y1(k)=u(k−1)1+u2(k−1)+u(k−2)+ϑ(k)y2(k)=0.1y2(k−1)+0.5y2(k−2)+u(k−1)1+u2(k−1)+u(k−2)+ϑ(k)y3(k)=0.2y3(k−1)+u(k−1)1+u2(k−1)+u(k−2)+ϑ(k)

where ϑ(k)∼N(0,0.12) denotes the system noise; u(k)∼N(0,1) is the system input.

1) Design of System input. The size of input cycle and the number of cycle are set the same as Example 1. The initial condition is set as yj(1+i⋅l)=yj(2+i⋅l)=0;i=0,1,⋯,l−1;j=1,2,3. It can be collected 900000 groups of data with the corresponding input sequence {u(k)} and output sequences {yj(k)}. The outputs are normalized and a PCA method [32] is used to handle the normalized data to get the first principal component information {y(k)}.

2) Classification and determining the actual output s(k), class quantity N, class center ci, class radius ri, and class threshold Ci. A modified quantized control classification method (M(⋅)) [39] is adopted and described in the following formula:

s(k+1)=D(M(y(k+1)))={y˘(k+1), if 11+Δκi<y(k+1)≤11−Δκi0, if −11+ΔκN<y(k+1)≤11+ΔκN−y˘(k+1), if −11−Δκi<y(k+1)≤−11+Δκi (34)

where y˘(k+1)=1+ρ04κi(ρ0i−1+ρ0i);ρ0∈(0,1);κi=ρ0iκ0;Δ=1−ρ01+ρ0; κ0 is the maximum working range of y(k)(κ0≥max(|y(k)|));i=1,2,⋯,N.

Given the upper limit of the initial class radius r0 at the working point 0 and the other parameters such as ρ0,κ0, one can obtain L≥ln⁡(r0(1+Δ)κ0)ln⁡ρ0, and the first principal component information {y(k)} is divided into 2L+1 segments. Furthermore, the number of pattern class (N=2L+1), the class center ci, class radius ri=1+ρ24ρ and threshold value Ci can be obtained respectively, i=1,⋯,N. The initial parameters are set as ρ0=0.5,κ0=5,r0=0.4,L=ln⁡(r0(1+Δ)κ0)ln⁡ρ0. The inputs (u(k)), first principal component information (y(k)) and class center metric values (s(k)) in the first cycle are shown in Fig. 2, and the detail property values of pattern classes are shown in Table 3.

3) Establishing a pattern-moving-based linear output error model with multi-threshold quantized observations. According to the information obtained from the above steps, the pattern-moving-based system dynamics description Eqs. (2) and (3) is to be obtained. The input and output orders are assumed to be known (m=n=2), and the distribution of model noise is assumed to be known (ϑ(k)∼N(0,0.12)). Therefore, a following linear output error model with multi-threshold quantized observations is established to approximate the system (2)–(3).

{dx¯∧(k)=a1dx¯(k−1)+a2dx¯(k−2)+b1u(k−1)+b2u(k−2)+v(k);s(k)=D(M(dx¯∧(k)))={c1,c1−r1≤dx¯∧(k)≤c1+r1;c2,c2−r2<dx¯∧(k)≤c2+r2;⋮c9,c9−r9<dx¯∧(k)≤c9+r9,

4) Model parameters identification. The designed M-AM-SGRA is employed to identify the model parameters. Referring to the conclusion of Example 1, the convergence factor ε is set by segments in this example. Fig. 3 shows the curves of parameters identification with a fixed value ε=0.995 and some interval variable values such as ε=0.9,k≤100;ε=0.96,k≤200;ε=0.99,k≤500;ε=0.995,k≤1000;ε=0.999,k>1000. At time instant k=3000, the estimation vector of θ is [0.0673,0.1655,0.3315,0.9200]T while ε=0.995, and it is [0.0815,0.1577,0.3270,0.9278]T while ε is set as the interval variable value. Fig. 3 shows that using the time-varying convergence indexs can not only obtain a fast convergence speed at the beginning, but also gradually achieve a stable convergence effect.

Remark 5.1 It is known to all that there exists many classifications and clustering methods in pattern recognition technology, such as C-means, ISODATA, and so on. Here a modified quantized control classification and class center explicit metric method is utilized. The initial parameters setting principle is based on the quality parameters of the product. In Example 2, it is assumed that the good quality is obtained with the output y(k)∈(−0.2344,0.2344], so the initial parameters are set to ensure that the working condition data y(k)∈(−0.2344,0.2344] belongs to one pattern class.

According to the characteristics of classification and hybrid metrics, mappings are in line with the set-valued system, a linear output error model with multi-threshold quantized observations is adopted to approximate the unknown system, and an M-AM-SGRA is designed with its convergence proved. Finally, the validity and desired effect of the parameter identification algorithm is demonstrated by two numerical examples. Future works will focus on pattern-moving-based set-valued system modeling and control methods for achieving optimal robustness.

Funding Statement: This work was supported by the National Natural Science Foundation of China (62076025).

Conflicts of Interest: The authors declare that they have no conflicts of interest to report regarding the present study.

References

1. Qu, S. D. (1998). Pattern recognition approach to intelligent automation for complex industrial processes. Journal of University of Science and Technology Beijing, 20(4), 385–389. DOI 10.13374/j.issn1001-053x.1998.04.018. [Google Scholar] [CrossRef]

2. Saridis, G. N. (1981). Application of pattern recognition methods to control systems. IEEE Transactions on Automatic Control, 26(3), 638–645. DOI 10.1109/TAC.1981.1102685. [Google Scholar] [CrossRef]

3. Li, X., Zhang, L. (2011). Intelligent controller based on pattern recognition. ICECC, 1512–1515. DOI 10.1109/ICECC.2011.6066661. [Google Scholar] [CrossRef]

4. Yu, B., Zhang, X., Wu, L., Chen, X. (2020). A novel postprocessing method for robust myoelectric pattern-recognition control through movement pattern transition detection. IEEE Transactions on Human-Machine Systems, 50(1), 32–41. DOI 10.1109/THMS.2019.2953262. [Google Scholar] [CrossRef]

5. Xu, Z. G. (2001). Pattern recognition method of intelligent automation and its implementation in engineering (Ph.D. Thesis). University of Science and Technology Beijing [Google Scholar]

6. Wang, M. S., Xu, Z. G., Guo, L. L. (2017). Pattern recognition based dynamics description of production processes in metric spaces. Asian Journal of Control, 19(4), 1–13. DOI 10.1002/asjc.1471. [Google Scholar] [CrossRef]

7. Wang, M. S., Xu, Z. G., Guo, L. L. (2019). Stability and stabilization for a class of complex production processes via LMIs. Optimal Control Applications and Methods, 40(3), 460–478. DOI 10.1002/oca.2488. [Google Scholar] [CrossRef]

8. Sun, C. P., Xu, Z. G. (2016). Multi-dimensional moving pattern prediction based on multi-dimensional interval T-S fuzzy model. Control and Decision, 31(9), 1569–1576. DOI 10.13195/j.kzyjc.2015.0944. [Google Scholar] [CrossRef]

9. Guo, L. L., Xu, Z. G., Wang, Y. (2014). Dynamic modeling and optimal control for complex systems with statistical trajectory. Discrete Dynamics in Nature & Society, 2014(1), 1–8. DOI 10.1155/2015/245685. [Google Scholar] [CrossRef]

10. Xu, Z. G., Wu, J. X., Guo, L. L. (2013). Modeling and optimal control based on moving pattern. Chinese Control Conference, pp. 7894–7899. Xi'an, China. [Google Scholar]

11. Xu, Z. G., Wu, J. X. (2012). Data-driven pattern moving and generalized predictive control. IEEE International Conference on Systems, Man and Cybernetics, 1604–1609. DOI 10.1109/ICSMC.2012.6377966. [Google Scholar] [CrossRef]

12. Ljung, L. (1999). System identification: Theory for the user. 2nd ed. Englewood Cliffs: Prentice-Hall. [Google Scholar]

13. Clarke, D. W. (1967). Generalized least squares estimation of the parameters of a dynamic model. Proceeding of IFAC Symposium on Identification in Automatic Control Systems. Prague. [Google Scholar]

14. Astrom, K. J. (1980). Maximum likelihood and prediction error methods. Automatica, 16(5), 551–574. DOI 10.1016/0005-1098(80)90078-3. [Google Scholar] [CrossRef]

15. Zhao, S. Y., Huang, B. (2020). Trial-and-error or avoiding a guess? Initialization of the Kalman filter. Automatica, 121(21), 109184. DOI 10.1016/j.automatica.2020.109184. [Google Scholar] [CrossRef]

16. Zhao, S. Y., Yuriy, S. S., Ahn, C., Liu, F. (2018). Adaptive-horizon iterative UFIR filtering algorithm with applications. IEEE Transactions on Industrial Electronics, 65(8), 6393–6402. DOI 10.1109/TIE.2017.2784405. [Google Scholar] [CrossRef]

17. Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. Journal of Basic Engineering, 82(1), 35–45. DOI 10.1115/1.3662552. [Google Scholar] [CrossRef]

18. Ho, Y., Lee, R. (1964). A Bayesian approach to problems in stochastic estimation and control. IEEE Transactions on Automatic Control, 9(4), 333–339. DOI 10.1109/TAC.1964.1105763. [Google Scholar] [CrossRef]

19. Zhao, Y. L., Zhang, J. F., Guo, J. (2012). System identification and adaptive control of set-valued systems. Journal of Systems Science and Mathematical Sciences, 32(10), 1257–1265. DOI 10.12341/jssms11944. [Google Scholar] [CrossRef]

20. Wang, T., Bi, W., Zhao, Y. L., Xue, W. (2016). Radar target recognition algorithm based on RCS observation sequence set-valued identification method. Journal of Systems Science and Complexity, 29(3), 573–588. DOI 10.1007/s11424-015-4151-8. [Google Scholar] [CrossRef]

21. Wang, L. Y., Zhang, J. F., Yin, G. G. (2003). System identification using binary sensors. IEEE Transactions on Automatic Control, 48(11), 1892–1907. DOI 10.1109/TAC.2003.819073. [Google Scholar] [CrossRef]

22. Wang, L. Y., Yin, G. G., Zhang, J. F., Zhao, Y. L. (2008). Space and time complexities and sensor threshold selection in quantized identification. Automatica, 44(12), 3014–3024. DOI 10.1016/j.automatica.2008.04.022. [Google Scholar] [CrossRef]

23. Wang, L. Y., Yin, G., Zhang, J. F., Zhao, Y. L. (2010). System identification with quantized observations, theory and applications. Birkhäuser, Boston: Springer. [Google Scholar]

24. Zhao, Y. L., Wang, L. Y., Yin, G. G., Zhang, J. F. (2007). Identification of Wiener systems with binary-valued output observations. Automatica, 43(10), 1752–1765. DOI 10.1016/j.automatica.2007.03.006. [Google Scholar] [CrossRef]

25. Zhao, Y. L., Zhang, J. F., Wang, L. Y., Yin, G. G. (2010). Identification of Hammerstein systems with quantized observations. SIAM Journal on Control and Optimization, 48(7), 4352–4376. DOI 10.1137/070707877. [Google Scholar] [CrossRef]

26. Guo, J., Wang, L. Y., Yin, G. G., Zhao, Y. L. (2017). Identification of Wiener systems with quantized inputs and binary-valued output observations. Automatica, 78(12), 280–286. DOI 10.1016/j.automatica.2016.12.034. [Google Scholar] [CrossRef]

27. Wang, T., Tan, J. W., Zhao, Y. L. (2018). Asymptotically efficient nontruncated identification for FIR systems with binary-valued outputs. Science China Information Sciences, 61(12), 052201. DOI 10.1007/s11432-018-9646-7. [Google Scholar] [CrossRef]

28. Li, X. Q., Xu, Z. G., Cui, J. R., Zhang, L. X. (2021). Suboptimal adaptive tracking control for FIR systems with binary-valued observations. Science China Information Sciences, 64(7), 172202: 1–172202: 11. DOI 10.1007/s11432-020-2914-2. [Google Scholar] [CrossRef]

29. Wang, T., Zhang, H., Zhao, Y. L. (2019). Parameter estimation based on set-valued signals: Theory and application. Acta Mathematicae Applicatae Sinica, English Series, 35(2), 255–263. DOI 10.1007/s10255-019-0822-x. [Google Scholar] [CrossRef]

30. Ding, F. (2017). System identification: Auxiliary model identification idea and methods. Beijing, Science-Press. [Google Scholar]

31. Xie, L. B., Ding, F., Wang, Y. (2009). Auxiliary model-based identification method for quantized control systems. Control Theory and Applications, 26(3), 277–282. [Google Scholar]

32. Li, X. Q., Xu, Z. G., Lu, Y. L., Cui, J. R., Zhang, L. X. (2021). Modified model free adaptive control for a class of nonlinear systems with multi-threshold quantized observations. International Journal of Control, Automation and Systems. DOI 10.1007/s12555-020-0289-9. [Google Scholar] [CrossRef]

33. Yang, X., Gao, J. J., Li, L. L., Luo, H., Ding, S. X. et al. (2020). Data-driven design of fault-tolerant control systems based on recursive stable image representation. Automatica, 122(6), 109246. DOI 10.1016/j.automatica.2020.109246. [Google Scholar] [CrossRef]

34. Yang, X., Gao, J. J., Huang, B. (2021). Data-driven design of fault detection and isolation method for distributed homogeneous systems. Journal of the Franklin Institute, 358(9), 4929. DOI 10.1016/j.jfranklin.2021.04.016. [Google Scholar] [CrossRef]

35. Yang, J., Zhang, D., Frangi, A. F., Yang, J. Y. (2004). Two-dimensional PCA: A dew approach to appearance-based face representation and recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(1), 131–137. DOI 10.1109/TPAMI.2004.1261097. [Google Scholar] [CrossRef]

36. Xu, Z. G., Qu, S. D., Yu, L. (2003). Self-learning pattern recognition method based on statistical space mapping. Journal of University of Science and Technology Beijing, 25(5), 480–482. DOI 10.13374/j.issn1001-053x.2003.05.050. [Google Scholar] [CrossRef]

37. Guo, J., Yin, G., Zhao, Y., Zhang, J. F. (2015). Asymptotically efficient identification of FIR systems with quantized observations and general quantized inputs. Automatica, 57(5), 113–122. DOI 10.1016/j.automatica.2015.04.009. [Google Scholar] [CrossRef]

38. Goodwin, G. C., Sin, K. S. (1984). Adaptive filtering, prediction and control. New Jersey: Prentice-Hall. [Google Scholar]

39. Guo, G. J., Chen, T. W. (2008). A new approach to quantized feedback control systems. Automatica, 44(2), 534–542. DOI 10.1016/j.automatica.2007.06.015. [Google Scholar] [CrossRef]