ArticlePDF Available

Data-Driven Sensor Selection for Signal Estimation of Vertical Wheel Forces in Vehicles

Authors:

Abstract

Sensor selection is one of the key factors that dictate the performance of estimating vertical wheel forces in vehicle durability design. To select K most relevant sensors among S candidate ones that best fit the response of one vertical wheel force, it has S!/(K!(S-K)!) possible choices to evaluate, which is not practical unless K or S is small. In order to tackle this issue, this paper proposes a data-driven method based on maximizing the marginal likelihood of the data of the vertical wheel force without knowing the dynamics of vehicle systems. Although the resulting optimization problem is a mixed-integer programming problem, it is relaxed to a convex problem with continuous variables and linear constraints. The proposed sensor selection method is flexible and easy to implement, and no additional hyper-parameters needed to be tuned using cross-validation. The feasibility and effectiveness of the proposed method are verified using experimental data in vehicle durability design. The results show that the proposed method has good performance with different data sizes and model orders, in providing sub-optimal sensor configurations for estimating vertical wheel forces in vehicles.
Xueke Zheng
UM-SJTU Joint Institute,
Shanghai Jiao Tong University,
Shanghai 200240, China
e-mail: zxk2046@sjtu.edu.cn
Ying Wang
UM-SJTU Joint Institute,
Shanghai Jiao Tong University,
Shanghai 200240, China
e-mail: dadide@sjtu.edu.cn
Le Wang
UM-SJTU Joint Institute,
Shanghai Jiao Tong University,
Shanghai 200240, China
e-mail: wangle@sjtu.edu.cn
Runze Cai
UM-SJTU Joint Institute,
Shanghai Jiao Tong University,
Shanghai 200240, China
e-mail: cairunze@sjtu.edu.cn
Mian Li
1
Professor
Fellow ASME
UM-SJTU Joint Institute,
Global Institute of Future Technology,
Department of Automation,
Shanghai Jiao Tong University,
Shanghai 200240, China
e-mail: mianli@sjtu.edu.cn
Yu Qiu
SAIC Motor Corporation Limited,
Shanghai 201800, China
e-mail: qiuyu_1123@163.com
Data-Driven Sensor Selection
for Signal Estimation of Vertical
Wheel Forces in Vehicles
Sensor selection is one of the key factors that dictate the performance of estimating vertical
wheel forces in vehicle durability design. To select K most relevant sensors among S can-
didate ones that best t the response of one vertical wheel force, it has S
K

possible
choices to evaluate, which is not practical unless K or S is small. In order to tackle this
issue, this paper proposes a data-driven method based on maximizing the marginal likeli-
hood of the data of the vertical wheel force without knowing the dynamics of vehicle
systems. Although the resulting optimization problem is a mixed-integer programming
problem, it is relaxed to a convex problem with continuous variables and linear constraints.
The proposed sensor selection method is exible and easy to implement, and the hyper-
parameters do not need to be tuned using additional validation data sets. The feasibility
and effectiveness of the proposed method are veried using numerical examples and exper-
imental data. In the results of different data sizes and model orders, the proposed method
has better tting performance than that of the group lasso method in the sense of the 2-norm
based metric. Also, the computational time of the proposed method is much less than that of
the enumeration-based method. [DOI: 10.1115/1.4055514]
Keywords: data-driven engineering, industrial internet of things, machine learning for
engineering applications, qualication, verication and validation of computational models
1 Introduction
When designing a new vehicle, vertical wheel force (i.e., the ver-
tical force acting at a wheel center) is one of the important quantities
to assess the durability performance of prototypes. In practice, ver-
tical wheel forces are usually measured directly by the so-called
wheel force transducers (WFTs) during road load data acquisition
(RLDA) testing campaigns in which prototype vehicles are driven
on proving grounds or public roads [1,2]. However, WFTs are
expensive, intrusive, and time-consuming to install in vehicles,
especially multiple prototypes are under tested simultaneously.
Thus, it is of great interest to alternatively obtaining vertical
wheel forces without measuring them directly.
Soft sensors are combination of easy-to-measure variables in the
processing plants with models for delivering estimates of hard-to-
measure variables [35]. Note that the data of hard-to-measure var-
iables are usually not available in most times (e.g., the online esti-
mation process) other than a short of time (e.g., the ofine training
process). The sensors for measuring easy-to-measure variables are
called easy-to-measure sensors (e.g., the accelerometers and displa-
cement sensors), and the sensors for measuring hard-to-measure
variables are called hard-to-measure sensors (e.g., vertical wheel
forces or other force sensors in vehicle durability design). Soft
sensors based on multibody simulation models have been applied
to estimate vertical wheel forces [1,6,7]. However, high-accuracy
multibody simulation models are usually difcult to obtain in
many scenarios, where data-driven soft sensors are preferred to esti-
mate vertical wheel forces [8]. Furthermore, due to good properties
such as bounded-input bounded-output (BIBO) stability, multiple-
inputs single-output (MISO) nite impulse response (FIR) models
have proven to be good candidate ones to estimate vertical wheel
forces and other hard-to-measure variables in vehicle durability
design [7,9]. Specically, FIR models are identied in the ofine
training process, and then the vertical wheel forces are estimated
using the easy-to-measure sensors and identied FIR models in
the online estimation process.
In previous work [7], a large amount of easy-to-measure sensors
are installed specically only to estimate vertical wheel forces, and
these easy-to-measure sensors are available for severing as inputs of
FIR models. Note that some of the easy-to-measure sensors may be
irrelevant to the vertical wheel forces in identifying FIR models.
Thus, the economic cost could be further reduced if these irrelevant
1
Corresponding author.
Contributed by the Computers and Information Division of ASME for publication
in the JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING. Manuscript
received April 15, 2022; nal manuscript received August 31, 2022; published
online December 9, 2022. Assoc. Editor: Bin He.
Journal of Computing and Information Science in Engineering JUNE 2023, Vol. 23 / 031010-1
Copyright © 2022 by ASME
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
sensors are removed, which is particularly important in RLDA
testing campaigns. Also, it has been shown by many studies that
the tting performance of regression models can be tremendously
improved if only vital sensors are included in the development of
regression models [10,11]. However, in previous work [7], the strat-
egy of selecting relevant or vital easy-to-measure variables as inputs
of FIR models is based on physical insights or trial and error.
This work aims to effectively and efciently select Ksensors
among Seasy-to-measure sensors that t vertical wheel forces in
identifying MISO FIR models. The determination of Kshould be
done considering several factors, e.g., the tting performance and
the cost of installation and maintenance. One simple method for
solving this sensor selection problem is to evaluate the performance
of all S
K

candidate solutions. Similar works have been con-
ducted in the cases of small Kand S[12,13]. However, obviously
it is impractical in the scenarios when the value of S
K

becomes large. For example, in the estimation of one vertical
wheel force (not to mention considering other hard-to-measure var-
iables in the vehicle durability design), for K=10 and S=34, to
nd the globally optimally best sensor selection by direct enumer-
ation would require evaluating the performance approximately
34
10

times, which is on the order of 10
9
times and clearly very
difcult, if not practically impossible. Also, some global optimiza-
tion techniques, such as branch and bound [14,15], can and often
run for very long time [16]. Thus, an effective and efcient
sensor selection strategy is in high demand for estimating vertical
wheel forces when the value of S
K

is large.
Prior and related work: Many researches on sensor selection or
optimal sensor placement aim at developing effective and efcient
methods to deal with specic applications, such as system monitor-
ing [17], manufacturing process control [18], health management
allocation [19], and structural design [20]. Many works focus on
sensor selection in the scenarios where the system dynamics is
known in advance. For example, optimal sensor selection strategies
are proposed in terms of system observability in the Kalman-
ltering framework [1,6]. Some heuristic methods such as generic
algorithm or particle swarm optimization technique are applied to
sensor selection eld by dening objective functions based on spe-
cic properties of nite element models [21,22]. Sensor selection
methods based on the conditioning analysis of the Markov param-
eters matrix or Bayesian formulation to select groups of features are
used in the identication of the input force [23,24]. The authors in
Refs. [16,25,26] propose sensor selection strategies based on max-
imizing the norm or determinant of Fisher information matrices for
modal analysis of large-scale structures. Sensor selection based on
information entropy that minimizes the uncertainty in the model
parameters of structures is also discussed in Refs. [27,28]. These
sensor selection methods rely on (accurate) rst-principle models
and have proven to be very effective and efcient. However, as
mentioned previously, the rst-principle models are not often
achievable in vehicle durability design, and hence these methods
may not be directly applicable to sensor selection in tting vertical
wheel forces.
On the other hand, some studies focus on data-driven methods for
sensor selection. Different from the model-based sensor selection
methods, most data-driven methods do not rely on rst-principle
models. For example, typical machine learning methods combining
with domain knowledge are applied to automatically select an
optimal set of real and virtual sensors to improve the system
energy performance through fault detection and system health mon-
itoring [29]. Data-driven methods based on clustering algorithms,
information loss approach, and Pareto principle are used to derive
the optimal sensor placement strategies for intelligent building
energy management [30]. Due to the simple implementation
using modeling packages such as CVX [31], the group lasso
method has been proven as a popular, effective, and efcient
sensor selection strategy in many signal processing applications
[24,3234]. In these applications, multiple coefcients of each
feature are grouped together and share similar structures, and the
group lasso method provides both inter-group and intra-group spar-
sity to remove irrelevant coefcients. In this work, the group lasso
method could be easily adopted in identifying FIR models. It is
worthwhile to note that the hyper-parameters in the group lasso
method need to be carefully tuned to balance the regularization
terms and the original cost function by using an additional round
of cross-validation [34]. Also, as mentioned in Refs. [17,35], the
results of the group lasso may not return the smallest possible set
of groups that are sufcient to obtain an accurate data-driven model.
In this work, a simple and effective sensor selection method is
developed and presented by maximizing the marginal likelihood
of the data of the output in FIR models. First, we innovatively for-
mulate the problem into a standard mixed-integer programming
problem by maximizing the marginal likelihood of the output
data. Then, we approximately relax the mixed-integer programming
problem into a convex optimization problem with continuous vari-
ables and linear constraints, which can be solved efciently. To the
authorsbest knowledge, this work is the rst one in data-driven
sensor selection for estimating hard-to-measure variables in
vehicle durability design. For example, in Refs. [1,6], the authors
discuss the optimal sensor selection strategies relying on multibody
simulation models in estimating wheel center loads in vehicles. It is
believed that using the proposed method with the real test data will
facilitate the durability design of new vehicles.
Compared to the group lasso method and other data-driven
methods, the proposed method brings the benets of
(i) small possible subsets of sensors while retaining the same
model accuracy;
(ii) no additional data sets are needed to tune hyper-parameters.
Test results of sensor selection in vehicle durability design power-
fully prove that the proposed method provides small possible
subsets of sensors to realize the balance of the tting performance
and deployment cost of easy-to-measure sensors. Without loss of
generality, we focus on the sensor selection of the vertical wheel
force on the front left of the vehicle hereafter, the sensor selection
of the other hard-to-measure variables in vehicles can be performed
in a similar manner.
The rest content of this paper is organized as follows: In Sec. 2,
we formally dene the sensor selection problem in the context of
parameter estimation. In Sec. 3, we present the proposed method
based on convex optimization that can efciently solve this
problem. We illustrate the proposed sensor selection method with
numerical examples and experimental data in Secs. 4and 5, respec-
tively. Some concluding remarks are presented in Sec. 6.
2 Sensor Selection
In this section, preliminaries on identifying MISO FIR models
are given in Sec. 2.1. Then, the sensor selection problem is pre-
sented in Sec. 2.2.
Fig. 1 The transmissibility Tfrom the easy-to-measure vari-
ables y
I
to the hard-to-measure variable y
O
. The input uis
unknown.
031010-2 / Vol. 23, JUNE 2023 Transactions of the ASME
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
2.1 Parameter Estimation. Consider an unknown linear time
invariant (LTI) system as shown in Fig. 1, where udenotes the
unknown excitation or input to the system, and y
I
as well as y
O
are the outputs of the system. The relationship from y
I
to y
O
can
be described as a time-domain transmissibility [13,36,37],
denoted by T. Here y
I
refers to easy-to-measure variables (e.g.,
the accelerations and displacements), and y
O
refers to the
hard-to-measure variable (e.g., the vertical wheel force).
According to Refs. [7,13,37], the time-domain transmissibility T
can be estimated by a noncausal MISO FIR model
yO(k)=
S
s=1
ϕs(k)Tθs+e(k),k=d,...,N+d1(1)
where ϕ
s
(k)=[y
I,s
(kd)···y
I,s
(k+r)]
T
is the sth regressor associ-
ated with sth easy-to-measure variable denoted by yI,s(k)R,r
and dare model orders, e(k) is the residual modeled by white
noise with mean zero and variance σ
2
, and yO(k)Ris the
hard-to-measure variable. Let θsRndenote the model parameters
associated with y
I,s
, where n=r+d+1. In the ofine training
process, Eq. (1) can be extended to
Y=
S
s=1
Φsθs+E(2)
=Φϑ+E(3)
where
Y=yO(d)···yO(N+d1)

TRN(4)
Φs=ϕs(d)···ϕs(N+d1)

TRN×n(5)
E=e(d)···e(N+d1)

TRN(6)
Φ=Φ1···ΦS

RN×Sn (7)
and ϑ=[θT
1··· θT
S]TRSn, by stacking all historic data together.
It is a well known fact [16] that the least squares estimator of ϑis
ˆ
ϑLS =(ΦTΦ)1ΦTY(8)
When Φ
T
Φis close to being singular, which indicates that the
easy-to-measure variables are not persistently exciting [37,38],
the least squares estimation may cause large variances [3941]. In
these cases, the ridge regression strikes a good balance between
the biases and variances of the estimates [40,41]. The solution of
ridge regression can be obtained from the Bayesian point of view
[41]. Under Bayesian assumption that ϑis Gaussian with zero
mean and covariance matrix λI
Sn
, i.e., ϑN(0,λISn), the posterior
probability density function (pdf) is given by Bayes rule
p(ϑY)=p(Yϑ)p(ϑ)
p(Y)(9)
where p(Yϑ) is the likelihood function corresponding to
YN(Φϑ,σ2IN), and p(ϑ) is the prior pdf corresponding to
ϑN(0,λISn), and the pdf p(Y)isaϑ-independent normalization
which will be clear shortly. Apart from p(Y) and other
ϑ-independent terms, twice the negative logarithm of (9)
2lnp(ϑY)=2( ln p(Yϑ)+ln p(ϑ)ln p(Y))
=YΦϑ2
2/σ2+N(ln(2π)+ln (σ2)) (10)
+ϑ2
2/λ+Sn(ln(σ2)+ln (2π)) +2lnp(Y)(11)
=YΦϑ2
2/σ2+ϑ2
2/λ+constant (12)
Let
L(ϑ):= YΦϑ2
2/σ2+ϑ2
2/λ(13)
Minimizing Eq. (13) leads to ridge regression estimator of ϑ
ˆ
ϑR=ΦTΦ+σ2
λISn1
ΦTY(14)
where λand σ
2
are hyper-parameters and can be estimated by
maximizing the marginal likelihood p(Y|λ,σ
2
), i.e., the probability
of Yconditioned on λand σ
2
, using empirical Bayes method
[4042]. Assume ϑN(0,λISn). According to Eq. (3),
ΦϑN(0,λS
s=1Ψs), where Ψs=ΦsΦT
s, is a positive semi-
denite matrix, and s=1, S. Thus, Ywill be a Gaussian
random vector with mean zero and covariance matrix
P(λ,σ2)=λ
S
s=1
Ψs+σ2IN(15)
Thus, the marginal likelihood of Ycorresponds to
YN(0,λS
s=1Ψs+σ2IN). As similar to Eq. (13), it can be ver-
ied that twice the negative logarithm of p(Y|λ,σ
2
) leads to (omit the
terms independent of λor σ
2
)
W(λ,σ2):= YTP(λ,σ2)Y+log det P(λ,σ2)(16)
and maximizing the marginal likelihood function p(Y|λ,σ
2
) w.r.t. λ
and σ
2
leads to
ˆ
λ,ˆσ2=argmin
λ>0,σ2>0
W(λ,σ2)
=argmin
λ>0,σ2>0
YTλ
S
s=1
Ψs+σ2IN1
Y(17)
+log det λ
S
s=1
Ψs+σ2IN(18)
Problem (18) is difference of convex programming (DCP) problem
and some nonconvex optimization methods can be used to obtain
high-accuracy locally optimal solutions [40,43].
2.2 Sensor Selection Problem. One may use many
easy-to-measure sensors to estimate the hard-to-measure variable
y
O
[1,6,7]. For the sake of simplicity, one sensor corresponds to
one easy-to-measure variable. Then there may have a scenario
where some sensors are not related to the hard-to-measure variable
y
O
, that is, do not contribute to tting the hard-to-measure variable
y
O
, and hence these irrelevant sensors are aimed to be removed from
Seasy-to-measure sensors. Whereas we do not know which
easy-to-measure sensors are irrelevant ones, since the system
dynamics is unknown.
In addition to estimating the hyper-parameters λand σ
2
, we can
further consider the situation that a subset of only Ksensors out of
Seasy-to-measure sensors (K<S) is chosen in the context of max-
imizing the marginal likelihood of Y. Following Eqs. (15)(18), this
can be expressed by the optimization problem
minimize
c1,...,cS,λ,σ2YTλ
S
s=1
csΨs+σ2IN1
Y
+log detλ
S
s=1
csΨs+σ2IN
subject to cs{0,1},s=1,...,S
S
s=1
cs=K
λ>0,σ2>0
(19)
Journal of Computing and Information Science in Engineering JUNE 2023, Vol. 23 / 031010-3
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
which is a mixed-integer programming problem. The rst term in
the objective function (19) is related to the C-optimum design
[44], i.e., minimizing the inverse of the covariance matrix
λS
s=1csΨs+σ2INin the direction of vector Y. Here c
s
encodes
whether the sth sensor is used to t the data of the hard-to-measure
sensor. Note that the data of sth sensor (i.e., Ψ
s
) can be easily
removed from the data of Seasy-to-measure sensors by simply
setting c
s
to be zero.
3 Approximate Relaxation of Sensor Selection
In this section, we will describe the method solving the sensor
selection problem in Sec. 2approximately but efciently, while
retaining a good accuracy.
3.1 Approximate Relaxed Optimization Problems. By
replacing the nonconvex constraints c
s
{0, 1} with convex con-
straints c
s
[0, 1] and minimizing an upper bound of the objective
function of (19), we obtain the approximate relaxation of the sensor
selection problem (19) as
minimize
c1,...,cS,λ,σ2YTλ
S
s=1
csΨs+σ2IN1
Y
+λ
S
s=1
cstr(Ψs)+Nσ2N
subject to
S
s=1
cs=K
0cs1,s=1,...,S
λ>0,σ2>0
(20)
where the upper bound of the objective function of (19) is given
using the inequality log det(X)tr(XIN) for any positive denite
matrix XRN×N. This inequality is often used to approximate the
log-determinant term, e.g., in Refs. [45,46]. The problem (20) is not
equivalent to the original sensor selection problem (19), and a sub-
optimal subset selection can be obtained in indices associated with
the Klargest values of the solution c
1
,,c
S
to the problem (20).
Also, note that the objective function of the problem (20) is noncon-
vex w.r.t. c
1
,,c
s
,λ,σ
2
, and thus it is difcult to obtain a global
optimum. Nevertheless, this difculty can be eliminated by dening
new variable h
s
=λc
s
minimize
h1,...,hS,λ,σ2YT
S
s=1
hsΨs+σ2IN1
Y
+
S
s=1
hstr(Ψs)+Nσ2N
subject to
S
s=1
hs=Kλ
0hsλ,s=1,...,S
λ>0,σ2>0
(21)
which comes down to a convex problem. Note that the calculation
of inverting the matrix S
s=1hsΨs+σ2INis computationally inten-
sive for large data size Nsince the size of this matrix depends on the
data size N. Also, it would be problematic to solve problem (21)
when the matrix S
s=1hsΨs+σ2INis close to being singular.
These two issues will be tackled in Sec. 3.2.
For problems with the large data size N,itisusefultoefciently
supply the gradient and Hessian of the objective function in problem
(21). This avoids expensive computation of their nite-difference
counterparts. Fortunately, the analytical expressions of the gradient
and Hessian are tractable and are given in details in the Appendix.
In addition, we can impose the selection constraints of the sensors
as linear equalities or inequalities on the variable h=[h
1
,,h
S
]
T
in
the approximate relaxed problem (21). For example, the following
two types of constraints are common and useful in practice.
Logical constraints: For example, the constraint that sensor ican
be chosen only when sensor jis also chosen is expressed as h
i
h
j
;
Budget constraints: A maximum allowed cost for the selection,
as z
T
hλB, where z=[z
1
···z
S
]
T
,z
i
is the cost, say, in
dollars, power, or weight associated with choosing sensor
i, and Bis the budget.
More details on these additional constraints can be found in
Ref. [16]. It is worthwhile to note that it may not be easy for
other sensor selection methods (e.g., the group lasso [34]) to take
these constraints into account.
Let h=[h
1···h
S]Tdenote a solution to the problem (21). The
greater of the value h
s, the higher likelihood that the sth sensor is
chosen. A simplest possible method to generate a sub-optimal
subset selection denoted by ˆ
Kis as follows [16]. Let h
i1,...,h
iS
denote the elements of hsorted in descending order. The subset
selection is then
ˆ
K={i1,...,iK}(22)
where the indices are the Klargest elements of h.
According to Eq. (18) and ˆ
K, the hyper-parameters λand σ
2
can
further be optimized as follows:
ˆ
λ,ˆσ2=argmin
λ>0,σ2>0
YTλ
K
s=1
Ψis+σ2IN1
Y
+log detλ
K
s=1
Ψis+σ2IN(23)
=argmin
λ>0,σ2>0
YTλ
K
s=1
Ψis+σ2IN1
Y
+Nlog σ2+log detλ
σ2
K
s=1
Ψis+IN(24)
=argmin
λ>0,σ2>0
YTλ
K
s=1
Ψis+σ2IN1
Y
+Nlog σ2+log detλ
σ2ΦT
ˆ
KΦˆ
K+IKn(25)
=argmin
λ>0,σ2>0
YTλ
K
s=1
Ψis+σ2IN1
Y
+(NKn) log σ2+log det(λΦT
ˆ
KΦˆ
K+σ2IKn)(26)
where Φˆ
K=[Φi1···ΦiK], and the third equality holds using Sylve-
sters determinant theorem [40], and the initial guess of λand σ
2
,
denoted by ˆ
λ0and ˆσ2
0respectively, is provided from the solution
to the convex problem (21). The algorithm of identifying FIR
models is as follows.
Algorithm 1 (Identication of FIR models.) The algorithm
consists of following steps.
(1) Solve the convex problem (21) to obtain sensor selection ˆ
Kas well as
initial guess ˆ
λ0and ˆσ2
0;
(2) Solve the nonconvex problem (26) according to ˆ
K,ˆ
λ0and ˆσ2
0, to obtain
high-accuracy local optimum ˆ
λand ˆσ2;
(3) Estimate ϑusing ridge regression to obtain ˆ
ϑR=ΦT
ˆ
KΦˆ
K+ˆσ2
ˆ
λIKn

1
ˆ
KTY.
031010-4 / Vol. 23, JUNE 2023 Transactions of the ASME
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
The estimate of the model parameter ϑcan also be obtained by least
squares method once the sensor selection ˆ
Kis chosen. The results of
the comparative study between the least squares method and the
ridge regression will be shown in Sec. 5.
3.2 Computation of σ2IN+S
s=1hsΨs1.In this subsec-
tion, we consider how to deal with the computational issue of cal-
culating σ2IN+S
s=1hsΨs1in the problem (21), when the data
size Nis very large or the matrix σ2IN+S
s=1hsΨs, denoted by
X
S
, is close to being singular.
The computational burden of the convex problem (21) increases
vastly as the data size Ngoes to innity. The computational com-
plexity of calculating σ2IN+S
s=1hsΨs1in the problem (21)
is O(N
3
+SN
2
), which could be problematic when Nis large,
e.g., Nis several thousand or even larger.
We attempt to reduce the computational complexity of calculat-
ing X
S
, such that solving the problem (21) could be possibly ef-
cient when Nis large. Here we calculate the X
S
in a recursive
way. Denote X
0
=σ
2
I
N
, and X
s
=h
s
Ψ
s
+X
s1
, where s=1, ,S.
Then, we can compute X1
sfrom X1
s1using matrix inversion lemma
X1
s=X1
s1hsX1
s1ΦsIn+hsΦT
sX1
s1Φs1ΦT
sX1
s1(27)
where s=1, ,Sand Ψs=ΦsΦT
s. The computational complexity
of computing X1
sfrom X1
s1is O(N
2
n+Nn
2
+n
3
). Thus, the com-
putational complexity of recursively computing X1
Sfrom X1
0is
O(SN
2
n+SN n
2
+Sn
3
), which could be far more efcient than
directly computing X1
Sif NSn.
Now we tackle the issue when X
S
is close to being singular. Note
that the magnitude of σ
2
I
N
could be small compared to that of
S
s=1hsΨs. In this case, XS=σ2IN+S
s=1hsΨscan be ill-
conditioned if S
s=1hsΨsis very ill-conditioned, which can cause
numerical issues, for example, the failure or inaccuracy of the cal-
culation of X1
S.
According to matrix inversion lemma, we have
X1
S=σ2IN+
S
s=1
hsΨs1
=σ2IN+ΦCΦT1
=σ2INσ2ΦC1/2(σ2ISn +C1/2ΦTΦC1/2)1C1/2ΦT
(28)
where C1/2=diag( h
1,...,
h
S)In, and is the Kronecker
product. Then, we resort to the thin QR factorization [40,47] to cal-
culate (σ
2
I
Sn
+C
1/2
Φ
T
ΦC
1/2
)
1
more accurately, since it also could
be close to being singular. More specically, consider the thin QR
factorization of
ΦC1/2
σISn

(N+Sn)×Sn
=Q1
Q2

R(29)
where Q1RN×Sn,Q2RSn×Sn , and RRSn×Sn is an upper trian-
gular matrix and is nonsingular. Note that
QT
1Q1+QT
2Q2=ISn (30)
ΦC1/2=Q1R(31)
σISn =Q2R(32)
σ2ISn +C1/2ΦTΦC1/2=RTR(33)
Substituting Eqs. (31) and (33) into Eq. (28) yields
X1
S=σ2INσ2Q1R(RTR)1RTQT
1
=σ2(INQ1QT
1)(34)
such that X1
Scan be accurately obtained without explicitly invert-
ing R.
The major computation complexity from Eq. (34) relies on the QR
factorization (29), which depends on the data size N. Note that Φis
xed when solving the numerical optimization problem (21).Thus,
we can make use of the observation to compute QR factorization
(29) inamoreefcient way (to make the computational complexity
independent of N). Now consider the thin QR factorization of
Φ=QdRd(35)
where QdRN×Sn whose columns are orthogonal unit vectors such
that
QT
dQd=ISn (36)
and RdRSn×Sn is an upper triangular matrix. Now consider further
another thin QR factorization of
RdC1/2
σISn

2Sn×Sn
=Qc1
Qc2

Rc(37)
where Qc1,Qc2RSn×Sn ,andRcRSn×Sn is an upper triangular
matrix. Note that the matrix in factorization (37) is 2Sn-by-Sn dimen-
sional, which is independent of data size N. Also, we have
QT
c1Qc1+QT
c2Qc2=ISn (38)
RdC1/2=Qc1Rc(39)
σISn =Qc2Rc(40)
AccordingtoEqs.(35),(39),and(40),wehave
ΦC1/2
σISn

=QdRdC1/2
σISn

=QdQc1Rc
Qc2Rc

=QdQc1
Qc2

Rc(41)
AccordingtoEqs.(36) and (38),wehave
QT
c1QT
dQT
c2

QdQc1
Qc2

=QT
c1QT
dQdQc1+QT
c2Qc2=ISn (42)
which indicates that R=R
c
,and
Q1
Q2

=QdQc1
Qc2
 (43)
AccordingtoEq.(43),wehave
Q1=QdQc1(44)
Substituting Eq. (44) into Eq. (34) yields
X1
s=σ2(INQdQc1QT
c1QT
d)(45)
Therefore, we obtain an efcient and accurate way to calculate X1
S=
(σ2IN+S
s=1hsΨs)1using Eqs. (35),(37),and(45). The algorithm
of calculating (σ2IN+S
s=1hsΨs)1is as follows.
Algorithm 2 Assume that the thin QR factorization (35) has
been computed, and Qd,Rdis available. Then the algorithm consists
of following steps to compute (σ2IN+S
s=1hsΨs)1.
(Computation of (σ2IN+S
s=1hsΨs)1).
(1) Compute the diagonal matrix C1/2;
(2) Compute Qc1according to QR factorization (37);
(3) Compute X1
S=(σ2IN+S
s=1hsΨs)1according to (45).
In MATLAB, the command qr(·,0) can be used to compute the thin QR
factorization (37), which costs O(S3n3)without involving the data size
N[48]. The total computational cost of calculating X1
Sis
O(S3n3+NS2n2+N2Sn). Note that the computation of (σ2IN+
S
s=1hsΨs)1above is performed when NSn, the computation can be
performed in a similar way with minor revision when N<Sn.
Journal of Computing and Information Science in Engineering JUNE 2023, Vol. 23 / 031010-5
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
4 Numerical Validation
In this section, we demonstrate the effectiveness of the proposed
sensor selection method through numerical examples.
All systems are multiple-inputs multiple-outputs (MIMO)
systems with n
u
=4 inputs and n
o
=9 outputs. Note that the
inputs are generated using the command idinput which produces
random Gaussian signals. The rst eight outputs serve as relevant
easy-to-measure variables, and the last output serves as the
hard-to-measure variable aimed to be estimated. In addition, 32
random Gaussian signals are generated using Matlab command
randn, to serve as irrelevant easy-to-measure variables. Thus,
there are 40 signals available where eight ones are relevant
easy-to-measure variables and the remaining 32 ones are irrelevant
easy-to-measure variables.
All n
o
=9 outputs are added by zero-mean white Gaussian noise
using the command awgn in MATLAB, and the signal-to-noise ratio
(SNR) is 40 dB. All data samples are normalized using the
command normalize, the training data size is N=1000, the
test data size for model validation is 500, and the FIR model
orders are set as r=d=30. The performance of the proposed
sensor selection method is compared with that of the group lasso
method [17,24,33,34]. The group lasso could be adopted in the
identication of FIR models by simply grouping the model param-
eters of each sensor together
minimize
θ1,...θS
Y
S
s=1
Φsθs
2
2
+κ
S
s=1θs2(46)
The group lasso (46) is a convex problem and can be efciently
solved. Here θ
s
,Φ
1
,,Φ
S
, and Yare shown in Eq. (2). The
group lasso requires the tuning of a tradeoff hyper-parameter
denoted by κbetween a penalty term of the form S
s=1θs2and
the sum of squares of the training error. This hyper-parameter κis
usually tuned in the experiments by using an additional round of
cross-validation. In this work, the tradeoff hyper-parameter is
selected from the candidate set {1 × 10
5
,5×10
5
,,1,,5×
10
4
,1×10
5
}. Once the hyper-parameter and the model parameters
are determined, the rank order of sensors in the group lasso is deter-
mined using the norm of the model parameters for each
easy-to-measure variable, i.e., by sorting the easy-to-measure vari-
ables according to θs2, where s=1, ,S. After that, the associ-
ated model parameters of selected easy-to-measure variables are
re-calculated by least square methods.
The trained models are cross-validated using test (unseen) data
sets. To quantitatively evaluate the performance of the methods in
tting the hard-to-measure variable, we dene the measure of FIT
(i.e., 2-norm based metric) as [49]
FIT =100 ·1YΦˆ
K
ˆ
ϑ
2
Y
Y2%(47)
where Yis the data of the test data set,
Yis the mean of all entries of
Y,Φˆ
Kis the regressor matrix generated by the selection of
easy-to-measure variables in test data. Also, ˆ
ϑis the estimate
from the least squares method or the ridge regression after the
subset selection ˆ
Kis determined. The performance gets better as
the value of FIT gets larger.
Figure 2shows the tting performance versus the number of
sensors selected. All results are averaged over 100 independent
runs of the experiment. Note that the proposed method with least
squaresrefers to the tting performance obtained from least
squares method and the proposed sensor selection method, and
the proposed method with ridge regressionrefers to the algorithm
1 shown in Sec. 3.1. The ideal FIT with least squaresrefers to the
tting performance obtained using eight relevant easy-to-measure
variables and least squares method (as a benchmark). It is clear that
(i) The overall results of the proposed methods are better than
those from the group lasso method in the sense of the tting
performance.
(ii) The results based on ridge regression is slightly better than
those based on least squares method, which indicates the
small advantage of the ridge regression over ordinary
linear regression in the numerical examples.
(iii) When K=8, the tting performance of the proposed method
with least squares is very close to the ideal FIT with least
squares, which indicates that the proposed algorithm
enables to correctly choose the relevant easy-to-measure
variables.
Thus, the proposed methods show the applicability of sensor selec-
tion and have the stronger capability of choosing the most relevant
easy-to-measure variables to t the hard-to-measure variable in
comparison with the group lasso method. This conclusion can
also be veried in more details through the real-world application
in the next section.
5 Experimental Validation
In this section, we demonstrate the applicability and effectiveness
of the proposed sensor selection method through the real data in
tting the vertical wheel force.
Figure 3shows six easy-to-measure accelerometers installed in
the vehicle body. In Fig. 4, an accelerometer is installed close to
one of four wheel centers, and a displacement sensor is also
installed to measure the deection of the shock absorber. Note
that all these accelerometers measure the accelerations in X,Y,Z
directions. Eighteen accelerations measured by six accelerometers
in the vehicle body are denoted by BX
i,BY
i,BZ
i, where i=1, ,6.
Four deections measured by displacement sensors are denoted
by D
1
,,D
4
. Twelve accelerations measured by four accelerome-
ters near the wheel centers are denoted by WX
i,WY
i,WZ
i, where
i=1,...,4. In this test the vehicle is running at the speed of
30 kmh
1
straightly on a Belgium block road, and over 1 × 10
4
data samples are collected. The sampling rate is 512 Hz.
5.1 The Comparative Study Between the Proposed Method
and the Group Lasso Method. Figures 5and 6show the tting
performance as the number of sensors selected is varied within
K=1, 2, , 10 in different scenarios. In Fig. 5, we investigate
the inuence of the training data size (N=1000, 1300, 1600,
2000) on the tting performance given the model orders r=0,
d=50 (in Ref. [7], it is reported that it is sufcient to use causal
FIR models to t the vertical wheel force). In Fig. 6, we investigate
the inuence of the model order (r=0, d=30, 40, 50, 60) on the
tting performance given the training data size N=1000. It is
Fig. 2 The tting performance (FIT) versus the number of
sensors selected
031010-6 / Vol. 23, JUNE 2023 Transactions of the ASME
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
clear to see that the overall results of the proposed methods are
better than those from the group lasso in different training data
sizes and the model orders. Furthermore, it can be seen that the pro-
posed methods work robustly when the data size and the model
order are varied, whereas the performance of the group lasso is sen-
sitive to the variation of the data size and the mode order. The
results indicate that the proposed methods can return smaller possi-
ble subsets of sensors while retaining the same model accuracy,
compared to the group lasso method. Therefore, it clearly validates
the effectiveness and applicability of the proposed methods when
the training data size and the model order are varied.
From Figs. 5and 6, we also notice that the tting performance is
not improved signicantly when K4. The possible reason is that
some related easy-to-measure sensors are redundant when K4
(the corresponding easy-to-measure variables can be estimated by
others), and these additional sensors involved do not provide
more information for tting the vertical wheel force. Also, the
overall performance of the ridge regression is marginally better
than that of least squares method, which indicates that it may not
be necessary using ridge regression after sensor selection.
Particularly, we list the selected sensors based on the results of
the proposed method in the scenario when N=1000 and d=50
(which corresponds to Fig. 5(a)), to see whether the results of
sensor selection coincide with the physical insights. Table 1
shows the selected sensors when K=3, 6:
(i) When K=3, the proposed method selects the displacement
sensors D
1
,D
2
and the acceleration WZ
1, on the front of the
vehicle. Selecting the acceleration WZ
1coincides with the
physical insights since the vertical wheel force and the accel-
eration WZ
1both are measured in the Zdirection, and the
locations of associated sensors are very close to each other
(both are on the front left of the vehicle). Selecting D
2
also
improves the result, which indicates that the two suspensions
on the front may be coupled with each other, i.e., the vibra-
tion of the suspension on the front right may also inuence
the vibration of the suspension on the front left, and vice
versa (this was veried when we t the vertical wheel
force on the front right of the vehicle).
(ii) When K=6, the deections and the accelerations in Xand Z
directions on the front of the vehicle are involved in improv-
ing the tting performance of the vertical wheel force on the
front left of the vehicle. Note that the accelerations in the Y
direction is not selected, which coincides with the physical
insight of the vehicle dynamics because the accelerations
in the Ydirection may not contribute to tting the vertical
wheel force when the vehicle is running straightly. Also,
the sensors on the rear right are not selected. That is
because the locations of these sensors on the rear right are
far from the wheel on the front left, and hence are less
likely to be relevant to the vertical wheel force on the front
left.
From the results of Table 1, we see that the results of the proposed
method coincide with the physical insights and hence are
explainable.
5.2 The Comparative Study Between the Proposed Method
and Enumeration-Based Method. Furthermore, the proposed
method is compared with the enumeration-based method for
small K, i.e., K=1, , 6. Here the enumeration-based method
refers to the method based on enumerating all S
K

candidate
choices: Once a candidate choice according enumeration is given,
Fig. 4 The locations of the wheel force transducer and the
accelerometers: (a) the wheel force transducer measuring
WCLs is installed on the wheels and (b) the accelerometer is
installed close to the wheel center
Fig. 3 The locations of the accelerometers and the wheel force
transducer (a) the locations of accelerometers installed in the
vehicle body (3D demonstration) and (b) the locations of acceler-
ometers installed in the vehicle body (top view)
Journal of Computing and Information Science in Engineering JUNE 2023, Vol. 23 / 031010-7
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
Fig. 5 The tting performance (FIT) versus the number of sensors selected when r=50: (a) data size N=1000,
(b) data size N=1300, (c) data size N=1600, and (d) data size N=2000.
Fig. 6 The tting performance (FIT) versus the number of sensors selected when the training data size N=1000: (a)
model order r=0, d=30, (b) model order r=0, d=40, (c) model order r=0, d=50, and (d) model order r=0, d=60.
031010-8 / Vol. 23, JUNE 2023 Transactions of the ASME
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
the tting performance corresponding to the candidate choice is
cross-validated using least squares in the additional validation
data set. Then, the optimal sensor selection can be obtained after
going through all candidate sensor selection choices. Although
the enumeration-based method is time-consuming when the value
of S
K

is large, it can be regarded as a benchmark when Kand
Sare small.
From the results in Sec. 5.1, it can be seen that the
easy-to-measure sensors on the front of vehicles may be related to
the vertical wheel force on the front left. Thus, 14 easy-to-measure
sensors, i.e., BX
i,BY
i,BZ
i,WX
i,WY
i,WZ
i,D
i
, where i=1, 2, are
selected as candidate ones. Figure 7shows the tting performance
of the proposed method using least squares and the enumeration-
based method in the case of K=1, ,6,S=14, N=2000, and
d=50. It can be seen that the tting performance of the proposed
method gets closer to that of the enumeration-based method as K
gets larger, especially when K3. This indicates it is possible to
obtain the optimal sensor selection using the proposed method.
Note that the performance of the proposed method is poor when
K=1, 2. This may indicate the limitation of the proposed method
on small K, which will be further addressed in the future work.
5.3 The Computational Time. Now we compare the compu-
tational efciencies of the proposed method, the group lasso, and
the enumeration-based method when K=1, 6 and S=14. The
experiments are conducted with a 2.50 GHz Intel Core i5-7200U
processor, and all implementations are done in Matlab 2021b.
The group lasso is solved using the MATLAB package CVX
[31,50], and the convex problem (21) is solved using Matlab
command fmincon with default setting. In Fig 8, we report the
run-time of the proposed method and the enumeration-based
method when the training data size N=2000 and the model order
r=0, d=50. Also, the run-time of the group lasso is 36.5 s. We
can see that the group lasso is faster than the proposed method
for going through all results when K=1, 2, , 6. The disadvantage
of the proposed method is that it needs to solve the optimization
problem for each K, which indicates that the overall run-time gets
larger as Kgets larger. On the contrary, the run-time of the group
lasso is irrelevant to the value of K. Nevertheless, given the value
of K, the convex optimization (which takes a few seconds) is
much faster than the group lasso method, since the hyper-
parameters of the group lasso method need to be tuned using addi-
tional round of cross-validation and is time-consuming. Further-
more, it can be seen that the run-time of the enumeration-based
method gets larger vastly as Kgets larger, since the it needs to enu-
merate all possible candidate solutions. That is very time-
consuming when the value of S
K

is large or there are many
hard-to-measure variables in the vehicle durability design.
6 Conclusion
The problem of choosing Ksensors, among a large number of
easy-to-measure sensors, to identify the best FIR model is a very
important and difcult combinatorial problem. This work presents
a novel data-driven method to remove the irrelevant sensors to
the vertical wheel force in vehicle durability design. The problem
is formulated into a mixed-integer programming problem, which
is relaxed into a convex optimization problem with continuous
design variables and linear constraints, such that the sensor selec-
tion problem can be solved effectively and efciently.
The numerical examples and real data in the vehicle durability
design are used to verify the applicability and effectiveness of the
proposed method. It has been shown that the proposed method is
capable of solving sensor selection problems, and working robustly
with different training data sizes and model orders. Compared to the
results from the group lasso or other data-driven methods, the pro-
posed method has two advantages:
(i) the results of the proposed method return smaller possible
subsets of sensors while retaining the same model accuracy,
which brings benets to the applications where the cost of
sensors is one of the key issues;
(ii) no additional data sets are needed to tune hyper-parameters,
which brings benets to the scenarios where data are limited
to achieve.
The proposed method presented is simple to implement and may
be applicable in various other signal processing applications where
multiple coefcients of each feature are grouped together and share
the similar structures [24,34].
Acknowledgment
The work presented here is supported in part by VMAP program
of VMware. Such support does not constitute an endorsement by
the funding agency of the opinions expressed in the article.
Table 1 Sensor selection results of the proposed in tting the
vertical wheel force on the front left shown in Fig. 4(a)
K=3K=6
D
1
D
2
WZ
1D
1
D
2
WZ
1WX
1BZ
1BZ
2
Fig. 7 The tting performance of the proposed method and the
enumeration-based method
Fig. 8 The run-time of the proposed method and the
enumeration-based method.
Journal of Computing and Information Science in Engineering JUNE 2023, Vol. 23 / 031010-9
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
Conict of Interest
There are no conicts of interest.
Data Availability Statement
The datasets generated and supporting the ndings of this article
are obtainable from the corresponding author upon reasonable
request.
Appendix: Computation of the Gradient and Hessian
in the Problem (21)
We give the analytic expression of the gradient and the Hessian
of the objective function in the problem (21) w.r.t. h
1
,,h
S
,σ
2
for
numerical optimization. Let f:RS+1R
f(h1,...,hS,σ2)=YTX1
SY+
S
s=1
hstr(Ψs)+Nσ2N(A1)
where YRN, and
XS=σ2IN+
S
s=1
hsΨs,hs0(A2)
is a positive denite matrix. Then, we have (to simplify the notation,
we omit the subscript Sin X
S
)
X1
hs
=X1X
hs
X1=X1ΨsX1,s=1,...,S(A3)
X1
σ2=X1X
σ2X1=X2(A4)
The gradient of fw.r.t. h
1
,,h
S
,σ
2
is
f=
YTX1Ψ1X1Ytr(Ψ1)
.
.
.
YTX1ΨSX1Ytr(ΨS)
YTX2YN
(A5)
=IS+1(YTX1)˜
ΨX1Y+η(A6)
where ˜
Ψ=[Ψ1··· ΨSIN]T,η=[tr(Ψ1)···tr(ΨS)N]T, and
denotes the Kronecker product. Furthermore, we have
2YTX1Y
hsh
=YTX1
h
ΨsX1+X1Ψs
X1
hY
=2YTX1ΨsX1ΨsX1Y(A7)
and
2YTX1Y
hsσ2=YTX1
σ2ΨsX1+X1Ψs
X1
hY
=2YTX1ΨsX2Y
(A8)
According to Eqs. (A7) and (A8), the Hessian matrix is
H=2IS+1(YTX1)˜
ΨX1˜
ΨT(IS+1(X1Y)) (A9)
which is the positive semi-denite.
References
[1] Risaliti, E., Tamarozzi, T., Vermaut, M., Cornelis, B., and Desmet, W., 2019,
Multibody Model Based Estimation of Multiple Loads and Strain Field on a
Vehicle Suspension System,Mech. Syst. Signal Process.,123(3), pp. 125.
[2] Johannesson, P., and Speckert, M., 2013, Guide to Load Analysis for Durability in
Vehicle Engineering, John Wiley & Sons, Hoboken, NJ.
[3] Kadlec, P., Gabrys, B., and Strandt, S., 2009, Data-Driven Soft Sensors in the
Process Industry,Comput. Chem. Eng.,33(4), pp. 795814.
[4] Jiang, Y., Yin, S., Dong, J., and Kaynak, O., 2020, A Review on Soft Sensors for
Monitoring, Control and Optimization of Industrial Processes,IEEE Sensors J.,
21(11), pp. 1286812881.
[5] Wang, D., Gong, J., Kang, Q., Fan, D., and Yang, J., 2019, Soft Sensing for
Gas-Condensate Field Production Using Parallel-Genetic-Algorithm-Based
Data Reconciliation,ASME J. Comput. Inf. Sci. Eng.,19(4), p. 044501.
[6] Cumbo, R., Mazzanti, L., Tamarozzi, T., Jiranek, P., Desmet, W., and Naets, F.,
2021, Advanced Optimal Sensor Placement for Kalman-Based Multiple-Input
Estimation,Mech. Syst. Signal Process.,160(6), p. 107830.
[7] Zheng, X., Cai, R., Xiao, S., Qiu, Y., Zhang, J., and Li, M., 2021,
Primary-Auxiliary Model Scheduling Based Estimation of the Vertical Wheel
Force in A Full Vehicle System,arXiv preprint arXiv:2107.11511, under review.
[8] El-kafafy, M., Csurcsia, P., Cornelis, B., Enrico, R., and Janssens, K., 2020,
Machine Learning and System Identication for the Estimation of Data-
Driven Models: An Experimental Case Study Illustrated on a Tire-Suspension
System,Proceedings of the International Conference on Noise and Vibration
Engineering (ISMA), October, KU Leuven, pp. 32873301.
[9] Zheng, X., Zhuang, C., Xiao, S., Qiu, Y., Zhang, J., and Li, M., 2023, Signal
Estimation for Vehicle Body Accelerations Using Piecewise Linear System
Identication in the Frequency Domain,ASME J. Comput. Inf. Sci. Eng.,
23(2), p. 021003.
[10] Wang, Z. X., He, Q. P., and Wang, J., 2015, Comparison of Variable
Selection Methods for PLS-Based Soft Sensor Modeling,J. Process Control,
26(6), pp. 5672.
[11] Andersen, C. M., and Bro, R., 2010, Variable Selection in RegressionA
Tutorial,J. Chemom.,24(1112), pp. 728737.
[12] Aljanaideh, K. F., and Bernstein, D. S., 2015, Time-Domain Analysis of Motion
Transmissibilities in Force-Driven and Displacement-Driven Structures,
J. Sound Vib.,347(4), pp. 169183.
[13] Aljanaideh, K. F., and Bernstein, D. S., 2018, Experimental Application of
Time-Domain Transmissibility Identication to Fault Detection and
Localization in Acoustic Systems,ASME J. Vib. Acoust.,140(2), p. 021017.
[14] Welch, W. J., 1982, Branch-and-Bound Search for Experimental Designs Based
on D-Optimality and Other Criteria,Technometrics,24(1), pp. 4148.
[15] Lawler, E. L., and Wood, D. E., 1966, Branch-and-Bound Methods: A Survey,
Oper. Res.,14(4), pp. 699719.
[16] Joshi, S., and Boyd, S., 2008, Sensor Selection Via Convex Optimization,IEEE
Trans. Signal Process.,57(2), pp. 451462.
[17] Subrahmanya, N., Shin, Y. C., and Meckl, P. H., 2010, A Bayesian Machine
Learning Method for Sensor Selection and Fusion With Application to
On-Board Fault Diagnostics,Mech. Syst. Signal Process.,24(1), pp. 182192.
[18] Zhang, W., Yu, J. Z., Zhu, F., Zhu, Y., Yang, Z., Ulu, N. G., Arisoy, B., and Kara,
L. B., 2019, High Degree of Freedom Hand Pose Tracking Using Limited Strain
Sensing and Optical Training,ASME J. Comput. Inf. Sci. Eng.,19(3),
p. 031014.
[19] Hoyle, C., Tumer, I. Y., Mehr, A. F., and Chen, W., 2009, Health Management
Allocation During Conceptual System Design,ASME J. Comput. Inf. Sci. Eng.,
9(2), p. 021002.
[20] Singh, V., and Willcox, K. E., 2021, Decision-Making Under Uncertainty for a
Digital Thread-Enabled Design Process,ASME J. Mech. Des.,143(9),
p. 091707.
[21] Yao, L., Sethares, W. A., and Kammer, D. C., 1993, Sensor Placement for
On-Orbit Modal Identication Via a Genetic Algorithm,AIAA J.,31(10),
pp. 19221928.
[22] Rao, A. R. M., and Anandakumar, G., 2007, Optimal Placement of Sensors for
Structural System Identication and Health Monitoring Using a Hybrid Swarm
Intelligence Technique,Smart Mater. Struct.,16(6), p. 2658.
[23] Wang, J., Law, S., and Yang, Q., 2013, Sensor Placement Methods for an
Improved Force Identication in State Space,Mech. Syst. Signal Process.,
41(12), pp. 254267.
[24] Feng, W., Li, Q., Lu, Q., Li, C., and Wang, B., 2021, Group Relevance Vector
Machine for Sparse Force Localization and Reconstruction,Mech. Syst. Signal
Process.,161(6), p. 107900.
[25] Stephan, C., 2012, Sensor Placement for Modal Identication,Mech. Syst.
Signal Process.,27(1), pp. 461470.
[26] Kammer, D. C., 1991, Sensor Placement for On-Orbit Modal Identi cation
and Correlation of Large Space Structures,J. Guidance Control Dyn.,14(2),
pp. 251259.
[27] Yin, T., Yuen, K.-V., Lam, H.-F., and Zhu, H.-p., 2017, Entropy-Based Optimal
Sensor Placement for Model Identication of Periodic Structures Endowed With
Bolted Joints,Comput.-Aided Civil Infrastruct. Eng.,32(12), pp. 10071024.
[28] Yuen, K.-V., Katafygiotis, L. S., Papadimitriou, C., and Mickleborough, N. C.,
2001, Optimal Sensor Placement Methodology for Identication With
Unmeasured Excitation,ASME J. Dyn. Syst. Meas. Control,123(4), pp. 677
686.
[29] Hu, R., Granderson, J., Auslander, D., and Agogino, A., 2019, Design of
Machine Learning Models With Domain Experts for Automated Sensor
Selection for Energy Fault Detection,Appl. Energy,235(1), pp. 117128.
[30] Yoganathan, D., Kondepudi, S., Kalluri, B., and Manthapuri, S., 2018, Optimal
Sensor Placement Strategy for Ofce Buildings Using Clustering Algorithms,
Energy Build.,158(1), pp. 12061225.
[31] CVX Research, I., 2012. CVX: Matlab Software for Disciplined Convex
Programming, Version 2.0, August.
031010-10 / Vol. 23, JUNE 2023 Transactions of the ASME
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
[32] Klingspor, M., Hansson, A., and Löfberg, J., 2018, Input Selection in ARX
Model Estimation Using Group Lasso Regularization,IFAC-PapersOnLine,
51(15), pp. 897902.
[33] Fujiwara, K., and Kano, M., 2015, Efcient Input Variable Selection for
Soft-Sensor Design Based on Nearest Correlation Spectral Clustering and
Group Lasso,ISA Trans.,58(4), pp. 367379.
[34] Li, B., Zhao, Y.-P., Wu, H., and Tan, H.-J., 2021, Optimal Sensor Placement
Using Data-Driven Sparse Learning Method With Application to Pattern
Classication of Hypersonic Inlet,Mech. Syst. Signal Process.,147(1), p.
107110.
[35] Wang, L., Chen, G., and Li, H., 2007, Group Scad Regression Analysis for
Microarray Time Course Gene Expression Data,Bioinformatics,23(12),
pp. 14861494.
[36] Aljanaideh, K. F., and Bernstein, D. S., 2015, Time-Domain Analysis of
Sensor-to-Sensor Transmissibility Operators,Automatica,53(2), pp. 312319.
[37] Aljanaideh, K. F., and Bernstein, D. S., 2020, Output-Only Identication of
InputOutput Models,Automatica,113(2), p. 108686.
[38] Lennart, L., 1999, System Identication: Theory for the User, PTR Prentice Hall,
Upper Saddle River, NJ, pp. 114.
[39] Zhang, Z., Poudel, L., Sha, Z., Zhou, W., and Wu, D., 2020, Data-Driven
Predictive Modeling of Tensile Behavior of Parts Fabricated by Cooperative
3D Printing,ASME J. Comput. Inf. Sci. Eng.,20(2), p. 021002.
[40] Chen, T., and Ljung, L., 2013, Implementation of Algorithms for Tuning
Parameters in Regularized Least Squares Problems in System Identication,
Automatica,49(7), pp. 22132220.
[41] Ljung, L., and Chen, T., 2013, What Can Regularization Offer for Estimation of
Dynamical Systems?,11th IFAC International Workshop on Adaptation and
Learning in Control and Signal Processing (ALCOSP13), Caen, France, July
35, 2013, IFAC, pp. 18.
[42] Carlin, B., and Louis, T., 2000, Bayes and Empirical Bayes Methods for Data
Analysis, 2nd ed., Chapman & Hall/CRC Texts in Statistical Science, Taylor &
Francis.
[43] Chen, T., Andersen, M. S., Ljung, L., Chiuso, A., and Pillonetto, G., 2014,
System Identication Via Sparse Multiple Kernel-Based Regularization Using
Sequential Convex Optimization Techniques,IEEE Trans. Autom. Control,
59(11), pp. 29332945.
[44] Pronzato, L., and Pázman, A., 2013, Design of Experiments in Nonlinear
Models,Lect. Notes Stat., 212(1), p. 1.
[45] Ang, M. A., and Gillis, N., 2018, Volume Regularized Non-negative Matrix
Factorizations,2018 Ninth Workshop on Hyperspectral Image and Signal
Processing: Evolution in Remote Sensing (WHISPERS), Sept. 2326, IEEE,
pp. 15.
[46] Leplat, V., Ang, A. M., and Gillis, N., 2019, Minimum-Volume Rank-Decient
Non-Negative Matrix Factorizations,IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP), May 1217, pp. 34023406.
[47] Gene, H. G., and Van Loan, C. F. 2013, Matrix Computations, JHU Press,
Baltimore, MD.
[48] Boyd, S., Boyd, S. P., and Vandenberghe, L., 2004, Convex Optimization,
Cambridge University Press, Cambridge, UK.
[49] Ljung, L., 2018, The System Identication Toolbox: The Manual,Natick, MA,
https://www.mathworks.com/products/sysid.html
[50] Grant, M., and Boyd, S., 2008, Graph Implementations for Nonsmooth Convex
Programs.Recent Advances in Learning and Control, V. Blondel, S. Boyd, and
H. Kimura, eds., Lecture Notes in Control and Information Sciences,
Springer-Verlag Limited, pp. 95110.
Journal of Computing and Information Science in Engineering JUNE 2023, Vol. 23 / 031010-11
Downloaded from http://asmedigitalcollection.asme.org/computingengineering/article-pdf/23/3/031010/6958817/jcise_23_3_031010.pdf by Shanghai Jiaotong University, Mian Li on 12 January 2023
... In the literature, many researchers considered various settings. For example, [12] proposed a data-driven method to select the K most relevant sensors among S candidate ones that best fit the response of one vertical wheel force; [13] proposed sparse group lasso to do a less restrictive form of structured sparse feature selection; [14] proposed a LASSO-based variable selection algorithm based on linear combinations of the gene expression. In [15], a tumor classification method based on sparse group lasso and support vector machine was introduced. ...
... Zheng proposed a data-driven method to select the K most relevant sensors among S candidate ones that best fit the response of one vertical wheel force [11]. Rao proposed sparse group lasso to do a less restrictive form of structured sparse feature selection [12]. ...
Preprint
Full-text available
Soft sensing is a way to indirectly obtain information of signals for which direct sensing is difficult or prohibitively expensive. It may not a priori be evident which sensors provide useful information about the target signal. There may be sensors irrelevant for the estimation as well as sensors for which the information is very poor. It is often required that the soft sensor should cover a wide range of operating points. This means that some sensors may be useful in certain operating conditions while irrelevant in others, while others may have no bearing on the target signal whatsoever. However, this type of structural information is typically not available but has to be deduced from data. A further compounding issue is that multiple operating conditions may be described by the same model, but which ones is not known in advance either. In this contribution, we provide a systematic method to construct a soft sensor that can deal with these issues. While the different models can be used, we adopt the multi-input single output finite impulse response models since they are linear in the parameters. We propose a single estimation criterion, where the objectives are encoded in terms of model fit, model sparsity (reducing the number of different models), and model parameter coefficient sparsity (to exclude irrelevant sensors). A post-processing model clustering step is also included. As proof of concept, the method is tested on field test datasets from a prototype vehicle.
... In static systems, where the underlying conditions are time-independent, the focus is on optimizing the placement and configuration of sensors to capture essential information about the system state. Classical methods for sensor subset selection in static systems include greedy algorithms which iteratively select sensors to maximize information gain or reduce uncertainty [21][22][23][24], information theory-based methods utilizing metrics like mutual information and entropy [25][26][27], optimization techniques aiming to maximize system performance subject to constraints [28][29][30][31], heuristic approaches employing intuitive strategies such as thresholding or random selection [32,33], and statistical methods modeling sensor-system relationships through regression analysis or principal component analysis [34,35]. ...
Article
Full-text available
Coupled multidisciplinary systems involve different disciplines/subsystems with feedback-coupled interactions, illustrating the complex interdependencies inherent in real-world engineering systems. Effective monitoring of a coupled multidisciplinary system is crucial for real-time assessment of the interactions between various disciplines within the system. This monitoring provides the data necessary for detecting and addressing issues in a timely manner and facilitates adaptive decision-making for taking reliable design or control actions. However, processing and analyzing data in real time is computationally intensive, and limited resources, such as computational power, sensor capabilities, and budget, may constrain the extent to which a system can be monitored comprehensively. To address this, this paper develops a particle-based approach that dynamically selects a subset of sensors that provides the highest information about the state of the system in real time. The proposed approach first predicts the amount of uncertainty in the estimation of the state of the system given noisy measurements from different subsets of available sensors. Then, it selects the sensors that reduce this uncertainty the most, enhancing the precision and efficiency of the monitoring process. The efficacy of the proposed framework is demonstrated via two coupled multidisciplinary systems in the numerical experiments.
Article
Precise decoupling and calibration of multi-axis force sensor (MFS) is crucial in engineering applications. This work presents a novel nonlinear decoupling and calibration approach to meet the physical coupling characteristics of the structural strain-deformation transducers. It deals with the most force-voltage responses by the linear prime modeling and the gross error deviation by the nonlinear error modeling. Such a prime-error framework is naturally derived from the conventional least-square decoupling model with the delicate nonlinear error modeling by the multivariate Bernstein polynomials. The two-axis and three-axis force sensor are tested and compared with the proposed Bernstein-based Prime-Error Model (BPEM), the Least-Square-based Method (LSM), and error-based Neural Network learning model (eNN), Extreme Learning Machine (ELM) as well as the error-based Support Vector Model (e-SVM), the Interpretable Nonlinear Decoupling model (IND). Results demonstrate that the proposed BPEM provides an accurate, practical, and effective scheme for modeling and calibrating MFSs.
Article
Full-text available
In this work, we investigate a signal estimation problem which is common and critical for durability design of vehicle bodies. The relation between the frequency responses of accelerometers is the target to model so that the ones of easy-to-measure accelerometers can estimate the responses of hard-to-measure accelerometers. A piecewise linear frequency-domain identification method relying on Finite Impulse Response (FIR) models is proposed and performed to tackle the nonlinearity issue in the signal estimation problems: first, the interesting frequency range is segmented into three sub-ranges which are clearly identified by peak histograms of frequency signals. Then, FIR models which provide a satisfactory description of the system are constructed to estimate the frequency responses of the interesting signals at sub-ranges, one for each. The performance of the proposed approach is validated by using real-world data under multiple working conditions. The results show that the proposed method has a good estimation accuracy, and it brings the benefit that the number of accelerometers can be significantly reduced during the durability design of vehicle bodies.
Article
Full-text available
Locations and amplitudes of external forces are essential information for design, loading assessment, and health monitoring of structures. In this paper, we propose an original time domain group sparsity regularization method, named Group Relevance Vector Machine, to localize and reconstruct external forces on structures based on structure responses only. Group Relevance Vector Machine constructs a structured regularization on the unknown forces, by binding the unknown amplitudes associated with different potential locations into separate groups and promoting the group-level sparsity between the potential locations. With this technique, we can adaptively localize and reconstruct dynamic point-forces in an underdetermined sensor configuration. Group Relevance Vector Machine is constructed and derived in detail under the hierarchical Bayesian framework. Its adaptivity, computational efficiency, and robustness with respect to noise and applied structure under underdetermined sensor configurations are comprehensively validated numerically on a cantilever beam and a cantilever plate, and experimentally on a cantilever plate and an engineering-scale tank.
Conference Paper
Full-text available
In this paper, we are investigating the capabilities of both classical system identification and modern machine learning (time regression neural networks) to derive predictive black-box models which can predict the wheel center loads (WCLs) by making use of either the road disturbances acting on the wheel-patch or strain measurements on the suspension components. The WCLs are very important for the durability analyses of a vehicle and their direct measurement requires the use of expensive force transducers. Thus, predictive black-box models can augment data sets by predictions of the WCLs when multiple vehicles need to be tested and where it is infeasible to instrument dedicated transducers in each vehicle. As identification techniques to be used to derive those black-box models, the linear ARX model, the polynomial nonlinear state space (PNLSS) method, and the time regression neural networks (NNs) will be reviewed and benchmarked using experimental data measured on a McPherson suspension test rig.
Article
Full-text available
Over the past twenty years, numerous research outcomes have been published, related to the design and implementation of soft sensors. In modern industrial processes, various types of soft sensors are used, which play essential roles in process monitoring, control and optimization. Emerging new theories, advanced techniques and the information infrastructure have enabled the elevation of the performance of soft sensing. However, novel opportunities are accompanied by novel challenges. This work is motivated by these observations and aims to present a comprehensive review of the developments since the start of the millennium. While a few books and review articles are published on the related topics, more focus on the most up-to-the-date advancement is put in this work, from the perspective of systems and control.
Article
Full-text available
In applications of system identification where the input signal is unknown, transmissibility identification can be used to estimate the operator between a pair of output signals. A limitation of transmissibility identification is the fact that the numerator and denominator of a transmissibility are numerators of the transfer functions from the input to the sensors. Consequently, transmissibility identification does not capture the poles of the system. To overcome this limitation, this paper introduces transmissibility-based system identification (TBSID), which uses estimates of two transmissibilities to identify the complete dynamics from two unmeasured inputs to two measured outputs. The method assumes that the system can be excited nonsimultaneously by two input signals at two distinct locations.
Article
In this work we study the estimation problem of the vertical wheel force in a vehicle system subject to non-stationary and unknown excitation, which is a common and critical problem in vehicle durability design. The vehicle system can be approximated by a linear time invariant system under each working condition. However, the working conditions are unknown during the estimation process. A primary–auxiliary model scheduling procedure based on time-domain transmissibilities is proposed and performed to identify the unknown working conditions and estimate the unknown vertical wheel force: In addition to constructing a primary transmissibility family from the pseudo-inputs to the output during the offline stage, an auxiliary transmissibility family is constructed by further decomposing the pseudo-input vector into two parts. The auxiliary family enables identifying the unknown working condition under which the system is currently running, and then an appropriate transmissibility from the primary transmissibility family for estimating the unknown output can be selected during the online estimation stage. Moreover, Finite Impulse Response (FIR) models, ridge regression, and Bayes classifiers are applied to realize the model scheduling procedure. A numerical example and a real-world application to the estimation of the vertical wheel force in a full vehicle system are, respectively, conducted to demonstrate the effectiveness of the proposed method. The results show that the estimates are in good agreement with reference measurements.
Article
The direct measurement of the external loads acting on a mechanical component represents often a challenge in many engineering applications. In this context, the attention of many researchers is focused on the field of inverse load identification. Several techniques are proposed in literature to address this problem by means of experimental methodologies, often coupled with simulation solutions. In this paper, a Kalman-based methodology is considered, which solves the problem of inverse load identification in a predictive manner. The common issue of most of the techniques is the selection of an optimal set of sensors which gives the best load estimation. In the Kalman-filtering framework, an Optimal Sensor Placement (OSP) strategy has been proposed by the authors and it aims to find the best set of sensors in terms of system observability, which is the minimum requirement of a stable estimation. However, this does not guarantee the most accurate load estimation. In this contribution, two alternative metrics are proposed, based on: i) steady-state error covariance of the estimation, ii) estimator bandwidth, with respect to the available set of measurements. Both criteria will be included in the existing OSP to improve the sensors selection. A comparison of the three strategies for multiple input/state estimation is discussed on an industrial-scale Finite Element Model, in order to show the improvement in the accuracy of the estimated quantities.
Article
Digital Thread is a data-driven architecture that links together information from all stages of the product lifecycle. Although it is finding increasing application in manufacturing, maintenance/operations, and design related tasks, a principled formulation of analyzing the decision making problem under uncertainty for the Digital Thread remains absent. The contribution of this paper is to present a formulation using Bayesian statistics and decision theory. First, we address how uncertainty propagates in the product lifecycle and how the Digital Thread evolves based on the decisions we make and the data we collect. Using these mechanics, we explore designing over multiple product generations or iterations and provide an algorithm to solve the underlying multistage decision problem. We illustrate our method on an example structural design problem where our method can quantify and optimize different types and sequences of decisions, ranging from experimentation, manufacturing, and sensor placement/selection, in order to minimize total accrued costs.
Article
Placing the right amount of sensors in key locations is critical for system monitoring. In real applications, the determination of sensor placement is a compromise between monitoring performance and the costs of installation and maintenance. Given the well-interpretability of sparse learning, this paper proposes an efficient data-driven method to obtain the optimal sensor subset from the entire candidate sensor set. In order to make our model more robust to outliers and overcome the limitation of inconsistent coefficients for multiple class optimization problem, our proposed method introduces a special norm to realize the similar sparse structures of coefficients. Considering that the redundant data cannot effectively improve the real-time condition monitoring performance of engineering systems, our proposed method also includes a redundant information elimination model, which is rarely investigated in data-driven methods for optimal sensor placement problem, and this elimination model is designed by exploring the diversity of measurement data of different sensors. What’s more, we provide an alternating iteration algorithm to solve the non-smoothness convex problem of our proposed data-driven method, and the proof of its convergence has also been presented. The optimal sensor subset can be determined by the rank of the coefficients obtained by the alternating iteration algorithm. Finally, the effectiveness and feasibility of our proposed method are verified by a large number of experiments, including validation experiments on benchmark data sets and a real engineering example on the inlet model of hypersonic aircraft engine.
Article
3D printing has been extensively used for rapid prototyping as well as low-volume production in aerospace, automotive, and medical industries. However, conventional manufacturing processes (i.e., injection molding and CNC machining) are more economical than 3D printing for high-volume mass production. In addition, current 3D printing techniques are not capable of fabricating large components due to the limited build size of commercially available 3D printers. To increase 3D printing throughput and build volume, a novel cooperative 3D printing technique has been recently introduced. Cooperative 3D printing is an additive manufacturing process where individual mobile 3D printers collaborate on printing a part simultaneously, thereby increasing printing speed and build volume. While cooperative 3D printing has the potential to fabricate larger components more efficiently, the mechanical properties of the components fabricated by cooperative 3D printing have not been systematically characterized. This paper aims to develop a data-driven predictive model that predicts the tensile strength of the components fabricated by cooperative 3D printing. Experimental results have shown that the predictive model is capable of predicting tensile strength as well as identifying the significant factors that affect tensile strength.