ArticlePDF Available

Abstract and Figures

As emissions regulations for transportation become stricter, it is increasingly important to develop accurate nitrogen oxide (NOx) emissions models for heavy-duty vehicles. However, estimation of transient NOx emissions using physics-based models is challenging due to its highly dynamic nature, which arises from the complex interactions between power demand, engine operation, and exhaust aftertreatment efficiency. As an alternative to physics-based models, a multi-dimensional data-driven approach is proposed as a framework to estimate NOx emissions across an extensive set of representative engine and exhaust aftertreatment system operating conditions. This paper employs Deep Neural Networks (DNN) to develop two models, an engine-out NOx and a tailpipe NOx model, to predict heavy-duty vehicle NOx emissions. The DNN models were developed using variables that are available from On-board Diagnostics from two datasets, an engine dynamometer and a chassis dynamometer dataset. Results from trained DNN models using the engine dynamometer dataset showed that the proposed approach can predict NOx emissions with high accuracy, where R2 scores are higher than 0.99 for both engine-out and tailpipe NOx models on cold/hot Federal Test Procedure (FTP) and Ramped Mode Cycle (RMC) data. Similarly, the engine-out and tailpipe NOx models using the chassis dynamometer dataset achieved R2 scores of 0.97 and 0.93, respectively. All models developed in this study have a mean absolute error percentage of approximately 1% relative to maximum NOx in the datasets, which is comparable to that of physical NOx emissions measurement analyzers. The input feature importance studies conducted in this work indicate that high accuracy DNN models (R2 = 0.92–0.95) could be developed by utilizing minimal significant engine and aftertreatment inputs. This study also demonstrates that DNN NOx emissions models can be very effective tools for fault detection in Selective Catalytic Reduction (SCR) systems.
Content may be subject to copyright.
Modeling and Predicting Heavy-Duty
Vehicle Engine-Out and Tailpipe
Nitrogen Oxide (NO
x
) Emissions Using
Deep Learning
Rinav Pillai
1
, Vassilis Triantopoulos
1
,
2
, Albert S. Berahas
3
, Matthew Brusstar
4
, Ruonan Sun
4
,
Tim Nevius
5
and André L. Boehman
1
*
1
Department of Mechanical Engineering, University of Michigan, Ann Arbor, MI, United States,
2
Plasma Science and Fusion
Center, Massachusetts Institute of Technology, Cambridge, MA, United States,
3
Department of Industrial and Operations
Engineering, University of Michigan, Ann Arbor, MI, United States,
4
National Vehicle and Fuel Emissions Laboratory, U.S. EPA,
Ann Arbor, MI, United States,
5
Horiba Instruments Inc., Saline, MI, United States
As emissions regulations for transportation become stricter, it is increasingly important to
develop accurate nitrogen oxide (NO
x
) emissions models for heavy-duty vehicles.
However, estimation of transient NO
x
emissions using physics-based models is
challenging due to its highly dynamic nature, which arises from the complex
interactions between power demand, engine operation, and exhaust aftertreatment
efciency. As an alternative to physics-based models, a multi-dimensional data-driven
approach is proposed as a framework to estimate NO
x
emissions across an extensive set
of representative engine and exhaust aftertreatment system operating conditions. This
paper employs Deep Neural Networks (DNN) to develop two models, an engine-out NO
x
and a tailpipe NO
x
model, to predict heavy-duty vehicle NO
x
emissions. The DNN models
were developed using variables that are available from On-board Diagnostics from two
datasets, an engine dynamometer and a chassis dynamometer dataset. Results from
trained DNN models using the engine dynamometer dataset showed that the proposed
approach can predict NO
x
emissions with high accuracy, where R
2
scores are higher than
0.99 for both engine-out and tailpipe NO
x
models on cold/hot Federal Test Procedure
(FTP) and Ramped Mode Cycle (RMC) data. Similarly, the engine-out and tailpipe NO
x
models using the chassis dynamometer dataset achieved R
2
scores of 0.97 and 0.93,
respectively. All models developed in this study have a mean absolute error percentage of
approximately 1% relative to maximum NO
x
in the datasets, which is comparable to that of
physical NO
x
emissions measurement analyzers. The input feature importance studies
conducted in this work indicate that high accuracy DNN models (R
2
= 0.920.95) could be
developed by utilizing minimal signicant engine and aftertreatment inputs. This study also
demonstrates that DNN NO
x
emissions models can be very effective tools for fault
detection in Selective Catalytic Reduction (SCR) systems.
Keywords: heavy-duty vehicles, nitrogen oxide emissions, data-driven modelling, deep learning, articial neural
networks, optimization
Edited by:
Weiqi Ji,
Robert Bosch, United States
Reviewed by:
Florian Vom Lehn,
RWTH Aachen University, Germany
Weiyu Cao,
Rivian Automotive LLC, United States
Opeoluwa Owoyele,
Louisiana State University,
United States
*Correspondence:
André L. Boehman
boehman@umich.edu
Specialty section:
This article was submitted to
Engine and Automotive Engineering,
a section of the journal
Frontiers in Mechanical Engineering
Received: 21 December 2021
Accepted: 02 February 2022
Published: 03 March 2022
Citation:
Pillai R, Triantopoulos V, Berahas AS,
Brusstar M, Sun R, Nevius T and
Boehman AL (2022) Modeling and
Predicting Heavy-Duty Vehicle Engine-
Out and Tailpipe Nitrogen Oxide (NO
x
)
Emissions Using Deep Learning.
Front. Mech. Eng 8:840310.
doi: 10.3389/fmech.2022.840310
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403101
ORIGINAL RESEARCH
published: 03 March 2022
doi: 10.3389/fmech.2022.840310
1 INTRODUCTION
Heavy-duty vehicles employ compression ignition engines due to
their high power density, reliability and powertrain efciency.
Even with the anticipated changes in Greenhouse Gas
regulations, diesel engine-powered trucks will continue to be
used in heavy-duty transportation for several years, especially
in the legacy eet (EPA, 2021b). Also, the heavy-duty
transportation sector is more challenging to electrify due to
the need for high energy storage, fast charging rates and high
ranges for long-haul movement of goods (Askin et al., 2015;
Brown et al., 2020). However, diesel engines emit signicant
amounts of NO
x
(nitric oxide and nitrogen dioxide) which is
designated as a criteria pollutant by the EPA (Winkler et al.,
2018), and has been shown to cause respiratory illness such as
asthma and chronic lung disease upon prolonged exposure. NO
x
is also a contributor to the formation of smog, acid rain and ozone
at ground levels (Boningari and Smirniotis, 2016). Stringent
emissions regulations have therefore been put in place to curb
vehicular NO
x
emissions (EPA, 2021b). This has put tremendous
pressure on the diesel engine industry to design and develop
technologies that limit NO
x
emissions from the engine and from
the tailpipe using exhaust aftertreatment systems. Accurate
estimation of instantaneous engine-out NO
x
emissions has
therefore become essential to improve engine control strategies
for NO
x
reduction. From a regulations perspective, accurate
models for tailpipe NO
x
predictions are important in
understanding the potential for future emissions reductions,
and as a tool for identifying possible modes of non-
compliance during in-use operation.
Formation of engine-out NO
x
is the result of complex chemical
reactions at high temperature within the combustion chamber,
and therefore strongly depends on the engine operating
condition. On the other hand, tailpipe NO
x
emissions are
highly dependent on the performance of the Selective Catalytic
Reduction (SCR) aftertreatment system. Past studies have made
use of thermophysical and chemical models to estimate NO
x
emissions. Mentink et al. (2017) developed a virtual engine-out
NO
x
sensor using a physics-based nitric oxide (NO) formation
model and an empirical correlation to determine nitrogen dioxide
(NO
2
) fraction of NO
x
. A semi-empirical two-zone model was
developed by Provataris et al. (2017) that made use of measured
in-cylinder pressure data and a physics-based model to estimate
NO formation in the combustion chamber. Camporeale et al.
(2017) used in-cylinder pressure sensor signal to create a grey-
box NO
x
raw emissions model. The model uses combustion
parameters such as adiabatic ame temperature and heat
release rate to estimate engine-out NO
x
emissions. Multiple
studies have also used computational uid dynamics models to
model the changes in temperature and composition in the
combustion chamber to better estimate NO
x
emissions
production (Mobasheri et al., 2012;Dahifale and Patil, 2017).
These models use rst principles to estimate NO
x
emissions and
therefore can achieve high extrapolation capabilities. However,
they require high computational time and cost, large number of
assumptions and the need for laborious manual conguration for
different engines.
In the past several years, increasing large quantities of data is
being collected through engine and chassis dynamometer
laboratory tests due to complex powertrain units with greater
number of actuators and ner control. This has led to growing
interest in the use of machine learning to develop predictive
models for NO
x
emissions. Using machine learning, accurate
data-driven models can be developed without requiring explicit
solution to the governing equations that describe the physics of
the system. Selvam et al. (2021) used measurements from On-
board Diagnostics (OBD) sensors to calculate combustion
variables like adiabatic ame temperature, oxygen
concentration and combustion time. These variables were then
used as inputs to an ensemble based method called Random
Forests to estimate engine-out NO
x
emissions for ve different
heavy-duty engines. The model was evaluated to have an average
R
2
value of 0.72 and a mean absolute error (MAE) of 78 parts per
million (ppm). A hybrid model consisting of a physics-based
model and a machine learning approach was proposed by
Mohammad et al. (2021). The model combined a physical and
chemical model developed in GT-Suite with a Support Vector
Machine and Feed-Forward Articial Neural Network (FFNN).
The model was validated using 772 steady state operating points
for a 13L heavy-duty diesel engine and showed good accuracy
with an R
2
score of 0.99 and root mean square error (RMSE) of
23 ppm. However, this model was not tested on transient
operating conditions. Johri and Filipi (2014) describes the
development of a Neuro-Fuzzy Model to predict transient NO
x
and soot emissions. This model divides the problem into multiple
sub-problems which are individually identied using a simpler
class of models. Polynomial and neural network models were
used as choices for the local models with validity functions that
determine the regions of input space where the local model is
active. The model was tested on Unites States Federal city-driving
schedule (FTP75) cycle data for a 6.4L heavy-duty engine. The
model predictions were in good agreement with the total
cumulative NO
x
measured over the test cycle.
Deep Learning has been shown to be adept at discovering
intricate structures in high-dimensional data and has applications
in various domains such as science, business and government
(LeCun et al., 2015;Goodfellow et al., 2016). A virtual NO
x
sensor
using Recurrent Neural Network (RNN) was proposed by Arsie
et al. (2013). Data for training was collected by running different
test cycles like the New European Driving Cycle (NEDC) on a
1.3L light-duty diesel engine. Engine Control Unit (ECU)
variables including engine speed, air mass ow, boost pressure,
fuel mass, start of injection (SOI) and air fuel ratio (AFR) were
used as inputs to the network. Pruning techniques were used to
improve generalization capability of the model. The authors
reported R
2
values between 0.830.91 for different test sets
with RMSE values ranging from 47122 ppm. Fischer (2013)
developed a virtual NO
x
sensor for a 2.2L light-duty diesel engine
using Self-Organizing Map algorithma type of ANN which
makes use of a selector and estimator layer. Six input parameters
including engine speed, fuel quantity, lambda, air mass ow,
boost pressure and exhaust gas temperature were used to estimate
engine-out NO
x
. The model showed good accuracy on the
Artemis Urban Test Cycle with an error of 1.57% between the
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403102
Pillai et al. Modeling NOx Using Deep Learning
total measured and predicted NO
x
over the cycle. Zhang et al.
(2015) proposed using an ANN model to predict engine-out NO
x
emissions using ECU variables like engine speed, torque, injection
timing, air ow rate, rail pressure and oil temperature as inputs.
The train and test data consisted of a Chirpcycle, Hot and Cold
Start drive cycles collected from a 2L light-duty diesel engine.
Total NO
x
estimated from the model deviated on an average
about 5.8% from the measured total NO
x
over the test cycle.
Bellone et al. (2020) compared a Convolutional Neural Network
(CNN) and Long-Short Term Memory (LSTM) network models
to predict engine-out NO
x
emission, soot and fuel consumption
for a heavy-duty 8L diesel engine. Input parameters included
ECU variables such as engine speed, fuel ow/cyl., injection angle,
rail pressure, wastegate position, exhaust gas recirculation (EGR)
position, exhaust temperature, main, post and pre-injection
quantity, inlet pressure, pre-injection angle and throttle
position. The CNN model captured 98.64% of the total test
cycle NO
x
emissions with an R
2
of 0.993, while the LSTM
model captured 99% of the total test cycle NO
x
emissions with
and R
2
of 0.995.
Shin et al. (2020) developed an engine-out NO
x
emissions
model for 1.6L light-duty diesel engine using Deep Neural
Networks (DNN) by training the model on the Worldwide
Harmonized Light Vehicles Test Procedure (WLTP) cycle.
They used Bayesian hyperparameter optimization to nd the
optimal DNN architecture. The accuracy of the model was
indicated by an R
2
value of 0.9675 and MAE of 17 ppm using
14 input variables from the ECU. Yu et al. (2021) presented a
method for estimating tailpipe NO
x
emissions by complete
ensemble empirical model decomposition with adaptive noise
and an LSTM network. They used on-road data from the OBD
sensors of a diesel bus to train the network. They reported good
model accuracy with an R
2
value of 0.98 with RMSE of 46.11 ppm
on the test data. A steady state engine-out NO
x
emissions model
was proposed by Lee et al. (2021) using a DNN model. The model
used 8 ECU parameters including engine speed, brake mean
effective pressure, EGR rate, air mass, fuel mass, injection timing,
boost pressure and injection pressure as inputs. 696 steady state
conditions were evaluated using the model with good accuracy
indicated by an R
2
of 0.98.
However, the previously conducted studies have developed
DNN models for engine-out and tailpipe NO
x
emissions without
taking into account the effect of SCR performance which is an
essential component affecting overall NO
x
production. With the
light-duty industry expected to be increasingly electried in the
near future, more emphasis also needs to be placed on developing
accurate models for estimation of heavy-duty diesel engine NO
x
emissions - both engine-out and tailpipe. Additionally, many of
the models also make use of ECU variables, such as fuel injection
timing, swirl ratio, and injection angle (Bellone et al., 2020;Shin
et al., 2020;Lee et al., 2021), which may not be readily available
except with proprietary access. Deep Learning models can be
highly accurate, but are inherently considered to be black-box
models, and therefore it is difcult to interpret their predictions.
However, it is crucial to understand for example why NO
x
emissions are higher than expected under given engine
operating conditions. Multiple papers have used percent of
NO
x
captured or total test cycle NO
x
error as an evaluation
metric for NO
x
prediction using DNN (Fischer, 2013;Johri
and Filipi, 2014;Bellone et al., 2020). The limitations of using
this metric in evaluating DNN model accuracy for predicting NO
x
emissions has been discussed in the current work and new
improved instantaneous error metrics for this application have
been proposed.
This work tries to address the aforementioned shortcomings of
the existing work in the literature by developing accurate engine-
out NO
x
and tailpipe NO
x
models using DNN. The DNN models
were trained and tested using two different datasets - an engine-
aftertreatment dynamometer dataset and a chassis dynamometer
dataset on two 6.7L heavy-duty bus engines (different model
years) using non-proprietary variables that are available from the
OBD as model inputs. The outline and contributions of this work
are as follows:
Application of DNN to develop engine-out and tailpipe NO
x
emissions models for heavy-duty diesel engines using
physics inspired inputs readily available from the OBD,
while demonstrating high accuracy on both train and test
datasets.
Development of a DNN model for tailpipe NO
x
emissions
using SCR aftertreatment information such as SCR inlet and
outlet temperatures and exhaust mass ow rate that
captures the effect of SCR performance on tailpipe NO
x
emissions.
Analysis and development of holistic error metrics that help
visualize instantaneous as well as total NO
x
emissions
prediction errors of DNN NO
x
models over the DNN
training process.
Interpretability study of models to enhance the physical
understanding of NO
x
emissions estimation using DNN.
Evaluation of model accuracy using minimal number of
relatively importantinput parameters physically affecting
production of NO
x
emissions, thereby illustrating DNN
model interpretation of complex transient NO
x
emissions.
Detailed analysis of a potential application of developed
DNN models to fault detection in SCR aftertreatment
systems.
Organization
The paper is organized as follows. In Section 2 we discuss engine-
out and tailpipe NO
x
formation, our choice of input features, and
the data. Our research methodology is described in Section 3.In
Section 4 we present our results followed by an in depth
discussion of input feature importance and potential
application of developed DNN models in Section 5.We
conclude with nal remarks in Section 6.
2 INPUT FEATURES AND DATA
DESCRIPTION
In this section, the thermophysical and chemical phenomena
affecting production of engine-out and tailpipe NO
x
emissions are
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403103
Pillai et al. Modeling NOx Using Deep Learning
explained, which lays the groundwork for the selection of the
input parameters for each models. A detailed description of the
datasets used to train and test the DNN models is also provided.
2.1 Input Feature Selection
In the current study, a DNN model using physics inspired
features as inputs has been developed. Therefore, signicant
engine and vehicle parameters from literature that affect
engine-out and tailpipe NO
x
formation were measured in the
tests conducted to develop the datasets used to train the DNN
models, while also taking into consideration their ease of
availability from vehicle OBD. A brief description of engine-
out NO
x
and tailpipe NO
x
formation has been provided in the
following sections to explain the different inputs selected for
each model.
2.1.1 Engine-Out NOx Formation
NO
x
is composed of nitric oxide (NO) and nitrogen dioxide
(NO
2
). Diesel engine NO formation is described by Three
mechanisms - Thermal NO, Prompt NO and Fuel NO
(Heywood, 2019). Prompt NO is formed in fuel-rich
conditions and is not highly temperature dependent. Fuel NO
formation is dependent on the presence of nitrogen-based
compounds in the fuel. However, the primary mechanism for
diesel NO formation within the combustion chamber is dened
by the Extended Zeldovich Mechanism (Lavoie et al., 1970)
referred to as Thermal NO. Thermal NO formation is due to
oxidation of nitrogen in the air. The principal reactions governing
the formation of Thermal NO are given by
N2+ONO +N(1)
N+O2NO +O(2)
N+OH NO +H.(3)
These reactions ((1),(2),(3)) are highly dependent on the
combustion temperature (>2000 K), in-cylinder oxygen (O
2
)
concentrations, and residence time of the reacting mixture at
peak temperatures and lean air-fuel mixtures (Bowman, 1975).
NO
2
on the other hand is formed due to partial oxidation of NO
further downstream of the cylinder which can be explained by the
following reaction (4) (Merryman and Levy, 1975):
NO +HO2NO2+OH.(4)
Engine-out NO
x
formation is primarily controlled by the
temperature of the burned gas and O
2
concentration in the
combustion chamber. These parameters vary based on
different engine operating and control variables such as intake
air mass ow rate, fuel ow rate, intake manifold temperature and
pressure, engine speed and load. Therefore, these variables were
selected as inputs for modeling engine-out NO
x
. Exhaust gas
recirculation (EGR) is an important engine-out NO
x
control
strategy typically employed on diesel engines. The
introduction of EGR, which is composed primarily of nitrogen
(N
2
), carbon dioxide (CO
2
), and water (H
2
O), displaces air in the
cylinder, and results in lower NO
x
formation. The primary
mechanisms for the decrease in NO
x
formation due to EGR
are the reduction in the mixtures oxygen concentration, and
decrease in the combustion temperatures due to presence of
higher specic heat capacity triatomic molecules. As a result,
EGR mass ow rate was also included as an input to the DNN
when available.
2.1.2 Tailpipe NOx Emissions
Tailpipe NO
x
emissions are highly dependent on the performance
of the SCR. SCR uses ammonia (NH
3
) in the form of aqueous urea
as a NO
x
reduction reagent to convert NO
x
to N
2
and H
2
O.
However, the dominant catalytic reactions require a certain
optimal temperature range (520725 K) (Khair and Majewski,
2006). There are three important SCR reactions given by (Koebel
et al., 2002):
4NO +4NH3+O24N2+6H2O(5)
4NH3+2NO +2NO24N2+6H2O(6)
4NH3+3NO23.5N2+6H2O.(7)
The standard SCR reaction (5) occurs in a NO dominant
environment and requires high temperature. At lower SCR
temperatures, a faster reaction rate conversion (6) takes place
using equimolar amounts of NO and NO
2
. This helps to improve
SCR performance at lower SCR temperatures. However, an excess
of NO
2
results in a slower reaction rate reaction (7). Therefore,
SCR conversion efciency is highly dependent on the ratio of NO
2
to NO entering the SCR.
The SCR performance is greatly inuenced by the residence
time available for reactants in the optimal temperature range,
which is a function of the space velocity. The space velocity of the
reactor is a function of the measured exhaust gas ow rate
through the SCR. The rate of NO
x
removal is also dependent
on the inlet NO
x
concentration, as NO
x
levels determine the rate
of reactions (Heywood, 2019). Therefore, exhaust aftertreatment
variables such as SCR inlet and outlet temperatures, exhaust mass
ow rate and measured engine-out NO
x
were considered as
inputs to the tailpipe NO
x
DNN model. Coolant temperature
was also included as an input to the model to provide information
regarding cold start or hot start conditions for the test cycles.
Other variables affecting SCR performance such as urea injection
quantity and timing, and ammonia (NH
3
) slip were not
considered, as they are not readily available from the OBD
data without proprietary access.
2.2 Data Sources and Dynamometer Test
Cycles
Data was collected from two different sources: engine
dynamometer testing and chassis dynamometer testing. A 6.7L
heavy-duty bus engine (different model years) was tested on an
engine and chassis dynamometer using multiple dynamometer
test cycles representative of urban, highway, idle, transient and
cold start conditions, thereby comprehensively encompassing
known sources of transient NO
x
emissions.
2.2.1 Engine Dynamometer Data and Test Cycles
Engine dynamometer testing was conducted at the United States
EPA, National Vehicle and Fuel Emissions Laboratory (NVFEL),
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403104
Pillai et al. Modeling NOx Using Deep Learning
Ann Arbor, on a 6.7L heavy-duty engine (2010) using
certication diesel fuel. Test data included engine parameters
as well as after-treatment parameters as described in Section 2.1.
Both engine-out and tailpipe-out NO
x
emissions were measured
using exhaust gas analyzers at 10 Hz frequency. All parameters
measured using test cell instruments were also recorded at 10 Hz
frequency. The dynamometer test cycles included three separate
runs of a cold Federal Test Procedure (FTP) cycle, a hot FTP cycle
and a Ramped Mode Cycle (RMC) as shown in Figures 1A,B.
These tests encompass a variety of engine operating conditions
including cold-start, hot-start and transient. This dataset will be
referred to as Dataset 1 in this paper. Dataset 1 had a total of
127,223 samples for training the DNN after pre-processing.
In literature, data for testing DNN NO
x
models was found to
be split from the dataset in two different ways: keeping a separate
run of a complete test cycle as test set (Arsie et al., 2013;Fischer,
2013;Zhang et al., 2015;Bellone et al., 2020) or randomly
selecting points from the dataset as test set (Shin et al., 2020;
Lee et al., 2021;Yu et al., 2021). In this study, both methods were
adopted to analyze and compare the effect they had on the model
performance. Subsequently. the dataset was rst randomly
shufed and then split into train, validation and test sets. 75%
of the data was used as train set, of which 25% was used as
validation set. The remaining 25% of the total data was used to
test the model after training and validation. Splitting of the
dataset was also done by using two runs of each test cycle as
train set and one run of each test cycle (unseen by the DNN
models) as test set.
2.2.2 Chassis Dynamometer Data and Test Cycles
Chassis dynamometer testing was conducted on a hybrid bus that
operates on a parallel hybrid architecture using a 6.7L heavy-duty
engine (2011) and a 650V nickel-metal hydride battery using
certication diesel fuel. The tests were conducted at the Heavy-
Duty Chassis Dynamometer Test Facility at the United States
EPA NVFEL, Ann Arbor. The facility is capable of simulating on-
road conditions for transient and loaded conditions with the help
of a road speed modulated vehicle cooling fan with high
precision. Continuous tailpipe exhaust measurements were
made using a heated dilution tunnel and a Horiba MEXA-One
gaseous emissions bench, while engine-out NO
x
measurements
were sampled directly from the exhaust prior to the SCR, both at
FIGURE 1 | Engine and Chassis dynamometer test cycles used to develop datasets for Deep Learning NO
x
Models. (A) Engine Dyno FTP Cycle. (B) Engine Dyno
RMC Cycle. (C) Chassis Dyno Cold Start Super Cycle. (D) Chassis Dyno Hot Start Cycles. (E) Chassis Dyno On Road Cycle. (F) Chassis Dyno Ramp Cycle.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403105
Pillai et al. Modeling NOx Using Deep Learning
10 Hz frequency. A Cold Start Super Cycle (CSSC), a Hot Start
Cycle which was a combination of New York Bus Cycle (NYBC),
Orange County Bus Cycle (OCBC), NREL Transient Cycle, an
On-Road bus cycle and a Ramp Cycle (Figures 1CF) were
successfully run on the chassis dynamometer to generate a
comprehensive dataset encompassing different vehicle
operating conditions. OBD data was collected from the bus
using Controller Area Network (CAN) at 10 Hz frequency.
This included engine and after-treatment parameters similar to
the engine dynamometer tests as described in Section 2.1. Battery
parameters including state of charge, charging and discharging
current were also included as inputs to the engine-out NO
x
model
to incorporate the inuence of hybridization on the NO
x
emissions of the bus (Bagheri et al., 2021). This dataset will be
referred to as Dataset 2 in this paper.
Dataset 2 had a total of 442,623 samples after pre-processing.
As described for Dataset 1, Dataset 2 was also split randomly into
equivalent train, validation and test set ratios. The data was also
split using 16 runs of complete test cycles as train set and 3
separate runs of test cycles (unseen by the DNN models) as test
set. Further details on the train, validation and test splits for both
the datasets has been provided in Table 1.
3 RESEARCH METHODOLOGY
In this section, the research methodology followed in this paper is
described. Specically, we discuss the model and its associated
hyperparameters and the data pre-processing strategy employed.
3.1 Feed Forward Deep Neural Network
An Articial Neural Network (ANN) makes use of representative
data to establish empirical relationships between input features and
some target output (LeCun et al., 2015;Goodfellow et al., 2016).
ANNs have been shown to be adept at establishing complex
relationships without the need of strict assumptions or
mathematical equations, and as a result have had tremendous
success in applications such as image classication (Ciregan et al.,
2012), speech recognition (Amodei et al., 2016), and games (Silver
et al., 2016); see (Bishop, 1994;Abiodun et al., 2018)formore
examples. In its most primitive form, an ANN is a composition of
connected neurons arranged in an input layer, possibly a set of
hidden layers, and an output layer. Neurons in adjacent layers are
connected through edges, or weights. Information ows from the
input layer to the output layer through activation functions at each
neuron that attempt to capture nonlinearity in the input-output
relationship. This process is called Feed-Forward Propagation.
Training an ANN amounts to adjusting the weights, commonly via
the process of Back Propagation(Rumelhart et al., 1986), in order
to minimize the objective or lossfunction which measures the
deviation between the true target output and the predicted output of
the network. One Feed-Forward and one Back-Propagation
constitute one training epochof the ANN.
Deep Learning Neural Networks (DNN) are a multi-layer
manifestation of ANNs (i.e., more than one hidden layer). The
predictive power of DNNs is positive correlated with the volume of
data and the size of the network, i.e., both the height (number of
neurons per layer) and the width (number of layers) (Sun et al.,
2017). An important aspect of DNNs is their capability to learn good
representations of complex phenomena using feature learning
(Bengio, 2012). This enables DNNs to learn nonlinear mappings
of input features to outputs by generating high levelfeatures using
low level(input) training data. These intrinsic characteristics of
DNNs make them suitable for modeling and predicting NO
x
emissions using simple accessible data.
In this study, DNNs are used in a supervised learning regression
task, i.e., the training data has a labelled output and the output (NO
x
emissions) is continuous. The proposed DNNs are trained over
multiple epochs to develop NO
x
emissions models with high
predictive power using the different training datasets (see Section
2.2). The DNNs have multiple hyperparameters(e.g., number of
hidden layers and nodes, batch size, learning rate) that determine the
structure of the neural network and guide the learning process. These
parameters are tuned, using exhaustive grid searches (Feurer and
Hutter, 2019), to give the best possible performance for a given
model, dataset, and computational budget. Prior to training the
DNNs, the available datasets are pre-processed in order to help with
the training (learning), and as a result increase the predictive power
of the DNNs for the given computational budget (Yu et al., 2021); see
Section 3.2 for more details. Figure 2 shows the schematic for the
DNN engine-out (2a) and tailpipe NO
x
(2b) models to describe the
DNN architecture.
3.2 Data Pre-Processing
Transient data was collected from the engine and chassis
dynamometer testing, therefore, it is necessary to eliminate
time delay between the input parameters and the measured
NO
x
emissions before training the DNN models to improve
model performance (Arsie et al., 2013;Fischer, 2013;Johri and
Filipi, 2014;Zhang et al., 2015). For the engine dynamometer
data, the dataset was time-aligned according to 40 Code of
Federal Regulations (CFR) Part §1,065 to account for delays in
exhaust gas transport and instrument responses (EPA, 2021a). An
empirical time constant was derived by cross-correlating the
input parameters with the measured NO
x
emissions for the
chassis dynamometer data for time-alignment of the complete
dataset. The effectiveness of the time alignment method was
demonstrated with a Pearsons correlation coefcient of 0.99.
Further, any data points that had negative values due to
instrument or calibration errors were removed. This ensures that
noisy or unreasonable data does not contaminate the DNN models.
The application of DNN to instantaneous NO
x
prediction is difcult
as it involves the prediction of a continuous variable at every point.
TABLE 1 | Description of datasets (engine and chassis dynamometer. Types of
data splits (random and test cycles) and train/validation/test splits.
Dataset Engine dynamometer Chassis dynamometer
Type of Data split Random Test cycles Random Test cycles
Total Samples 127,223 127,223 442,623 442,623
Train 76,334 64,883 265,574 301,569
Validation 19,083 16,221 66,393 75,392
Test 31,806 46,119 110,656 65,662
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403106
Pillai et al. Modeling NOx Using Deep Learning
To mitigate this issue several studies in the literature have used box
plots or median methods to determine and remove outliersin the
data to improve network performance (Donateo and Filomena,
2020;Yu et al., 2021). In this study, it was found that eliminating
outliers also removed a large number of peakNO
x
conditions
which are signicant in predicting different transient and rare
events that occur during vehicle operations. Thus, in order to
promote robustness in the DNN models, in this study, data
associated with outliers (e.g., peakNO
x
conditions) was
included in the training.
Each dataset was divided into train, validation and test sets as
described in Section 2.2. The train set is used to train the models.
The validation set provides an unbiased evaluation of the modelst
on the training samples while tuning the different hyper-parameters
of DNN models. The test set is used to evaluate the nal model to
determine model accuracy and generalization capability.
Feature Scaling is an essential step in data pre-processing for
DNN models. The basis for feature scaling is to transform the data
such that all the inputs have similar distributions, i.e., a common
scale, and equal importance is given to each variable ensuring that no
variable inuences the model solely due to magnitude (Kotsiantis
et al., 2006). Scaling the features helps with the stability, efciency
and robustness of gradient-based optimization algorithms (Wan,
2019). In this study, normalization was used which shifts and
rescales the input values to a range between 0 and 1 (also known
as minmax scaling) given by:
Xnorm XXmin
Xmax Xmin
In all the datasets, min max scaling was t on the train set and
then used to normalize both the validation and test set. This was
done to ensure unbiased testing predictions.
3.3 Architecture of DNN Models
An Intel
®
Core
TM
i7-10750H CPU @ 2.60 GHZ (12 cores) with
16 GB RAM and an NVIDIA GeForce RTX 2060 GPU were used
for computation. Python 3.60 programming language was used to
develop the model with the help of Kerasdeep learning library
using TensorFlow (Abadi et al., 2016) as backend. The Python
library scikit-learn(Pedregosa et al., 2011) was used for pre-
processing data, dataset splitting, hyperparameter grid search and
model evaluation using different metrics.
3.3.1 Hyperparameter Selection and Optimization
In this study, hyperparameter optimization, a very important
ingredient of DNN training (Feurer and Hutter, 2019), is
implemented using a grid search where the search space is
dened by a grid of hyperparameter values. Every point in the
grid which represents a model conguration is then evaluated for
performance using appropriate evaluation metrics. Grid search was
performed using the GridSearchCV function in the scikit-learn
library, which allows to perform cross-validation (Refaeilzadeh et al.,
2009) in order to understand the generalization capability of each
model conguration being tested. The target hyperparameters for
optimization included the number of hidden layers, number of
nodes in each hidden layer, learning rate and batch size. The ranges
of hyperparameters considered for the two datasets are given in
Table 2.Table 3 summarizes the nal architecture and
hyperparameters for each model.
For the hyperparameter optimization, initially a small DNN
network composed of two hidden layers with 20 and 10 nodes,
respectively, was set up to reduce computational burden. Some
hyperparameters were selected based on optimal values in literature
that used DNNs for similar supervised learning regression tasks. The
Adam optimizer (Kingma and Ba, 2014), a stochastic rst-order
diagonally scaled method, was used as the optimization algorithm.
Associated with the optimization algorithm, the learning rate (a
parameter that controls the change in the DNN model weights), as
well as the batch size (the amount of data used in the Forward- and
Backward-Propagations) were also tuned. With respect to the DNN
model, the Rectied Linear Unit (ReLU) activation function (Nair
and Hinton, 2010) was considered appropriate for both the hidden
layers and the output layer. ReLU is a piece-wise linear function
which outputs the input itself if it is positive, but outputs zero if it is
FIGURE 2 | Deep neural network model architecture, inputs/outputs, and activation functions. (A) Engine Out NOx Model. (B) Tailpipe NOx Model.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403107
Pillai et al. Modeling NOx Using Deep Learning
negative. In this application all input values are non-negative, thus,
ReLU is an appropriate candidate for the hidden layers.
Moreover, the number of hidden layers and nodes in each
hidden layer were optimized using a separate grid search using
the optimized learning rate and batch size from the rst grid
search. The different DNN hidden layer congurations for the
grid search were developed using a custom function that utilizes
three parameters as inputs to create different DNN hidden layer
congurations; number of hidden layers, number of neurons in
the rst hidden layer and number of neurons in the last hidden
layer. Based on the number of hidden layers selected, the function
individually selects the number of neurons for rst and last
hidden layers from the ranges provided in Table 2 and
linearly decreases the number of neurons in each layer based
on the number of hidden layers. For example, if we consider 5
hidden layers, 200 neurons in the rst hidden layer and 20
neurons in the last hidden layer, a DNN hidden layer
conguration given by (200,155,110,65,20) is created by the
function. Each network in both the grid searches was run for
200 epochs with 5 fold cross-validation (Stone, 1978)to
determine the optimal batch size, learning rate and hidden
layer conguration based on the optimal mean MSE on the
cross-validation tests. Further tests were then carried out on
the optimal network architecture determined to evaluate the
effect of increasing the width of the network (i.e., number of
nodes in each hidden layer) and number of epochs on the
performance of the model. The number of epochs for training
the network was optimized based on the training and validation
loss curves to ensure that there was no overtting of the model on
the train set. Number of epochs was used as a termination criteria
for the training of all the models. The optimal hyperparameters
based on the results from the grid searches have been summarized
in Table 3.
Learning Rate Decay
Learning rate decay is a mechanism by which the learning rate
(employed by the optimization algorithm) is set and adjusted as
the optimization progresses in help with learning. Specically,
during the early stages of training large learning rates are
employed to allow for large steps, and in order to avoid
spurious local minima. In the latter stages of training, a
smaller, more rened learning rate is employed in order to
obviate the effects of noise, and in order to converge (to a
local minimum). Learning rate decay has been shown
empirically to improve model optimization and generalization
(Smith, 2018;Ge et al., 2019;You et al., 2019). In this study,
learning rate decay was used for all the models.
Dropout
Overparametrized DNNs, i.e., DNNs with large number of
parameter (weights), are prone to overtting to the training
dataset (Sankararaman et al., 2020). Dropout is a
TABLE 2 | Ranges of hyperparameters explored for different models (Engine Dynamometer and Chassis Dynamometer).
Dataset Engine dynamometer Chassis dynamometer
Model Engine-out NO
x
Tailpipe NO
x
Engine-out NO
x
Tailpipe NO
x
Learning Rate [0.01,0.001,0.0001] [0.01,0.001,0.0001] [0.01,0.001,0.0001] [0.01,0.001,0.0001]
Batch Size [100, 500, 1,000, 95,417] [100, 500, 1 000, 95,417] [1,000, 5,000, 10,000, 3,31,967] [1,000,5,000, 10,000, 3,31,967]
Input Layer Nodes 8 5 9 5
Hidden Layers [2,3,4,5,6] [2,3,4,5,6] [2,3,4,5,6] [2,3,4,5,6]
First Hidden Layer Nodes [200,100,50,20] [200,100,50,20] [200,100,50,20] [200,100,50,20]
Last Hidden Layer Nodes [20,15,10,5] [20,15,10,5] [20,15,10,5] [20,15,10,5]
Hidden Layer Activation Function ReLU ReLU ReLU ReLU
Output Layer Nodes 1 1 1 1
Output Layer Activation Function ReLU ReLU ReLU ReLU
Epochs 200 200 200 200
TABLE 3 | Final optimal hyperparameters for engine-out and tailpipe NO
x
models for dataset 1 and 2.
Dataset Engine dynamometer Chassis dynamometer
Model Engine-out NO
x
Tailpipe NO
x
Engine-out NO
x
Tailpipe NO
x
Input Layer Nodes 8 5 9 5
Hidden Layer Nodes [200, 100, 50, 5] [1,000, 500, 250, 100, 5] [1,000, 500, 250, 100, 5] [2,000, 1,000, 500, 250, 100, 5]
Hidden Layer Activation Function ReLU ReLU ReLU ReLU
Output Nodes 1 1 1 1
Output Layer Activation Function ReLU LeakyReLU ReLU LeakyReLU
Learning Rate 0.001 0.001 0.001 0.001
Learning Rate Decay 1/5 every 200 Epochs 1/5 every 200 Epochs 1/10 every 400 Epochs 1/10 every 400 Epochs
Drop Out 0 0.1 0.1 0.1
Batch Size 500 500 1,000 1,000
Epochs 600 600 1,000 1,000
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403108
Pillai et al. Modeling NOx Using Deep Learning
regularization technique often employed in DNN training to
mitigate this issue (Srivastava et al., 2014). The technique
involves randomly dropping outnodes along with their
connections from the network during training. Temporarily
deactivatingnodes during training reduces the over-
adaptation of the network weights to the training data and
leads to improvement in network out-of-sample performance
and generalization (Baldi and Sadowski, 2013). The fraction of
nodes that are deactivated at every iteration of training (dropout
rate) is often treated as a hyperparameter. In this study a small
dropout rate of 0.1 was applied to the hidden layers of all models
during training except for the engine-out NO
x
model for
Dataset 1.
3.3.2 Loss Function and Evaluation Metrics
In this study, the following loss function and evaluation metrics
were used.
Mean Squared Error (MSE)
Mean Squared Error (MSE) is the default loss function used in
many DNNs for regression problems. It is calculated via
MSE 1
n
n
i1
yi^
yi

2,
and is the average of the squared difference between the true
output value (y
i
) and the models predicted value (^
yi), also
referred to as the prediction error.
R-Squared (R
2
)
Coefcient of determination or R-Squared (R
2
), dened as
R21n
i1yi^
yi

2
n
i1yi
y

2,
where
yis the mean of the true output value, is a statistical
measure used to determine the goodness of t, i.e., how well the
predicted NO
x
values t with the true NO
x
values. Using R
2
as an
evaluation metric indicates the model performance across
different points in the NO
x
distribution. The higher the R
2
value, the better the model prediction across the NO
x
distribution.
Total NO
x
Error
Another evaluation metric that has been used in the literature to
measure the predictive accuracy of NO
x
emissions models is Total
NO
x
error (%) (Fischer, 2013;Johri and Filipi, 2014;Bellone et al.,
2020). Total NO
x
error, dened as
Total NOxError %
()
Total NOxpredicted Total NOxactual
Total NOxactual
*100,
is the difference between total true and predicted NO
x
emissions
in the dataset. Total cumulative NO
x
is calculated for train,
validation and test datasets for both true and predicted NO
x
values. The percent difference between total true and predicted
NO
x
over each dataset shows if the model has captured various
transient NO
x
conditions (both high and low) across the dataset
effectively and the percentage of total NO
x
emissions captured by
the model over the dataset.
Instantaneous NO
x
error
Since the DNN models developed in this study are used to
predict instantaneous NO
x
emissions, we propose a novel
instantaneous NO
x
error metric that captures the error at
every point in the training (validation, testing) set. The
absolute prediction error and percent absolute prediction
error(%)foreverytrainingpointiscalculatedatevery
epoch of training and is given by
Absolute Prediction Errori|yi^
yi|
Absolute Prediction Errori%
()
|yi^
yi|
yi
*100.
The maximum, minimum and mean absolute prediction error
and percent error is then calculated and reported for each epoch
over the training set. As the DNN learns to predict NO
x
emissions
at every instant (data point) in the training set, these error metrics
should reduce indicating improvement in the models
instantaneous NO
x
prediction capability.
3.3.3 Output Layer Activation Function for Tailpipe
NOx Model
Both engine-out and tailpipe NO
x
models have 1 node in the
output layer as the models are predicting one continuous non-
negative output (engine-out or tailpipe NO
x
emissions). ReLU
was considered as an appropriate candidate for the output
layer for both models. However, an interesting phenomenon
was observed while examining model predictions for tailpipe
NO
x
. Namely, the model predicted tailpipe NO
x
as 0for
many true labels where the NO
x
emissions were smaller than
0.0001 g/s (70% of the training set). This resulted in
underprediction of total cycle NO
x
emissions since many
non-zero values were predicted as zero. Extracting the input
values to the output layer ReLU activation function showed
negative values for smaller true NO
x
labels ( <0.000 1 g/s),
which by the nature of ReLU are output as 0,thereby
increasing total NO
x
prediction error. Thus, other activation
functions like LeakyReLU (Maas et al., 2013) and Exponential
Linear Unit (ELU) (Clevert et al., 2015) were tested for the
output layer. These activation functions are similar to the
ReLU activation function with minor differences (e.g.,
LeakyReLU has a small slope in the negative region;
nonzero gradient when the node is not active). It was found
that using LeakyReLU improved the tailpipe NO
x
model
predictions at lower NO
x
values at the expense of predicting
some negative NO
x
values ( <1%of the dataset). The small
negative slope helps to smoothen the hard threshold that ReLU
has to output 0value and therefore helps the models predict
better at lower NO
x
values. LeakyReLU only outputs negative
values for very small NO
x
true labels ( <105g/s). These
negative values are negligible when compared to the total
training set NO
x
predicted (<0.01%), thereby improving
overall tailpipe NO
x
DNN model performance.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 8403109
Pillai et al. Modeling NOx Using Deep Learning
4 RESULTS
In this section, the main results for the engine-out NO
x
and
tailpipe NO
x
models are presented. Table 3 summarizes the nal
architecture and hyperparameters for each model. The results are
divided into the following sub-sections: model evaluation metrics
(Section 4.1), error metrics (Section 4.2), and instantaneous
actual vs predicted NO
x
emissions (Section 4.3), on the train,
validation and test sets for each model and dataset.
4.1 Model Evaluation Metrics Results
The evaluations metrics used to measure the prediction accuracy
and overall performance of the models on the train, validation
and test sets are reported in Table 4. Cross-validation (5 fold), a
popular tool used in machine learning to evaluate a model
prediction and generalization capability (Refaeilzadeh et al.,
2009), was employed in this study. The results presented in
Table 4 are the average MSE, MAE, R
2
and MAE (%) over
the maximum NO
x
values over the 5 fold cross-validation for each
model along with their respective 95% condence intervals.
Overall, the condence interval values reported in Table 4
indicate that the models were robust to changes in training
inputs.
The selection of evaluation metrics was directed at analyzing
different aspects of the modelsNO
x
prediction capability. First,
MSE and MAE were chosen to evaluate if the models were
generalizing well to unseen data, i.e., validation and test sets.
For all the models, the training and validation MSE and MAE
values are very close indicating good model generalization. The
test and validation set MSE and MAE values are comparable
indicating model robustness to predicting NO
x
for conditions
unseen by the model when training.
One of the holistic metrics used in the literature to capture
NO
x
, emissions model accuracy is R
2
(Arsie et al., 2013;Bellone
et al., 2020;Shin et al., 2020;Yu et al., 2021). As can be seen from
Table 4,R
2
values on train, validation and test sets are high for
both engine-out and tailpipe models for Dataset 1 and engine-out
model for Dataset 2 even with the inclusion of peak NO
x
,
conditions (Yu et al., 2021). Contrary to (Arsie et al., 2013;
Zhang et al., 2015;Bellone et al., 2020;Shin et al., 2020)that
used ECU variables which are comprehensive but not easily
accessible, the models developed in this study achieved
comparable high model accuracy while utilizing only simple
OBD, parameters as inputs to the models. The aftertreatment
system on the bus used to collect data for Dataset 2 has been
subject to a product recall due to a manufacturing defect (EPA,
2018). Therefore, the data is not completely representative of a
working SCR, system and subsequently the production of
tailpipe NO
x
, emissions. This could explain the relatively
lower R
2
values for the tailpipe NO
x
,modelforthisdataset.
However, the model still captured a signicant portion of the
transient tailpipe NO
x
, emissions as indicated by Test R
2
of
0.9275 shown in Table 4.
Linearity of the models was evaluated using mean absolute
error (percent) with respect to the maximum NO
x
in the dataset.
As can be seen from Table 4, the percent MAE with respect
maximum NO
x
for all the models are well within 12%, which is
comparable to the NO
x
measurement accuracy of NO
x
emissions
analyzers which have linearity of 1% of full-scale NO
x
(Gluck
et al., 2003).
Figure 3 presents the training and validation MSE (loss) and
R
2
curves for each model. These curves help to visualize the
progress of the MSE function and R
2
as the network is trained
over a set number of epochs. The training loss over the epochs
shows how well the model is learning using the given dataset,
while the validation loss shows how well the model is
generalizing to a smaller validation set that is not being
used to train the DNN. For all the models, it can be seen
that MSE decreases as the network learns, while R
2
increases
which indicates improvement in model prediction. The
sudden drops in the loss curves indicate where the learning
rate decay was implemented for each model.
The results of Regression Analysis (R
2
t) for train and test
datasets of the different models are shown in Figure 4. The
models show high degrees of agreement with the measured
engine-out and tailpipe NO
x
emissions demonstrated by high
TABLE 4 | Evaluation metrics for train, validation and test set with 95% condence intervals (all models).
Dataset Engine dynamometer Chassis dynamometer
Model Engine-out NO
x
Tailpipe NO
x
Engine-out NO
x
Tailpipe NO
x
Train MSE (g/s) 5.02E-06 ± 3.03E-07 7.79E-07 ± 2.14E-07 1.91E-05 ± 3.23E-07 4.21E-05 ± 2.07E-06
Val MSE (g/s) 7.35E-06 ± 2.47E-07 1.27E-06 ± 3.29E-07 4.30E-05 ± 8.60E-07 7.20E-05 ± 2.70E-06
Test MSE (g/s) 7.41E-06 ± 1.76E-07 1.43E-06 ± 3.37E-06 4.19E-05 ± 4.86E-07 7.07E-05 ± 9.22E-07
Train MAE (g/s) 1.22E-03 ± 2.41E-05 4.30E-04 ± 1.36E-04 2.48E-03 ± 2.61E-05 3.18E-03 ± 3.56E-05
Val MAE (g/s) 1.25E-03 ± 1.43E-05 4.44E-04 ± 1.32E-04 3.25E-03 ± 2.04E-05 3.94E-03 ± 7.02E-05
Test MAE (g/s) 1.34E-03 ± 1.77E-05 4.51E-04 ± 1.35E-04 3.27E-03 ± 2.20E-05 3.91E-03 ± 3.14E-05
Train R
2
0.998 ± 0.001 0.996 ± 0.001 0.987 ± 0.001 0.956 ± 0.002
Val R
2
0.997 ± 0.001 0.995 ± 0.001 0.971 ± 0.001 0.926 ± 0.003
Test R
2
0.997 ± 0.001 0.994 ± 0.001 0.972 ± 0.001 0.927 ± 0.001
Train MAE (%) 0.512 ± 0.010 0.080 ± 0.025 0.516 ± 0.021 1.487 ± 0.040
Val MAE (%) 0.566 ± 0.006 0.084 ± 0.025 0.950 ± 0.020 2.137 ± 0.222
Test MAE (%) 0.566 ± 0.006 0.085 ± 0.025 0.883 ± 0.028 1.797 ± 0.020
Train Total NO
x
Error (%) 0.086 ± 0.049 1.229 ± 1.250 0.116 ± 0.114 0.151 ± 0.231
Val Total NO
x
Error (%) 0.100 ± 0.058 1.304 ± 1.375 0.155 ± 0.114 0.368 ± 0.350
Test Total NO
x
Error (%) 0.084 ± 0.059 1.214 ± 1.313 0.167 ± 0.109 0.249 ± 0.195
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031010
Pillai et al. Modeling NOx Using Deep Learning
R
2
values on both train and test sets. The points are well
distributed on both sides of the regression t line which
indicates normal distribution of prediction errors with a mean
around 0, which is further conrmed by the histogram of errors
shown for all the models. The large scatter of points on either side
of the regression t line in Figures 4G,H can be attributed to the
large number of data points which causes the fewoutliers in the
plot to cover a larger region in the plots.
4.2 Error Metrics
Figure 5 shows the total NO
x
error (%) (Arsie et al., 2013;Johri and
Filipi, 2014;Bellone et al., 2020) for train and validation sets (orange
line) on a log scale. The error was consistent on average over the
training. From Table 4 it can be observed that total NO
x
error (%) for
lower accuracy models (R
2
=0.930.95) is lower than that of higher
accuracy models (R
2
= 0.99). A simple reason for this could be due to
the fact that the lower accuracy models overpredict and underpredict
FIGURE 3 | Evolution of MSE and R
2
curves over training and validation data (All Models). (A) Engine Out NOx Loss and R
2
Dataset 1. (B) Tailpipe NOx Loss and R
2
Dataset 1. (C) Engine out NOx Loss & R
2
Dataset 2. (D) Tailpipe NOx Loss and R
2
Dataset 2
FIGURE 4 | Train and Test R
2
ts with histograms showing distribution of errors (All Models). (A): Engine Out NOx Model R
2
Train Dataset 1. (B): Engine Out NOx
Model R
2
Test Dataset 1. (C): Engine Out NOx Model R
2
Train Dataset 2. (D): Engine Out NOx Model R
2
Test Dataset 2. (E): Tailpipe NOx Model R
2
Train Dataset 1. (F):
Tailpipe NOx Model R
2
Test Dataset 1. (G): Tailpipe NOx Model R
2
Train Dataset 2. (H): Tailpipe NOx Model R
2
Test Dataset 2.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031011
Pillai et al. Modeling NOx Using Deep Learning
instantaneous NO
x
more than the higher accuracy models. The
results from Table 4 show that there is no clear correlation between
R
2
values and the total NO
x
error, i.e., higher R
2
models do not show
lower total NO
x
error. The studies conducted on the models indicate
that the magnitude of overprediction and underprediction in lower
R
2
models, sufciently balance out when the cumulative total true
and predicted NO
x
over the train, validation and test set is calculated,
resulting in a lower total NO
x
error (see histograms in Figure 4). This
suggests that the total NO
x
error metric alone is not a good indicator
of a models instantaneous NO
x
predictive capability.
Therefore, in order to better understand the progression of the
instantaneous prediction error at every epoch, the (maximum,
minimum and mean) absolute prediction error and percent
absolute prediction error over the train set were used as additional
evaluation merits; see Section 3.3.2 for denition. From Figure 5,it
can be seen that all metrics decrease as the model is trained, indicating
improvement in model accuracy. The large maximum percent
absolute prediction errors (of the order 10
7
) can be attributed to a
few very low NO
x
emissions points being signicantly overpredicted,
however, this represents a very small portion of the entire dataset
(<5%). Subsequently, it can be observed that the mean absolute error
(%) is low (in the order of 10
1
) indicating higher overall instantaneous
prediction capability. Also, it is important to consider the scale of the
NO
x
emissions while evaluating percentage absolute error. Very small
errors in predictions can be blown up when calculating absolute error
with respect to the true NO
x
emissions which are at the scale of 1E-04
and lower. These absolute errors however are still small when
compared to the average value of NO
x
emissions in the datasets
(0.030.05 g/s) as can be seen from Figures 5EHand Table 4 which
shows the mean MAE and percent MAE with respect to the
maximum NO
x
over the train, validation and test sets.
4.3 Actual vs Predicted NOx Emissions
Figures 6,7depict the actual NO
x
emissions in red and the
predicted NO
x
emissions in blue. Such visualizations aid in
understanding the predictive capabilities of the models over the
course of training, and clearly highlight points for which the model is
overpredicting or underpredicting NO
x
emissions.
As an example, Figure 6 depicts the improvement in the
predictions of the engine-out NO
x
DNN model for Dataset 1, over
a small section of the train set as the network learns. Figures
6ADshow the actual and predicted NO
x
emissions for the DNN
model at epochs 1, 200, 400 and 600 respectively. It can be
observed that as the network is trained, subsequently the
predictions (dashed blue line) approach the true values of NO
x
emissions (red line). The increasing overlap of the two lines in
Figures 6BDindicates signicant improvement in the model
predictive capabilities over the course of training.
Figure 7 shows the NO
x
prediction results for a portion of the test
set for all the models. Sections of the test set have been enlarged
below each sub-plot to provide more clarity to the prole for
measured and predicted NO
x
emissions. Figure 7B also includes
FIGURE 5 | Progression of total NO
x
error in comparison with maximum, minimum and mean absolute error over training of all DNN NO
x
models. (A) Engine Out
NOx Train Errors (%) Dataset 1. (B) Tailpipe NOx Train Errors (%) Dataset 1. (C) Engine Out NOx Train Errors (%) Dataset 2. (D) Tailpipe NOx Train Errors (%) Dataset 2.
(E) Engine Out NOx Train Errors Dataset 1. (F) Tailpipe NOx Train Errors Dataset 1. (G) Engine Out NOx Train Errors Dataset 2. (H) Tailpipe NOx Train Errors Dataset 2.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031012
Pillai et al. Modeling NOx Using Deep Learning
an enlarged log-scale plot to show the peak tailpipe NO
x
predictions
more clearly. The DNN models are able to capture both dips and
peaks in the engine-out and tailpipe NO
x
emissions effectively as
observed from the enlarged plots. The DNN models were
successfully able to capture high frequency oscillations in
previously unseen test data, while having MAE within 12% of
the full scale of the NO
x
emissions available in the dataset. Overall,
the results are very promising, and the models appear to be suitable
for transient NO
x
emissions estimation in engine-out NO
x
control
and tailpipe NO
x
compliance applications.
5 DISCUSSION
In this section, three different studies conducted have been described
which highlight some important aspects of the application of DNNs
to NO
x
emissions predictions. First, an analysis of the effect of type of
input data split on the accuracy of developed DNN models is
presented. Then, insights from an input feature importance study
are discussed. Finally, the effectiveness of the developed DNN models
to fault-detection in SCR aftertreatment systems is also presented.
5.1 Effect of Training Data on Model
Accuracy
In this section, the effect of model training data for NO
x
prediction on the DNN model accuracy has been analyzed.
DNN models were trained using data split by two different
approaches as described in Section 2.2. The results for this
experiment on each of the models is presented in Figure 8.It
was observed that when the models were trained using randomly
selected training data, they had good accuracy (both R
2
and MSE)
on train, validation and test sets. However, when the models were
trained using complete test cycles and then tested on different
(unseen) complete runs of the same test cycles, the models had
less prediction accuracy on the test set. However, the models are
not overtting on the training data, as R
2
and MSE of the hold-
outvalidation set is comparable to that of the train set, as can be
seen from the striped bars in Figure 8.
One explanation for this phenomenon could be that there are
variations in NO
x
emissions measurements across multiple runs
of the same test cycle, either due to instrument measurement
deviations or accuracy limits or due to the effect of an engine or
aftertreatment parameter that is not included in the input features
for the DNN models - for eg. fuel injection pressure and timing,
urea injection or NH
3
slip. The results from this study suggest that
DNN models trained using randomly selected data, are capable of
learning the different variations in NO
x
measurements that occur
at similar inputs much better than when the model is trained
using whole test cycles. The variation in NO
x
emissions at the
same point in the dataset between the test cycle runs in the train
set and the test set are unknown to the models when the data is
not split randomly. This could be causing the higher error
between the model prediction and the true NO
x
measurement
FIGURE 6 | Improvement of instantaneous NO
x
prediction over DNN training (Engine-Out NO
x
model Dataset 1). (A) Engine Out NOx Training Epoch 1 Dataset 1.
(B) Engine Out NOx Training Epoch 200 Dataset 1. (C) Engine Out NOx Training Epoch 400 Dataset 1. (D) Engine Out NOx Training Epoch 600 Dataset 1.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031013
Pillai et al. Modeling NOx Using Deep Learning
in the test cycle. Therefore, to improve model prediction when
using multiple runs of whole test cycles to train the DNN, it could
be advisable to add more input features to explain the variation in
true NO
x
measurements. Fuel injection strategies affect the
formation of NO
x
emissions (Agarwal et al., 2013). SCR
performance is also affected by NH
3
/NO
x
ratio, while NH3
slip could also affect the effectiveness of engine-out to tailpipe
NO
x
conversion (Girard et al., 2007). Therefore, proprietary ECU
FIGURE 7 | Actual vs Predicted NO
x
emissions for a portion of the test set (All Models). (A) Test Set Engine Out NOx Actual vs Predicted Dataset 1. (B) Test Set
Tailpipe NOx Actual vs Predicted Dataset 1. (C) Test Set Engine Out NOx Actual vs Predicted Dataset 2. (D) Test Set Tailpipe NOx Actual vs Predicted Dataset 2.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031014
Pillai et al. Modeling NOx Using Deep Learning
parameters such as fuel injection pressure and timing, urea
injection timing and quantity and NH
3
slip could be included
as inputs to the DNN models to capture the variation in NO
x
emissions.
5.2 Input Feature Importance Study
DNNs are inherently black-box models due to their multi-layer
nonlinear architecture. DNNs are capable of modelling complex
problems with high accuracy but at the expense of losing
explainability, i.e., how transparent the models predictions
are to a human (Fan et al., 2021). In the application of
predicting NO
x
emissions, especially for engine control and
compliance purposes it is important to discern where and why
the model is predicting incorrectly. Therefore, an attempt has
been made to determine importantinputs to the DNN models
presented in this paper that affect model prediction. This study
does not completely make the DNN models transparent, but tries
to gain some understanding into the inner workings of the DNNs
characterization of various engine and aftertreatment parameters
to the production of NO
x
emissions.
The DNN models make use of input engine or aftertreatment
parameters to learn the complex transient nature of NO
x
emissions. Therefore, by removing an input feature that is
important to the DNN to learn, would result in decrease in
the model accuracy. This was the underlying principle used to
develop an understanding of relative importance of engine or
aftertreatment variables for the DNN models to predict NO
x
emissions. In this study, few (59) input features were used to
train the models, and as a consequence this method was easier to
implement. After the DNN models were trained, the models (with
the same hyperparameters and architecture) were trained again
by removing one input feature at a time to nd the input feature
which when removed reduced the model prediction accuracy,
i.e., R
2
and MSE. As an example, results for the experiments have
been presented for the engine-out NO
x
model using Dataset 1 in
Figure 9.
However, as can be seen from the R
2
and MSE values for both
train and test sets, removal of any single input variable did not
seem to affect the model prediction accuracy. The actual physics
of NO
x
formation and engine operation could provide an
explanation for this phenomenon. NO
x
emissions are formed
due to highly complex chemical and physical phenomena which
are affected by parameters that are highly dependent on each
other. Therefore, even if one variable or input is removed from
the DNN model, information provided by other engine or
aftertreatment variables help the DNN to capture the complex
process of NO
x
formation in diesel engines. The observed
independence could also be because the DNN was sufciently
over-parameterized, i.e., number of parameters in the network
exceeds the training points (~ 106model parameters vs ~ 105
training points) Therefore, the network is able to learn from
existing input features even if one input feature is removed. The
FIGURE 8 | Effect of type of data split on model performance. (A) Engine Out NOx Model Dataset 1. (B) Tailpipe NOx Model Dataset 1. (C) Engine Out NOx Model
Dataset 2. (D) Tailpipe NOx Model Dataset 2.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031015
Pillai et al. Modeling NOx Using Deep Learning
two hypotheses could be responsible in conjunction for the
observed independence of model accuracy on the removal of
single input features.
Further testing was therefore conducted by removing multiple
sets of inputs till a signicant reduction in model prediction
accuracy was achieved to test the models capability to model NO
x
emissions with only the most importantvariables. For the
engine-out NO
x
model, it was observed that just providing
engine speed and torque as inputs to the DNN still resulted in
model accuracy (R
2
) of 0.93. This is consistent with the diesel
engine operationas the engine speed and torque essentially
determine the engine operating conditions which is highly related
to the production of engine-out NO
x
emissions. However, the
signicant reduction in accuracy suggests that the other input
features also contribute to the DNN prediction accuracy on a
smaller scale when compared to engine speed and torque.
Similarly, using chassis dynamometer data which was collected
from a hybrid bus, model accuracy, i.e., R
2
of 0.93 was achieved
using engine speed and torque along with state of charge and
charging current as input features.
For the tailpipe NO
x
model using the engine dynamometer
dataset, utilizing SCR inlet temperature, exhaust mass ow rate
and engine-out NO
x
as inputs resulted in a model prediction
accuracy with R
2
of 0.95 on the test set. However, for the tailpipe
NO
x
model for the hybrid bus, removing even a single input
feature resulted in signicant reduction in model prediction
accuracy as shown in Figure 10. This could be attributed to
incorrect functioning of the SCR system for this hybrid bus (EPA,
2018). Subsequently, the data collected to train this tailpipe NO
x
DNN model, is not a true representation of the effect of SCR
parameters on tailpipe NO
x
emissions. Considering that this
particular DNN model is much deeperand widerthan the
other tailpipe NO
x
DNN model, over-parameterization of the
network was inadequate for the network to completely capture
incorrect operation of the SCR system.
The variable importance study however suggests that the DNN
models are capable of capturing some aspects of the physics of
engine-out and tailpipe NO
x
emissions without the need for any
physical or chemical equations. Important engine and
aftertreatment variables guide the DNN models to predict with
higher accuracy. The study conducted also demonstrates that
utilizing minimal information, i.e., two to four physics inspired
inputs, the DNN models developed in this study are capable of
capturing complex trends of NO
x
emissions in heavy-duty
vehicles as indicated by the R
2
values of 0.92 for engine-out
and 0.95 for tailpipe NO
x
model.
5.3 DNN as a Fault Detection Tool for Engine
and Aftertreatment System
This section presents an example of the application of DNN
models such as the ones developed in this paper for detection of
anomalies or faults in diesel vehicles. If DNN models using
physics inspired inputs are trained using data from
functioning engine and aftertreatment systems, the predictions
of the model can compared with data that is obtained from in-use
vehicles. This can be applied to detect abnormal NO
x
emissions
occurring either due to a faulty engine operation, incorrectly
operating aftertreatment systems or defeat devices. Fault
detection can therefore be performed on an engine-level, as
well as, aftertreatment-level.
As an example, data was collected from an poorly-functioning
aftertreatment (SCR) system for the same engine as the one used
for engine dynamometer testing in this paper. The DNN model
trained using data from a functioning aftertreatment system was
used to predict the tailpipe NO
x
emissions for this engine. The
tailpipe NO
x
emissions predictions for all three engine
dynamometer test cycles (Section 2.2) were then compared
with the faultyaftertreatment system data. In Figure 11A,
the blue dashed line indicates the DNN prediction (using
functioning aftertreatment system data) and the red line
indicates NO
x
emissions measurements collected from the
faulty aftertreatment system. The cumulative total tailpipe NO
x
emissions over the 3 test cycles for the faulty aftertreatment
system was 29.81 g while, the DNN model predicted the expected
total tailpipe NO
x
emissions (if the aftertreatment was
FIGURE 9 | Effect of variable removal on model performance (Engine out NO
x
model Dataset 1). (A) Effect of Variable Removal on R
2
Engine Out NOx Dataset 1.
(B) Effect of Variable Removal on MSEEngine Out NOx Dataset 1.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031016
Pillai et al. Modeling NOx Using Deep Learning
functioning correctly) of 18.6 g, as shown in Figure 11B. The
engine with a faulty aftertreatment system produced 60% higher
tailpipe NO
x
emissions which was successfully detected by the
DNN model. This example demonstrates the capability of
optimized DNN models to detect NO
x
emissions anomalies or
faults in diesel vehicles and their application for testing and
compliance purposes.
5.4 Application of DNN Models to Other
Heavy-Duty Engines and Fuels
This section discusses the application of similar DNN models to
other heavy-duty engines and fuels. Even though the models
developed in this paper have been trained on data from one
particular engine, it is expected that due to the use of physics
inspired features, the models are capable of capturing the
signicant features and trends that affect transient NO
x
emissions in other heavy duty diesel engines. However, from
an instantaneous engine-out NO
x
perspective, since it is a
continuous variable and highly dependent on engine design
and calibration and fuel injection strategies, trained model
accuracy would be reduced when tested directly on other
engines not used to train the models developed in this study.
Also, from a tailpipe NO
x
model perspective, SCR aftertreatment
systems with different catalysts have different NO
x
conversion
dependencies on the inputs used in the DNN models developed in
this study. Therefore, using comprehensive datasets such as the
ones developed in this study for other engines and aftertreatment
systems, and subsequently applying a similar training process as
described in this study should theoretically result in accurate
DNN NO
x
emissions models for different heavy-duty engines.
The current DNN models could also be modied to include other
FIGURE 10 | Effect of variable removal on model performance (tailpipe NO
x
model Dataset 2). (A) Effect of Variable Removal on R
2
Tailpipe NOx Dataset 2. (B)
Effect of Variable Removal on MSETailpipe NOx Dataset 2.
FIGURE 11 | Application of DNN for fault detection in SCR aftertreatment Systems. (A) Tailpipe NOx Emission-faulty after treatment vs DNN model prediction. (B)
Cumulative Tailpipe NOx Emission.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031017
Pillai et al. Modeling NOx Using Deep Learning
inputs that capture the effect of different engine designs and
calibration and SCR catalysts along with comprehensive datasets
for other engines to evaluate model performance on other heavy-
duty engines.
The type of fuel tested to develop the datasets used to train the
DNN models in this study could inuence the performance of the
trained models. The models in this study were trained using data
from engine and chassis dynamometer testing running on
certication diesel fuel. Therefore, the dataset used to train the
DNN models captures the NO
x
emissions trends for certication
diesel fuel. Capturing NO
x
emissions trends for other fuels such as
biodiesel or other higher oxygenated renewable fuels would
require subsequent training and optimization using vehicle
testing data using these fuels. DNN models developed in this
study could be utilized as a base model for initial training using
data from a different fuel and then optimized for NO
x
emissions
prediction. A larger dataset that encompasses NO
x
emissions
trends using different fuels on a single engine can be used to train
similar DNN models with fuel type as an input. The model
performance could then be evaluated to capture the inuence of
fuels on the DNN NO
x
emissions predictions.
6 CONCLUSIONS AND FUTURE WORK
Deep Neural Network (DNN) models were developed using
physics inspired inputs to predict transient heavy-duty diesel
engine-out and tailpipe NO
x
emissions using engine and
aftertreatment variables. The study employed popular and
well-established techniques in machine/deep learning to
develop engine-out and tailpipe NO
x
emissions models with
high predictive power. Based on an in-depth analysis of the
DNN models for predicting NO
x
emissions developed in this
study, the following conclusions can be drawn:
1. DNN models using physics inspired inputs are capable of
effectively characterizing the complex, nonlinear nature of
transient engine-out and tailpipe NO
x
emissions. In this study,
simple and easily accessible OBD parameters (inputs) were
used to develop accurate DNN models. All the models
developed in this study have a mean absolute error
percentage within 12% of the maximum NO
x
measurement, which is comparable to physical NO
x
emissions measurement analyzer accuracy of 1% of full scale.
2. Novel tailpipe NO
x
models developed using SCR aftertreament
variables, such as SCR inlet and outlet temperature, engine-out
NO
x
and exhaust mass ow rate, showed good prediction
accuracy (R
2
= 0.99). However, the DNN tailpipe NO
x
model
developed using data from a faulty SCR system exhibited lower
prediction accuracy (R
2
= 0.92) on the test set.
3. This study analyzed the effect of type of dataset splitting on the
model accuracy. It was shown that randomly splitting the
dataset into train and test sets provides a better understanding
of cycle-to-cycle NO
x
emissions variation to the DNN model
while trainingthereby improving model accuracy on the test
set. If the DNN models are trained using multiple runs of test
cycles as train data, it would be advisable to include more input
features that provide additional information to the DNN about
the cause of disparity in NO
x
emissions for similar test cycles.
4. The feature importance study conducted on the DNN models
showed the robustness of the models to removal of single input
features while training the network. It was also observed that
the DNN models had close understanding of the complex
transient nature of NO
x
emissions as they were trained using
physics inspired input features. Engine-out NO
x
models
showed good correlation with engine operating conditions
like engine speed and torque (R
2
= 0.93), while tailpipe NO
x
models exhibited good accuracy even with just three
aftertreatment variables as inputs (R
2
= 0.95). Interestingly,
the DNN models did not perform equally well when trained
using data from a poorly functioning aftertreatment system
(R
2
= 0.92). This indicated that when DNN models for NO
x
emissions are trained using physics inspired inputs, training
data that is not representative of the physics of NO
x
emissions
formation can lead to relatively poor DNN model
performance.
5. This work demonstrated that DNN NO
x
emissions models can
be very effective tools for fault detection in Selective Catalytic
Reduction (SCR) systems. Cumulative NO
x
predictions from
the DNN model detected that the engine with a faulty
aftertreatment (SCR) system produced 60% more total cycle
NO
x
(g) than the expected NO
x
emissions from a functioning
aftertreatment (SCR) system.
Future work in this domain will involve the application of
similar DNN models to on-road testing data from OBD
information and a Portable Emissions Measurement System
(PEMS). On-road emissions prediction presents an interesting
challenge as environmental variables such as road grade, ambient
temperature, pressure and humidity also affect NO
x
emissions,
but their effects are not necessarily captured in the controlled
environment of laboratory testing. Use of DNN models that are
trained using physics inspired inputs along with real-world
driving effects would help develop models for on-road NO
x
prediction, which should reduce the disparities in NO
x
emissions between on-road and laboratory type tests.
However, measurements from low cost production on-road
sensors are less repeatable than those taken from expensive
instruments used in engine and chassis dynamometer test
cells, and hence it would be challenging to achieve equally
high accuracy models using on-road data. The inuence of
different type of fuels on DNN model NO
x
prediction could
also be explored by training the models on comprehensive
datasets including data from different fuels tested on a heavy-
duty vehicle. More heavy-duty engines and aftertreatment
systems could be incorporated into the DNN models to assess
the models robustness in predicting NO
x
emissions from
different engine sizes and SCR systems. Successful
implementation using comprehensive datasets available from
chassis and engine dynamometer testing regularly conducted
for compliance purposes could result in a database created to
measure cumulative NO
x
emissions over test cycles for different
heavy-duty engines using the DNN models. This would be
important to inform the development of future NO
x
emissions
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031018
Pillai et al. Modeling NOx Using Deep Learning
regulations and for validating real-world emissions
measurements against expected performance.
DATA AVAILABILITY STATEMENT
The raw data supporting the conclusions of this article will be
made available by the authors, without undue reservation.
AUTHOR CONTRIBUTIONS
Specic contributions to the paper are as follows: RP: Data
Curation, Formal Analysis, Methodology, Software, Validation,
Investigation and Writing - original draft. VT: Investigation,
Methodology and Validation. ASB: Investigation,
Methodology, Software and Validation. MB: Resources and
Supervision. RS: Resources and Data Curation. TN: Resources,
Project Administration and Funding Acquisition. ALB:
Resources, Supervision, Project administration and Funding
Acquisition.
FUNDING
Funding for this study was provided by Horiba Instruments Inc.
ACKNOWLEDGMENTS
The authors would like to thank Maria Peralta, Advanced Testing
Center Director, United States EPA, NVFEL, Garrett Brown,
Scott Ludlam and Robert Caldwell at the United States EPA,
NVFEL for conducting the chassis dynamometer tests at the
Heavy-Duty Chassis Dynamometer Test Facility and providing
data for this study.
REFERENCES
Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., et al. (2016).
Tensorow: A System for Large-Scale Machine Learning,in Proceeding of the
12th {USENIX} Symposium on Operating Systems Design and Implementation
({OSDI} 16), Savannah, GA, USA, November 2016, 265283.
Abiodun, O. I., Jantan, A., Omolara, A. E., Dada, K. V., Mohamed, N. A., and
Arshad, H. (2018). State-of-the-art in Articial Neural Network Applications: A
Survey. Heliyon 4, e00938. doi:10.1016/j.heliyon.2018.e00938
Agarwal, A. K., Srivastava, D. K., Dhar, A., Maurya, R. K., Shukla, P. C., and Singh,
A. P. (2013). Effect of Fuel Injection Timing and Pressure on Combustion,
Emissions and Performance Characteristics of a Single cylinder Diesel Engine.
Fuel 111, 374383. doi:10.1016/j.fuel.2013.03.016
Amodei, D., Ananthanarayanan, S., Anubhai, R., Bai, J., Battenberg, E., Case, C.,
et al. (2016). Deep Speech 2: End-To-End Speech Recognition in English and
Mandarin,in Proceeding of the International conference on machine learning,
June 2016 (Brookline, MA: PMLR), 173182.
Arsie, I., Cricchio, A., De Cesare, M., Pianese, C., and Sorrentino, M. (2013). A
Methodology to Enhance Design and On-Board Application of Neural Network
Models for Virtual Sensing of Nox Emissions in Automotive Diesel Engines.
SAE Tech. Pap. 6. doi:10.4271/2013-24-0138
Askin, A. C., Barter, G. E., West, T. H., and Manley, D. K. (2015). The Heavy-Duty
Vehicle Future in the united states: A Parametric Analysis of Technology and
Policy Tradeoffs. Energy Policy 81, 113. doi:10.1016/j.enpol.2015.02.005
Bagheri, S., Huang, Y., Walker, P. D., Zhou, J. L., and Surawski, N. C. (2021).
Strategies for Improving the Emission Performance of Hybrid Electric Vehicles.
Sci. Total Environ. 771, 144901. doi:10.1016/j.scitotenv.2020.144901
Baldi, P., and Sadowski, P. J. (2013). Understanding Dropout. Adv. Neural Inf.
Process. Syst. 26, 28142822.
Bellone, M., Faghani, E., and Karayiannidis, Y. (2020). Comparison of Cnn and
Lstm for Modeling Virtual Sensors in an Engine. SAE Tech. Pap. 2, 26322639.
doi:10.4271/2020-01-0735
Bengio, Y. (2012). Deep Learning of Representations for Unsupervised and
Transfer Learning. JMLR: Workshop Conf. Proc. 27, 1727.
Bishop, C. M. (1994). Neural Networks and Their Applications. Rev. scientic Instr.
65, 18031832. doi:10.1063/1.1144830
Boningari, T., and Smirniotis, P. G. (2016). Impact of Nitrogen Oxides on the
Environment and Human Health: Mn-Based Materials for the NO X
Abatement. Curr. Opin. Chem. Eng. 13, 133141. doi:10.1016/j.coche.2016.
09.004
Bowman, C. T. (1975). Kinetics of Pollutant Formation and Destruction in
Combustion. Prog. Energ. Combustion Sci. 1, 3345. doi:10.1016/0360-
1285(75)90005-2
Brown, A. L., Fleming, K. L., and Safford, H. R. (2020). Prospects for a Highly
Electric Road Transportation Sector in the usa. Curr. Sustainable/Renewable
Energ. Rep. 7, 110. doi:10.1007/s40518-020-00155-3
Camporeale, S. M., Ciliberti, P. D., Carlucci, A., and Ingrosso, D. (2017). Dynamic
Validation and Sensitivity Analysis of a Nox Estimation Model Based on in-
cylinder Pressure Measurement. Warrendale, PA: SAE International. doi:10.
4271/2017-24-0131
Ciregan, D., Meier, U., and Schmidhuber, J. (2012). Multi-column Deep Neural
Networks for Image Classication,in 2012 IEEE conference on computer
vision and pattern recognition, Providence, RI, USA, June 2012 (IEEE),
36423649.
Clevert, D.-A., Unterthiner, T., and Hochreiter, S. (2015). Fast and Accurate Deep
Network Learning by Exponential Linear Units (Elus). arXiv preprint arXiv:
1511.07289.
Dahifale, B. S., and Patil, A. S. (2017). Diesel Engine Performance Improvement
for Constant Speed Application Using Cfd,in ASME International Mechanical
Engineering Congress and Exposition (American Society of Mechanical
Engineers), 111. doi:10.1115/imece2017-70012
Donateo, T., and Filomena, R. (2020). Real Time Estimation of Emissions in a
Diesel Vehicle with Neural Networks,.E3S Web of Conferences (EDP Sciences),
197, 0620. doi:10.1051/e3sconf/202019706020
EPA (2018). EPA Announces Largest Voluntary Recall of Medium- and Heavy-Duty
Trucks. Available at: https://archive.epa.gov/epa/newsreleases/epa-announces-
largest-voluntary-recall-medium-and-heavy-duty-trucks.html.
EPA (2021a). CFR Title 40 Part 1065 - Engine Testing Procedures. Legal
Information Institute. Available at: https://www.ecfr.gov/current/title-40/
chapter-I/subchapter-U/part-1065.
EPA (2021b). Regulations for Emissions from Vehicles and Engines. Available at:
https://www.epa.gov/regulations-emissions-vehicles-and-engines/cleaner-
trucks-initiative.
Fan, F. L., Xiong, J., Li, M., and Wang, G. (2021). On Interpretability of Articial
Neural Networks: A Survey. IEEE Trans. Radiat. Plasma Med. Sci. 6 (5),
741760. doi:10.1109/trpms.2021.3066428
Feurer, M., and Hutter, F. (2019). Hyperparameter Optimization,in
Automated Machine Learning (Cham: Springer), 333. doi:10.1007/978-
3-030-05318-5_1
Fischer, M. (2013). Transient Nox Estimation Using Articial Neural Networks.
IFAC Proc. Volumes 46, 101106. doi:10.3182/20130904-4-jp-2042.00006
Ge, R., Kakade, S. M., Kidambi, R., and Netrapalli, P. (2019). The Step Decay
Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
for Least Squares. Adv. Neural Inf. Process. Syst. 32, 1497714988.
Girard, J., Snow, R., Cavataio, G., and Lambert, C. (2007). The Inuence of
Ammonia to Nox Ratio on Scr Performance. Warrendale, PA: SAE
International. doi:10.4271/2007-01-1581
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031019
Pillai et al. Modeling NOx Using Deep Learning
Gluck, S., Glenn, C., Logan, T., Vu, B., Walsh, M., and Williams, P. (2003).
Evaluation of Nox Flue Gas Analyzers for Accuracy and Their Applicability for
Low-Concentration Measurements. J. Air Waste Manag. Assoc. 53, 749758.
doi:10.1080/10473289.2003.10466208
Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning. MIT press.
Heywood, J. (2019). Internal Combustion Engine Fundamentals. 2nd Edition. New
York: McGraw-Hill.
Johri, R., and Filipi, Z. (2014). Neuro-fuzzy Model Tree Approach to Virtual
Sensing of Transient Diesel Soot and Nox Emissions. Int. J. Engine Res. 15,
918927. doi:10.1177/1468087413492962
Khair, M. K., and Majewski, W. (2006). Diesel Emissions and Their Control.
Warrendale, PA: SAE International.
Kingma, D. P., and Ba, J. (2014). Adam: A Method for Stochastic Optimization.
arXiv preprint arXiv:1412.6980
Koebel, M., Madia, G., and Elsener, M. (2002). Selective Catalytic Reduction of No
and No2 at Low Temperatures. Catal. Today 73, 239247. doi:10.1016/s0920-
5861(02)00006-8
Kotsiantis,S.B.,Kanellopoulos,D.,andPintelas,P.E.(2006).DataPreprocessingfor
Supervised Leaning. Int. J. Comput. Sci. 1, 111117. doi:10.4304/jcp.1.4.30-37
Lavoie, G. A., Heywood, J. B., and Keck, J. C. (1970). Experimental and Theoretical
Study of Nitric Oxide Formation in Internal Combustion Engines. Combustion
Sci. Tech. 1, 313326. doi:10.1080/00102206908952211
LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep Learning. Nature 521, 436444.
doi:10.1038/nature14539
Lee, S., Lee, Y., Lee, Y., Shin, S., Kim, M., Park, J., et al. (2021). Proposal of a
Methodology for Designing Engine Operating Variables Using Predicted Nox
Emissions Based on Deep Neural Networks. J. Mech. Sci. Technol. 35,
17471756. doi:10.1007/s12206-021-0337-2
Maas, A. L., Hannun, A. Y., and Ng, A. Y. (2013). Rectier Nonlinearities Improve
Neural Network Acoustic Models,.Proc. Icml (Citeseer) (CA 94305 USA:
Computer Science Department, Stanford University), 30.
Mentink, P., Seykens, X., and Escobar Valdivieso, D. (2017). Development and
Application of a Virtual Nox Sensor for Robust Heavy Duty Diesel Engine
Emission Control. SAE Int. J. Engines 10, 12971304. doi:10.4271/2017-01-
0951
Merryman, E. L., and Levy, A. (1975). Nitrogen Oxide Formation in Flames: The
Roles of No2 and Fuel Nitrogen. Symp. (International) Combustion 15,
10731083. doi:10.1016/S0082-0784(75)80372-9
Mobasheri, R., Peng, Z., and Mirsalim, S. M. (2012). Analysis the Effect of
Advanced Injection Strategies on Engine Performance and Pollutant
Emissions in a Heavy Duty Di-diesel Engine by Cfd Modeling. Int.
J. Heat Fluid Flow 33, 5969. doi:10.1016/j.ijheatuidow.2011.10.004
Mohammad, A., Rezaei, R., Hayduk, C., Delebinski, T. O., Shahpouri, S., and
Shahbakhti, M. (2021). Hybrid Physical and Machine Learning-Oriented
Modeling Approach to Predict Emissions in a Diesel Compression Ignition
Engine. SAE Tech. Paper,113. doi:10.4271/2021-01-0496
Nair, V., and Hinton, G. E. (2010). Rectied Linear Units Improve
Restricted Boltzmann Machines,in Proceedings of the 27th
International Conference on International Conference on Machine
Learning, June 2010, 807814.
Panneer Selvam, H., Shekhar, S., and Northrop, W. F. (2021). Prediction of Nox
Emissions from Compression Ignition Engines Using Ensemble Learning-Based
Models with Physical Interpretability. SAE Technical Paper. doi:10.4271/2021-
24-0082
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011).
Scikit-learn: Machine Learning in python. J. machine Learn. Res. 12, 28252830.
Provataris, S. A., Savva, N. S., Chountalas, T. D., and Hountalas, D. T. (2017).
Prediction of nox emissions for high speed di diesel engines using a semi-
empirical, two-zone model. Energ. Convers. Manag. 153, 659670. doi:10.1016/
j.enconman.2017.10.007
Refaeilzadeh, P., Tang, L., and Liu, H. (2009). Cross-validation. Encyclopedia
database Syst. 5, 532538. doi:10.1007/978-0-387-39940-9_565
Rumelhart, D. E., Hinton, G. E., and Williams, R. J. (1986). Learning
Representations by Back-Propagating Errors. nature 323, 533536. doi:10.
1038/323533a0
Sankararaman, K. A., De, S., Xu, Z., Huang, W. R., and Goldstein, T. (2020). The
Impact of Neural Network Overparameterization on Gradient Confusion and
Stochastic Gradient Descent,in International Conference on Machine Learning
(Brookline, MA: PMLR), 84698479.
Shin, S., Lee, Y., Kim, M., Park, J., Lee, S., and Min, K. (2020). Deep Neural Network
Model with Bayesian Hyperparameter Optimization for Prediction of Nox at
Transient Conditions in a Diesel Engine. Eng. Appl. Artif. Intelligence 94,
103761. doi:10.1016/j.engappai.2020.103761
Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., Van Den Driessche, G.,
et al. (2016). Mastering the Game of Go with Deep Neural Networks and Tree
Search. nature 529, 484489. doi:10.1038/nature16961
Smith, L. N. (2018). A Disciplined Approach to Neural Network Hyper-Parameters:
Part 1learning Rate, Batch Size, Momentum, and Weight Decay. arXiv preprint
arXiv:1803.09820.
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R.
(2014). Dropout: A Simple Way to Prevent Neural Networks from Overtting.
J. Machine Learn. Res. 15, 19291958.
Stone, M. (1978). Cross-validation:a Review2. Ser. Stat. 9, 127139. doi:10.1080/
02331887808801414
Sun, C., Shrivastava, A., Singh, S., and Gupta, A. (2017). Revisiting Unreasonable
Effectiveness of Data in Deep Learning Era,in Proceedings of the IEEE
international conference on computer vision, Venice, Italy, Oct. 2017
(IEEE), 843852. doi:10.1109/iccv.2017.97
Wan, X. (2019). Inuence of Feature Scaling on Convergence of Gradient Iterative
Algorithm. J. Phys. Conf. Ser. 1213, 032021. doi:10.1088/1742-6596/1213/3/
032021
Winkler, S., Anderson, J., Garza, L., Ruona, W., Vogt, R., and Wallington, T.
(2018). Vehicle Criteria Pollutant (Pm, No X, Co, Hcs) Emissions: How Low
Should We Go. npj Clim. Atmos. Sci. 1, 15. doi:10.1038/s41612-018-0037-5
You, K., Long, M., Wang, J., and Jordan, M. I. (2019). How Does Learning Rate
Decay Help Modern Neural Networks. arXiv:1908.01878 (cs, stat).
Yu, Y., Wang, Y., Li, J., Fu, M., Shah, A. N., and He, C. (2021). A Novel Deep
Learning Approach to Predict the Instantaneous NOEmissions from Diesel
Engine. IEEE Access 9, 1100211013. doi:10.1109/ACCESS.2021.3050165
Zhang, Q., Pennycott, A., Burke, R., Akehurst, S., and Brace, C. (2015). Predicting
the Nitrogen Oxides Emissions of a Diesel Engine Using Neural Networks.
Warrendale, PA: SAE International. doi:10.4271/2015-01-1626
Conict of Interest: TN was employed by Horiba Instruments Inc. This study
received funding from Horiba Instruments Inc. The funder was involved in
resources, project administration and funding acquisition.
The remaining authors declare that the research was conducted in the absence of
any commercial or nancial relationships that could be construed as a potential
conict of interest.
Publishers Note: All claims expressed in this article are solely those of the authors
and do not necessarily represent those of their afliated organizations, or those of
the publisher, the editors, and the reviewers. Any product that may be evaluated in
this article, or claim that may be made by its manufacturer, is not guaranteed or
endorsed by the publisher.
Copyright © 2022 Pillai, Triantopoulos, Berahas, Brusstar, Sun, Nevius and
Boehman. This is an open-access article distributed under the terms of the
Creative Commons Attribution License (CC BY). The use, distribution or
reproduction in other forums is permitted, provided the original author(s) and
the copyright owner(s) are credited and that the original publication in this journal is
cited, in accordance with accepted academic practice. No use, distribution or
reproduction is permitted which does not comply with these terms.
Frontiers in Mechanical Engineering | www.frontiersin.org March 2022 | Volume 8 | Article 84031020
Pillai et al. Modeling NOx Using Deep Learning
... In light of remarkable advances in deep learning and the evolution of intelligent transportation systems, a multitude of data-driven emission models have emerged [37]- [39]. However, these models are often limited by restricted datasets that do not encompass the diversity of vehicles and scenarios. ...
... However, employing ReLU led to an underestimation of emissions, particularly for N O x , as it frequently predicted low N O x values as zero. The same phenomenon is observed in [37]. As a remedy, a linear activation function is employed for the final layer, preventing the last layer from having negative outputs. ...
... In [31] 2022, a Deep Neural Network (DNN) was employed to predict the NOX output levels from heavy-duty vehicles in two states: engine-out and tailpipe emissions. The proposed model in this study was tested and verified using a dataset obtained through dynamometer measurements. ...
Article
One of the viable solutions for utilizing natural gas in diesel engines involves adopting dual-fuel technology. This study converted the MT440C Compression Ignition (CI) engine into a dual-fuel engine, combining Diesel and Compressed Natural Gas (CNG) as its primary fuel. This investigation aims to assess the impact of natural gas as the primary fuel and diesel fuel as the spark ignition source on a 4-cylinder CI engine to reduce diesel fuel consumption. To address the challenge of accurately predicting Nitrogen Oxide (NOX) emissions from a dual-fuel Compression Ignition Engine (utilizing Diesel and CNG) under transient operating conditions, a NOX reduction prediction model is proposed based on the Deep Convolutional Neural Network (DCNN) architecture. Convolutional Neural Networks (CNNs) represent a more intricate class of neural networks than conventional artificial neural networks. The primary advantage of CNNs, owing to their deep architecture, lies in their capability to discern and extract distinct and heterogeneous features at various levels of abstraction. The experimental data utilized in this study were gathered from the Engine Research Center of Tabriz Motorsazan Company in Iran. Experiments were conducted under stable conditions at engine speeds of 1200, 1400, 1600, 1800, and 2000 revolutions per minute (rpm). Each experimental run was repeated three times to ensure comprehensive data collection. To effectively predict NOX emissions from the dual-fuel engine during training, the proposed DCNN initially extracts features from multidimensional data and subsequently employs these features to establish the relationships between different time steps. The experimental results demonstrate that the proposed DCNN model exhibits superior accuracy, convergence, and robustness in predicting transient NOX emissions from the dual-fuel engine. This is evident through a root mean square error (RMSE) of 21.70 and a fitting coefficient (R2) of 0.997, affirming the high precision and effectiveness of the proposed DCNN model in managing multidimensional time-series datasets.
... By surpassing the challenges of physical modeling, they offer fresh perspectives on the multidimensional aspects of these problems. Rinav et al. [31] used DNN to estimate nitrogen oxide (NOx) emissions in heavy-duty vehicles. The approach used variables from two datasets, an engine dynamometer and a chassis dynamometer, to predict NOx emissions. ...
Article
Full-text available
Machine learning models have been used to precisely forecast emissions from diesel engines, specifically examining the impact of various fuel types (HVO10, HVO 30, HVO40, HVO50) on the accuracy of emission forecasts. The research has revealed that models with different numbers of perceptrons had greater initial error rates, which subsequently reached a stable state after further training. Additionally, the research has revealed that augmenting the proportion of Hydrogenated Vegetable Oil (HVO) resulted in the enhanced precision of emission predictions. The use of visual data representations, such as histograms and scatter plots, yielded significant insights into the model’s versatility across different fuel types. The discovery of these results is vital for enhancing engine performance and fulfilling environmental regulations. This study highlights the capacity of machine learning in monitoring the environment and controlling engines and proposes further investigation into enhancing models and making real-time predictive adjustments. The novelty of the research is based on the determination of the input interface (a sufficient amount of input parameters, including chemical as well as technical), which characterizes the different regimes of the diesel engine. The novelty of the methodology is based on the selection of a suitable ANN type and architecture, which allows us to predict the required parameters for a wide range of input intervals (different types of mixtures consisting of HVO and pure diesel, different loads, different RPMs, etc.).
... Previous research for analysis and estimation emissions in diesel engines such as carbon monoxide, carbon dioxide, nitrogen oxides, hydrocarbon, and particulate matter based on machine learning and AI techniques have proved the power of AI in emissions data investigations [40][41][42]. These studies have motivated the use of artificial intelligence techniques such as ANN, ANFIS, SVM, RF, and Deep Neural Networks (DNN) [43] in analyzing and estimating of emissions for diesel engines. ...
Chapter
Full-text available
The use of artificial intelligence in different parts of human life is becoming inevitable and it is expected that in the near future, the range of artificial intelligence applications will include all service, industrial, research, and educational activities. Providing a solution or performance enhancement in research and industrial activities, considering that these activities contain lots of dependent parameters with formulated/non-formulated correlations, is always a challenge for researchers. The recent approach of researchers in using statistical data and applying techniques based on artificial intelligence is a promising solution that provides the desired answers more quickly and accurately. The automobile industry and internal combustion engines have also benefited from the advantages of artificial intelligence in order to improve their performance and efficiency. Among the most important developments and achievements of this approach, we can mention real-time modeling, optimization and intelligent control, new fuel combinations, fault detection systems, and self-driving vehicles. Therefore, in this chapter, the recent research and industrial achievements of diesel engines due to the use of artificial intelligence techniques will be discussed.
... The figure also shows that all input variables had a noticeable effect on NOx concentration. Compared to the maximum difference of 3.04 ppm from the results of previous studies (Yu et al. 2021;Pillai et al. 2022;Wang et al. 2022), the differences of the present study are much smaller (100 ppm and 17 ppm, respectively). ...
Article
Full-text available
The reduction of various nitrogen oxide (NOx) emissions from diesel engines is an important environmental issue due to their negative impact on air quality and public health. Selective catalytic reduction (SCR) has emerged as an effective technology to mitigate NOx emissions, but predicting the performance of SCR systems remains a challenge due to the complex chemistry involved. In this study, we propose using DNN models to predict NOx emission reductions in SCR systems. Four types of datasets were created; each consisted of five variables as inputs. We evaluated the models using experimental data collected from a diesel engine equipped with an SCR system. Our results indicated that the deep neural network (DNN) model produces precise estimates for exhaust gas temperature, NOx concentration, and De-NOx efficiency. Moreover, inclusion of additional input features, such as engine speed and temperature, improved the prediction accuracy of the DNN model. The mean absolute error (MAE) values for these parameters were 3.1 °C, 3.04 ppm, and 3.65%, respectively. Furthermore, the R-squared coefficient of determination values for the estimates were 0.912, 0.983, and 0.905, respectively. Overall, this study demonstrates the potential of using DNNs to accurately predict NOx emissions from diesel engines and provides insights into the impact of input features on the performance of the model.
Article
Diesel NO X emission control is the main control content of diesel engines. Real-time high-precision NO X emission prediction model is the basic guarantee for (1) Diesel combustion management, (2) SCR after-treatment control, (3) OBD system. Existing models that rely exclusively on physical processes or data-driven artificial intelligence models fail to meet real-time requirements, while MAP look-up table-based models are poorly adapted and require extensive experiments for calibration. In this study, a new semi-empirical model for NO X emission prediction was developed, and the variables of the model were identified as exhaust temperature, engine speed, injected fuel mass flow rate, air intake mass flow rate, and exhaust oxygen mass flow rate, taking into account the mechanism of NO X generation in diesel engines as well as the requirement of real-time and direct measurability of the variables for on-line control. The influence of each variable on NO X was analyzed, the model structure of the NO X prediction model was determined, and the parameters of the prediction model were identified by least squares. The model is applied to real operating vehicles for validation and compared with a previous semiempirical model taken from the literature, and the results show that the overall accuracy of the proposed model is improved. Finally, the effect of variable uncertainty on the proposed model was assessed and it was found that the relative uncertainty in predicting NO X emissions ranged between 0 and 8% for sensor accuracies of [Formula: see text] and [Formula: see text] for exhaust temperature and intake air mass flow, respectively, suggesting that the sensor accuracy has a significant effect on the uncertainty of the model.
Conference Paper
div class="section abstract"> As global regulations on automotive tailpipe emissions become increasingly stringent, developing precise tailpipe emissions models has garnered significant attention to fulfill onboard monitoring requirements without some drawbacks associated with traditional sensor-based systems. Within the European Union, there is consideration of mandating real-time measurement of emission constituents to enable driver warnings in cases where constituent standards are exceeded. Presently, available technology renders this approach cost-prohibitive and technologically challenging, with most sensor suppliers either unable to meet the demand or unwilling to justify the development costs associated with sensor commercialization. Efforts to circumvent the sensor-based approach through first principle models, incorporating thermokinetics, have proven to be both computationally expensive and lacking in accuracy during transient operations. We propose a data-driven solution based on DL (deep learning) to create virtual sensors capable of accurately estimating instantaneous emissions comparable to fast gas analyzers as an alternative to these approaches. To construct such DL models, a highly accurate dataset is essential for training, validation, and testing. This level of precision was achieved by utilizing a PEMS (portable emissions measurement system) to analyze real-world exhaust stream constituents, complemented by the logging of critical powertrain variables. The data recorded by the PEMS comprises a comprehensive inventory of THC (total hydrocarbons), CO (carbon monoxide), and NO (nitrogen oxide) concentrations in the tailpipe, correlated with engine speed, air intake charge, ignition timing, catalyst temperatures, and other key powertrain signals. During the collection of emissions and powertrain characteristics, test vehicles were driven over diverse city and highway routes, encompassing various ambient conditions, to create an extensive dataset conducive to training. The generated datasets exclude cold-start events, which are subject to rigorous scrutiny in vehicle certification efforts. Furthermore, the model relies on the closed-loop operation of the fuel control system, which is often not the case during cold start conditions. The trained networks exhibit good accuracy, as R2 and error metrics demonstrate. The resulting data-driven model can be integrated into production vehicles as an independent virtual measuring module or with OBM (onboard monitoring). </div
Article
Full-text available
Accurate and stable prediction of NOx emissions from diesel vehicles plays a crucial role in the establishment of virtual NOx sensors and the development and design of diesel engines. This paper presents a method for estimating transient NOx emissions by complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and a long- and short-term memory neural network (LSTM). First, the CEEMDAN algorithm is used to reduce the non-stationarity and volatility of the transient NOx emission data to obtain multiple subseries with different frequencies. Secondly, a predictive model is developed for each subsequence using an LSTM neural network. Finally, the results of each subsequence prediction are summed to obtain the final prediction. The proposed model uses NOx emission data generated by an EU IV diesel bus during real road driving. The results show that (1) The use of CEEMDAN can effectively improve the smoothness of NOx transient emission data, as well as facilitate more effective extraction of internal characteristics and variations of the raw data. (2) LSTM has better learning and prediction capability for transient changes in NOx emissions. (3) The results of CEEMDAN-LSTM for RMSE, R2, MAE and NRMSE are 46.11,0.98, 29.82 and 2.71, respectively, which are better than the other model with improved prediction performance.
Conference Paper
Full-text available
Several studies in literature have shown how real-world emissions strongly depend on driving condition, driving style, ambient temperature and humidity, etc. so that they are significantly different from the values measured on test benches over standard driving cycles. This concern, together with the so-called Diesel-gate, has caused the introduction in Europe of an innovative procedure for the registration of vehicle based on real driving emissions (RDE) measured with a portable emission measurement system (PEMS). PEMS devices are bulky and very expensive, therefore they cannot be extensively for an actual real time monitoring of emissions. To solve this problem, the present work proposes a Neural Network model based on the interpolation of the time-histories of driving conditions (speed, altitude, ambient temperature, humidity and pressure) and emissions measured on a diesel start-and-stop vehicle while performing a series of RDE tests. Two different approaches are proposed. The first one calculates the emissions on the basis of the vehicle motion (speed and altitude profile, ambient conditions). The second one models the engine block using as input the ambient conditions, the load and the rpm of the engine as derived from the OBD-II scanner. The output of both models are the flow rates and cumulated values of CO2 and NOx. Note that the inputs of the two models are signal that can easily obtained on-board without additional sensors.
Article
Full-text available
Purpose of Review This review explores the transportation implications of key scenarios for deep decarbonization. While electrification of transportation is not the only action that will be needed to reach climate-change goals, it is one necessary step. This review provides an overview of technologies available for electrification of light-, medium-, and heavy-duty vehicles, the infrastructure required to support electrification, and policy options to accelerate the transition to a cleaner transportation future. Recent Findings Climate studies have shown that as greenhouse gas (GHG) emissions continue rising, aggressive action is needed to achieve deep decarbonization in the USA and worldwide. Transportation is the largest source of GHG emissions in the USA, and one of the largest worldwide. Experts agree that electrifying road transportation is essential to avoiding the worst impacts of climate change. Recent advances in electric vehicle (EV) technology are making electrification viable. Automakers are quickly increasing the number and diversity of commercially available EV models, and consumer awareness of EV technology is slowly growing. Studies show that policymakers can effectively accelerate transportation electrification through tax incentives, regulations, and other policy tools. Summary All credible pathways for achieving deep decarbonization rely on complete or near-complete electrification of passenger vehicles. Advances in EV technology have made this goal feasible from a technical standpoint, though much more needs to be done to achieve rapid electrification in practice. Prospects for electrifying medium- and heavy-duty vehicles are improving, but the high mileage and heavy load requirements of these vehicles present technical challenges. Moreover, achieving widespread electrification of road transportation will require considerable expansion of EV infrastructure (e.g., charging networks), as well as upgrades to existing electric grids. Policymakers can support rapid electrification through actions including tax rebates and other incentives that encourage consumers to purchase EVs, regulations that prioritize electrification of vehicles serving transportation network companies (TNCs), and increased public investment in EV infrastructure and research.
Article
The process used by engine manufacturers for the development of a new engine includes the planning and conceptual design phases, followed by the detailed design phase, in which the design target specifications are met. In the conceptual design phase, a rough specification of the target engine is presented to facilitate a detailed design and the additional cost of modification is reduced exponentially. In the conceptual design phase, however, not only is there no real engine. but there are also no 1D and 3D models present, so it is impossible to test and simulate them. Therefore, at this stage, a model that can predict emission and performance only according to the specifications and operating conditions of the engine would be very useful. Previous studies developed an EGR prediction model that can be used in the 0-D NOx prediction using a deep learning method. In this study, a NOx prediction model with high accuracy using only the operating conditions as input variables, without ECU data, was developed using deep neural networks. The developed model has high accuracy with an R-square of 0.988. The feature of this model is that all the input parameters for the deep neural network come from the operating conditions of the engine. Therefore, this model can be used in the early stages of the development of new engines when testing and simulation cannot be performed because they do not exist. The designer can set the range of the operating conditions such that they do not exceed the NOx limits at the specific operating point (specific rpm and BMEP). This variable operating design methodology is expected to be useful in the development of new engines for automobile manufacturers with various engine data.
Article
Deep learning as represented by the artificial deep neural networks (DNNs) has achieved great success recently in many important areas that deal with text, images, videos, graphs, and so on. However, the black-box nature of DNNs has become one of the primary obstacles for their wide adoption in mission-critical applications such as medical diagnosis and therapy. Because of the huge potentials of deep learning, increasing the interpretability of deep neural networks has recently attracted much research attention. In this paper, we propose a simple but comprehensive taxonomy for interpretability, systematically review recent studies in improving interpretability of neural networks, describe applications of interpretability in medicine, and discuss possible future research directions of interpretability, such as in relation to fuzzy logic and brain science.
Article
Low emission vehicle technologies need widespread adoption in the transport sector to overcome its significant decarbonisation challenges. Hybrid Electric Vehicles (HEVs) represent an intermediate technology between pure electric vehicles and internal combustion engines that have proven capability in reducing petroleum consumption. HEV customers often cite improved fuel economy as a major benefit from adopting this technology; however, outstanding questions remain regarding their respective emission levels. Through an extensive literature study, we show that several issues remain with HEV emissions performance which stem from frequent high-power cold starts, engine calibration issues and inefficient operating conditions for catalytic converters. HEVs have more NOx, HC, CO and particle number emissions compared to conventional vehicles by up to 21.0, 5.8, 9.0 and 23.3 times, respectively. Improved engine control algorithms, after-treatment design and thermal design of three-way catalysts emerge as research priorities for improving the emissions performance of HEVs.
Article
Owing to increasing interest in the environment, particularly on air quality, regulations in the automobile industry have become stricter. Test cycles have been substituted to simulate real driving conditions, and they offer opportunities for researchers to satisfy regulations and predict emissions using models. The objective of this study is to develop a deep neural network (DNN) model, optimize its hyperparameters using the Bayesian optimization method, and use hidden-node determination logic to predict engine-out NOx emissions by using the worldwide harmonized light vehicles test procedure (WLTP) of diesel engines. A DNN network learns the internal relationships between inputs and target outputs even though they are complicated. However, the hyperparameters of DNNs are typically determined by researchers before training, and they affected the accuracy of the model. In this study, the hyperparameters of the DNN model such as the number of hidden layers, number of nodes in each hidden layer, learning rate, learning rate decay, and batch size are automatically optimized using the Bayesian optimization method. Some logical equations are combined with the number of nodes in the first hidden layer and the number of hidden layers to realize the model’s structure instead of using the number of hidden nodes in each hidden layer. Compared with grid search and random sampling, the Bayesian optimization method is a promising solution to optimize hyperparameters. In addition, a hidden-node determination logic further improved the accuracy of the model. The accuracy of the optimized model is indicated by an R² value of 0.9675 with 14 input features. The result of cycle prediction shows that the mean absolute errors are approximately 16–17 ppm for four WLTP cycles, which are 1.6% of the maximum NOx value. These results indicate that the accuracy of the model is comparable to that of a physical NOx measurement device whose linearity is 1% of the full scale (5,000 ppm).
Conference Paper
div class="section abstract"> The automotive industry makes extensive use of virtual models to increase efficiency during the development stage. The complexity of such virtual models increases with the complexity of the process that they describe, and thus new methods for their development are constantly evaluated. Among many others, data-driven techniques and machine learning offer promising results, creating deep neural networks that map complex input-output relations. This work aims at comparing the performance of two different neural network architectures for the estimation of the engine state and emissions (flow fuel, NOx and soot). More specifically, Convolutional Neural Network (CNN) and Long-Short Term Memory (LSTM) will be evaluated in terms of performance, using different techniques to increase the model generalization. During the learning stage data from different engine cycles are fed to the neural networks. In order to evaluate the generalization of the model, the networks are tested over new, previously unseen, engine cycles. Results show that our models over-perform other state-of-the-art models, the best performance was found for the LSTM model with 2.40%, 2.80% and 18.19% error for flow fuel, NOx and soot sensor respectively. </div