PreprintPDF Available
Preprints and early-stage research may not have been peer reviewed yet.

Abstract and Figures

Complex optimal design and control processes often require repeated evaluations of expensive objective functions and consist of large design spaces. Data-driven surrogates such as neural networks and Gaussian processes provide an attractive alternative to simulations and are utilized frequently to represent these objective functions in optimization. However, pure data-driven models, due to a lack of adherence to basic physics laws and constraints, are often poor at generalizing and extrapolating. This is particularly the case, when training occurs over sparse high-fidelity datasets. A class of Physics-infused machine learning (PIML) models integrate ML models with low-fidelity partial physics models to improve generalization performance while retaining computational efficiency. This paper presents two potential approaches for Physics infused modelling of aircraft aerodynamics which incorporate Artificial Neural Networks with a low-fidelity Vortex Lattice Method model with blown wing effects (BLOFI) to improve prediction performance while also keeping the computational cost tractable. This paper also develops an end-to-end auto differentiable open-source framework that enables efficient training of such hybrid models. These two PIML modelling approaches are then used to predict the aerodynamic coefficients of a 6 rotor eVTOL aircraft given its control parameters and flight conditions. The models are trained on a sparse high-fidelity dataset generated using a CHARM model. The trained models are then compared against the vanilla low-fidelity model and a standard pure data-driven ANN. Our results show that one of the proposed architectures outperforms all the other models at a nominal increase in run time. These results are promising and pave way for PIML frameworks which can generalize over different aircraft and configurations thereby significantly reducing costs of design and control.
Content may be subject to copyright.
Physics-Infused Machine Learning Based Prediction
of VTOL Aerodynamics with Sparse Datasets
Manaswin Oddiraju , Divyang Amin , Michael Piedmonte, Souma Chowdhury
Abstract
Complex optimal design and control processes often require repeated evaluations of expensive objective functions and consist
of large design spaces. Data-driven surrogate models such as neural networks and Gaussian processes provide an attractive
alternative to expensive simulations and are utilized frequently to represent these objective functions in optimization. However,
pure data-driven models, due to a lack of adherence to basic physics laws and constraints, are often poor at generalizing
and extrapolating. This is particularly the case, when training occurs over sparse high-fidelity datasets. A class of Physics-
infused machine learning (PIML) models integrate ML models with low-fidelity partial physics models to improve generalization
performance while retaining computational efficiency. This paper presents two potential approaches for Physics infused modelling
of aircraft aerodynamics which incorporate Artificial Neural Networks with a low-fidelity Vortex Lattice Method model with blown
wing effects (BLOFI) to improve prediction performance while also keeping the computational cost tractable. This paper also
develops an end-to-end auto differentiable open-source framework that enables efficient training of such hybrid models. These two
PIML modelling approaches are then used to predict the aerodynamic coefficients of a 6 rotor eVTOL aircraft given its control
parameters and flight conditions. The models are trained on a sparse high-fidelity dataset generated using a CHARM model. The
trained models are then compared against the vanilla low-fidelity model and a standard pure data-driven ANN. Our results show
that one of the proposed architecture outperforms all the other models and at a nominal increase in execution time. These results
are promising and pave way to PIML frameworks which are able generalize over different aircraft and configurations thereby
significantly reducing costs of design and control.
I. INTRODUCTION
Real world optimal design and control problems consist of optimizations with large design spaces and computationally
expensive objective functions and therefore require efficient and accurate models of complex physical systems. There are
different proposed approaches to render these problems computationally feasible, such as the use of surrogate models, reduced
order modelling methods and multi-fidelity optimization techniques. The use of data-driven surrogate models for optimization
[1] is a popular approach to reduce the computational burden and render the process tractable. However, using pure data
driven models poses additional challenges in the form of poor explainability and usually these models require large amounts of
data either from expensive high-fidelity analysis or experiments in order to achieve sufficient accuracy. For many engineering
design problems however fast running physics-based models are available, but they usually trade-off accuracy for performance.
Therefore, in this paper, we propose two Physics Informed Machine Learning (PIML) models which combine data driven
Artificial Neural Networks (ANNs) with low-fidelity physics models and augment the performance of the physics models
while only needing a sparse high-fidelity dataset. We also develop an end-to-end auto-differentiable physics framework to
enable efficient training of such hybrid architectures and apply them to model the aerodynamics of a 6 rotor eVTOL aircraft.
The rest of this section briefly covers existing techniques for augmenting low-fidelity physics models using high-fidelity data,
physics informed machine learning architectures before stating our research objectives.
Conventional methods for enhancing low-fidelity computational models with high-fidelity or experimental data encompass
techniques like Bayesian Inference [2], [3], Gradient-Based and Gradient-Free Optimization [4], [5], among others. The primary
goal of these techniques is to determine the optimal parameters of the low-fidelity model in order to improve its overall
performance. However, due to the complexity of the physical phenomena often modeled by these techniques, identifying a
static set of parameters for a dataset might be inadequate and may not fully capitalize on the potential accuracy improvements
offered by a sparse high-fidelity dataset. On the other hand, data-driven models, due to their near-instantaneous run times,
are being utilized frequently for speculating the behavior of complex systems in domains such as mechanical systems [6],
[7], robotics [8]–[10], and energy forecasting [11]. Models such as Neural Networks, Gaussian Processes etc. are showing
competitive prediction accuracy on some problems in this domain. However, they underperform at extrapolating and generalizing
[12], [13], especially when trained with small or sparse data sets [14]. This can be attributed to a lack of adherence to basic
laws of physics. Additionally, they also exhibit challenging-to-interpret black-box behavior and sensitivity to noisy data [15].
In lieu of these problems, significant research efforts are being directed towards ”hybrid models” or ”physics-infused” models.
These models generally integrate data-driven and low-fidelity physics models in order to make predictions computationally
feasible and accurate.
There are different classes of physics-infused neural networks; although almost all of them make use of a data-driven model
and a physics model, they differ in the architectures. Hybrid ML architectures can broadly be classified into serial[16]–[19]
and parallel [20]–[22] architectures. Serial architectures typically have the data-driven models set in sequence with the partial
physics model or used to tune the partial physics model parameters, while parallel architectures usually contain additive or
This work was presented at the AIAA Aviation 2023 Forum.
Copyright @ Souma Chowdhury
1
arXiv:2307.03286v1 [cs.CE] 6 Jul 2023
multiplicative ensembles of partial physics and data-driven ML models [23]. Several such hybrid PIML architectures have
been reported in the literature in the past few years [24]–[36], spanning over a wide range of applications such as in modeling
dynamic systems, cyber-physical systems, robotic systems, flow systems and materials behavior, among others. The OPTMA
model [37], is a physics-infused machine learning model which combines an artificial neural network with a partial physics
model in order to make predictions. Earlier applications of this framework on acoustics problems have shown that it generalizes
well and can even extrapolate successfully. This architecture primarily works by using the data-driven model -in this case the
transfer network; to map the original inputs to the inputs of the partial physics model so as to make its output match that of the
high-fidelity model. Hence, the training process is just the transfer network learning this mapping. And since the intermediate
parameters are open and interpretable to domain experts (as they are inputs to partial physics models), the OPTMA model is
less of a black box and more understandable when compared to pure data-driven models.
In all of these sequential hybrid-ML models however, the presence of an external partial physics model increases the
complexity and cost of the training process. Previously[38], this hurdle was overcome by programming custom loss functions
which include the partial physics in PyTorch[39] to enable backpropagation. However, this approach may not be feasible in
all domains as PyTorch is not optimal for general purpose scientific computing (especially numerical methods). Therefore, in
our goal to make OPTMA more user-friendly and computationally tractable, we use Google JAX [40] to auto-differentiate
the partial physics model. JAX is capable of forward and backward mode auto-differentiation and works on codes written in
python and is also easy to integrate with the transfer network written in PyTorch. This is a much more generalizable and
scalable approach to creating computationally efficient PIML frameworks.
During the process of aircraft design, and eventually the aircraft controller design, engineers often encounter the earlier stated
predicament of selecting appropriate methods to model the aircraft. They may need to balance the benefits and drawbacks of
using a method that is highly accurate, albeit costly and time-consuming, against a faster and more economical approach that
might not provide the same level of precision.The complexity of this problem is magnified when engineers are tasked with
designing more challenging-to-control aircraft than traditional fixed-wing varieties. A prime example is the Vertical Takeoff &
Landing (VTOL) aircraft. As the aircraft designs become more complex, the modeling demands can escalate exponentially. In
the case of VTOL, it is essential to model the aircraft dynamics across all phases: from hover to transition, and finally into
cruise. Such aircraft also exhibit more intricate aerodynamics, including phenomena like blown-wing effects. There is typically
a greater number of degrees of freedom, which further escalates the complexity of designing robust controllers for VTOL
aircraft. Given the numerous benefits and strengths of physics-infused modeling, and the necessity for fast accurate models
for aircraft design and control, it is evident that PIML models are particularly well-suited for modeling the aerodynamics of
an aircraft.
Therefore, in this paper, we propose two physics informed modelling approaches to augment a low-fidelity VLM model
with a sparse high-fidelity dataset. One of the approaches, called PIML-A is an extension of the OPTMA architecture. The
other architecture, named PIML-B is a comparatively simpler ensemble model that learns the error in the low-fidelity physics
models. The architectures are aimed at improving the accuracy of the low-fidelity physics models while only using a sparse
high-fidelity dataset to keep training costs low. Our research objectives are as follows:
1) Create a Vortex Lattice Method (VLM) model with auto-differentiation capabilities.
2) Develop different PIML architectures seeking to provide better accuracy / cost trade-offs than pure Low-Fidelity and
pure data-driven models.
3) Apply the above architectures to model the control inputs to force and moment coefficients of a eVTOL aircraft.
4) Compare the performance and computational cost of the PIML models with the baseline low-fidelity and data-driven
models.
The remainder of this paper is laid out as follows: SectionII contains a description of the two PIML models framework and
their components in detail. SectionIII explains our case study and the models that we are using before moving on to SectionIV
which contains our results and discussion. Finally, we state our conclusions in SectionV.
II. PHYSICS IN FU SE D MACH IN E LEARNING ARCHITECTURES
Fig.1 shows the two proposed PIML frameworks as well as the vanilla low-fidelity physics model (L). In this paper, all the
models shown are used to predict the airframe force and moment coefficients for a given set of control and environmental
parameters. The PIML models amalgamate low-fidelity physics models with Artificial Neural Networks (ANNs) to enhance
predictions, yet they vary based on the specific role of the ANNs. As depicted in the framework diagram 1, the low-fidelity
model comprises of a propeller solver and the Vortex Lattice Method (VLM) model. The prop solver computes induced
velocities which are then fed into the VLM, which in turn calculates the force and moment coefficients on the airframe.
The PIML-A model incorporates ANN layers to achieve three primary objectives: 1) To supersede the prop solver and
directly estimate the axial and tangential induced velocities of the propellers, 2) to modify the flight conditions in order to
enhance the predictions of the VLM model, and 3) to adjust the output of the VLM to reduce discrepancies with respect to the
high-fidelity data. Given that all layers are trained concurrently, the PIML-A model learns the mapping from flight conditions
2
Induced
Velocies
VLM Model
Prop Solver
Inputs
Inputs
Low-Fidelity Outputs
Low-Fidelity Model
(a)
PIML -A
Transfer Layers
VLM Model
PIML Predicted Output
Transfer Parameters Correcon Layers
Inputs
(b)
Prop Solver
Inputs
PIML -B
Induced
Velocies
VLM Model
PIML Predicted Output
Correcon Layers
Inputs
Predicted Correcon
of Induced Velocies
(c)
ANN Predicted Output
Inputs
Pure Ar�ficial Neural Network
(d)
Fig. 1: Framework Diagrams of a) Low-Fidelity Physics b) PIML-A c) PIML-B and d) Purely Data-Driven ANN
and control inputs to the transfer parameters. Simultaneously, it learns the output errors of the VLM and adjusts them to align
more closely with the high-fidelity data. The reasoning behind this model is that part of the errors in the VLM model can be
mitigated by shifting the inputs into different domains, and the errors that remain, not resolved by this input modification, will
be addressed by the correction layers on the output side.
In contrast, the PIML-B model does not adjust any inputs to the original low-fidelity model and retains the prop solver in its
structure. Within this design, the sole objective is to identify the error in the induced velocities computed by the prop solver
and rectify these parameters. This model is more closely aligned with the low-fidelity physics model and aims to enhance
modeling accuracy by refining the outputs of the prop solver.
The following subsection focuses on the training processes of these PIML models, specifically the computation of gradients
and the implementation of backpropagation through the low-fidelity physics models.
3
A. PIML Model Training
The training of neural networks (i.e optimizing the weights of the neural network to minimize loss) is performed using
gradient descent and therefore requires the gradient of the loss function(L) w.r.t to the weights(W). The following equations
show the integration of backpropagation gradients through an auto-differentiable physics model. This feature enables the training
of our PIML architectures using standard ANN training techniques and using standard ANN software libraries.
L=YiY
i2(MSE Loss)
L=1
n
n
i=1
(YiY
i)2
∂L
∂W =1
n
n
i=0
2 (Y
iYi)∂Y
i
∂W
Where Yi

Ground Truth
=ϕ(Xi),Y
i

Prediction
=G(Xi, W )
(1)
Grepresents the sequential PIML model and ϕrepresents the high-fidelity physics model.The terms Xi,Yi,Y
irepresent the
input, high-fidelity output and the OPTMA prediction of the ith training sample respectively.
From the definition of Y
i:
Y
i=G(Xi, W )
∂Y
i
∂W = G (Xi, W )
∂W
As Gis the OPTMA model, we have:
G(Xi, W ) = Φ (S(Xi, W ))
=G(Xi, W )
∂W =Φ (S(Xi, W ))
∂W =Φ(U)
∂U

Partial Physics
Auto-Differentiation
×S(Xi, W )
∂W

Backpropagation
(2)
Here, Sand Φrepresent the neural network and the partial physics models respectively. Urepresents the intermediate parameter
vector which is the output of the transfer network and input to the partial physics model.
III. CAS E STU DY: MODELLING AERODYNAM IC S OF A TILT-ROTOR E VTOL AIRCRAFT
The Aerobeacon, shown in fig.2, is a 6 rotor aircraft with 4 hover rotors in a quad configuration and two tilt-rotors at the
edges of the wings. The aircraft has no rudder and instead makes use of differential thrust between the tilt rotors for yaw
control. The only movable control surface on the aircraft apart from the tiltrotor is an elevator. Since this is our initial attempt
at modelling this aircraft, we chose to fix the hover rotor RPMs at 5000 and only vary the other control inputs listed in Tale.I.
Table I: Aircraft Control Parameter Bounds
Parameters Bounds
Speed (v) [0 m/s ,45 m/s]
Angle of Attack (α) [-15, 15]
Starboard Propeller
RPM (ωstar)
[4000 , 10000]
Port Propeller RPM
(ωport)
[4000 , 10000]
Starboard Propeller
Angle (θstar)
[0, 110]
Port Propeller Angle
(θport)
[0, 110]
Elevator Deflection
(θelev)
[-15, 15]
Fig. 2: Aerobeacon aircraft design configuration from Flighthouse
Engineering
4
A. Modelling Aircraft Flight Dynamics
Table II displays the inputs and outputs of all the models and also the transfer parameters corresponding to the PIML-A
model. The transfer parameters are specifically selected such that they have an impact on the output accuracy of the model, as
well as to allow greater flexibility to the embedded ANN layers to affect the final outputs. In both the proposed architectures,
we try to improve the low-fidelity model by either augmenting or replacing the prop solver. The PIML-A model in particular
also tries to alter the outputs of the VLM model to improve accuracy. This model is an extension of the OPTMA architecture
discussed in the Introduction.
Table II: Modeling Parameters
Inputs (All Models) Transfer Parameters (PIML-A Model) Outputs (All Models)
Speed (v)
Angle of Attack (α)
Starboard Propeller RPM
(ωstar)
Port Propeller RPM (ωport)
Starboard Propeller Angle
(θstar)
Port Propeller Angle (θport)
Elevator Deflection (θelev)
Speed(v)
Angle of Attack(α)
Elevator Deflection(θ
elev)
va
star
vt
star
va
port
vt
port
va
hover
vt
hover
CL
CD
Cl
Cm
B. Low Fidelity Model
The lower fidelity flight dynamics model uses Bechamo LLC’s developed VLM plus propeller effects low-fidelity tool,
BLOFI. This tool was developed initially to model the NASA Tiltwing aircraft and was validated against the X-57 data [41].
The validation plots can be seen in fig.9. This was done in order to prove that we are able to capture the blown wing effects.
BLOFI was created by using an improvised VLM that accounts for some blown wing effects caused by the propellers upstream
of the the lifting surfaces. A base VLM [42] is implemented and is modified by accounting for induced velocities caused by
propellers’ wash upstream of the control points on the lifting surfaces. The properties for each propeller are calculated using
Blade Element Momentum Theory [43]. It should also be noted that propeller slipstream contraction [44] is accounted for
when including the induced velocities from the propellers in the VLM setup.
A set of flight conditions is created by making a grid of every combination of freestream velocity, sideslip and angle of
attack. At each of these flight conditions the aerodynamic forces and moments, including the propeller thrusts and torques,
are calculated, using the aforementioned VLM implementation. This is done for every expected RPM setting, propeller tilt
angle and tail deflection. The combination of aerodynamic forces and moments and propeller thrusts and torques is used to
determine the overall forces and moments acting on the aircraft.
CHARM[45] is a commercial tool developed by Continuum Dynamics, Inc. for simulating the aerodynamics and dynamics
of rotorcraft using partical methods. Flighthouse Engineering LLC. provided their Aerobeacon aircraft design, parameters,
and constructed the CHARM model. They ran CHARM test cases for a collection of randomly generated input samples. The
data set derived from the CHARM model serves as the high-fidelity data for the research presented in this paper. The VLM
configuration is shown in fig.3 and the CHARM higher fidelity output visualization is shown in fig.4.
C. Sparse High-Fidelity Dataset
In order to generate the high fidelity dataset, we used Latin Hypercube Sampling and created a dataset of 100 points across
the parameters and bounds listed in Tab.I. These 100 samples were then input to the CHARM high fidelity model and the
dataset was generated. Out of the 100 initial samples, only 87 were usable as 11 cases failed to converge on the CHARM
model and the remaining 2 cases showcased abnormally high values of the force and moment coefficients and were removed
from the dataset. While generating the dataset, we fixed the RPMs of the hover quad rotors to 5000 RPM. This final dataset of
87 samples was then randomly divided into a training set containing 70 samples and a validation set containing 17 samples.
All the performance results shown in this paper are run on this validation dataset consisting of samples completely unseen by
the model.
D. Prior Analysis of BLOFI Performance on Other Data-Sets
BLOFI was developed at Bechamo LLC and in order to prove its usefulness in comparison to other available VLM solvers
that do not account for blown wing effects. It has been previously validated using CFD data[41] generated by NASA for the
X-57. Just as a flat plate model was created for the Aerobeacon with the appropriate propellers for the developments of this
paper, a flat plate model with the corresponding propellers was earlier created for the X-57 as part of the validation study
5
Table III: Comparison of PIML, Low-Fidelity and Pure ANN Models
PIML-A PIML-B Low-Fidelity ANN
Execution Time (s) 0.32 0.53 0.338 0.0002
Training Time (min) 23.46 23.18 - 0.04
#Hidden Layers 6 6 - 4
#Nodes per Layer 200 200 - 150
on BLOFI. In figure9 we are able to see that BLOFI can capture significant blown-wing effects, by comparing the power-on
and power-off plots. The increase in CLand CDat every angle of attack can be observed. More importantly, the inclusion
of induced velocities in the VLM solve also lets us observe the increased flap effectiveness. Errors are between 5%and 10%
for the range of angles of attack for which this validation study was conducted. It is expected that with a PIML approach,
not only can we close down this gap (error), but more readily extend the low/medium fidelity solver such as BLOFI to work
on a wider variety of aircraft configurations with little to no manual tuning and manual model development. To support his
premise, the next section presents the results of our PIML architectures with the example of Aerobeacon configuration, where
we had more control over the high-fidelity sampling enabling ease of PIML training for this paper.
IV. RES ULT S AN D DISCUSSION
A. Baselines For Comparison
For testing the performance of the two proposed PIML models, we compared them against the vanilla low-fidelity model(L)
shown in fig.1a and a pure data-driven ANN (A). Table III lists the parameters of the stand-alone ANN and the ANNs used as
part of the PIML frameworks. All the data-driven and PIML models were trained on the same training dataset of 70 high-fidelity
samples and validated on a validation dataset consisting of 17 samples. As the training data was sparse, a pure ANN with
similar size of the ANNs used for the PIML models tended to overfit very quickly. Therefore, the size of the pure data-driven
ANN is lower as compared to the ANNs used in the PIML models.
B. Modelling Performance
Figure.5 shows the convergence history for both the PIML models. From the figure we see that PIML-A has a much smoother
training history as compared to the PIML-B model. The PIML-B model seems to be sensitive to corrections in induced velocity.
Further training with a lower learning rate or decaying the learning rate may resolve this issue and improve performance of
the PIML-B model. It may also be the case that the VLM is very sensitive to the induced velocity inputs or that there is an
unknown bias in the validation dataset. This could’ve happened due to the random splitting of data into train and test samples.
Further work with varying test and train datasets similar to K-fold cross-validation are necessary to understand if there’s any
data bias.
Figure.6 shows the prediction error of the models in all of the 4 outputs. From the figure, we see that the PIML-A model
performs the best. The pure data-driven ANN and the PIML-B model exhibit similar performance. Relative to the amount
Fig. 3: Mesh model of the Aerobeacon aircraft. The
propellers and their direction of rotation are represented
by directed circles. The two rotors positioned at the wing
extremities are capable of tilting.
Fig. 4: CHARM visualized output of particle method to
obtain a higher-fidelity model at discrete points
6
0 20 40 60 80 100 120 140
0
0.05
0.1
0.15
0.2
Train Loss
Test Loss
Epoch
MSE Loss
(a) PIML-A
0 20 40 60 80 100 120 140
0
0.05
0.1
0.15
0.2
Train Loss
Test Loss
Epoch
MSE Loss
(b) PIML-B
Fig. 5: Convergence history of PIML Models
M
o
d
e
l
(a) Prediction Error in CL
Low_Fidelity PIML A PIML B ANN
0
0.2
0.4
0.6
0.8
1
M
o
d
e
l
RMSE (Normalized)
(b) Prediction Error in CD
Low_Fidelity PIML A PIML B ANN
0
0.2
0.4
0.6
0.8
1
M
o
d
e
l
RMSE (Normalized)
(c) Prediction Error in Cl
M
o
d
e
l
(d) Prediction Error in Cm
Fig. 6: Prediction Error Compared to High Fidelity Data
of data available, the ANN performs very well and can prove to be a possible alternative in cases with a larger sample set
and no additional requirements on interpretability. he fact that the high-fidelity samples were generated using Latin hypercube
sampling, ensuring an even domain coverage, could also be a contributing factor to the ANN’s strong performance. Among
all the modeled outputs, PIML-A appears to excel in predicting CD. This could be attributed to the transfer parameters having
a greater impact on CDas compared to the other outputs. Where things begin to get interesting is when we see the effect of
the learnt output correction layers in PIML-A as shown in fig.7. From the figure, it is easily seen that the output layers of
the VLM model have a very strong negative correction on the CLthat the VLM outputs. As we also see from fig.6, we see
that the VLM model has the highest prediction error in CL. Combined, these two plots convey that the transfer parameters
7
C_L C_D C_l C_m
−1.5
−1
−0.5
0
0.5
1
Model
Normalized Correction to VLM Outputs
Fig. 7: Correction of VLM Outputs by the Correction Layers
in PIML-A
PIML-B
−0.6
−0.4
−0.2
0
0.2
0.4
0.6
Model
Normalized Correction to Induced Velocities
Fig. 8: Correction of Propeller Induced Velocities by the
Correction Layers in PIML-B
do not influence CLas much as they do the other outputs or that just shifting the inputs is not sufficient to improve the CL
predictions of the VLM model. The correction for other outputs predicted by the PIML model is evenly distributed between
positive and negative and this seems to be reasonable given the performance of the vanilla low-fidelity models on all these
outputs is about the same.
The PIML-B model on the other hand performs on par with the low-fidelity VLM model. This outcome might be explained
by an earlier hypothesis suggesting that if the induced velocities indeed have a minor impact on CL, then regardless of what
the correction network learns, it won’t be able to enhance the predictions for CL. However, since the training loss is calculated
across all outputs, the overall loss remains substantial, causing the network to struggle in learning the transfer mapping necessary
to improve predictions on the other inputs. It may be the case that when different models are trained to learn the various outputs
of this modelling problem we might see improved performance by the PIML-B architecture on outputs other than CL, but as
that is not feasible especially once the number of outputs gets large enough to support design / control frameworks, PIML-A
model might be better suited for this problem.
V. CONCLUSION
The paper presents two distinct architectures of physics-informed models, referred to as PIML-A and PIML-B. These models
integrate Artificial Neural Networks with a low-fidelity BLOFI model with the aim of enhancing the prediction performance
of the aerodynamic coefficients of a six-rotor eVTOL aircraft. To facilitate the training of both models, an end-to-end auto-
differentiable physics framework was established using Google JAX. The models were then trained using a sparse dataset
derived from a high-fidelity CHARM model. When comparing prediction performance, it was found that the PIML-A model
achieved a higher degree of accuracy in comparison to the PIML-B model, the basic low-fidelity model, and a baseline purely
data-driven model. A comprehensive analysis of the results demonstrated that the PIML-A model’s method of simultaneous
input-shifting and output-correction yielded superior results, given that one of the inputs might not be highly sensitive to the
chosen transfer parameters. To conclusively determine the optimal architecture for this modeling problem, additional testing
using cross-validation and hyperparameter optimization is needed. However, the initial results indicate that the PIML-A model
holds significant promise. Looking forward, the BLOFI low-fidelity model, when augmented with PIML, could potentially be
employed to generalize across diverse aircraft designs and configurations, thereby proving to be a valuable tool in the domains
of aircraft design and control.
ACK NOW LE DG EM EN TS
This material is based upon work funded by Bechamo LLC’s (with sub-contract to University at Buffalo) NASA Phase II
Small Business Innovation Research (SBIR) Award No. 80NSSC22CA046. The authors would also like to thank Flighthouse
8
Engineering LLC for providing the aircraft design and the high-fidelity data used in this paper.
REFERENCES
[1] T. Chatterjee, S. Chakraborty, and R. Chowdhury, “A critical review of surrogate assisted robust design optimization,”
Archives of Computational Methods in Engineering, vol. 26, no. 1, pp. 245–274, 2019.
[2] P. Bisaillon, R. Sandhu, C. Pettit, M. Khalil, D. Poirel, C. Manohar, and A. Sarkar, “Combined selection of the dynamic
model and modeling error in nonlinear aeroelastic systems using bayesian inference,” Journal of Sound and Vibration,
vol. 522, p. 116 418, 2022.
[3] E. A. Morelli and V. Klein, “Accuracy of aerodynamic model parameters estimated from flight test data, Journal of
Guidance, Control, and Dynamics, vol. 20, no. 1, pp. 74–80, 1997.
[4] A. Mohamad, J. Karimi, and A. Naderi, “Dynamic aerodynamic parameter estimation using a dynamic particle swarm
optimization algorithm for rolling airframes,” Journal of the Brazilian Society of Mechanical Sciences and Engineering,
vol. 42, pp. 1–16, 2020.
[5] E. Morelli, S. Derry, and M. Smith, “Aerodynamic parameter estimation for the x-43a (hyper-x) from flight data, in
AIAA atmospheric flight mechanics conference and exhibit, 2005, p. 5921.
[6] M. Bataineh, T. Marler, K. Abdel-Malek, and J. Arora, “Neural network for dynamic human motion prediction,” Expert
Systems with Applications, vol. 48, pp. 26–34, 2016.
[7] G. Garimella, J. Funke, C. Wang, and M. Kobilarov, “Neural network modeling for steering control of an autonomous
vehicle, in Intelligent Robots and Systems (IROS), 2017 IEEE/RSJ International Conference on, IEEE, 2017, pp. 2609–
2615.
[8] M. Pfeiffer, M. Schaeuble, J. Nieto, R. Siegwart, and C. Cadena, “From perception to decision: A data-driven approach
to end-to-end motion planning for autonomous ground robots,” in 2017 ieee international conference on robotics and
automation (icra), IEEE, 2017, pp. 1527–1533.
[9] L. Shao, F. Ferreira, M. Jorda, V. Nambiar, J. Luo, E. Solowjow, J. A. Ojea, O. Khatib, and J. Bohg, “Unigrasp: Learning
a unified model to grasp with n-fingered robotic hands,” arXiv preprint arXiv:1910.10900, 2019.
[10] A. Rajeswaran, V. Kumar, A. Gupta, G. Vezzani, J. Schulman, E. Todorov, and S. Levine, “Learning complex dexterous
manipulation with deep reinforcement learning and demonstrations,” arXiv preprint arXiv:1709.10087, 2017.
[11] P. Ghassemi, K. Zhu, and S. Chowdhury, “Optimal surrogate and neural network modeling for day-ahead forecasting of
the hourly energy consumption of university buildings,” in International Design Engineering Technical Conferences and
Computers and Information in Engineering Conference, American Society of Mechanical Engineers, vol. 58134, 2017,
V02BT03A026.
[12] P. J. Haley and D. Soloway, “Extrapolation limitations of multilayer feedforward neural networks, in Neural Networks,
1992. IJCNN., International Joint Conference on, IEEE, vol. 4, 1992, pp. 25–30.
[13] R. M. Neal, Bayesian learning for neural networks. Springer Science & Business Media, 2012, vol. 118.
[14] D. Solomatine, L. M. See, and R. Abrahart, “Data-driven modelling: Concepts, approaches and experiences, in Practical
hydroinformatics, Springer, 2009, pp. 17–30.
[15] D. Karimi, H. Dou, S. K. Warfield, and A. Gholipour, “Deep learning with noisy labels: Exploring techniques and
remedies in medical image analysis,” Medical Image Analysis, vol. 65, p. 101 759, 2020.
[16] K. S. Narendra and K. Parthasarathy, “Identification and control of dynamical systems using neural networks, IEEE
Transactions on neural networks, vol. 1, no. 1, pp. 4–27, 1990.
[17] C.-C. Young, W.-C. Liu, and M.-C. Wu, “A physically based and machine learning hybrid approach for accurate rainfall-
runoff modeling during extreme typhoon events,” Applied Soft Computing, vol. 53, pp. 205–216, 2017.
[18] V. Nourani, M. T. Alami, and M. H. Aminfar, “A combined neural-wavelet model for prediction of ligvanchai watershed
precipitation,” Engineering Applications of Artificial Intelligence, vol. 22, no. 3, pp. 466–472, 2009.
[19] S. K. Singh, R. Yang, A. Behjat, R. Rai, S. Chowdhury, and I. Matei, “Pi-lstm: Physics-infused long short-term memory
network, in 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), IEEE, 2019,
pp. 34–41.
[20] K. Javed, “A robust & reliable data-driven prognostics approach based on extreme learning machine and fuzzy cluster-
ing.,” Ph.D. dissertation, Universit´
e de Franche-Comt´
e, 2014.
[21] S. Cheng and M. Pecht, “A fusion prognostics method for remaining useful life prediction of electronic products,” in
Automation Science and Engineering, 2009. CASE 2009. IEEE International Conference on, IEEE, 2009, pp. 102–107.
[22] A. Karpatne, W. Watkins, J. Read, and V. Kumar, “Physics-guided neural networks (pgnn): An application in lake
temperature modeling,” arXiv preprint arXiv:1710.11431, 2017.
[23] F. MANGILI, “Development of advanced computational methods for prognostics and health management in energy
components and systems,” 2013.
9
[24] B. Kapusuzoglu and S. Mahadevan, “Physics-informed and hybrid machine learning in additive manufacturing: Appli-
cation to fused filament fabrication, Jom, vol. 72, no. 12, pp. 4695–4705, 2020.
[25] B. Freeman, Y. Tang, Y. Huang, and J. VanZwieten, “Physics-informed turbulence intensity infusion: A new hybrid
approach for marine current turbine rotor blade fault detection, Ocean Engineering, vol. 254, p. 111 299, 2022.
[26] A. Rajagopal, A. P. Leynes, N. Dwork, J. E. Scholey, T. A. Hope, and P. E. Larson, “Physics-driven deep learning for
pet/mri,” arXiv preprint arXiv:2206.06788, 2022.
[27] K. Ankobea-Ansah and C. M. Hall, “A hybrid physics-based and stochastic neural network model structure for diesel
engine combustion events, Vehicles, vol. 4, no. 1, pp. 259–296, 2022.
[28] A. Maier, H. K¨
ostler, M. Heisig, P. Krauss, and S. H. Yang, “Known operator learning and hybrid machine learning in
medical imaging—a review of the past, the present, and the future, Progress in Biomedical Engineering, 2022.
[29] D. A. Rufa, H. E. B. Macdonald, J. Fass, M. Wieder, P. B. Grinaway, A. E. Roitberg, O. Isayev, and J. D. Chodera, “To-
wards chemical accuracy for alchemical free energy calculations with hybrid physics-based machine learning/molecular
mechanics potentials,” BioRxiv, 2020.
[30] A. Rai and M. Mitra, “A hybrid physics-assisted machine-learning-based damage detection using lamb wave, S¯
adhan¯
a,
vol. 46, no. 2, pp. 1–11, 2021.
[31] I. Matei, C. Zeng, S. Chowdhury, R. Rai, and J. de Kleer, “Controlling draft interactions between quadcopter unmanned
aerial vehicles with physics-aware modeling, Journal of Intelligent & Robotic Systems, vol. 101, no. 1, pp. 1–21, 2021.
[32] Z. Zhang, R. Rai, S. Chowdhury, and D. Doermann, “Midphynet: Memorized infusion of decomposed physics in neural
networks to model dynamic systems, Neurocomputing, vol. 428, pp. 116–129, 2021.
[33] X. Chen, Z. Yao, S. Xu, A. S. McLeod, S. N. Gilbert Corder, Y. Zhao, M. Tsuneto, H. A. Bechtel, M. C. Martin,
G. L. Carr, et al., “Hybrid machine learning for scanning near-field optical spectroscopy, ACS Photonics, vol. 8, no. 10,
pp. 2987–2996, 2021.
[34] H.-C. Choi, C. Deng, and I. Hwang, “Hybrid machine learning and estimation-based flight trajectory prediction in
terminal airspace,” IEEE Access, vol. 9, pp. 151 186–151 197, 2021.
[35] D. Machalek, J. Tuttle, K. Andersson, and K. M. Powell, “Dynamic energy system modeling using hybrid physics-based
and machine learning encoder-decoder models, Energy and AI, p. 100 172, 2022.
[36] Y. Lai, K. Pan, Y. Cen, J. Yang, C. Cai, P. Yin, and S. Park, “An intelligent system for reflow oven temperature settings
based on hybrid physics-machine learning model, Soldering & Surface Mount Technology, 2022.
[37] A. Behjat, C. Zeng, R. Rai, I. Matei, D. Doermann, and S. Chowdhury, “A physics-aware learning architecture with
input transfer networks for predictive modeling, Applied Soft Computing, vol. 96, p. 106 665, 2020.
[38] R. Iqbal, A. Behjat, R. Adlakha, J. Callanan, M. Nouh, and S. Chowdhury, “Efficient training of transfer mapping in
physics-infused machine learning models of uav acoustic field, in AIAA SCITECH 2022 Forum, 2022, p. 0384.
[39] A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A.
Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and
S. Chintala, “Pytorch: An imperative style, high-performance deep learning library, in Advances in Neural Information
Processing Systems 32, Curran Associates, Inc., 2019, pp. 8024–8035. [Online]. Available: http: / /papers.neurips . cc /
paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf.
[40] J. Bradbury, R. Frostig, P. Hawkins, M. J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. VanderPlas,
S. Wanderman-Milne, and Q. Zhang, JAX: Composable transformations of Python+NumPy programs, version 0.3.13,
2018. [Online]. Available: http://github.com/google/jax.
[41] J. C. Duensing, J. A. Housman, D. Maldonado, C. C. Kiris, and S. Y. Yoo, Jun. 2019. [Online]. Available: https:
//ntrs.nasa.gov/api/citations/20190028345/downloads/20190028345.pdf.
[42] M. Drela, Flight Vehicle Aerodynamics. MIT Press, 2014.
[43] M. Drela. “Qprop formulation.” (), [Online]. Available: https://web.mit.edu/drela/Public/web/qprop/qprop theory.pdf.
[44] L. Veldhuis, “Propeller wing aerodynamic interference, Ph.D. dissertation, 2028.
[45] C. D. Inc, Charm. [Online]. Available: https://continuum-dynamics.com/product/.
10
APPENDIX
Fig. 9: BLOFI was verified against X-57 CFD data.
11
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Estimation of combustion phasing and power production is essential to ensuring proper combustion and load control. However, archetypal control-oriented physics-based combustion models can become computationally expensive if highly accurate predictive capabilities are achieved. Artificial neural network (ANN) models, on the other hand, may provide superior predictive and computational capabilities. However, using classical ANNs for model-based prediction and control can be challenging, since their heuristic and deterministic black-box nature may make them intractable or create instabilities. In this paper, a hybridized modeling framework that leverages the advantages of both physics-based and stochastic neural network modeling approaches is utilized to capture CA50 (the timing when 50% of the fuel energy has been released) along with indicated mean effective pressure (IMEP). The performance of the hybridized framework is compared to a classical ANN and a physics-based-only framework in a stochastic environment. To ensure high robustness and low computational burden in the hybrid framework, the CA50 input parameters along with IMEP are captured with a Bayesian regularized ANN (BRANN) and then integrated into an overall physics-based 0D Wiebe model. The outputs of the hybridized CA50 and IMEP models are then successively fine-tuned with BRANN transfer learning models (TLMs). The study shows that in the presence of a Gaussian-distributed model uncertainty, the proposed hybridized model framework can achieve an RMSE of 1.3 × 10⁻⁵ CAD and 4.37 kPa with a 45.4 and 3.6 s total model runtime for CA50 and IMEP, respectively, for over 200 steady-state engine operating conditions. As such, this model framework may be a useful tool for real-time combustion control where in-cylinder feedback is limited.
Article
Full-text available
In this article, we perform a review of the state-of-the-art of hybrid machine learning in medical imaging. We start with a short summary of the general developments of the past in machine learning and how general and specialized approaches have been in competition in the past decades. A particular focus will be the theoretical and experimental evidence pro and contra hybrid modelling. Next, we inspect several new developments regarding hybrid machine learning with a particular focus on so-called known operator learning and how hybrid approaches gain more and more momentum across essentially all applications in medical imaging and medical image analysis. As we will point out by numerous examples, hybrid models are taking over in image reconstruction and analysis. Even domains such as physical simulation and scanner and acquisition design are being addressed using machine learning grey box modelling approaches. Towards the end of the article, we will investigate a few future directions and point out relevant areas in which hybrid modelling, meta learning, and other domains will likely be able to drive the state-of-the-art ahead.
Article
Full-text available
Purpose This paper aims to provide the proper preset temperatures of the convection reflow oven when reflowing a printed circuit board (PCB) assembly with varied sizes of components simultaneously. Design/methodology/approach In this study, computational fluid dynamics modeling is used to simulate the reflow soldering process. The training data provided to the machine learning (ML) model is generated from a programmed system based on the physics model. Support vector regression and an artificial neural network are used to validate the accuracy of ML models. Findings Integrated physical and ML models synergistically can accurately predict reflow profiles of solder joints and alleviate the expense of repeated trials. Using this system, the reflow oven temperature settings to achieve the desired reflow profile can be obtained at substantially reduced computation cost. Practical implications The prediction of the reflow profile subjected to varied temperature settings of the reflow oven is beneficial to process engineers when reflowing bulky components. The study of reflowing a new PCB assembly can be started at the early stage of board design with no need for a physical profiling board prototype. Originality/value This study provides a smart solution to determine the optimal preset temperatures of the reflow oven, which is usually relied on experience. The hybrid physics–ML model providing accurate prediction with the significantly reduced expense is used in this application for the first time.
Article
Full-text available
For air traffic management, trajectory prediction plays an important role as the predicted trajectory information is used in crucial tasks for the safety and efficiency of air traffic operations, such as conflict detection and resolution, scheduling, and sequencing. In this paper, we propose a framework for trajectory prediction in terminal airspace by combining a machine learning-based method and a physics-based estimation method. A trajectory prediction model based on machine learning is trained from historical surveillance data to represent the collective behavior of a set of flight trajectories, from which the data-driven prediction can be obtained as the expected future behavior of an incoming flight. A physics-based estimation algorithm called Residual-Mean Interacting Multiple Models (RM-IMM) then incorporates the machine learning prediction as a pseudo-measurement to account for the current motion of the aircraft. The proposed framework is tested, with real air traffic surveillance data, by predicting the future state information of the flights for real-time air traffic control applications. The results show that the proposed framework produces a greatly improved prediction accuracy compared to the two existing machine learning-based algorithms.
Article
Three model configurations are presented for multi-step time series predictions of the heat absorbed by the water and steam in a thermal power plant. The models predict over horizons of 2, 4, and 6 steps into the future, where each step is a 5-minute increment. The evaluated models are a pure machine learning model, a novel hybrid machine learning and physics-based model, and the hybrid model with an incomplete dataset. The hybrid model deconstructs the machine learning into individual boiler heat absorption units: economizer, water wall, superheater, and reheater. Each configuration uses a gated recurrent unit (GRU) or a GRU-based encoder–decoder as the deep learning architecture. Mean squared error is used to evaluate the models compared to target values. The encoder–decoder architecture is over 11% more accurate than the GRU only models. The hybrid model with the incomplete dataset highlights the importance of the manipulated variables to the system. The hybrid model, compared to the pure machine learning model, is over 10% more accurate on average over 20 iterations of each model. Automatic differentiation is applied to the hybrid model to perform a local sensitivity analysis to identify the most impactful of the 72 manipulated variables on the heat absorbed in the boiler. The models and sensitivity analyses are used in a discussion about optimizing the thermal power plant.
Article
Ocean current turbines (OCT) convert the kinetic energy housed within the earth’s ocean currents into electricity. However, OCT technologies used to harvest this energy are still at an early stage in development due to technical and economic challenges stemming from the high operation and maintenance costs associated with limited geographical location access and harsh operating environments. In an effort to alleviate reliability concerns associated with marine electricity generation, this paper proposes a novel physics-guided rotor blade imbalance fault detection framework that combines non-intrusively acquired fault features obtained from the turbine’s electrical power signal with environmental condition data to enhance the fault detection capabilities. The combination of these two data sources paved the way for the development of a physics-informed neural network that ensures the classifications made by our framework are scientifically consistent with the underlining hydro-kinematic rotor dynamics of the OCT. The effectiveness of our framework is validated on simulation data produced by an in-house high-fidelity numerical simulation platform that includes temporally and spatially dynamic oceanic operating environment models. Test results demonstrate a Type-I error rate of 5.00% and a Type-II error rate of 2.92%.
Conference Paper
View Video Presentation: https://doi.org/10.2514/6.2022-0384.vid Physics-Infused Machine Learning (PIML) architectures aim at integrating machine learning with computationally-efficient, low-fidelity (partial) physics models, leading to improved generalizability, extrapolability, and robustness to noise, compared to pure data-driven approximation models. End-uses of PIML include, but are not limited to, model-based optimization and model-predictive control. Recently a new PIML architecture was reported by the same authors, known as Opportunistic Physics-mining Transfer Mapping Architecture or OPTMA, which transfers the original inputs into latent features using a transfer neural network; the partial physics model then uses the latent features to generate the final output that is as close as possible to the high-fidelity output. While gradient-free solvers and back-propagation with supervised learning (where optimum labels are pre-generated) have been used to train OPTMA, that approach remains computationally inefficient, and challenging to generalize across different problems or exploit state-of-the-art machine learning architectures. This paper aims to alleviate these issues by infusing the partial physics model inside the neural network, as described via tensors in the popular machine learning framework, PyTorch. Such a description also naturally allows auto-differentiation of the partial physics model, thereby enabling the use of efficient back-propagation methods to train the transfer network. The benefits of the upgraded OPTMA architecture with Automatic Differentiation (OPTMA-Net) is demonstrated by applying it to the problem of modeling the sound pressure field created by a hovering unmanned aerial vehicle (UAV). Ground truth data for this problem has been obtained from an indoor UAV noise measurement setup. Here, the partial physics model is based on the interference of acoustic pressure waves generated by an arbitrary number of acoustic monopole sources. Case studies show that OPTMA-Net provides generalization performance close to, and extrapolation performance that is 4 times better than, those given by a pure data-driven model.
Article
We report a Bayesian framework for concurrent selection of physics-based models and (modelling) error models. We investigate the use of colored noise to capture the mismatch between the predictions of calibrated models and observational data that cannot be explained by measurement error alone within the context of Bayesian estimation for stochastic ordinary differential equations. Proposed models are characterized by the average data-fit, a measure of how well a model fits the measurements, and the model complexity measured using the Kullback–Leibler divergence. The use of a more complex error models increases the average data-fit but also increases the complexity of the combined model, possibly over-fitting the data. Bayesian model selection is used to find the optimal physical model as well as the optimal error model. The optimal model is defined using the evidence, where the average data-fit is balanced by the complexity of the model. The effect of colored noise process is illustrated using a nonlinear aeroelastic oscillator representing a rigid NACA0012 airfoil undergoing limit cycle oscillations due to complex fluid–structure interactions. Several quasi-steady and unsteady aerodynamic models are proposed with colored noise or white noise for the model error. The use of colored noise improves the predictive capabilities of simpler models.
Article
The underlying physics behind an experimental observation often lacks a simple analytical description. This is especially the case for scanning probe microscopy techniques, where the interaction between the probe and the sample is nontrivial. Realistic modeling to include the exact details of the probe is widely acknowledged as a challenge. Due to various complexity constraints, the probe is often only approximated in a simplified geometry, leading to a source for modeling inconsistencies. On the other hand, a well-trained artificial neural network based on real data can grasp the hidden correlation between the signal and the sample properties, circumventing the explicit probe modeling process. In this work we show that, via a combination of model calculation and experimental data acquisition, a physics-infused hybrid neural network can predict the probe-sample interaction in the widely used scattering-type scanning near-field optical microscope. This hybrid network provides a long-sought solution for accurate extraction of material properties from tip-specific raw data. The methodology can be extended to other scanning probe microscopy techniques as well as other data-oriented physical problems in general.