Content uploaded by Manaswin Oddiraju
Author content
All content in this area was uploaded by Manaswin Oddiraju on Jul 17, 2023
Content may be subject to copyright.
Physics-Infused Machine Learning Based Prediction
of VTOL Aerodynamics with Sparse Datasets
Manaswin Oddiraju , Divyang Amin , Michael Piedmonte, Souma Chowdhury
Abstract
Complex optimal design and control processes often require repeated evaluations of expensive objective functions and consist
of large design spaces. Data-driven surrogate models such as neural networks and Gaussian processes provide an attractive
alternative to expensive simulations and are utilized frequently to represent these objective functions in optimization. However,
pure data-driven models, due to a lack of adherence to basic physics laws and constraints, are often poor at generalizing
and extrapolating. This is particularly the case, when training occurs over sparse high-fidelity datasets. A class of Physics-
infused machine learning (PIML) models integrate ML models with low-fidelity partial physics models to improve generalization
performance while retaining computational efficiency. This paper presents two potential approaches for Physics infused modelling
of aircraft aerodynamics which incorporate Artificial Neural Networks with a low-fidelity Vortex Lattice Method model with blown
wing effects (BLOFI) to improve prediction performance while also keeping the computational cost tractable. This paper also
develops an end-to-end auto differentiable open-source framework that enables efficient training of such hybrid models. These two
PIML modelling approaches are then used to predict the aerodynamic coefficients of a 6 rotor eVTOL aircraft given its control
parameters and flight conditions. The models are trained on a sparse high-fidelity dataset generated using a CHARM model. The
trained models are then compared against the vanilla low-fidelity model and a standard pure data-driven ANN. Our results show
that one of the proposed architecture outperforms all the other models and at a nominal increase in execution time. These results
are promising and pave way to PIML frameworks which are able generalize over different aircraft and configurations thereby
significantly reducing costs of design and control.
I. INTRODUCTION
Real world optimal design and control problems consist of optimizations with large design spaces and computationally
expensive objective functions and therefore require efficient and accurate models of complex physical systems. There are
different proposed approaches to render these problems computationally feasible, such as the use of surrogate models, reduced
order modelling methods and multi-fidelity optimization techniques. The use of data-driven surrogate models for optimization
[1] is a popular approach to reduce the computational burden and render the process tractable. However, using pure data
driven models poses additional challenges in the form of poor explainability and usually these models require large amounts of
data either from expensive high-fidelity analysis or experiments in order to achieve sufficient accuracy. For many engineering
design problems however fast running physics-based models are available, but they usually trade-off accuracy for performance.
Therefore, in this paper, we propose two Physics Informed Machine Learning (PIML) models which combine data driven
Artificial Neural Networks (ANNs) with low-fidelity physics models and augment the performance of the physics models
while only needing a sparse high-fidelity dataset. We also develop an end-to-end auto-differentiable physics framework to
enable efficient training of such hybrid architectures and apply them to model the aerodynamics of a 6 rotor eVTOL aircraft.
The rest of this section briefly covers existing techniques for augmenting low-fidelity physics models using high-fidelity data,
physics informed machine learning architectures before stating our research objectives.
Conventional methods for enhancing low-fidelity computational models with high-fidelity or experimental data encompass
techniques like Bayesian Inference [2], [3], Gradient-Based and Gradient-Free Optimization [4], [5], among others. The primary
goal of these techniques is to determine the optimal parameters of the low-fidelity model in order to improve its overall
performance. However, due to the complexity of the physical phenomena often modeled by these techniques, identifying a
static set of parameters for a dataset might be inadequate and may not fully capitalize on the potential accuracy improvements
offered by a sparse high-fidelity dataset. On the other hand, data-driven models, due to their near-instantaneous run times,
are being utilized frequently for speculating the behavior of complex systems in domains such as mechanical systems [6],
[7], robotics [8]–[10], and energy forecasting [11]. Models such as Neural Networks, Gaussian Processes etc. are showing
competitive prediction accuracy on some problems in this domain. However, they underperform at extrapolating and generalizing
[12], [13], especially when trained with small or sparse data sets [14]. This can be attributed to a lack of adherence to basic
laws of physics. Additionally, they also exhibit challenging-to-interpret black-box behavior and sensitivity to noisy data [15].
In lieu of these problems, significant research efforts are being directed towards ”hybrid models” or ”physics-infused” models.
These models generally integrate data-driven and low-fidelity physics models in order to make predictions computationally
feasible and accurate.
There are different classes of physics-infused neural networks; although almost all of them make use of a data-driven model
and a physics model, they differ in the architectures. Hybrid ML architectures can broadly be classified into serial[16]–[19]
and parallel [20]–[22] architectures. Serial architectures typically have the data-driven models set in sequence with the partial
physics model or used to tune the partial physics model parameters, while parallel architectures usually contain additive or
This work was presented at the AIAA Aviation 2023 Forum.
Copyright @ Souma Chowdhury
1
arXiv:2307.03286v1 [cs.CE] 6 Jul 2023
multiplicative ensembles of partial physics and data-driven ML models [23]. Several such hybrid PIML architectures have
been reported in the literature in the past few years [24]–[36], spanning over a wide range of applications such as in modeling
dynamic systems, cyber-physical systems, robotic systems, flow systems and materials behavior, among others. The OPTMA
model [37], is a physics-infused machine learning model which combines an artificial neural network with a partial physics
model in order to make predictions. Earlier applications of this framework on acoustics problems have shown that it generalizes
well and can even extrapolate successfully. This architecture primarily works by using the data-driven model -in this case the
transfer network; to map the original inputs to the inputs of the partial physics model so as to make its output match that of the
high-fidelity model. Hence, the training process is just the transfer network learning this mapping. And since the intermediate
parameters are open and interpretable to domain experts (as they are inputs to partial physics models), the OPTMA model is
less of a black box and more understandable when compared to pure data-driven models.
In all of these sequential hybrid-ML models however, the presence of an external partial physics model increases the
complexity and cost of the training process. Previously[38], this hurdle was overcome by programming custom loss functions
which include the partial physics in PyTorch[39] to enable backpropagation. However, this approach may not be feasible in
all domains as PyTorch is not optimal for general purpose scientific computing (especially numerical methods). Therefore, in
our goal to make OPTMA more user-friendly and computationally tractable, we use Google JAX [40] to auto-differentiate
the partial physics model. JAX is capable of forward and backward mode auto-differentiation and works on codes written in
python and is also easy to integrate with the transfer network written in PyTorch. This is a much more generalizable and
scalable approach to creating computationally efficient PIML frameworks.
During the process of aircraft design, and eventually the aircraft controller design, engineers often encounter the earlier stated
predicament of selecting appropriate methods to model the aircraft. They may need to balance the benefits and drawbacks of
using a method that is highly accurate, albeit costly and time-consuming, against a faster and more economical approach that
might not provide the same level of precision.The complexity of this problem is magnified when engineers are tasked with
designing more challenging-to-control aircraft than traditional fixed-wing varieties. A prime example is the Vertical Takeoff &
Landing (VTOL) aircraft. As the aircraft designs become more complex, the modeling demands can escalate exponentially. In
the case of VTOL, it is essential to model the aircraft dynamics across all phases: from hover to transition, and finally into
cruise. Such aircraft also exhibit more intricate aerodynamics, including phenomena like blown-wing effects. There is typically
a greater number of degrees of freedom, which further escalates the complexity of designing robust controllers for VTOL
aircraft. Given the numerous benefits and strengths of physics-infused modeling, and the necessity for fast accurate models
for aircraft design and control, it is evident that PIML models are particularly well-suited for modeling the aerodynamics of
an aircraft.
Therefore, in this paper, we propose two physics informed modelling approaches to augment a low-fidelity VLM model
with a sparse high-fidelity dataset. One of the approaches, called PIML-A is an extension of the OPTMA architecture. The
other architecture, named PIML-B is a comparatively simpler ensemble model that learns the error in the low-fidelity physics
models. The architectures are aimed at improving the accuracy of the low-fidelity physics models while only using a sparse
high-fidelity dataset to keep training costs low. Our research objectives are as follows:
1) Create a Vortex Lattice Method (VLM) model with auto-differentiation capabilities.
2) Develop different PIML architectures seeking to provide better accuracy / cost trade-offs than pure Low-Fidelity and
pure data-driven models.
3) Apply the above architectures to model the control inputs to force and moment coefficients of a eVTOL aircraft.
4) Compare the performance and computational cost of the PIML models with the baseline low-fidelity and data-driven
models.
The remainder of this paper is laid out as follows: SectionII contains a description of the two PIML models framework and
their components in detail. SectionIII explains our case study and the models that we are using before moving on to SectionIV
which contains our results and discussion. Finally, we state our conclusions in SectionV.
II. PHYSICS IN FU SE D MACH IN E LEARNING ARCHITECTURES
Fig.1 shows the two proposed PIML frameworks as well as the vanilla low-fidelity physics model (L). In this paper, all the
models shown are used to predict the airframe force and moment coefficients for a given set of control and environmental
parameters. The PIML models amalgamate low-fidelity physics models with Artificial Neural Networks (ANNs) to enhance
predictions, yet they vary based on the specific role of the ANNs. As depicted in the framework diagram 1, the low-fidelity
model comprises of a propeller solver and the Vortex Lattice Method (VLM) model. The prop solver computes induced
velocities which are then fed into the VLM, which in turn calculates the force and moment coefficients on the airframe.
The PIML-A model incorporates ANN layers to achieve three primary objectives: 1) To supersede the prop solver and
directly estimate the axial and tangential induced velocities of the propellers, 2) to modify the flight conditions in order to
enhance the predictions of the VLM model, and 3) to adjust the output of the VLM to reduce discrepancies with respect to the
high-fidelity data. Given that all layers are trained concurrently, the PIML-A model learns the mapping from flight conditions
2
Induced
Veloci�es
VLM Model
Prop Solver
Inputs
Inputs
Low-Fidelity Outputs
Low-Fidelity Model
(a)
PIML -A
Transfer Layers
VLM Model
PIML Predicted Output
Transfer Parameters Correc�on Layers
Inputs
(b)
Prop Solver
Inputs
PIML -B
Induced
Veloci�es
VLM Model
PIML Predicted Output
Correc�on Layers
Inputs
Predicted Correc�on
of Induced Veloci�es
(c)
ANN Predicted Output
Inputs
Pure Ar�ficial Neural Network
(d)
Fig. 1: Framework Diagrams of a) Low-Fidelity Physics b) PIML-A c) PIML-B and d) Purely Data-Driven ANN
and control inputs to the transfer parameters. Simultaneously, it learns the output errors of the VLM and adjusts them to align
more closely with the high-fidelity data. The reasoning behind this model is that part of the errors in the VLM model can be
mitigated by shifting the inputs into different domains, and the errors that remain, not resolved by this input modification, will
be addressed by the correction layers on the output side.
In contrast, the PIML-B model does not adjust any inputs to the original low-fidelity model and retains the prop solver in its
structure. Within this design, the sole objective is to identify the error in the induced velocities computed by the prop solver
and rectify these parameters. This model is more closely aligned with the low-fidelity physics model and aims to enhance
modeling accuracy by refining the outputs of the prop solver.
The following subsection focuses on the training processes of these PIML models, specifically the computation of gradients
and the implementation of backpropagation through the low-fidelity physics models.
3
A. PIML Model Training
The training of neural networks (i.e optimizing the weights of the neural network to minimize loss) is performed using
gradient descent and therefore requires the gradient of the loss function(L) w.r.t to the weights(W). The following equations
show the integration of backpropagation gradients through an auto-differentiable physics model. This feature enables the training
of our PIML architectures using standard ANN training techniques and using standard ANN software libraries.
L=∥Yi−Y′
i∥2(MSE Loss)
L=1
n
n
i=1
(Yi−Y′
i)2
∴
∂L
∂W =1
n
n
i=0
2 (Y′
i−Yi)∂Y ′
i
∂W
Where Yi
Ground Truth
=ϕ(Xi),Y′
i
Prediction
=G(Xi, W )
(1)
Grepresents the sequential PIML model and ϕrepresents the high-fidelity physics model.The terms Xi,Yi,Y′
irepresent the
input, high-fidelity output and the OPTMA prediction of the ith training sample respectively.
From the definition of Y′
i:
Y′
i=G(Xi, W )
∂Y ′
i
∂W =∂ G (Xi, W )
∂W
As Gis the OPTMA model, we have:
G(Xi, W ) = Φ (S(Xi, W ))
=⇒∂G(Xi, W )
∂W =∂Φ (S(Xi, W ))
∂W =∂Φ(U)
∂U
Partial Physics
Auto-Differentiation
×∂S(Xi, W )
∂W
Backpropagation
(2)
Here, Sand Φrepresent the neural network and the partial physics models respectively. Urepresents the intermediate parameter
vector which is the output of the transfer network and input to the partial physics model.
III. CAS E STU DY: MODELLING AERODYNAM IC S OF A TILT-ROTOR E VTOL AIRCRAFT
The Aerobeacon, shown in fig.2, is a 6 rotor aircraft with 4 hover rotors in a quad configuration and two tilt-rotors at the
edges of the wings. The aircraft has no rudder and instead makes use of differential thrust between the tilt rotors for yaw
control. The only movable control surface on the aircraft apart from the tiltrotor is an elevator. Since this is our initial attempt
at modelling this aircraft, we chose to fix the hover rotor RPMs at 5000 and only vary the other control inputs listed in Tale.I.
Table I: Aircraft Control Parameter Bounds
Parameters Bounds
Speed (v) [0 m/s ,45 m/s]
Angle of Attack (α) [-15◦, 15◦]
Starboard Propeller
RPM (ωstar)
[4000 , 10000]
Port Propeller RPM
(ωport)
[4000 , 10000]
Starboard Propeller
Angle (θstar)
[0◦, 110◦]
Port Propeller Angle
(θport)
[0◦, 110◦]
Elevator Deflection
(θelev)
[-15◦, 15◦]
Fig. 2: Aerobeacon aircraft design configuration from Flighthouse
Engineering
4
A. Modelling Aircraft Flight Dynamics
Table II displays the inputs and outputs of all the models and also the transfer parameters corresponding to the PIML-A
model. The transfer parameters are specifically selected such that they have an impact on the output accuracy of the model, as
well as to allow greater flexibility to the embedded ANN layers to affect the final outputs. In both the proposed architectures,
we try to improve the low-fidelity model by either augmenting or replacing the prop solver. The PIML-A model in particular
also tries to alter the outputs of the VLM model to improve accuracy. This model is an extension of the OPTMA architecture
discussed in the Introduction.
Table II: Modeling Parameters
Inputs (All Models) Transfer Parameters (PIML-A Model) Outputs (All Models)
Speed (v)
Angle of Attack (α)
Starboard Propeller RPM
(ωstar)
Port Propeller RPM (ωport)
Starboard Propeller Angle
(θstar)
Port Propeller Angle (θport)
Elevator Deflection (θelev)
Speed(v′)
Angle of Attack(α′)
Elevator Deflection(θ′
elev)
−→
va
star
−→
vt
star
−→
va
port
−→
vt
port
−→
va
hover
−→
vt
hover
CL
CD
Cl
Cm
B. Low Fidelity Model
The lower fidelity flight dynamics model uses Bechamo LLC’s developed VLM plus propeller effects low-fidelity tool,
BLOFI. This tool was developed initially to model the NASA Tiltwing aircraft and was validated against the X-57 data [41].
The validation plots can be seen in fig.9. This was done in order to prove that we are able to capture the blown wing effects.
BLOFI was created by using an improvised VLM that accounts for some blown wing effects caused by the propellers upstream
of the the lifting surfaces. A base VLM [42] is implemented and is modified by accounting for induced velocities caused by
propellers’ wash upstream of the control points on the lifting surfaces. The properties for each propeller are calculated using
Blade Element Momentum Theory [43]. It should also be noted that propeller slipstream contraction [44] is accounted for
when including the induced velocities from the propellers in the VLM setup.
A set of flight conditions is created by making a grid of every combination of freestream velocity, sideslip and angle of
attack. At each of these flight conditions the aerodynamic forces and moments, including the propeller thrusts and torques,
are calculated, using the aforementioned VLM implementation. This is done for every expected RPM setting, propeller tilt
angle and tail deflection. The combination of aerodynamic forces and moments and propeller thrusts and torques is used to
determine the overall forces and moments acting on the aircraft.
CHARM[45] is a commercial tool developed by Continuum Dynamics, Inc. for simulating the aerodynamics and dynamics
of rotorcraft using partical methods. Flighthouse Engineering LLC. provided their Aerobeacon aircraft design, parameters,
and constructed the CHARM model. They ran CHARM test cases for a collection of randomly generated input samples. The
data set derived from the CHARM model serves as the high-fidelity data for the research presented in this paper. The VLM
configuration is shown in fig.3 and the CHARM higher fidelity output visualization is shown in fig.4.
C. Sparse High-Fidelity Dataset
In order to generate the high fidelity dataset, we used Latin Hypercube Sampling and created a dataset of 100 points across
the parameters and bounds listed in Tab.I. These 100 samples were then input to the CHARM high fidelity model and the
dataset was generated. Out of the 100 initial samples, only 87 were usable as 11 cases failed to converge on the CHARM
model and the remaining 2 cases showcased abnormally high values of the force and moment coefficients and were removed
from the dataset. While generating the dataset, we fixed the RPMs of the hover quad rotors to 5000 RPM. This final dataset of
87 samples was then randomly divided into a training set containing 70 samples and a validation set containing 17 samples.
All the performance results shown in this paper are run on this validation dataset consisting of samples completely unseen by
the model.
D. Prior Analysis of BLOFI Performance on Other Data-Sets
BLOFI was developed at Bechamo LLC and in order to prove its usefulness in comparison to other available VLM solvers
that do not account for blown wing effects. It has been previously validated using CFD data[41] generated by NASA for the
X-57. Just as a flat plate model was created for the Aerobeacon with the appropriate propellers for the developments of this
paper, a flat plate model with the corresponding propellers was earlier created for the X-57 as part of the validation study
5
Table III: Comparison of PIML, Low-Fidelity and Pure ANN Models
PIML-A PIML-B Low-Fidelity ANN
Execution Time (s) 0.32 0.53 0.338 0.0002
Training Time (min) 23.46 23.18 - 0.04
#Hidden Layers 6 6 - 4
#Nodes per Layer 200 200 - 150
on BLOFI. In figure9 we are able to see that BLOFI can capture significant blown-wing effects, by comparing the power-on
and power-off plots. The increase in CLand CDat every angle of attack can be observed. More importantly, the inclusion
of induced velocities in the VLM solve also lets us observe the increased flap effectiveness. Errors are between 5%and 10%
for the range of angles of attack for which this validation study was conducted. It is expected that with a PIML approach,
not only can we close down this gap (error), but more readily extend the low/medium fidelity solver such as BLOFI to work
on a wider variety of aircraft configurations with little to no manual tuning and manual model development. To support his
premise, the next section presents the results of our PIML architectures with the example of Aerobeacon configuration, where
we had more control over the high-fidelity sampling – enabling ease of PIML training for this paper.
IV. RES ULT S AN D DISCUSSION
A. Baselines For Comparison
For testing the performance of the two proposed PIML models, we compared them against the vanilla low-fidelity model(L)
shown in fig.1a and a pure data-driven ANN (A). Table III lists the parameters of the stand-alone ANN and the ANNs used as
part of the PIML frameworks. All the data-driven and PIML models were trained on the same training dataset of 70 high-fidelity
samples and validated on a validation dataset consisting of 17 samples. As the training data was sparse, a pure ANN with
similar size of the ANNs used for the PIML models tended to overfit very quickly. Therefore, the size of the pure data-driven
ANN is lower as compared to the ANNs used in the PIML models.
B. Modelling Performance
Figure.5 shows the convergence history for both the PIML models. From the figure we see that PIML-A has a much smoother
training history as compared to the PIML-B model. The PIML-B model seems to be sensitive to corrections in induced velocity.
Further training with a lower learning rate or decaying the learning rate may resolve this issue and improve performance of
the PIML-B model. It may also be the case that the VLM is very sensitive to the induced velocity inputs or that there is an
unknown bias in the validation dataset. This could’ve happened due to the random splitting of data into train and test samples.
Further work with varying test and train datasets similar to K-fold cross-validation are necessary to understand if there’s any
data bias.
Figure.6 shows the prediction error of the models in all of the 4 outputs. From the figure, we see that the PIML-A model
performs the best. The pure data-driven ANN and the PIML-B model exhibit similar performance. Relative to the amount
Fig. 3: Mesh model of the Aerobeacon aircraft. The
propellers and their direction of rotation are represented
by directed circles. The two rotors positioned at the wing
extremities are capable of tilting.
Fig. 4: CHARM visualized output of particle method to
obtain a higher-fidelity model at discrete points
6
0 20 40 60 80 100 120 140
0
0.05
0.1
0.15
0.2
Train Loss
Test Loss
Epoch
MSE Loss
(a) PIML-A
0 20 40 60 80 100 120 140
0
0.05
0.1
0.15
0.2
Train Loss
Test Loss
Epoch
MSE Loss
(b) PIML-B
Fig. 5: Convergence history of PIML Models
Low_Fidelity PIML A PIML B ANN
0
0.2
0.4
0.6
0.8
1
M
o
d
e
l
RMSE (Normalized)
(a) Prediction Error in CL
Low_Fidelity PIML A PIML B ANN
0
0.2
0.4
0.6
0.8
1
M
o
d
e
l
RMSE (Normalized)
(b) Prediction Error in CD
Low_Fidelity PIML A PIML B ANN
0
0.2
0.4
0.6
0.8
1
M
o
d
e
l
RMSE (Normalized)
(c) Prediction Error in Cl
Low_Fidelity PIML A PIML B ANN
0
0.2
0.4
0.6
0.8
1
M
o
d
e
l
RMSE (Normalized)
(d) Prediction Error in Cm
Fig. 6: Prediction Error Compared to High Fidelity Data
of data available, the ANN performs very well and can prove to be a possible alternative in cases with a larger sample set
and no additional requirements on interpretability. he fact that the high-fidelity samples were generated using Latin hypercube
sampling, ensuring an even domain coverage, could also be a contributing factor to the ANN’s strong performance. Among
all the modeled outputs, PIML-A appears to excel in predicting CD. This could be attributed to the transfer parameters having
a greater impact on CDas compared to the other outputs. Where things begin to get interesting is when we see the effect of
the learnt output correction layers in PIML-A as shown in fig.7. From the figure, it is easily seen that the output layers of
the VLM model have a very strong negative correction on the CLthat the VLM outputs. As we also see from fig.6, we see
that the VLM model has the highest prediction error in CL. Combined, these two plots convey that the transfer parameters
7
C_L C_D C_l C_m
−1.5
−1
−0.5
0
0.5
1
Model
Normalized Correction to VLM Outputs
Fig. 7: Correction of VLM Outputs by the Correction Layers
in PIML-A
PIML-B
−0.6
−0.4
−0.2
0
0.2
0.4
0.6
Model
Normalized Correction to Induced Velocities
Fig. 8: Correction of Propeller Induced Velocities by the
Correction Layers in PIML-B
do not influence CLas much as they do the other outputs or that just shifting the inputs is not sufficient to improve the CL
predictions of the VLM model. The correction for other outputs predicted by the PIML model is evenly distributed between
positive and negative and this seems to be reasonable given the performance of the vanilla low-fidelity models on all these
outputs is about the same.
The PIML-B model on the other hand performs on par with the low-fidelity VLM model. This outcome might be explained
by an earlier hypothesis suggesting that if the induced velocities indeed have a minor impact on CL, then regardless of what
the correction network learns, it won’t be able to enhance the predictions for CL. However, since the training loss is calculated
across all outputs, the overall loss remains substantial, causing the network to struggle in learning the transfer mapping necessary
to improve predictions on the other inputs. It may be the case that when different models are trained to learn the various outputs
of this modelling problem we might see improved performance by the PIML-B architecture on outputs other than CL, but as
that is not feasible especially once the number of outputs gets large enough to support design / control frameworks, PIML-A
model might be better suited for this problem.
V. CONCLUSION
The paper presents two distinct architectures of physics-informed models, referred to as PIML-A and PIML-B. These models
integrate Artificial Neural Networks with a low-fidelity BLOFI model with the aim of enhancing the prediction performance
of the aerodynamic coefficients of a six-rotor eVTOL aircraft. To facilitate the training of both models, an end-to-end auto-
differentiable physics framework was established using Google JAX. The models were then trained using a sparse dataset
derived from a high-fidelity CHARM model. When comparing prediction performance, it was found that the PIML-A model
achieved a higher degree of accuracy in comparison to the PIML-B model, the basic low-fidelity model, and a baseline purely
data-driven model. A comprehensive analysis of the results demonstrated that the PIML-A model’s method of simultaneous
input-shifting and output-correction yielded superior results, given that one of the inputs might not be highly sensitive to the
chosen transfer parameters. To conclusively determine the optimal architecture for this modeling problem, additional testing
using cross-validation and hyperparameter optimization is needed. However, the initial results indicate that the PIML-A model
holds significant promise. Looking forward, the BLOFI low-fidelity model, when augmented with PIML, could potentially be
employed to generalize across diverse aircraft designs and configurations, thereby proving to be a valuable tool in the domains
of aircraft design and control.
ACK NOW LE DG EM EN TS
This material is based upon work funded by Bechamo LLC’s (with sub-contract to University at Buffalo) NASA Phase II
Small Business Innovation Research (SBIR) Award No. 80NSSC22CA046. The authors would also like to thank Flighthouse
8
Engineering LLC for providing the aircraft design and the high-fidelity data used in this paper.
REFERENCES
[1] T. Chatterjee, S. Chakraborty, and R. Chowdhury, “A critical review of surrogate assisted robust design optimization,”
Archives of Computational Methods in Engineering, vol. 26, no. 1, pp. 245–274, 2019.
[2] P. Bisaillon, R. Sandhu, C. Pettit, M. Khalil, D. Poirel, C. Manohar, and A. Sarkar, “Combined selection of the dynamic
model and modeling error in nonlinear aeroelastic systems using bayesian inference,” Journal of Sound and Vibration,
vol. 522, p. 116 418, 2022.
[3] E. A. Morelli and V. Klein, “Accuracy of aerodynamic model parameters estimated from flight test data,” Journal of
Guidance, Control, and Dynamics, vol. 20, no. 1, pp. 74–80, 1997.
[4] A. Mohamad, J. Karimi, and A. Naderi, “Dynamic aerodynamic parameter estimation using a dynamic particle swarm
optimization algorithm for rolling airframes,” Journal of the Brazilian Society of Mechanical Sciences and Engineering,
vol. 42, pp. 1–16, 2020.
[5] E. Morelli, S. Derry, and M. Smith, “Aerodynamic parameter estimation for the x-43a (hyper-x) from flight data,” in
AIAA atmospheric flight mechanics conference and exhibit, 2005, p. 5921.
[6] M. Bataineh, T. Marler, K. Abdel-Malek, and J. Arora, “Neural network for dynamic human motion prediction,” Expert
Systems with Applications, vol. 48, pp. 26–34, 2016.
[7] G. Garimella, J. Funke, C. Wang, and M. Kobilarov, “Neural network modeling for steering control of an autonomous
vehicle,” in Intelligent Robots and Systems (IROS), 2017 IEEE/RSJ International Conference on, IEEE, 2017, pp. 2609–
2615.
[8] M. Pfeiffer, M. Schaeuble, J. Nieto, R. Siegwart, and C. Cadena, “From perception to decision: A data-driven approach
to end-to-end motion planning for autonomous ground robots,” in 2017 ieee international conference on robotics and
automation (icra), IEEE, 2017, pp. 1527–1533.
[9] L. Shao, F. Ferreira, M. Jorda, V. Nambiar, J. Luo, E. Solowjow, J. A. Ojea, O. Khatib, and J. Bohg, “Unigrasp: Learning
a unified model to grasp with n-fingered robotic hands,” arXiv preprint arXiv:1910.10900, 2019.
[10] A. Rajeswaran, V. Kumar, A. Gupta, G. Vezzani, J. Schulman, E. Todorov, and S. Levine, “Learning complex dexterous
manipulation with deep reinforcement learning and demonstrations,” arXiv preprint arXiv:1709.10087, 2017.
[11] P. Ghassemi, K. Zhu, and S. Chowdhury, “Optimal surrogate and neural network modeling for day-ahead forecasting of
the hourly energy consumption of university buildings,” in International Design Engineering Technical Conferences and
Computers and Information in Engineering Conference, American Society of Mechanical Engineers, vol. 58134, 2017,
V02BT03A026.
[12] P. J. Haley and D. Soloway, “Extrapolation limitations of multilayer feedforward neural networks,” in Neural Networks,
1992. IJCNN., International Joint Conference on, IEEE, vol. 4, 1992, pp. 25–30.
[13] R. M. Neal, Bayesian learning for neural networks. Springer Science & Business Media, 2012, vol. 118.
[14] D. Solomatine, L. M. See, and R. Abrahart, “Data-driven modelling: Concepts, approaches and experiences,” in Practical
hydroinformatics, Springer, 2009, pp. 17–30.
[15] D. Karimi, H. Dou, S. K. Warfield, and A. Gholipour, “Deep learning with noisy labels: Exploring techniques and
remedies in medical image analysis,” Medical Image Analysis, vol. 65, p. 101 759, 2020.
[16] K. S. Narendra and K. Parthasarathy, “Identification and control of dynamical systems using neural networks,” IEEE
Transactions on neural networks, vol. 1, no. 1, pp. 4–27, 1990.
[17] C.-C. Young, W.-C. Liu, and M.-C. Wu, “A physically based and machine learning hybrid approach for accurate rainfall-
runoff modeling during extreme typhoon events,” Applied Soft Computing, vol. 53, pp. 205–216, 2017.
[18] V. Nourani, M. T. Alami, and M. H. Aminfar, “A combined neural-wavelet model for prediction of ligvanchai watershed
precipitation,” Engineering Applications of Artificial Intelligence, vol. 22, no. 3, pp. 466–472, 2009.
[19] S. K. Singh, R. Yang, A. Behjat, R. Rai, S. Chowdhury, and I. Matei, “Pi-lstm: Physics-infused long short-term memory
network,” in 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), IEEE, 2019,
pp. 34–41.
[20] K. Javed, “A robust & reliable data-driven prognostics approach based on extreme learning machine and fuzzy cluster-
ing.,” Ph.D. dissertation, Universit´
e de Franche-Comt´
e, 2014.
[21] S. Cheng and M. Pecht, “A fusion prognostics method for remaining useful life prediction of electronic products,” in
Automation Science and Engineering, 2009. CASE 2009. IEEE International Conference on, IEEE, 2009, pp. 102–107.
[22] A. Karpatne, W. Watkins, J. Read, and V. Kumar, “Physics-guided neural networks (pgnn): An application in lake
temperature modeling,” arXiv preprint arXiv:1710.11431, 2017.
[23] F. MANGILI, “Development of advanced computational methods for prognostics and health management in energy
components and systems,” 2013.
9
[24] B. Kapusuzoglu and S. Mahadevan, “Physics-informed and hybrid machine learning in additive manufacturing: Appli-
cation to fused filament fabrication,” Jom, vol. 72, no. 12, pp. 4695–4705, 2020.
[25] B. Freeman, Y. Tang, Y. Huang, and J. VanZwieten, “Physics-informed turbulence intensity infusion: A new hybrid
approach for marine current turbine rotor blade fault detection,” Ocean Engineering, vol. 254, p. 111 299, 2022.
[26] A. Rajagopal, A. P. Leynes, N. Dwork, J. E. Scholey, T. A. Hope, and P. E. Larson, “Physics-driven deep learning for
pet/mri,” arXiv preprint arXiv:2206.06788, 2022.
[27] K. Ankobea-Ansah and C. M. Hall, “A hybrid physics-based and stochastic neural network model structure for diesel
engine combustion events,” Vehicles, vol. 4, no. 1, pp. 259–296, 2022.
[28] A. Maier, H. K¨
ostler, M. Heisig, P. Krauss, and S. H. Yang, “Known operator learning and hybrid machine learning in
medical imaging—a review of the past, the present, and the future,” Progress in Biomedical Engineering, 2022.
[29] D. A. Rufa, H. E. B. Macdonald, J. Fass, M. Wieder, P. B. Grinaway, A. E. Roitberg, O. Isayev, and J. D. Chodera, “To-
wards chemical accuracy for alchemical free energy calculations with hybrid physics-based machine learning/molecular
mechanics potentials,” BioRxiv, 2020.
[30] A. Rai and M. Mitra, “A hybrid physics-assisted machine-learning-based damage detection using lamb wave,” S¯
adhan¯
a,
vol. 46, no. 2, pp. 1–11, 2021.
[31] I. Matei, C. Zeng, S. Chowdhury, R. Rai, and J. de Kleer, “Controlling draft interactions between quadcopter unmanned
aerial vehicles with physics-aware modeling,” Journal of Intelligent & Robotic Systems, vol. 101, no. 1, pp. 1–21, 2021.
[32] Z. Zhang, R. Rai, S. Chowdhury, and D. Doermann, “Midphynet: Memorized infusion of decomposed physics in neural
networks to model dynamic systems,” Neurocomputing, vol. 428, pp. 116–129, 2021.
[33] X. Chen, Z. Yao, S. Xu, A. S. McLeod, S. N. Gilbert Corder, Y. Zhao, M. Tsuneto, H. A. Bechtel, M. C. Martin,
G. L. Carr, et al., “Hybrid machine learning for scanning near-field optical spectroscopy,” ACS Photonics, vol. 8, no. 10,
pp. 2987–2996, 2021.
[34] H.-C. Choi, C. Deng, and I. Hwang, “Hybrid machine learning and estimation-based flight trajectory prediction in
terminal airspace,” IEEE Access, vol. 9, pp. 151 186–151 197, 2021.
[35] D. Machalek, J. Tuttle, K. Andersson, and K. M. Powell, “Dynamic energy system modeling using hybrid physics-based
and machine learning encoder-decoder models,” Energy and AI, p. 100 172, 2022.
[36] Y. Lai, K. Pan, Y. Cen, J. Yang, C. Cai, P. Yin, and S. Park, “An intelligent system for reflow oven temperature settings
based on hybrid physics-machine learning model,” Soldering & Surface Mount Technology, 2022.
[37] A. Behjat, C. Zeng, R. Rai, I. Matei, D. Doermann, and S. Chowdhury, “A physics-aware learning architecture with
input transfer networks for predictive modeling,” Applied Soft Computing, vol. 96, p. 106 665, 2020.
[38] R. Iqbal, A. Behjat, R. Adlakha, J. Callanan, M. Nouh, and S. Chowdhury, “Efficient training of transfer mapping in
physics-infused machine learning models of uav acoustic field,” in AIAA SCITECH 2022 Forum, 2022, p. 0384.
[39] A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A.
Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and
S. Chintala, “Pytorch: An imperative style, high-performance deep learning library,” in Advances in Neural Information
Processing Systems 32, Curran Associates, Inc., 2019, pp. 8024–8035. [Online]. Available: http: / /papers.neurips . cc /
paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf.
[40] J. Bradbury, R. Frostig, P. Hawkins, M. J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. VanderPlas,
S. Wanderman-Milne, and Q. Zhang, JAX: Composable transformations of Python+NumPy programs, version 0.3.13,
2018. [Online]. Available: http://github.com/google/jax.
[41] J. C. Duensing, J. A. Housman, D. Maldonado, C. C. Kiris, and S. Y. Yoo, Jun. 2019. [Online]. Available: https:
//ntrs.nasa.gov/api/citations/20190028345/downloads/20190028345.pdf.
[42] M. Drela, Flight Vehicle Aerodynamics. MIT Press, 2014.
[43] M. Drela. “Qprop formulation.” (), [Online]. Available: https://web.mit.edu/drela/Public/web/qprop/qprop theory.pdf.
[44] L. Veldhuis, “Propeller wing aerodynamic interference,” Ph.D. dissertation, 2028.
[45] C. D. Inc, Charm. [Online]. Available: https://continuum-dynamics.com/product/.
10
APPENDIX
Fig. 9: BLOFI was verified against X-57 CFD data.
11