Annie S. Wu’s research while affiliated with University of Central Florida and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (128)


Figure 2. Effect of Varying Velocity Magnitudes (v 1 and v 2 ) on the Measurement of the CMB Temperature
Figure 5. Schematics of the Orbit and the Sensors' Offset for the Simulation
RMSE of Velocity for Each Component (v x , v y , v z ) at Different T F Values
Model Parameters for the Evaluation of the CMB-Based Velocity Predictions
Evaluation Results for Velocity Estimation Using Regression Models with Bootstrap 95% CI
Orbit Determination through Cosmic Microwave Background Radiation
  • Preprint
  • File available

April 2025

·

4 Reads

·

·

Annie S. Wu

·

[...]

·

Paulo Costa

This research explores the use of Cosmic Microwave Background (CMB) radiation as a reference signal for Initial Orbit Determination (IOD). By leveraging the unique properties of CMB, this study introduces a novel method for estimating spacecraft velocity and position with minimal reliance on pre-existing environmental data, offering significant advantages for space missions independent of Earth-specific conditions. Using Machine Learning (ML) regression models, this approach demonstrates the capability to determine velocity from CMB signals and subsequently determine the satellite's position. The results indicate that CMB has the potential to enhance the autonomy and flexibility of spacecraft operations.

Download

Enhancing BVR Air Combat Agent Development With Attention-Driven Reinforcement Learning

January 2025

·

12 Reads

IEEE Access

This study explores the use of Reinforcement Learning (RL) to develop autonomous agents for Beyond Visual Range (BVR) air combat, addressing the challenges of dynamic and uncertain adversarial scenarios. We propose a novel approach that introduces a task-based layer, leveraging domain expertise to optimize decision-making and training efficiency. By integrating multi-head attention mechanisms into the policy model and employing an improved DQN algorithm, agents dynamically select context-aware tasks, enabling the learning of efficient emergent behaviors for variable engagement conditions. Evaluations in single- and multi-agent BVR scenarios against adversaries with diverse tactical characteristics demonstrate superior training efficiency and enhanced agent capabilities compared to leading RL algorithms commonly applied in similar domains, including PPO, DDPG, and SAC. A robustness study underscores the critical role of diverse enemy selection in the RL process, showing that adversaries with variable tactical behaviors are essential for developing robust agents. This work advances RL methodologies for autonomous BVR air combat and provides insights applicable to other problems with challenging adversarial scenarios.


Optimized Prediction of Weapon Effectiveness in BVR Air Combat Scenarios Using Enhanced Regression Models

January 2025

·

61 Reads

·

1 Citation

IEEE Access

This study investigates high-performance models for predicting the Weapon Engagement Zone (WEZ) in beyond-visual-range (BVR) air combat scenarios. Accurate WEZ predictions are crucial for decision-making in air combat, and high-performance solutions are essential for developing and deploying autonomous systems. To optimize the model training process, we introduce novel feature engineering and data augmentation strategies, achieving a 70% improvement in the Mean Absolute Error (MAE) of WEZ predictions. A comparison of various regression methods highlights the potential of polynomial-based alternatives when fully utilized. In our evaluations, Polynomial Regression (PR) with higher interaction degrees outperforms more complex machine learning models in prediction accuracy and computational efficiency. For instance, Lasso regression, a PR method with regularization, achieves results that are 33% better and 2.1 times faster than the best artificial neural network-based solution. Our results challenge common assumptions in the literature about the complexity and feasibility of higher-order PR solutions, suggesting that they can be a compelling alternative for various challenges across domains. This study also provides a new open dataset to facilitate further research and advancements in this field.


Figure 1. Visualization of BVR 2x2 Air Combat simulations in B-ACE. Snapshot of evaluation in a MARL experiment (left). Multiple simulation running in parallel in the Experiment Mode (right).
Figure 2. Example of the prediction generated by the WEZ model created based on the default B-ACE Missile performance, evaluated across different altitudes and aspects among agents.
Figure 3. Mean Reward for MARL training in a BVR 2x2 Air Combat Scenario using Multi-Agent PPO and Multi-Agent DDPG
B-ACE: An Open Lightweight Beyond Visual Range Air Combat Simulation Environment for Multi-Agent Reinforcement Learning

December 2024

·

404 Reads

·

1 Citation

This paper introduces B-ACE (Beyond Visual Range (BVR)-Air Combat Environment), an open-source simulation framework leveraging the Godot game engine to evaluate Multi-Agent Reinforcement Learning (MARL) for military research and development. Traditional military simulations are often restricted, limiting research discussions and comparisons among different groups. B-ACE addresses this by providing an open, accessible environment that can be easily shared and extended within the research community, ensuring reproducibility and flexibility for further studies. Our approach capitalizes on Godot's high performance and script-based development, offering a cost-effective and customizable solution for creating air combat scenarios. This integration allows rapid prototyping and evaluation of autonomous agent behaviors using existing reinforcement learning frameworks. In the developed scenario, agents should learn to engage in BVR Air Combat, defend itself and a position against enemy aircraft. Using integration with state-of-the-art MARL algorithms, we explore advanced techniques in autonomous agent development within complex Beyond Visual Range (BVR) air combat scenarios. The environment simulates key aspects of air combat, including radar detection, weapons engagement, and tactical maneuvers. While not overly realistic, B-ACE provides a valuable testbed for prototyping and evaluating AI development approaches. Through three study cases, we demonstrate B-ACE's capability to support research in BVR air combat scenarios, including the generation of weapon efficiency models, optimizing baseline behaviors, and training agents using MARL. ABOUT THE AUTHORS Andre R. Kuroswiski, M.S., is a Lieutenant Colonel in the Brazilian Air Force. As a fighter pilot and electrical engineer, he became a researcher at the Institute for Advanced Studies (IEAv) in 2018. Since then, he has been working on simulation development to support autonomous agents research and military scenario analysis. Currently, he is a visiting researcher at the University of Central Florida, focusing on enhancing deep reinforcement learning techniques to develop cooperative air combat agents.



Figure 1: Attention Task-based action selection for MARL.
Figure 2: Attention-driven task-based solution Mean Rewards for MPE Single Spread and SISL Pursuit.
Figure 3: MPE Single Spread scalability with 95% CI.
MPE Simple Spread and SISL Pursuit Task-Based Actions definition and explanation predator-prey game in a discrete setting, where agents work together to trap the prey with limited views and random prey moves. We develop extended versions of the base environ- ments, including the Task Generator and the Task to Action Converter, to process observations, generate tasks, and pro- duce valid actions the policy selection. Table 1 detail the tasks for the Single Spread and Pursuit, including embedded knowledge, action conversion, and selected features.
Attention-Driven Multi-Agent Reinforcement Learning: Enhancing Decisions with Expertise-Informed Tasks

May 2024

·

47 Reads

The International FLAIRS Conference Proceedings

In this paper, we introduce an alternative approach to enhancing Multi-Agent Reinforcement Learning (MARL) through the integration of domain knowledge and attention-based policy mechanisms. Our methodology focuses on the incorporation of domain-specific expertise into the learning process, which simplifies the development of collaborative behaviors. This approach aims to reduce the complexity and learning overhead typically associated with MARL by enabling agents to concentrate on essential aspects of complex tasks, thus optimizing the learning curve. The utilization of attention mechanisms plays a key role in our model. It allows for the effective processing of dynamic context data and nuanced agent interactions, leading to more refined decision-making. Applied in standard MARL scenarios, such as the Stanford Intelligent Systems Laboratory (SISL) Pursuit and Multi-Particle Environments (MPE) Simple Spread, our method has been shown to improve both learning efficiency and the effectiveness of collaborative behaviors. The results indicate that our attention-based approach can be a viable approach for improving the efficiency of MARL training process, integrating domain-specific knowledge at the action level.


Independent variables measuring the rate of change of glucose in the 30 minutes before a hypoglycemic event, R.
Machine learning prediction of severity and duration of hypoglycemic events in type 1 diabetes patients

May 2024

·

20 Reads

The International FLAIRS Conference Proceedings

We compare the performance of machine learning methods for building predictive models to estimate the expected characteristics of hypoglycemic or low blood glucose events in type 1 diabetes patients. We hypothesize that the rate of change of blood glucose ahead of a hypoglycemic event may affect the severity and duration of the event and investigate the utility of machine learning methods on using blood glucose rate of change, in combination with other physiological and demographic factors, to predict the minimum glucose value and the duration of a hypoglycemic event. This work compares the performance of six state-of-the-art methods on prediction accuracy and feature selection. Results find that XGBoost delivers the best performance in all cases. Examination of the XGBoost feature importance scores show that glucose rate of change is the most used feature in the models generated by XGBoost.


Genetic algorithm feature selection resilient to increasing amounts of data imputation

The International FLAIRS Conference Proceedings

This paper investigates the robustness of a genetic algorithm (GA) in feature selection across a dataset with increasing imputed missing values. Feature selection can be beneficial in predictive modeling to reduce computational costs and potentially improve performance. Beyond these benefits, it also enables a clearer understanding of the algorithm's decision-making processes. In the context of real-world datasets that can contain missing values, feature selection becomes more challenging.A robust feature selection algorithm should be able to identify the key features despite missing data values. We investigate the effectiveness of this approach against two other feature selection algorithms on a dataset with increasingly imputed values to determine whether it can sustain good performance with only the selected features.Our results reveal that compared to the other two methods, the features selected by GA resulted in better classification performance across different imputation rates and methods.



Security Awareness in Smart Homes and Internet of Things Networks through Swarm-Based Cybersecurity Penetration Testing

September 2023

·

197 Reads

·

3 Citations

Internet of Things (IoT) devices are common in today’s computer networks. These devices can be computationally powerful, yet prone to cybersecurity exploitation. To remedy these growing security weaknesses, this work proposes a new artificial intelligence method that makes these IoT networks safer through the use of autonomous, swarm-based cybersecurity penetration testing. In this work, the introduced Particle Swarm Optimization (PSO) penetration testing technique is compared against traditional linear and queue-based approaches to find vulnerabilities in smart homes and IoT networks. To evaluate the effectiveness of the PSO approach, a network simulator is used to simulate smart home networks of two scales: a small, home network and a large, commercial-sized network. These experiments demonstrate that the swarm-based algorithms detect vulnerabilities significantly faster than the linear algorithms. The presented findings support the case that autonomous and swarm-based penetration testing in a network could be used to render more secure IoT networks in the future. This approach can affect private households with smart home networks, settings within the Industrial Internet of Things (IIoT), and military environments.


Citations (63)


... Additionally, we include higher-level variables representing dynamic calculations for missile effectiveness. These are identified as RMax (maximum range at which a missile can succeed if the enemy takes no evasive action) and NEZ (No Escape Zone, the range at which a successful launch is highly probable) [38], [39]. These calculations are vital for BVR air combat, simplifying the learning process by providing direct information about offensive and defensive conditions. ...

Reference:

Enhancing BVR Air Combat Agent Development With Attention-Driven Reinforcement Learning
Optimized Prediction of Weapon Effectiveness in BVR Air Combat Scenarios Using Enhanced Regression Models

IEEE Access

... In our method, actions are represented by the tasks, however, we also need to define the raw actions since they represent the actual interactions with the simulation environment. The raw actions consist of continuous control variables, including heading, flight level, desired g-force, and missile firing [40]. These inputs allow agents to make precise adjustments to the aircraft's heading, altitude, and g-force, as well as to determine the optimal moment for missile firing. ...

B-ACE: An Open Lightweight Beyond Visual Range Air Combat Simulation Environment for Multi-Agent Reinforcement Learning

... Computer networks today inherit devices commonly known as Internet of Things (IoT) devices. IoT devices are characterized as objects that are connected to the internet [4]. The IoT is fostering innovation across every sector, from smart homes that provide convenience and energy efficiency to industrial settings that optimize operations through predictive maintenance [5]. ...

Security Awareness in Smart Homes and Internet of Things Networks through Swarm-Based Cybersecurity Penetration Testing

... Zhao and Gu proposed an adaptive PID method for car suspensions, where a radial basis function neural network is used to fine-tune the PID parameters, improving ride quality and suspension control 21 . Kebari et al. optimized PID parameter values based on real-time task demand and the cumulative sum of previous demands, providing a more responsive control system 22 . Similarly, Gupta et al. employed a hybrid swarm intelligence algorithm to adjust PID gains for stabilizing the active magnetic bearing (AMB) system under unstable conditions 23 . ...

Pid-Inspired Modifications in Response Threshold Models In Swarm Intelligent Systems
  • Citing Conference Paper
  • July 2023

... LEO utilizes Gaussian mutation [79] as the operator for EA. Self-adaptation is the ability of a GA's ability to modify its algorithm while solving a specific problem [80]. It has been demonstrated that the Gaussian mutation operator is the most effective and popular option for self-adaptation in GA. ...

Genetic algorithms with self-adaptation for predictive classification of Medicare standardized payments for physical therapists
  • Citing Article
  • May 2023

Expert Systems with Applications

... Masalah yang ada pada penelitian ini yaitu dataset yang kecil dan terdapat nilai yang kosong pada umumnya jika dilakukan penghapusan data maka akan terjadi penurunan kinerja akurasi yang disebabkan oleh pengurangan jumlah data maka solusi yang dilakukan proses imputasi missing value menggunakan Mean dan KNN Imputation (Martinez et al., 2022). Dalam penelitian ini menggunakan metode klasifikasi meliputi metode Random Forest, KNN dan Naive Bayes kemudian untuk dataset yang digunakan menggunakan 2 dataset data public dari repository UCI (Dinh et al., 2021). ...

Effects of imputation strategy on genetic algorithms and neural networks on a binary classification problem
  • Citing Conference Paper
  • July 2022

... e swarm parameters are displayed in Table 5. e swarm members of the four groups are S 1 ∼ S 2 and A 1 ∼ A 4 , S 1 ∼ S 3 and A 1 ∼ A 6 , S 1 ∼ S 4 and A 1 ∼ A 8 , and S 1 ∼ S 5 and A 1 ∼ A 10 , respectively. e method proposed in this paper, together with binary particle swarm optimization (BPSO) [37], binary artificial fish swarm algorithm (BASFA) [38,39], and genetic algorithm (GA) [5,40], respectively, runs the above grouping experiments 50 times independently, and the average track cost (ATC) and the average total calculation time (ATCT) are counted. ATC is the average value of the UAV trajectory where all targets have been destroyed. ...

Analysis of Evolved Response Thresholds for Decentralized Dynamic Task Allocation
  • Citing Article
  • May 2022

ACM Transactions on Evolutionary Learning and Optimization

... Distinct locomotion techniques can be employed to enable users to explore VE. Teleporting, joysticks, and arm swings are practical techniques for small physical spaces since they rely on hand-controller usage and do not require the user to physically move [16,21]. However, they significantly deviate from natural human locomotion, disrupting immersion and presence in VR, unlike natural walking, which closely mirrors real-world locomotion [20,30]. ...

Research Trends in Virtual Reality Locomotion Techniques
  • Citing Conference Paper
  • March 2022

... While the feasibility is demonstrated, the research acknowledges the need for further refinement and extensive investigations before practical application. This approach reflects the intersection of behavioral analysis and medical diagnostics, addressing a crucial aspect of road safety [25]. ...

Detection of Driver Health Condition by Monitoring Driving Behavior through Machine Learning from Observation
  • Citing Article
  • April 2022

Expert Systems with Applications

... LSTMs seem to have taken over the PCGML (Procedural Content Generation based on Machine Learning) experiments, thanks to the readily available implementations in Python and C#, as well as their recurrent nature that caters for generation of diverse and (theoretically) infinite content [48]. For instance, Savery and Weinberg [52] used LSTMs to synthesise musical scores based on image and video analysis, and Botoni et al. [5] to create NPCs with more depth in terms of dialogue and style. ...

Character Depth and Sentence Diversification in Automated Narrative Generation