Guoyong Wang’s research while affiliated with Nanjing University of Information Science and Technology and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (7)


The system model diagram.
The overall architecture of the MP-QGRD.
Simulation results of node velocity variation: (a) packet delivery ratio; (b) end-to-end delay; (c) energy consumption in communication; (d) routing overhead.
Simulation results of changes in the number of nodes: (a) packet delivery ratio; (b) end-to-end delay; (c) energy consumption in communication; (d) routing overhead.
Geographic Routing Decision Method for Flying Ad Hoc Networks Based on Mobile Prediction
  • Article
  • Full-text available

April 2025

·

15 Reads

Guoyong Wang

·

Mengfei Fan

·

Saiwei Jia

·

[...]

·

Lin Wang

Flying ad hoc networks (FANETs) have highly dynamic and energy-limited characteristics. Compared with traditional mobile ad hoc networks, their nodes move faster and their topology changes more frequently. Therefore, the design of routing protocols faces greater challenges. The existing routing schemes rely on frequent and fixed-interval Hello transmissions, which exacerbates network load and leads to high communication energy consumption and outdated location information. MP-QGRD combined with the extended Kalman filter (EKF) is used for node position prediction, and the Hello packet transmission interval is dynamically adjusted to optimize neighbor discovery. At the same time, reinforcement learning methods are used to comprehensively consider link stability, energy consumption, and communication distance for routing decisions. The simulation results show that compared to QMR, QGeo, and GPSR, MP-QGRD has an increased packet delivery rate, end-to-end latency, and communication energy consumption by 10%, 30%, and 15%, respectively.

Download

Stochastic Zeroth-Order Multi-Gradient Algorithm for Multi-Objective Optimization

February 2025

·

14 Reads

Multi-objective optimization (MOO) has become an important method in machine learning, which involves solving multiple competing objective problems simultaneously. Nowadays, many MOO algorithms assume that gradient information is easily available and use this information to optimize functions. However, when encountering situations where gradients are not available, such as black-box functions or non-differentiable functions, these algorithms become ineffective. In this paper, we propose a zeroth-order MOO algorithm named SZMG (stochastic zeroth-order multi-gradient algorithm), which approximates the gradient of functions by finite difference methods. Meanwhile, to avoid conflicting gradients between functions and reduce stochastic multi-gradient direction bias caused by stochastic gradients, an SGD-type method is adopted to acquire weight parameters. Under the non-convex setting and mild assumptions, the convergence rate is established for the SZMG algorithm. Simulation results demonstrate the effectiveness of the SZMG algorithm.


Adaptive temporal-difference learning via deep neural network function approximation: a non-asymptotic analysis

January 2025

·

13 Reads

·

1 Citation

Complex & Intelligent Systems

Although deep reinforcement learning has achieved notable practical achievements, its theoretical foundations have been scarcely explored until recent times. Nonetheless, the rate of convergence for current neural temporal-difference (TD) learning algorithms is constrained, largely due to their high sensitivity to stepsize choices. In order to mitigate this issue, we propose an adaptive neural TD algorithm (AdaBNTD) inspired by the superior performance of adaptive gradient techniques in training deep neural networks. Simultaneously, we derive non-asymptotic bounds for AdaBNTD within the Markovian observation framework. In particular, AdaBNTD is capable of converging to the global optimum of the mean square projection Bellman error (MSPBE) with a convergence rate of O(1/K){{\mathcal {O}}}(1/\sqrt{K}), where K denotes the iteration count. Besides, the effectiveness AdaBNTD is also verified through several reinforcement learning benchmark domains.


Fig. 7 Measured output and controlled output
Simultaneous fault detection and control for a class of nonlinear uncertain systems with limited communication

October 2023

·

26 Reads

This paper investigates the simultaneous fault detection and control(SFDC) problem for a class of uncertain nonlinear networked control systems with limited communication. In order to improve the utilization of network resources, a novel dynamic event-triggered mechanism is employed affected by the Bernoulli package dropouts phenomenon, which is usually appearing in practical systems. Then, a fault detection filter and dynamic output feedback controller module is applied to achieve the required fault detection and control objectives simultaneously. The linear matrix inequality-based approach is provided to obtain the designed filter and controller parameters so that the closed-loop system is robustly stochastically stable and meets the required performance. Finally, two practical examples are adopted to show the effectiveness of the developed scheme.


Multi-Depot Heterogeneous Vehicle Routing Optimization for Hazardous Materials Transportation

January 2023

·

81 Reads

·

6 Citations

IEEE Access

This paper considers a multi-depot heterogeneous vehicle routing problem (MDHVRP) with time windows, which is very crucial for hazardous materials transportation. For this reason, we formalize this problem as a multi-objective MDHVRP optimization model, where the actual load dependent risk of hazardous materials transportation is considered. To solve the optimization problem, we propose a hybrid multi-objective evolutionary algorithm (HMOEA) and a two-stage algorithm (TSA). In addition, we verify the performance of the proposed algorithms by experiments on the modified Solomon’s VRPTW examples. In the experiment, it can be seen from the distribution of Pareto solution sets and the convergence distribution of IGD values that HMOEA is significantly superior to the other three algorithms in searching for Pareto solutions, as well as in the convergence and diversity of the algorithm. HMOEA and TSA were compared, and the minimum cost obtained by TSA was 13.38% lower than HMOEA, while the minimum risk was 81.69% higher than HMOEA. The advantages of each algorithm in finding solutions in reality were analyzed. A comparison was made between multi-depots heterogeneous VRP and multi-depots homogeneous VRP in the C101 instance, and the results showed that scheduling heterogeneous vehicles would reduce risk and cost.


Framework of the proposed AESPF‐GZSL method
ZSL Classification accuracy with α on three datasets
GZSL Classification accuracies with α on AWA
GZSL Classification accuracies with α on CUB
GZSL Classification accuracies with α on SUN
Auto‐encode the synthesis pseudo features for generalized zero‐shot learning

September 2022

·

23 Reads

Abstract Zero‐shot learning (ZSL) is to identify target categories without labeled data, in which semantic information is used to transfer knowledge from some seen categories. In the existing Generalized Zero‐Shot Learning (GZSL) methods, domains shift problem always appeared during generating feature stage. In order to solve this problem, a new method to Auto‐Encode the Synthesis Pseudo Features for the GZSL task (AESPF‐GZSL) is proposed in this manuscript. Specifically, the AESPF‐GZSL method trains the generated features under the semantic auto‐encoder framework and exploits attention mechanism to train the generated features again. Then, the generated features are input to the classifier. The proposed method is performed on three benchmark data sets referred as to AWA, CUB and SUN. The experimental results show that the proposed method achieves the state‐of‐the art classifier accuracy both in ZSL and GZSL settings. In ZSL setting, the classification accuracy of our method is superior to the compared algorithms, improved by 0.40% in AWA and 0.30% in SUN, respectively. And in GZSL setting, the classification accuracy of the method is superior to the comparison algorithm 0.41% in Harmonic mean on AWA, and 1.01%, 0.62%, and 1.05% in training data set, testing data set, and harmonic average on SUN.


Stochastic Adaptive Forwarding Strategy Based on Deep Reinforcement Learning for Secure Mobile Video Communications in NDN

April 2021

·

119 Reads

·

9 Citations

Named Data Networking (NDN) can effectively deal with the rapid development of mobile video services. For NDN, selecting a suitable forwarding interface according to the current network status can improve the efficiency of mobile video communication and can also avoid attacks to improve communication security. For this reason, we propose a stochastic adaptive forwarding strategy based on deep reinforcement learning (SAF-DRL) for secure mobile video communications in NDN. For each available forwarding interface, we introduce the twin delayed deep deterministic policy gradient algorithm to obtain a more robust forwarding strategy. Moreover, we conduct various numerical experiments to validate the performance of SAF-DRL. Compared with BR, RFA, SAF, and AFSndn forwarding strategies, the results show that SAF-DRL can reduce the delivery time and the average number of lost packets to improve the performance of NDN.

Citations (3)


... Deriving the values of all fuzzy parameters of an AE-FDNN is challenging because the effects of these parameters on the output may offset each other [15,28]. Consequently, the following steps are conducted: ...

Reference:

Flexible and objective diagnosis of type II diabetes by using a fuzzy deep learning ensemble approach
Adaptive temporal-difference learning via deep neural network function approximation: a non-asymptotic analysis

Complex & Intelligent Systems

... Furthermore, [35] addressed the time-dependent fleet size and mixed MDVRP, introducing a mathematical model with generic and problem-specific valid inequalities along with a powerful meta-heuristic for solving large instances of the problem. Finally, [36] focused on hazardous material transportation, introducing multi-objective optimization models and algorithms such as the Hybrid Multi-Objective Evolutionary Algorithm (HMOEA) and Two-Stage Algorithm (TSA) to optimize vehicle routing while considering various factors such as time windows, actual load, and depot stock. By providing references to the articles, the VRP variants addressed, and the methods employed, a summary of the relevant literature is presented in Table 1. ...

Multi-Depot Heterogeneous Vehicle Routing Optimization for Hazardous Materials Transportation

IEEE Access

... Several proposed methods shown in (de Sena et al., 2020;Fu et al., 2017;Gong et al., 2016;Hao et al., 2021;Kerrouche et al., 2016;Lehman et al., 2016;Lei et al., 2015;Mekinda & Muscariello, 2016;Posch et al., 2017;Ren et al., 2019;Yao et al., 2017;Ye et al., 2017; are considered as adaptive forwarding for their ability to estimate network conditions and adapt accordingly. ...

Stochastic Adaptive Forwarding Strategy Based on Deep Reinforcement Learning for Secure Mobile Video Communications in NDN