Lei Liu’s research while affiliated with University of Science and Technology of China and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (48)


VQLTI: Long-Term Tropical Cyclone Intensity Forecasting with Physical Constraints
  • Article

April 2025

·

5 Reads

Proceedings of the AAAI Conference on Artificial Intelligence

Xinyu Wang

·

Lei Liu

·

Kang Chen

·

[...]

·

Tropical cyclone (TC) intensity forecasting is crucial for early disaster warning and emergency decision-making. Numerous researchers have explored deep-learning methods to address computational and post-processing issues in operational forecasting. Regrettably, they exhibit subpar long-term forecasting capabilities. We use two strategies to enhance long-term forecasting. (1) By enhancing the matching between TC intensity and spatial information, we can improve long-term forecasting performance. (2) Incorporating physical knowledge and physical constraints can help mitigate the accumulation of forecasting errors. To achieve the above strategies, we propose the VQLTI framework. VQLTI transfers the TC intensity information to a discrete latent space while retaining the spatial information differences, using large-scale spatial meteorological data as conditions. Furthermore, we leverage the forecast from the weather prediction model FengWu to provide additional physical knowledge for VQLTI. Additionally, we calculate the potential intensity (PI) to impose physical constraints on the latent variables. In the global long-term TC intensity forecasting, VQLTI achieves state-of-the-art results for the 24h to 120h, with the MSW (Maximum Sustained Wind) forecast error reduced by 35.65%-42.51% compared to ECMWF-IFS.





The pipeline of our method for quantitative assessment in premature infants (red circle to indicate the calculation area within the ROI)
Association between retinal vascular fractal dimensions and retinopathy of prematurity: an AI-assisted retrospective case-control study
  • Article
  • Publisher preview available

March 2025

·

3 Reads

International Ophthalmology

Purpose The main objective of this study was to analyze the fractal dimensions (D(f)) of retinal vasculature in premature infants with retinopathy of prematurity (ROP) and determine their correlation with ROP severity. Methods We conducted a single-center retrospective case-control study involving 641 premature patients with ROP (641 eyes) and 684 normal preterm infants (684 eyes) matched for corrected gestational age (CGA). Computer-assisted techniques were used to quantify peripapillary retinal vascular D(f), vessel tortuosity (VT), and vessel width (VW). Results Compared to the normal preterm groups, patients with ROP exhibited a significant increase in retinal vascular D(f) by 0.0061 (P = 0.0002). Subgroup analyses revealed a significant association between increasing ROP severity and increased retinal vascular D(f) (P < 0.05). Multivariable-adjusted ordered logistic regression models demonstrated that retinal vascular D(f) (aOR: 3.307, P < 0.0001) was significantly independent and associated with ROP severity. For every 0.1 increase in D(f), the probability of ROP requiring intervention increased by 33.07%. Multiple linear regression models indicated a significant positive correlation between D(f) and VT, as well as VW around the optic disc (P < 0.0001). For every 1 (10⁴ cm⁻³) increase in VT, D(f) increased by 0.0010. Similarly, for every 1 (μm) increase in VW, D(f) increased by 0.0025. Conclusions Our findings suggest that increased D(f) in retinal vessels is a pathological characteristic of ROP. This increase may be attributed to the curvature and width of the retinal vasculature in infants with ROP. Quantitative measurement of retinal vascular D(f) could serve as a valuable vascular indicator for assessing the severity of ROP.

View access options

Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation

March 2025

·

5 Reads

Multimodal autoregressive (AR) models, based on next-token prediction and transformer architecture, have demonstrated remarkable capabilities in various multimodal tasks including text-to-image (T2I) generation. Despite their strong performance in general T2I tasks, our research reveals that these models initially struggle with subject-driven image generation compared to dominant diffusion models. To address this limitation, we introduce Proxy-Tuning, leveraging diffusion models to enhance AR models' capabilities in subject-specific image generation. Our method reveals a striking weak-to-strong phenomenon: fine-tuned AR models consistently outperform their diffusion model supervisors in both subject fidelity and prompt adherence. We analyze this performance shift and identify scenarios where AR models excel, particularly in multi-subject compositions and contextual understanding. This work not only demonstrates impressive results in subject-driven AR image generation, but also unveils the potential of weak-to-strong generalization in the image generation domain, contributing to a deeper understanding of different architectures' strengths and limitations.



VQLTI: Long-Term Tropical Cyclone Intensity Forecasting with Physical Constraints

January 2025

·

6 Reads

Tropical cyclone (TC) intensity forecasting is crucial for early disaster warning and emergency decision-making. Numerous researchers have explored deep-learning methods to address computational and post-processing issues in operational forecasting. Regrettably, they exhibit subpar long-term forecasting capabilities. We use two strategies to enhance long-term forecasting. (1) By enhancing the matching between TC intensity and spatial information, we can improve long-term forecasting performance. (2) Incorporating physical knowledge and physical constraints can help mitigate the accumulation of forecasting errors. To achieve the above strategies, we propose the VQLTI framework. VQLTI transfers the TC intensity information to a discrete latent space while retaining the spatial information differences, using large-scale spatial meteorological data as conditions. Furthermore, we leverage the forecast from the weather prediction model FengWu to provide additional physical knowledge for VQLTI. Additionally, we calculate the potential intensity (PI) to impose physical constraints on the latent variables. In the global long-term TC intensity forecasting, VQLTI achieves state-of-the-art results for the 24h to 120h, with the MSW (Maximum Sustained Wind) forecast error reduced by 35.65%-42.51% compared to ECMWF-IFS.



Adaptive Confidence Multi-View Learning

January 2025

·

1 Read

IEEE Transactions on Multimedia

Multi-view hashing is a crucial technology for multimedia retrieval because it transforms heterogeneous data from many viewpoints into binary hash codes. However, the existing approaches focus mostly on the complementarity among multiple views while being without confidence fusion. Furthermore, redundant noise is present in the single-view data in real-world application contexts. We present an innovative Adaptive Confidence Multi-View Learning (ACMVL) method to perform confidence fusion and remove extraneous noise. Initially, a confidence network is constructed to eliminate noise data and extract useful information from various single-view features. Moreover, an adaptive confidence multi-view network is utilized to quantify the confidence of each view and further fuse multiple view features using a weighted summation. Here, we propose an Automatic View Confidence Metric (AVCM) as a score for evaluating the confidence of views. Finally, to improve the semantic representation of the fused feature, a dilation network is created. Based on ACMVL, we introduce a novel Adaptive Confidence Multi-View Hashing (ACMVH) method. To our knowledge, we are the pioneers in using confidence learning for multimedia retrieval. Comprehensive experiments on three publicly available datasets demonstrate that our ACMVH outperforms the state-of-the-art methods (maximum improvement of 3.24%3.24\% on mAP). The original code is available at https://github.com/HackerHyper/ACMVH .


Citations (25)


... Accurate weather forecasts enable airlines to enhance aviation safety [1], support agriculture in optimizing crop management [2], and assist the energy industry in managing production and mitigating weather-related risks [3]. Moreover, reliable forecasts are crucial for early warning systems, aiding in the preparation for natural disasters and extreme weather events, thereby safeguarding lives and property [4,5]. ...

Reference:

WEATHER-5K: A Large-scale Global Station Weather Dataset Towards Comprehensive Time-series Forecasting Benchmark
Global Tropical Cyclone Intensity Forecasting with Multi-modal Multi-scale Causal Autoregressive Model
  • Citing Conference Paper
  • April 2025

... However, the inherent randomness and instability self-attention mechanism of Vision Transformer to fully exploit the complex nonlinear relationships among input data, thereby improving the accuracy of ultra-short-term wind power forecasting. Liu et al. [33] proposed an interpretable Transformer model integrating decoupled feature-temporal self-attention and variable-attention networks and enhanced wind power prediction accuracy through multi-task learning. ...

Interpretable feature-temporal transformer for short-term wind power forecasting with multivariate time series
  • Citing Article
  • November 2024

Applied Energy

... Recently, the characterization of model uncertainty has received extensive attention (Zheng et al. 2023;Zhu et al. 2024), especially in safety-critical fields such as medical diagnosis and self-driving, where understanding the reliability and confidence of algorithmic decisions is crucial (Pedersen et al. 2017). A series of uncertainty-based learning has been proposed, including Bayesian neural networks (BNN) (Blundell et al. 2015), confidence learning (Han et al. 2022b), and evidence-based learning (Audun 2001). ...

Adaptive Confidence Multi-View Hashing for Multimedia Retrieval
  • Citing Conference Paper
  • April 2024

... Physical triggers [18,43,54,57] are real-world objects that activate the backdoor, posing serious risks in environments like autonomous driving, where they blend seamlessly and are hard to distinguish from benign objects. Lastly, word triggers [3,13,55] in natural language models are specific words, either inherent or injected, that activate a backdoor. Their complexity and the need for deep linguistic analysis make detection particularly challenging. ...

Efficient Trigger Word Insertion
  • Citing Conference Paper
  • December 2023

... Several frequency-based attacks (Jia et al., 2022;Luo et al., 2022) are proposed to evade frequency-based detectors and keep adversarial perturbations imperceptible. Recently, backdoor attacks (Sun et al., 2023), attribute variation-based attacks (Meng et al., 2023) and audio-based attacks (Panariello et al., 2023) have been applied to face forgery detection. On the contrary, only a few works (Hussain et al., 2021;Neekhar et al., 2021) try to prevent detectors from adversarial attacks. ...

Real is not True: Backdoor Attacks Against Deepfake Detection
  • Citing Conference Paper
  • December 2023

... The entire process of AI-based blood vessel feature analysis closely follows the methodology outlined in the paper published by Liu et al. [28], as shown in Fig. 1. The key steps are as follows: (1) Extraction of the optic disc and blood vessels from the fundus image. ...

Vascular features around the optic disc in familial exudative vitreoretinopathy: findings and their relationship to disease severity

BMC Ophthalmology

... Yu et al. [28] proposed a new model for the optimization of attention mechanisms based on BiGRU and RF-WOA-VMD. Liu et al. [29] eliminated high wind speed and low power as well as low wind speed and high power data, which combined encoder-decoder (ED), BiGRU, and feature-temporal attention (FT-Attention) to predict wind power with good prediction results. ...

Ultra-short-term wind power forecasting based on deep Bayesian model with uncertainty
  • Citing Article
  • March 2023

Renewable Energy

... In t year, Heitmar et al. studied the effect of endothelial microparticles on retinal SO betes and cardiovascular disease [30]. In 2023, Chen et al. researched the idiopa retinal membrane based on the retinal SO2 and vascular morphological chara [31]. ...

Screening of idiopathic epiretinal membrane using fundus images combined with blood oxygen saturation and vascular morphological features

International Ophthalmology

... For instance, Zacny et al. [33] found that the percussive approaches were other alternatives to increase the drilling effectiveness by reducing the soil adhesion to the drilling tool on the Moon. It was observed in recent studies that the bio-inspired strategies could develop highly efficient, low cost and sustainable designs in geotechnical engineering [1,[22][23][24]37] and have attracted much attention in the current drilling tool designs on the earth [3,4,31,36]. The clam-inspired drilling design might be one of the most popular concepts in bio-inspired drilling design in recent years. ...

A Novel and High-efficiency Razor Clam Inspired Burrowing Robot that Utilizes Localized Fluidization Mechanism
  • Citing Conference Paper
  • October 2021