Yujun Li’s research while affiliated with Shandong University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (70)


Semantic matters: A constrained approach for zero-shot video action recognition
  • Article

June 2025

·

1 Read

Pattern Recognition

Zhenzhen Quan

·

Jialei Chen

·

Daisuke Deguchi

·

[...]

·


Privacy-Preserving Sequential Recommendation with Collaborative Confusion

December 2024

·

12 Reads

ACM Transactions on Information Systems

Sequential recommendation has attracted a lot of attention from both academia and industry, however the privacy risks associated with gathering and transferring users’ personal interaction data are often underestimated or ignored. Existing privacy-preserving studies are mainly applied to traditional collaborative filtering or matrix factorization rather than sequential recommendation. Moreover, these studies are mostly based on differential privacy or federated learning, which often lead to significant performance degradation, or have high requirements for communication. In this work, we address privacy-preserving from a different perspective. Unlike existing research, we capture collaborative signals of neighbor interaction sequences and directly inject indistinguishable items into the target sequence before the recommendation process begins, thereby increasing the perplexity of the target sequence. Even if the target interaction sequence is obtained by attackers, it is difficult to discern which ones are the actual user interaction records. To achieve this goal, we introduce a novel sequential recommender system called CLOUD ( C o L laborative-c O nfusion seq U ential recommen D er), which incorporates a collaborative confusion mechanism to modify the raw interaction sequences before conducting recommendation. Specifically, CLOUD first calculates the similarity between the target interaction sequence and other neighbor sequences to find similar sequences. Then, CLOUD considers the shared representation of the target sequence and similar sequences to determine the operation to be performed: keep, delete, or insert. A copy mechanism is designed to make items from similar sequences have a higher probability to be inserted into the target sequence. Finally, the modified sequence is used to train the recommender and predict the next item. We conduct extensive experiments on three benchmark datasets. The experimental results show that CLOUD achieves a maximum modification rate of 66.57% on interaction sequences, and obtains over 99% recommendation accuracy compared to the state-of-the-art sequential recommendation methods. This proves that CLOUD can effectively protect user privacy at minimal recommendation performance cost, which provides a new solution for privacy-preserving for sequential recommendation. Our implementation is available at https://github.com/weiwang0927/CLOUD .



MSSA: Multispectral Semantic Alignment for Cross-Modality Infrared-RGB Person Reidentification

December 2024

·

1 Read

·

1 Citation

IEEE Transactions on Computational Social Systems

The widespread deployment of dual-camera systems has laid a solid foundation for practical applications of infrared (IR)-RGB cross-modality person reidentification (ReID). However, the inherent modality differences between RGB and IR images cause significant intra-class variances in the feature space for individuals of the same identity. Current methods typically employ various network architectures for the image style transfer or extracting modality-invariant features, yet they overlook the information extraction from the most fundamental spectral semantic features. Based on the existing approaches, we propose a multi-spectral semantic alignment (MSSA) architecture aimed at aligning fine-grained spectral semantic features across both intra-modality and inter-modality perspectives. Through modality center semantic alignment (MCSA) learning, we comprehensively mitigate differences in identity features of different modalities. Moreover, to attenuate the discriminative information unique to a single modality, we introduce the modality reliability intensification (MRI) loss to enhance the reliability of identity information. Finally, to tackle the challenge that inter-modality intra-class disparities surpass inter-modality inter-class differences, we leverage the dynamic discriminative center (DDC) loss to further bolster the discriminability of reliable information. Through an extensive experiments conducted on SYSU-MM01, RegDB, and LLCM datasets, we demonstrate the substantial advantages of the proposed MSSA over other state-of-the-art methods.





Adaptive Single-Hop and Relay-Selection-Based Double-Hop D2D Transmissions Underlaying Large-Scale Cellular Networks

October 2024

·

8 Reads

IEEE Transactions on Vehicular Technology

Device-to-device (D2D) communication enables direct data transmissions between devices, thereby filling in the blind zones of base stations (BSs) and alleviating their load pressures. In this work, we propose an adaptive single-hop and relay-selection-based double-hop D2D transmission scheme underlaying cellular networks with full-frequency reuse. When the communication distance between two D2D users is shorter than a threshold d0d_{0} , the single-hop D2D link can be established. However, when the distance between two D2D users is longer than d0d_{0} but shorter than 2d02d_{0} , and there exists at least one relay node in the overlapping coverage areas of both users, the double-hop D2D transmission is performed. Based on the stochastic geometry theory, we properly model the random distributions of D2D users, relay nodes, cellular users, and BSs, as well as the aggregate interference for each type of communication mode. We propose to divide the overlapping area of the coverage regions of two D2D users into small grids, and the relay selection priority is assigned to each grid according to the selection criterion. The griding of overlapping areas can greatly simplify the analysis of successful relaying by considering the possible locations of the selected relay. We also present two benchmark schemes, namely cellular only, cellular with D2D single-hop. Numerical results show that our proposed scheme can significantly improve the area throughput compared with the benchmark schemes.


Fig. 2: Schematic diagram of the model structure
Fig. 3: Specific structure of the extractive summarization model
Fig. 4: Specific structure of the generative summarization model
Fig. 6: Model performance for the number of candidate summaries on the AED dataset
Fig. 7: AED Dataset Margin β vs. ROUGE-1

+7

An improved approach for generating summaries of judicial instruments
  • Preprint
  • File available

September 2024

·

8 Reads

generation technology can assist administrative law enforcement officers in grasping the core summary of administrative cases more quickly, which is one of the research focuses in the applied research of administrative justice. How to cope with the phenomenon of long-distance dependence on judicial documents and realize high-precision summary generation is an urgent problem to be solved at present. We propose a segmented judicial instrument summary generation model to solve the long-distance dependency problem. The model adopts a two-segment structure combining an extractive summary model and a generative summary model. Among them, the extractive summarization model consists of two parts: the BERT model and the document-level sentence encoder of the Transformer structure, which is used to initially process the judicial documents, extract the key legal information in the original documents, and compose the preliminary summaries, which are used as the training corpus for the generative summarization model. In addition, we introduce the contrast learning idea to alleviate the exposure bias problem in the generative model. The generative summary model takes T5-PEGASUS as the main body, generates pseudo-optimal summaries and multiple candidate summaries, and constructs contrast loss based on contrast learning to mitigate the exposure bias problem in the main model. Finally, the effectiveness of the algorithm is verified through experiments on the CAIL 2020 dataset (Challenge of AI in Law 2020) and the self-labelled AED summary dataset dedicated to administrative enforcement documents (Administrative Enforcement Documents Dataset).

Download


Citations (45)


... Although there are no definitive early signs for the diagnosis of CAD, image-processing systems have emerged as powerful tools for prediction and noninvasive 20,[37][38][39][40] . Given that machine learning (ML) algorithms have already shown significant potential in medical imaging and disease prediction, their importance and application in advancing modern medicine and improving healthcare services continue to grow [41][42][43][44] . ...

Reference:

Predicting coronary artery occlusion risk from noninvasive images by combining CFD-FSI, cGAN and CNN
CMAR: A Pipeline for Cross-Modal Alignment and 3D Reconstruction of Coronary Arteries Based on Key Bifurcation Vessel Measurements

IEEE Transactions on Instrumentation and Measurement

... The advantage of multi-stream models is that they are able to capture the complex relationships of multiple data sources and effectively integrate information from different domains, thereby improving the performance and generalisation of the model. Such models have a wide range of applications in many domains, including computer vision [45] and natural language processing [12]. Multi-stream network models are also widely used in language tasks [29,35] and have good potential for DOA tasks. ...

ASMGCN: Attention-Based Semantic-Guided Multistream Graph Convolution Network for Skeleton Action Recognition
  • Citing Article
  • June 2024

IEEE Sensors Journal

... The achieved results in terms of fairness, energy and spectral efficiencies outperform the OMA schemes. In Zhai et al. (2024b), a multi-phase adaptive transmission is proposed to improve the success probability of the transmitted message to the destination UE. Further, based on the non-linear EH model, the study adopts a differential evolution algorithm to optimize the power allocation coefficients and the transmission time slot duration. ...

Nonlinear energy harvesting based alternate cooperative nonorthogonal multiple access with adaptive interference cancellation
  • Citing Article
  • February 2024

Computer Networks

... Feng et al further advanced the field with multistage integration networks 20 and separable attention modules 21 to restore target-contrast images by leveraging auxiliary contrast images through cross-contrast feature exploration. Moreover, transformer-enabled frameworks 22,23,24,25,26,27 have been used to enhance joint feature space learning, providing a significant enhancement in multi-resolution SR performance. However, all these SRR methods typically require extensive highresolution ground truth datasets for training, which are scarce or not available at mesoscale (< 0.5mm) resolution. ...

Multi-Contrast Complementary Learning for Accelerated MR Imaging
  • Citing Article
  • December 2023

IEEE Journal of Biomedical and Health Informatics

... Chlapanis et al. (2024) introduce Legal Argument Reasoning (LAR), a novel task designed to evaluate the legal reasoning capabilities of LLMs. Deng et al. (2023) develop a syllogistic reasoning dataset for legal judgment analysis, manually curating 11,239 criminal cases that cover 4 criminal elements, 80 charges, and 124 articles. Joshi et al. (2024) propose the Indian Legal Text Understanding and Reasoning Benchmark (IL-TUR), which includes monolingual (English, Hindi) and multilingual (9 Indian languages) tasks to assess the understanding and reasoning capabilities of Indian legal documents. ...

Syllogistic Reasoning for Legal Judgment Analysis
  • Citing Conference Paper
  • January 2023

... Several works [34]- [37] about deep learning-based fall detection with the KD approach are reviewed as follows. Most literatures [34]- [37] only consider the single-stage KD approach. ...

SMTDKD: A Semantic-Aware Multimodal Transformer Fusion Decoupled Knowledge Distillation Method for Action Recognition
  • Citing Article
  • January 2023

IEEE Sensors Journal

... The dense correspondences between cross-modality images were utilized to extract discriminative pixel-wise local features while training by Park H et al. [27]. Chen et al. [28] utilize changes in image appearance from multiple spectra to guide the network in finding modality-invariant commonalities during training which improves efficiency in the use of computational resources. Huang N et al. [29] design a feature extraction strategy to extract the modality-shared appearance features and modality-invariant relation features in shared different multi-dimensional feature spaces, and fuse them into the final features which have more robust and abundant modality-invariant relation information. ...

MSIF: multi-spectrum image fusion method for cross-modality person re-identification

International Journal of Machine Learning and Cybernetics

... In recent years, the demand for medical video segmentation has significantly increased, leading to promising results from researchers in this field [20,25,34,38]. This paper is the first to explore using ultrasound video for AoP measurements. ...

Refined Feature-based Multi-frame and Multi-scale Fusing Gate Network for Accurate Segmentation of Plaques in Ultrasound Videos

Computers in Biology and Medicine

... To obtain the AU features, we first used a micro-expression spotting network of our previous work based on attention and 1D convolutional sliding window [31] to extract microexpression clips in the input video. Then these micro-expressions clips were then processed with a micro-expression action unit recognition model based on dynamic graphs and spatial pyramid [32], which is also our previous work. ...

Micro-expression spotting network based on attention and one-dimensional convolutional sliding window

Multimedia Systems

... Tensor-based data analysis techniques have significant advantages in data representation and processing, facilitating the correlation analysis of intertwined heterogeneous data [4]. Meanwhile, some tensorbased analysis methods have been proposed, such as tensorbased data fusion [5], tensor-based multi-clustering [6], tensorbased multi-modal prediction [7], and tensor-based accurate recommendation [8]. These methods play crucial roles in various fields such as intelligent transportation, online education, and social networking [9]. ...

User-Context Collaboration and Tensor Factorization for GNN-Based Social Recommendation
  • Citing Article
  • November 2023

IEEE Transactions on Network Science and Engineering