Changjun Jiang’s research while affiliated with Tongji University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (392)


Domain-wise knowledge decoupling for personalized federated learning via Radon transform
  • Article

March 2025

·

1 Read

Neurocomputing

Zihao Lu

·

·

Changjun Jiang

Figure 3: Visualization of financial time series data with different movement patterns.
Figure 4: Hexbin plots illustrating the normalized density values of the low-dimensional feature spaces generated by PCA, applied to stock features such as open price, close price, high price, low price, and trading volume for FinTSB, alongside four different time-sliced stock data.
Figure 5: The pipeline of FinTSB with four integral modules.
Figure 6: Cumulative return of typical methods over the whole year of 2024 in the CSI 300.
Figure 7: Inference efficiency comparison under FinTSB.

+1

FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting
  • Preprint
  • File available

February 2025

·

11 Reads

Yifan Hu

·

·

Peiyuan Liu

·

[...]

·

Changjun Jiang

Financial time series (FinTS) record the behavior of human-brain-augmented decision-making, capturing valuable historical information that can be leveraged for profitable investment strategies. Not surprisingly, this area has attracted considerable attention from researchers, who have proposed a wide range of methods based on various backbones. However, the evaluation of the area often exhibits three systemic limitations: 1. Failure to account for the full spectrum of stock movement patterns observed in dynamic financial markets. (Diversity Gap), 2. The absence of unified assessment protocols undermines the validity of cross-study performance comparisons. (Standardization Deficit), and 3. Neglect of critical market structure factors, resulting in inflated performance metrics that lack practical applicability. (Real-World Mismatch). Addressing these limitations, we propose FinTSB, a comprehensive and practical benchmark for financial time series forecasting (FinTSF). To increase the variety, we categorize movement patterns into four specific parts, tokenize and pre-process the data, and assess the data quality based on some sequence characteristics. To eliminate biases due to different evaluation settings, we standardize the metrics across three dimensions and build a user-friendly, lightweight pipeline incorporating methods from various backbones. To accurately simulate real-world trading scenarios and facilitate practical implementation, we extensively model various regulatory constraints, including transaction fees, among others. Finally, we conduct extensive experiments on FinTSB, highlighting key insights to guide model selection under varying market conditions. Overall, FinTSB provides researchers with a novel and comprehensive platform for improving and evaluating FinTSF methods. The code is available at https://github.com/TongjiFinLab/FinTSBenchmark.

Download

Figure 5: The performance of various prediction methods in forecasting the next three days' settlement prices for six futures contracts is evaluated based on the MSE of predicted returns. For the first three futures contracts, additional news information was incorporated into the predictions using LLMs, i.e. Deepseek and our proposed method.
The results of questionnaires completed by three types of LLM agents, compared with the original human experiment results reported by Kahneman [28]. Options A or B indicate the majority choice made by agents or human in the questionnaire. Bold options represent choices made by LLM agents that align with human choices. The 'Question Prospect' column distinguishes between 'Positive' (gaining money) and 'Negative' (losing money) scenarios in monetary decision-making.
Advanced Digital Simulation for Financial Market Dynamics: A Case of Commodity Futures

February 2025

·

3 Reads

After decades of evolution, the financial system has increasingly deviated from an idealized framework based on theorems. It necessitates accurate projections of complex market dynamics and human behavioral patterns. With the development of data science and machine intelligence, researchers are trying to digitalize and automate market prediction. However, existing methodologies struggle to represent the diversity of individuals and are regardless of the domino effects of interactions on market dynamics, leading to the poor performance facing abnormal market conditions where non-quantitative information dominates the market. To alleviate these disadvantages requires the introduction of knowledge about how non-quantitative information, like news and policy, affects market dynamics. This study investigates overcoming these challenges through rehearsing potential market trends based on the financial large language model agents whose behaviors are aligned with their cognition and analyses in markets. We propose a hierarchical knowledge architecture for financial large language model agents, integrating fine-tuned language models and specialized generators optimized for trading scenarios. For financial market, we develop an advanced interactive behavioral simulation system that enables users to configure agents and automate market simulations. In this work, we take commodity futures as an example to research the effectiveness of our methodologies. Our real-world case simulation succeeds in rehearsing abnormal market dynamics under geopolitical events and reaches an average accuracy of 3.4% across various points in time after the event on predicting futures price. Experimental results demonstrate our method effectively leverages diverse information to simulate behaviors and their impact on market dynamics through systematic interaction.


Graph neural networks for financial fraud detection: a review

January 2025

·

115 Reads

·

2 Citations

Frontiers of Computer Science (electronic)

The landscape of financial transactions has grown increasingly complex due to the expansion of global economic integration and advancements in information technology. This complexity poses greater challenges in detecting and managing financial fraud. This review explores the role of Graph Neural Networks (GNNs) in addressing these challenges by proposing a unified framework that categorizes existing GNN methodologies applied to financial fraud detection. Specifically, by examining a series of detailed research questions, this review delves into the suitability of GNNs for financial fraud detection, their deployment in real-world scenarios, and the design considerations that enhance their effectiveness. This review reveals that GNNs are exceptionally adept at capturing complex relational patterns and dynamics within financial networks, significantly outperforming traditional fraud detection methods. Unlike previous surveys that often overlook the specific potentials of GNNs or address them only superficially, our review provides a comprehensive, structured analysis, distinctly focusing on the multifaceted applications and deployments of GNNs in financial fraud detection. This review not only highlights the potential of GNNs to improve fraud detection mechanisms but also identifies current gaps and outlines future research directions to enhance their deployment in financial systems. Through a structured review of over 100 studies, this review paper contributes to the understanding of GNN applications in financial fraud detection, offering insights into their adaptability and potential integration strategies.


Multi-View Graph-Based Hierarchical Representation Learning for Money Laundering Group Detection

January 2025

·

11 Reads

IEEE Transactions on Information Forensics and Security

Anti-money laundering (AML) is crucial to maintaining national financial security. Contemporary AML methods focus on homogeneous mining or unitary money laundering pattern. These methods ignore a characteristic of gang operation in money laundering. Thus, in this paper, we propose a multi-view graph based hierarchical representation learning method, named MG-HRL, to mine organized money laundering groups. In particular, we extract multi-level representations of transaction subgraphs, including transaction features, user features, structural features, and high-order association features from multiple observational perspectives. To learn the correlation between users, we model transaction networks as heterogeneous information networks (HINs) and design six meta-paths related to money laundering scenarios to mine correlations among users. Combining with correlation representations of users, we propose a heterogeneous hypergraph representation learning method to learn high-order representations of transaction subgraphs. Through hierarchical representation learning, the MG-HRL achieves full exploration of money laundering groups. Finally, we conduct experiments on two public transaction datasets. The result shows that MG-HRL method performs better than other state-of-the-art baselines.


Preferential Selective-Aware Graph Neural Network for Preventing Attacks in Interbank Credit Rating

December 2024

·

7 Reads

IEEE Transactions on Neural Networks and Learning Systems

Accurately assessing and forecasting bank credit ratings at an early stage is vitally important for a healthy financial environment and sustainable economic development. However, the evaluation process faces challenges due to individual attacks on the rating model. Some participants may provide manipulated information in an attempt to undermine the rating model and secure higher scores, further complicating the evaluation process. Therefore, we propose a novel approach called the preferential selective-aware graph neural network (PSAGNN) model to simultaneously defend against feature and structural nontarget poisoning attacks on Interbank credit ratings. In particular, the model establishes a phased optimization approach combined with biased perturbation and explores the Interbank preferences and scale-free nature of networks, to adaptively prioritize the poisoning training data and simulate a clean graph. Finally, we apply a weighted penalty on the opposition function to optimize the model so that the model can distinguish between attackers. Extensive experiments on our newly collected Interbank quarter dataset and case studies demonstrate the superior performance of our proposed approach in preventing credit rating attacks compared to state-of-the-art baselines.


Distributed Kalman Filtering Under Two-Bitrate Periodic Coding Strategies

December 2024

·

9 Reads

IEEE Transactions on Automatic Control

This paper is concerned with the problem of distributed Kalman filtering over sensor networks under two-bitrate periodic coding strategies. Initially, the optimal estimates for sensor individuals are acquired using the conventional Kalman filter. Subsequently, the information pair, consisting of the local estimate and the corresponding covariance, is exchanged among their immediate neighbors to achieve cooperative estimation. Due to the constrained network bandwidth, a vector/matrix quantization approach is formulated to quantize the information pair. The output of this quantization establishes a conservative bound for the actual covariance. A two-bitrate periodic coding strategy is proposed, where the encoded bits of the quantizer outputs are divided into two separate parts, namely the most significant and least significant bits, following a periodic transmission principle. It is demonstrated that the estimation preserves a consistency property over the sensor networks as the reported error covariance always serves as an upper bound for the actual error covariance. It is shown that the mean-square estimation errors are bounded when certain conditions regarding collective observability and network connectivity are satisfied. Finally, the effectiveness of the proposed algorithm is verified through a numerical example.


Adaptive Knowledge Recomposition for Personalized Federated Learning via Discrete Wavelet Transform

November 2024

·

3 Reads

IEEE Internet of Things Journal

Heterogeneous data silos hinder the application of deep learning in the Internet of Things. As a dealing scheme, personalized federated learning (pFL) distributedly customizes multiple local models for these silos. Most pFL methods directly use a global model to assist local model optimization, ignoring the performance drop caused by irrelevant or misleading global-model knowledge. To address this, we propose an adaptive knowledge recomposition approach (FedAKR), which refines relevant and correctly-leading knowledge from the global model and recomposes it into the local model to promote personalization. Specifically, FedAKR provides a discrete wavelet transform-based method to recompose different kinds of knowledge in the same representation space. Facilitated by this common space, we introduce an enriched local optimization objective to establish a causal relationship between refined global-model knowledge and recomposed local-model knowledge. The relationship guides effective and efficient knowledge refinement, thereby promoting personalization. Besides, we provide the theoretical proof of convergence for our novel pFL approach. Extensive experiments demonstrate that FedAKR achieves interpretable improvements, higher performance over twelve state-of-the-art methods, and the potential to further integrate pre-trained large models.


Service Workflow Activity Input/Output Parameters Recommendation Method by Combining Transformer and Weighted HITS

November 2024

IEEE Transactions on Services Computing

Each activity in the service workflow interacts with services as required to meet complex business needs and quickly adapt to market changes. The design of each activity's input/output interface parameters influences whether it can successfully map to appropriate and interactive services. In practice, suitable activity interface parameters should possess 3-features: realism, relevance, and compatibility, as popular parameters originating from the real world and closely related to activity semantics are apt to match user-expected services. However, existing research requires expert specification or ontology-based inference, resulting in outdated, inconsistent parameters that lack necessary elements, making it challenging to match expected services. Therefore, we propose an automated method combining Transformer and weighted HITS to recommend interface parameters with 3-features on activity function requirement. It filters similar Endpoints (EPs) based on the activity's semantics by supervised Transformer-based learning of multidomain APIs and unsupervised EPs matching. Next, a nodeweighted heterogeneous graph is built based on similar EPs and their interface parameter relationships. We then apply a nodeweighted HITS to explore mutual gain relationships within the graph and calculate parameter compatibilities. Finally, a topk non-redundant compatible parameter list and corresponding different formats are recommended for the activity. The method's effectiveness and efficiency are verified using a real API service dataset from RapidAPI


Graph Neural Networks for Financial Fraud Detection: A Review

October 2024

·

46 Reads

The landscape of financial transactions has grown increasingly complex due to the expansion of global economic integration and advancements in information technology. This complexity poses greater challenges in detecting and managing financial fraud. This review explores the role of Graph Neural Networks (GNNs) in addressing these challenges by proposing a unified framework that categorizes existing GNN methodologies applied to financial fraud detection. Specifically, by examining a series of detailed research questions, this review delves into the suitability of GNNs for financial fraud detection, their deployment in real-world scenarios, and the design considerations that enhance their effectiveness. This review reveals that GNNs are exceptionally adept at capturing complex relational patterns and dynamics within financial networks, significantly outperforming traditional fraud detection methods. Unlike previous surveys that often overlook the specific potentials of GNNs or address them only superficially, our review provides a comprehensive, structured analysis, distinctly focusing on the multifaceted applications and deployments of GNNs in financial fraud detection. This review not only highlights the potential of GNNs to improve fraud detection mechanisms but also identifies current gaps and outlines future research directions to enhance their deployment in financial systems. Through a structured review of over 100 studies, this review paper contributes to the understanding of GNN applications in financial fraud detection, offering insights into their adaptability and potential integration strategies.


Citations (63)


... We propose an AI-driven approach that analyzes gait signals-capturing differences in movement patterns between actual human walking and artificially generated motion from automatic walking devices as detected by smartphone sensors. While most existing studies on gait analysis have focused on human activity recognition (HAR) [2][3][4][5] or disease diagnosis [6][7][8], anomaly detection in digital platforms has primarily been applied to financial fraud detection, identifying suspicious transactions and behavioral anomalies [9]. However, these methods are not designed to detect motion-based fraud in M2E applications. ...

Reference:

Real Steps or Not: Auto-Walker Detection in Move-to-Earn Applications
Graph neural networks for financial fraud detection: a review
  • Citing Article
  • January 2025

Frontiers of Computer Science (electronic)

... Chen et al. [25] proposed a Privacy Passport, a formal privacy protection mechanism for the inter-domain data sharing. Advances in big data and cross-domain collaboration have greatly promoted the development of traditional privacypreserving methods, where challenges such as model parameter sharing, privacy constraints and communication costs arise. ...

Privacy Passport: Privacy-Preserving Cross-Domain Data Sharing
  • Citing Article
  • January 2024

IEEE Transactions on Information Forensics and Security

... Rights reserved. multi-dimensional data processing and dynamic modeling (Li et al. 2024b). The ordinal mixed-data sampling (OR-Mix) model effectively integrates heterogeneous data types (continuous, categorical, and ordinal) for corporate credit rating predictions, demonstrating improved accuracy and interpretability compared to traditional models (Goldmann et al. 2024). ...

HFTCRNet: Hierarchical Fusion Transformer for Interbank Credit Rating and Risk Assessment
  • Citing Article
  • October 2024

IEEE Transactions on Neural Networks and Learning Systems

... Nonetheless, the mechanism cannot directly identify and filter out problematic samples. Motivated by the knowledge distillation paradigm [72], [73], Lu et al. [74] proposed to employ conditional generators on each client to generate proxy samples for ensemble distillation. To address the loss of fidelity in synthetic data, the authors connected a filter to the head of each local model whose outputs are then modulated based on the reference labels. ...

Data-Free Knowledge Filtering and Distillation in Federated Learning
  • Citing Article
  • January 2024

IEEE Transactions on Big Data

... To create this artifact, we followed the process of DSRM (Peffers et al., 2007), with this paper covering four "designimplement-evaluate" cycles. Starting with the problem identification our study contributes to descriptive knowledge concerning the problem space by identifying data scarcity in combination with the inability to share data due to privacy protection as a major hurdle for financial institutions, validating the existing research on cross-organizational fraud detection collaboration within financial services (Abdul Salam et al., 2024;Kong et al., 2024). During the exploration of the solution, space synthetic data sharing was identified as an underexplored solution to tackle data scarcity in financial fraud detection extending the literature on cross-organizational collaboration in the field (Chatterjee et al., 2024). ...

ASIA: A Federated Boosting Tree Model Against Sequence Inference Attacks in Financial Networks
  • Citing Article
  • January 2024

IEEE Transactions on Information Forensics and Security

... For example, sampling-based methods selectively extract graph structures to improve computational efficiency and representation capabilities [30,71,75]. Similarly, several studies focus on defining adaptive neighborhoods for aggregation at each vertex by utilizing learnable mechanisms that are governed by the ground truth values of the downstream task [22,50]. Attentionbased GNNs [6,60] adjust edge weights adaptively by learning attention coefficients that reflect the relative importance of neighboring vertices during the learning process. ...

Graph Convolutional Networks With Adaptive Neighborhood Awareness
  • Citing Article
  • April 2024

IEEE Transactions on Pattern Analysis and Machine Intelligence

... Through in-depth analysis of historical data, market trends and consumer behavior, big data technology can effectively predict future market demand fluctuations (Wang et al., 2020). Research has shown that big data can help companies accurately predict product demand and thus optimize production and inventory management by analyzing sales records (Tseng et al., 2021), seasonal demand changes (Jiang et al., 2024), and consumer feedback from social media (Hu, 2021;Zhao, 2023). Such demand forecasting not only reduces inventory backlogs but also improves the efficiency of capital utilization, which contributes to the rational allocation of funds in the supply chain and avoids the risk of idle or shortage funds (Zhao, 2022;Zhou, 2021). ...

Guest Editorial: Special Issue on Big Data and Computational Social Intelligence for Guaranteed Financial Security
  • Citing Article
  • April 2024

IEEE Transactions on Computational Social Systems

... Research on continual learning is mostly concerned with image recognition [31]. Few studies present fraud detection as the core problem [18,71,23,28]. It often only appears as one of many data sets to which continual learning methods are applied [61,46,22]. ...

Pre-trained Online Contrastive Learning for Insurance Fraud Detection
  • Citing Article
  • March 2024

Proceedings of the AAAI Conference on Artificial Intelligence