Farimah Poursafaei’s research while affiliated with McGill University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (17)


Figure 4: Different setting for evaluation of future link prediction include between deployed, streaming and live-update setting. UTG framework is designed for the streaming setting.
Dataset statistics. Dataset # Nodes # Edges # Unique Edges Surprise Time Granularity # Snapshots
Test MRR comparison for snapshot and event-based methods on DTDG datasets, results reported from 5 runs. Top three models are coloured by First, Second, Third.
Test inference time comparison for snapshot and event based methods on DTDG datasets, we report the average result from 5 runs. Top three models are coloured by First, Second, Third.
Test inference time comparison for snapshot and event based methods on CTDG datasets, results reported from 5 runs. Top three models are coloured by First, Second, Third.
UTG: Towards a Unified View of Snapshot and Event Based Models for Temporal Graphs
  • Preprint
  • File available

July 2024

·

17 Reads

Shenyang Huang

·

Farimah Poursafaei

·

·

[...]

·

Emanuele Rossi

Temporal graphs have gained increasing importance due to their ability to model dynamically evolving relationships. These graphs can be represented through either a stream of edge events or a sequence of graph snapshots. Until now, the development of machine learning methods for both types has occurred largely in isolation, resulting in limited experimental comparison and theoretical crosspollination between the two. In this paper, we introduce Unified Temporal Graph (UTG), a framework that unifies snapshot-based and event-based machine learning models under a single umbrella, enabling models developed for one representation to be applied effectively to datasets of the other. We also propose a novel UTG training procedure to boost the performance of snapshot-based models in the streaming setting. We comprehensively evaluate both snapshot and event-based models across both types of temporal graphs on the temporal link prediction task. Our main findings are threefold: first, when combined with UTG training, snapshotbased models can perform competitively with event-based models such as TGN and GraphMixer even on event datasets. Second, snapshot-based models are at least an order of magnitude faster than most event-based models during inference. Third, while event-based methods such as NAT and DyGFormer outperforms snapshotbased methods on both types of temporal graphs, this is because they leverage joint neighborhood structural features thus emphasizing the potential to incorporate these features into snapshot-based models as well. These findings highlight the importance of comparing model architectures independent of the data format and suggest the potential of combining the efficiency of snapshot-based models with the performance of event-based models in the future.

Download

Towards Neural Scaling Laws for Foundation Models on Temporal Graphs

June 2024

·

58 Reads

The field of temporal graph learning aims to learn from evolving network data to forecast future interactions. Given a collection of observed temporal graphs, is it possible to predict the evolution of an unseen network from the same domain? To answer this question, we first present the Temporal Graph Scaling (TGS) dataset, a large collection of temporal graphs consisting of eighty-four ERC20 token transaction networks collected from 2017 to 2023. Next, we evaluate the transferability of Temporal Graph Neural Networks (TGNNs) for the temporal graph property prediction task by pre-training on a collection of up to sixty-four token transaction networks and then evaluating the downstream performance on twenty unseen token networks. We find that the neural scaling law observed in NLP and Computer Vision also applies in temporal graph learning, where pre-training on greater number of networks leads to improved downstream performance. To the best of our knowledge, this is the first empirical demonstration of the transferability of temporal graphs learning. On downstream token networks, the largest pre-trained model outperforms single model TGNNs on thirteen unseen test networks. Therefore, we believe that this is a promising first step towards building foundation models for temporal graphs.


TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous Graphs

June 2024

·

14 Reads

Multi-relational temporal graphs are powerful tools for modeling real-world data, capturing the evolving and interconnected nature of entities over time. Recently, many novel models are proposed for ML on such graphs intensifying the need for robust evaluation and standardized benchmark datasets. However, the availability of such resources remains scarce and evaluation faces added complexity due to reproducibility issues in experimental protocols. To address these challenges, we introduce Temporal Graph Benchmark 2.0 (TGB 2.0), a novel benchmarking framework tailored for evaluating methods for predicting future links on Temporal Knowledge Graphs and Temporal Heterogeneous Graphs with a focus on large-scale datasets, extending the Temporal Graph Benchmark. TGB 2.0 facilitates comprehensive evaluations by presenting eight novel datasets spanning five domains with up to 53 million edges. TGB 2.0 datasets are significantly larger than existing datasets in terms of number of nodes, edges, or timestamps. In addition, TGB 2.0 provides a reproducible and realistic evaluation pipeline for multi-relational temporal graphs. Through extensive experimentation, we observe that 1) leveraging edge-type information is crucial to obtain high performance, 2) simple heuristic baselines are often competitive with more complex methods, 3) most methods fail to run on our largest datasets, highlighting the need for research on more scalable methods.


Model Hyperparameters.
Temporal Graph Rewiring with Expander Graphs

June 2024

·

31 Reads

Evolving relations in real-world networks are often modelled by temporal graphs. Graph rewiring techniques have been utilised on Graph Neural Networks (GNNs) to improve expressiveness and increase model performance. In this work, we propose Temporal Graph Rewiring (TGR), the first approach for graph rewiring on temporal graphs. TGR enables communication between temporally distant nodes in a continuous time dynamic graph by utilising expander graph propagation to construct a message passing highway for message passing between distant nodes. Expander graphs are suitable candidates for rewiring as they help overcome the oversquashing problem often observed in GNNs. On the public tgbl-wiki benchmark, we show that TGR improves the performance of a widely used TGN model by a significant margin. Our code repository is accessible at https://anonymous.4open.science/r/TGR-254C.




Towards Temporal Edge Regression: A Case Study on Agriculture Trade Between Nations

August 2023

·

42 Reads

Recently, Graph Neural Networks (GNNs) have shown promising performance in tasks on dynamic graphs such as node classification, link prediction and graph regression. However, few work has studied the temporal edge regression task which has important real-world applications. In this paper, we explore the application of GNNs to edge regression tasks in both static and dynamic settings, focusing on predicting food and agriculture trade values between nations. We introduce three simple yet strong baselines and comprehensively evaluate one static and three dynamic GNN models using the UN Trade dataset. Our experimental results reveal that the baselines exhibit remarkably strong performance across various settings, highlighting the inadequacy of existing GNNs. We also find that TGN outperforms other GNN models, suggesting TGN is a more appropriate choice for edge regression tasks. Moreover, we note that the proportion of negative edges in the training samples significantly affects the test performance. The companion source code can be found at: https://github.com/scylj1/GNN_Edge_Regression.


Temporal Graph Benchmark for Machine Learning on Temporal Graphs

July 2023

·

27 Reads

·

1 Citation

We present the Temporal Graph Benchmark (TGB), a collection of challenging and diverse benchmark datasets for realistic, reproducible, and robust evaluation of machine learning models on temporal graphs. TGB datasets are of large scale, spanning years in duration, incorporate both node and edge-level prediction tasks and cover a diverse set of domains including social, trade, transaction, and transportation networks. For both tasks, we design evaluation protocols based on realistic use-cases. We extensively benchmark each dataset and find that the performance of common models can vary drastically across datasets. In addition, on dynamic node property prediction tasks, we show that simple methods often achieve superior performance compared to existing temporal graph models. We believe that these findings open up opportunities for future research on temporal graphs. Finally, TGB provides an automated machine learning pipeline for reproducible and accessible temporal graph research, including data loading, experiment setup and performance evaluation. TGB will be maintained and updated on a regular basis and welcomes community feedback. TGB datasets, data loaders, example codes, evaluation setup, and leaderboards are publicly available at https://tgb.complexdatalab.com/ .



Towards Improved Illicit Node Detection with Positive-Unlabelled Learning

March 2023

·

13 Reads

Detecting illicit nodes on blockchain networks is a valuable task for strengthening future regulation. Recent machine learning-based methods proposed to tackle the tasks are using some blockchain transaction datasets with a small portion of samples labeled positive and the rest unlabelled (PU). Albeit the assumption that a random sample of unlabeled nodes are normal nodes is used in some works, we discuss that the label mechanism assumption for the hidden positive labels and its effect on the evaluation metrics is worth considering. We further explore that PU classifiers dealing with potential hidden positive labels can have improved performance compared to regular machine learning models. We test the PU classifiers with a list of graph representation learning methods for obtaining different feature distributions for the same data to have more reliable results.


Citations (7)


... In this context, temporal network theory, network science metrics, and temporal graph mining tools can be leveraged to enhance the interpretability of TGL models. For instance, two works [40,11] analyze the temporal edge re-occurrence in the TGB datasets, highlighting that deep learning models may vary their performance due to the level of re-occurrence/novelty of the edges in the datasets, depending on their memorization or inductive reasoning capabilities. Dileo et al. [8] tested wellknown network science heuristics for link prediction on TGB. ...

Reference:

Network Science Meets AI: A Converging Frontier
Temporal Graph Analysis with TGX
  • Citing Conference Paper
  • March 2024

... DNLP focuses on predicting future connections by utilizing historical data, making it possible to understand how relationships develop and change over time. For instance, DNLP can be used to forecast new friendships in social networks based on previous interactions or to predict future collaborations among researchers by analyzing past co-authorship patterns [2,3]. ...

Exhaustive Evaluation of Dynamic Link Prediction
  • Citing Conference Paper
  • December 2023

... Following the approaches of previous studies [6,23,25,27,40,43], we proposed a random-walk-based approach for its parallelism ability across nodes, offering improved efficiency potentials. We first incorporated the strategy of Unbiased Update proposed by Sajjad et al. [36] to adjust node representations without full graph retraining by trimming random walks at affected nodes and then continuing the walks, considering the graph's updated structure. ...

Towards Improved Illicit Node Detection with Positive-Unlabelled Learning
  • Citing Conference Paper
  • May 2023

... In intrusion detection, SigTran can identify temporal patterns in attack behaviors, offering a more dynamic and responsive detection mechanism [31]. Similarly, TGBASE [32] leverages temporal graph structures to enhance anomaly detection by considering the temporal evolution of data, which is critical in identifying sophisticated and evolving cyber threats. Incorporating temporal graph methodologies into our FL-based IDSs can provide a more robust framework, enhancing the detection of time-based anomalies and evolving threats, thereby complementing our current focus on model accuracy and robustness. ...

A Strong Node Classification Baseline for Temporal Graphs
  • Citing Chapter
  • January 2022

... Due to its high transaction volume and anonymity, Ethereum has been a prime target for fraudulent activities [6,9,11,12]. Numerous studies have focused on identifying phishing accounts to safeguard users from scams [1,3,4,7,13,14]. However, with the emergence of Ethereum, web3 scams have also emerged, where certain services engage in covert malicious activities within the blockchain. ...

SigTran: Signature Vectors for Detecting Illicit Activities in Blockchain Transaction Networks
  • Citing Conference Paper
  • May 2021

Lecture Notes in Computer Science

... I LLICIT activities, including money laundering, phishing, and fraud, have cast a shadow over the advancements made in cryptocurrencies and the accompanying advantages they offer, as highlighted in a study on detecting such activities on the blockchain network [1]. Due to the substantial volume of sensitive data they handle, these technologies are vulnerable to a range of malicious actions, attacks, and security threats that pose risks to the availability and integrity of information and services. ...

Detecting Malicious Ethereum Entities via Application of Machine Learning Classification
  • Citing Conference Paper
  • September 2020