Gabriele Ciravegna

Gabriele Ciravegna
  • Master of Computer Engineering
  • PhD Student at University of Florence

About

60
Publications
5,922
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
390
Citations
Introduction
Gabriele Ciravegna currently works as a PhD student at SAILAB - Siena Artificial Intelligence Laboratory. Gabriele is mainly interested in Artificial Neural Networks and Artificial Intelligence. Currently, he is working on overcoming intrinsic limits of deep learning, above all understandability and robustness.
Current institution
University of Florence
Current position
  • PhD Student

Publications

Publications (60)
Preprint
Full-text available
We formalize a novel modeling framework for achieving interpretability in deep learning, anchored in the principle of inference equivariance. While the direct verification of interpretability scales exponentially with the number of variables of the system, we show that this complexity can be mitigated by treating interpretability as a Markovian pro...
Article
Full-text available
The field of voice analysis has experienced significant transformations, evolving from basicperceptual assessments to the incorporation of advanced digital signal processing and computational tools. This progression has facilitated a deeper understanding of the complex dynamics of vocal function, particularly through the use of acoustic voice analy...
Article
Full-text available
Resistance spot welding is widely adopted in manufacturing and is characterized by high reliability and simple automation in the production line. The detection of defective welds is a difficult task that requires either destructive or expensive and slow non-destructive testing (e.g., ultrasound). The robots performing the welding automatically coll...
Article
Machine Learning (ML) heavily relies on optimization techniques built upon gradient descent. Numerous gradient-based update methods have been proposed in the scientific literature, particularly in the context of neural networks, and have gained widespread adoption as optimizers in ML software libraries. This paper introduces a novel perspective by...
Preprint
Full-text available
The lack of transparency in the decision-making processes of deep learning systems presents a significant challenge in modern artificial intelligence (AI), as it impairs users' ability to rely on and verify these systems. To address this challenge, Concept Bottleneck Models (CBMs) have made significant progress by incorporating human-interpretable...
Preprint
Full-text available
Despite their success, Large-Language Models (LLMs) still face criticism as their lack of interpretability limits their controllability and reliability. Traditional post-hoc interpretation methods, based on attention and gradient-based analysis, offer limited insight into the model's decision-making processes. In the image field, Concept-based mode...
Article
Full-text available
Deep learning has been recently used to extract the relevant features for representing input data also in the unsupervised setting. However, state-of-the-art techniques focus mostly on algorithmic efficiency and accuracy rather than mimicking the input manifold. On the contrary, competitive learning is a powerful tool for replicating the input dist...
Chapter
Machine Learning (ML) strongly relies on optimization procedures that are based on gradient descent. Several gradient-based update schemes have been proposed in the scientific literature, especially in the context of neural networks, that have become common optimizers in software libraries for ML. In this paper, we re-frame gradient-based update st...
Chapter
The opaque reasoning of Graph Neural Networks induces a lack of human trust. Existing graph network explainers attempt to address this issue by providing post-hoc explanations, however, they fail to make the model itself more interpretable. To fill this gap, we introduce the Concept Distillation Module, the first differentiable concept-distillation...
Chapter
The deployment of Deep Learning (DL) models is still precluded in those contexts where the amount of supervised data is limited. To answer this issue, active learning strategies aim at minimizing the amount of labelled data required to train a DL model. Most active strategies are based on uncertain sample selection, and even often restricted to sam...
Preprint
The design of interpretable deep learning models working in relational domains poses an open challenge: interpretable deep learning methods, such as Concept-Based Models (CBMs), are not designed to solve relational problems, while relational models are not as interpretable as CBMs. To address this problem, we propose Relational Concept-Based Models...
Chapter
Deep clustering is a branch of deep learning, in which the dimensionality reduction capabilities of deep networks are exploited for clustering data. In this sense, the deep part of clustering has to be considered more as a preprocessing step than a perfectly integrated module of the neural network. This paper proposes the idea of dual neural networ...
Chapter
The opaqueness of deep neural networks hinders their employment in safety-critical applications. This issue is driving the research community to focus on eXplainable Artificial Intelligence (XAI) techniques. XAI algorithms can be categorized into two types: those that explain the predictions of black-box models and those that create interpretable m...
Preprint
Full-text available
Deep learning methods are highly accurate, yet their opaque decision process prevents them from earning full human trust. Concept-based models aim to address this issue by learning tasks based on a set of human-understandable concepts. However, state-of-the-art concept-based models rely on high-dimensional concept embedding representations which la...
Article
Full-text available
The large and still increasing popularity of deep learning clashes with a major limit of neural network architectures, that consists in their lack of capability in providing human-understandable motivations of their decisions. In situations in which the machine is expected to support the decision of human experts, providing a comprehensible explana...
Chapter
The generation of aesthetically pleasing graph layouts is the main purpose of Graph Drawing techniques. Recent contributions delved into the usage of Gradient-descent (GD) based schemes to optimize differentiable loss functions, built to measure the graph layout adherence to given layout characteristics. However, some properties cannot be easily ex...
Preprint
Full-text available
Recently, Logic Explained Networks (LENs) have been proposed as explainable-by-design neural models providing logic explanations for their predictions. However, these models have only been applied to vision and tabular data, and they mostly favour the generation of global explanations, while local ones tend to be noisy and verbose. For these reason...
Preprint
Full-text available
Deploying AI-powered systems requires trustworthy models supporting effective human interactions, going beyond raw prediction accuracy. Concept bottleneck models promote trustworthiness by conditioning classification tasks on an intermediate level of human-like concepts. This enables human interventions which can correct mispredicted concepts to im...
Preprint
Full-text available
The opaque reasoning of Graph Neural Networks induces a lack of human trust. Existing graph network explainers attempt to address this issue by providing post-hoc explanations, however, they fail to make the model itself more interpretable. To fill this gap, we introduce the Concept Encoder Module, the first differentiable concept-discovery approac...
Article
Full-text available
Graph drawing techniques have been developed in the last few years with the purpose of producing esthetically pleasing node-link layouts. Recently, the employment of differentiable loss functions has paved the road to the massive usage of gradient descent and related optimization algorithms. In this article, we propose a novel framework for the dev...
Article
Full-text available
Explainable artificial intelligence has rapidly emerged since lawmakers have started requiring interpretable models for safety-critical domains. Concept-based neural networks have arisen as explainable-by-design methods as they leverage human-understandable symbols (i.e. concepts) to predict class memberships. However, most of these approaches focu...
Article
Full-text available
The widespread adoption of encryption in computer network traffic is increasing the difficulty of analyzing such traffic for security purposes. The data set presented in this data article is composed of network statistics computed on captures of TCP flows, originated by executing various network stress and web crawling tools, along with statistics...
Article
Full-text available
Adversarial attacks on machine learning-based classifiers, along with defense mechanisms, have been widely studied in the context of single-label classification problems. In this paper, we shift the attention to multi-label classification, where the availability of domain knowledge on the relationships among the considered classes may offer a natur...
Article
Full-text available
Due to the current high availability of omics, data-driven biology has greatly expanded, and several papers have reviewed state-of-the-art technologies. Nowadays, two main types of investigation are available for a multi-omics dataset: extraction of relevant features for a meaningful biological interpretation and clustering of the samples. In the l...
Article
This paper presents an approach that leverages classical machine learning techniques to identify the tools from the packets sniffed, both for clear-text and encrypted traffic. This research aims to overcome the limitations to security monitoring systems posed by the widespread adoption of encrypted communications. By training three distinct classif...
Preprint
Full-text available
In the last few years, Deep Learning models have become increasingly popular. However, their deployment is still precluded in those contexts where the amount of supervised data is limited and manual labelling expensive. Active learning strategies aim at solving this problem by requiring supervision only on few unlabelled samples, which improve the...
Preprint
Full-text available
Graph Drawing techniques have been developed in the last few years with the purpose of producing aesthetically pleasing node-link layouts. Recently, the employment of differentiable loss functions has paved the road to the massive usage of Gradient Descent and related optimization algorithms. In this paper, we propose a novel framework for the deve...
Preprint
Full-text available
The large and still increasing popularity of deep learning clashes with a major limit of neural network architectures, that consists in their lack of capability in providing human-understandable motivations of their decisions. In situations in which the machine is expected to support the decision of human experts, providing a comprehensible explana...
Conference Paper
Full-text available
Topological learning is a wide research area aiming at uncovering the mutual spatial relationships between the elements of a set. Some of the most common and oldest approaches involve the use of unsupervised competitive neural networks. However, these methods are not based on gradient optimization which has been proven to provide striking results i...
Preprint
Full-text available
Explainable artificial intelligence has rapidly emerged since lawmakers have started requiring interpretable models for safety-critical domains. Concept-based neural networks have arisen as explainable-by-design methods as they leverage human-understandable symbols (i.e. concepts) to predict class memberships. However, most of these approaches focu...
Preprint
Full-text available
LENs is a Python module integrating a variety of state-of-the-art approaches to provide logic explanations from neural networks. This package focuses on bringing these methods to non-specialists. It has minimal dependencies and it is distributed under the Apache 2.0 licence allowing both academic and commercial use. Source code and documentation ca...
Chapter
In the field of machine learning, coresets are defined as subsets of the training set that can be used to obtain a good approximation of the behavior that a given algorithm would have on the whole training set. Advantages of using coresets instead of the training set include improving training speed and allowing for a better human understanding of...
Chapter
Full-text available
In recent years, due to the high availability of omic data, data driven biology has greatly expanded. However, the analysis of different data sources is still an open challenge. A few multi-omic approaches have been proposed in literature. However, none of them take into consideration the intrinsic topology of each omic. In this work, an unsupervis...
Preprint
Full-text available
Deep learning has been widely used for supervised learning and classification/regression problems. Recently, a novel area of research has applied this paradigm to unsupervised tasks; indeed, a gradient-based approach extracts, efficiently and autonomously, the relevant features for handling input data. However, state-of-the-art techniques focus mos...
Preprint
Full-text available
Topological learning is a wide research area aiming at uncovering the mutual spatial relationships between the elements of a set. Some of the most common and oldest approaches involve the use of unsupervised competitive neural networks. However, these methods are not based on gradient optimization which has been proven to provide striking results i...
Conference Paper
Full-text available
Deep neural networks are usually considered black-boxes due to their complex internal architecture, that cannot straightforwardly provide human-understandable explanations on how they behave. Indeed, Deep Learning is still viewed with skepticism in those real-world domains in which incorrect predictions may produce critical effects. This is one of...
Preprint
Full-text available
Adversarial attacks on machine learning-based classifiers, along with defense mechanisms, have been widely studied in the context of single-label classification problems. In this paper, we shift the attention to multi-label classification, where the availability of domain knowledge on the relationships among the considered classes may offer a natur...
Chapter
The most popular approach to explain cancer is based on the discovery of oncogenes and tumor suppressor genes as a preliminary step in estimating their impact on altered pathways. The present paper proposes a pipeline which aims at detecting “weak” or “indirect” functions impacted by Copy Number Variations (CNVs) of cancer-related genes, integratin...
Article
Full-text available
In the last few years we have seen a remarkable progress from the cultivation of the idea of expressing domain knowledge by the mathematical notion of constraint. However, the progress has mostly involved the process of providing consistent solutions with a given set of constraints, whereas learning “new” constraints, that express new knowledge, is...
Chapter
In the field of artificial intelligence, agents learn how to take decisions by fitting their parameters on a set of samples called training set. Similarly, a core set is a subset of the training samples such that, if an agent exploits this set to fit its parameters instead of the whole training set, then the quality of the inferences does not chang...
Chapter
Full-text available
In pattern recognition, neural networks can be used not only for the classification task, but also for feature selection and other intermediate steps. This paper addresses the 3D face recognition problem in order to select the most meaningful geometric descriptors. At this aim, the classification results are directly integrated in a biclustering pr...
Article
Full-text available
Hierarchical clustering is an important tool for extracting information from data in a multi-resolution way. It is more meaningful if driven by data, as in the case of divisive algorithms, which split data until no more division is allowed. However, they have the drawback of the splitting threshold setting. The neural networks can address this prob...
Conference Paper
Full-text available
Non-stationary topological representation can be addressed in two ways, according to the application: life-long modeling or by forgetting the past. Life-long learning requires neural networks equipped with a tool for judging if a neuron has to be created for tracking the input distribution. It is always implemented as an isotropic criterion (a hype...
Chapter
Full-text available
Cancer is a large family of genetic diseases that involve abnormal cell growth. Genetic mutations can vary from one patient to another. Therefore personalized precision is required to increase the reliability of prognostic predictions and the benefit of therapeutic decisions. The most important issues concerning gene analysis are strong noise, high...
Chapter
Full-text available
Cancer is a large family of genetic diseases that involve abnormal cell growth. Genetic mutations can vary from one patient to another. Therefore personalized precision is required to increase the reliability of prognostic predictions and the benefit of therapeutic decisions. The most important issues concerning gene analysis are strong noise, high...
Chapter
Full-text available
The analysis of complex systems, such as cancer resistance to drugs, require flexible algorithms but also simple models, as they will be used by biologists in order to get insights on the underlying phenomenon. Exploiting the availability of the largest collection of Patient-Derived Xenografts from metastatic colorectal cancer annotated for respons...
Chapter
Full-text available
Exploiting the availability of the largest collection of Patient-Derived Xenografts from metastatic colorectal cancer annotated for response to therapies, this manuscript aims to characterize the biological phenomenon from a mathematical point of view. In particular, we design an experiment in order to investigate how genes interact with each other...
Conference Paper
Full-text available
In pattern recognition, neural networks can be used not only for the classification task, but also for feature selection and other intermediate steps. This paper addresses the 3D face recognition problem in order to select the most meaningful geometric descriptors. At this aim, the classification results are directly integrated in a biclustering pr...
Conference Paper
Full-text available
Clustering in high dimensional spaces is a very difficult task. Dealing with DNA microarrays is even more difficult because gene subsets are coregulated and coexpressed only under specific conditions. Biclusterng addresses the problem of finding such submanifolds by exploiting both gene and condition (tissue) clustering. The paper proposes a selfor...
Presentation
Full-text available
Presentation of the conference papers: - "Supervised Gene Identification in Colorectal Cancer" - "‘Unsupervised Gene Identification in Colorectal Cancer"

Network

Cited By