Valentin Iovene’s research while affiliated with University of Paris-Saclay and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (9)


Overview of the NeuroLangQA algorithm. Step numbers refer to those described in Algorithm 1
Resulting thresholded brain image from the NeuroLang use case showing the activations related to spatial attention obtained through the resolution of a query under the open world assumption
Resulting thresholded brain image from the NeuroLang use case showing that foci in the amygdala are most probably reported if a study includes the word “emotion”. As expected, the main area shown corresponds to the amygdala (Mesulam, 1998)
Resulting thresholded brain image from the NeuroLang use case showing the activations related to pain and its related terms derived from the IOBC ontology (noxious and nociceptive). Dorsal anterior cingulate cortex (x = 0) and parietal regions are be active in articles mentioning pain and related words. agreeing with current knowledge in pain location (Lieberman & Eisenberger, 2015)
Comparison of results between ALE (Turkeltaub et al., 2002; Laird et al., 2005) and Modified ALE (Eickhoff et al., 2009) a shows a more accurate selection of voxels than b concerning the expected results for an auditory stimulus modality. This is because the modified version of ALE allows us to weigh each voxel according to the number of subjects that participated in the experiment. b is unable to capture the variance and relies on each experiment present in the BrainMap database with the same weight, leading to noisier results
Scalable Query Answering Under Uncertainty to Neuroscientific Ontological Knowledge: The NeuroLang Approach
  • Article
  • Publisher preview available

November 2022

·

50 Reads

·

1 Citation

Neuroinformatics

Gaston E. Zanitti

·

Yamil Soto

·

Valentin Iovene

·

[...]

·

Researchers in neuroscience have a growing number of datasets available to study the brain, which is made possible by recent technological advances. Given the extent to which the brain has been studied, there is also available ontological knowledge encoding the current state of the art regarding its different areas, activation patterns, keywords associated with studies, etc. Furthermore, there is inherent uncertainty associated with brain scans arising from the mapping between voxels—3D pixels—and actual points in different individual brains. Unfortunately, there is currently no unifying framework for accessing such collections of rich heterogeneous data under uncertainty, making it necessary for researchers to rely on ad hoc tools. In particular, one major weakness of current tools that attempt to address this task is that only very limited propositional query languages have been developed. In this paper we present NeuroLang, a probabilistic language based on first-order logic with existential rules, probabilistic uncertainty, ontologies integration under the open world assumption, and built-in mechanisms to guarantee tractable query answering over very large datasets. NeuroLang’s primary objective is to provide a unified framework to seamlessly integrate heterogeneous data, such as ontologies, and map fine-grained cognitive domains to brain regions through a set of formal criteria, promoting shareable and highly reproducible research. After presenting the language and its general query answering architecture, we discuss real-world use cases showing how NeuroLang can be applied to practical scenarios.

View access options

Meta-analysis of the functional neuroimaging literature with probabilistic logic programming

November 2022

·

172 Reads

·

1 Citation

Inferring reliable brain-behavior associations requires synthesizing evidence from thousands of functional neuroimaging studies through meta-analysis. However, existing meta-analysis tools are limited to investigating simple neuroscience concepts and expressing a restricted range of questions. Here, we expand the scope of neuroimaging meta-analysis by designing NeuroLang: a domain-specific language to express and test hypotheses using probabilistic first-order logic programming. By leveraging formalisms found at the crossroads of artificial intelligence and knowledge representation, NeuroLang provides the expressivity to address a larger repertoire of hypotheses in a meta-analysis, while seamlessly modeling the uncertainty inherent to neuroimaging data. We demonstrate the language’s capabilities in conducting comprehensive neuroimaging meta-analysis through use-case examples that address questions of structure-function associations. Specifically, we infer the specific functional roles of three canonical brain networks, support the role of the visual word-form area in visuospatial attention, and investigate the heterogeneous organization of the frontoparietal control network.


Figure 3. Coactivation patterns along the principal LPFC gradient. The coactivation patterns of quintile bins along the principal gradient in the LPFC capture a unimodal-to-transmodal spatial layout in brain network connectivity. (A) Coactivation patterns along the principal gradient in the left and right LPFC. Each brain map shows the regions that have a least three times the odds of being reported active given activation in a quintile bin relative to being active when activation is not reported in the quintile bin. Note that cerebellar and sub-cortical regions, although included in the analysis, are not shown in the figures. (B) Bar plots showing the number of regions from each network that overlaps with the coactivation pattern of each quintile bin. The data shown here suggests that the dorsal attention (green) and sensorimotor networks (blue) coactivate with the caudal bins (i.e. bins 1 and 2) more than with more rostral bins. On the other hand, the default mode network coactivates more with the rostral bins (i.e. bins 4 and 5) than with caudal bins.
Figure 6. Schematic overview of our analysis pipeline. (A) Inputs and outputs of NeuroLang. Inputs are represented using blue arrows and include: Peak activations and topics from the Neurosynth dataset, the lateral PFC mask, and the 1024 regions from the DiFuMo atlases are represented in a unifying framework within NeuroLang. Two examples of outputs are shown here and represented using red arrows. (B) The main steps of the meta-analysis carried out in this study. (1) Spatial smoothing with 10 mm kernel around each peak. (2) The binary activation map of each study is projected onto 1024 functional regions. Varying shades of red signify that regions have different probabilities of being reported by a study depending on the location of voxels within each region. (3) The meta-analytic connectivity matrix encodes the log-odds ratios of coactivation between each region in the LPFC and every region in the brain. (4) A similarity matrix encodes the degree of correspondence between LPFC regions in their meta-analytic connectivity profiles, estimated by the eta-squared similarity metric. (5) The principal gradient of meta-analytic connectivity in each hemisphere is then derived from the similarity matrix using diffusion embedding. (6) Coactivation patterns of successive quintile gradient bins are inferred (7) Specific topic associations along the principal gradient are inferred using segregation queries. (8) Finally, a gradient-based meta-analysis of hemispheric asymmetries is performed.
Functional gradients in the human lateral prefrontal cortex revealed by a comprehensive coordinate-based meta-analysis

September 2022

·

217 Reads

·

10 Citations

eLife

The lateral prefrontal cortex (LPFC) of humans enables flexible goal-directed behavior. However, its functional organization remains actively debated after decades of research. Moreover, recent efforts aiming to map the LPFC through meta-analysis are limited, either in scope or in the inferred specificity of structure-function associations. These limitations are in part due to the limited expressiveness of commonly-used data analysis tools, which restricts the breadth and complexity of questions that can be expressed in a meta-analysis. Here, we adopt NeuroLang, a novel approach to more expressive meta-analysis based on probabilistic first-order logic programming, to infer the organizing principles of the LPFC from 14,371 neuroimaging studies. Our findings reveal a rostrocaudal and a dorsoventral gradient, respectively explaining the most and second most variance in meta-analytic connectivity across the LPFC. Moreover, we identify a unimodal-to-transmodal spectrum of coactivation patterns along with a concrete-to-abstract axis of structure-function associations extending from caudal to rostral regions of the LPFC. Finally, we infer inter-hemispheric asymmetries along the principal rostrocaudal gradient, identifying hemisphere-specific associations with topics of language, memory, response inhibition, and sensory processing. Overall, this study provides a comprehensive meta-analytic mapping of the LPFC, grounding future hypothesis generation on a quantitative overview of past findings.


Meta-Analysis of the Functional Neuroimaging Literature with Probabilistic Logic Programming

February 2022

·

44 Reads

·

1 Citation

Inferring reliable brain-behavior associations requires synthesizing evidence from thousands of functional neuroimaging studies through meta-analysis. However, existing meta-analysis tools are limited to investigating simple neuroscience concepts and expressing a restricted range of questions. Here, we expand the scope of neuroimaging meta-analysis by designing NeuroLang: a domain-specific language to express and test hypotheses using probabilistic first-order logic programming. By leveraging formalisms found at the crossroads of artificial intelligence and knowledge representation, NeuroLang provides the expressivity to address a larger repertoire of hypotheses in a meta-analysis, while seamlessly modelling the uncertainty inherent to neuroimaging data. We demonstrate the language’s capabilities in conducting comprehensive neuroimaging meta-analysis through use-case examples that address questions of structure-function associations. Specifically, we infer the specific functional roles of three canonical brain networks, support the role of the visual word-form area in visuospatial attention, and investigate the heterogeneous organization of the fronto-parietal control network.


Scalable Query Answering under Uncertainty to Neuroscientific Ontological Knowledge: The NeuroLang Approach

February 2022

·

15 Reads

Researchers in neuroscience have a growing number of datasets available to study the brain, which is made possible by recent technological advances. Given the extent to which the brain has been studied, there is also available ontological knowledge encoding the current state of the art regarding its different areas, activation patterns, key words associated with studies, etc. Furthermore, there is an inherent uncertainty associated with brain scans arising from the mapping between voxels -- 3D pixels -- and actual points in different individual brains. Unfortunately, there is currently no unifying framework for accessing such collections of rich heterogeneous data under uncertainty, making it necessary for researchers to rely on ad hoc tools. In particular, one major weakness of current tools that attempt to address this kind of task is that only very limited propositional query languages have been developed. In this paper, we present NeuroLang, an ontology language with existential rules, probabilistic uncertainty, and built-in mechanisms to guarantee tractable query answering over very large datasets. After presenting the language and its general query answering architecture, we discuss real-world use cases showing how NeuroLang can be applied to practical scenarios for which current tools are inadequate.


Functional gradients in the human lateral prefrontal cortex revealed by a comprehensive coordinate-based meta-analysis

January 2022

·

26 Reads

The human lateral prefrontal cortex (LPFC) enables flexible goal-directed behavior. Yet, its organizing principles remain actively debated despite decades of research. Meta-analysis efforts to map the LPFC have either been restricted in scope or suffered from limited expressivity in meta-analysis tools. The latter short-coming hinders the complexity of questions that can be expressed in a meta-analysis and hence limits the specificity of structure-function associations. Here, we adopt NeuroLang, a novel approach to meta-analysis based on first-order probabilistic logic programming, to infer the organizing principles of the LPFC with greater specificity from 14,371 neuroimaging publications. Our results reveal a rostrocaudal and a dorsoventral gradient, respectively explaining the most and second-most variance in whole-brain meta-analytic connectivity in the LPFC. Moreover, we find a cross-study agreement on a spectrum of increasing abstraction from caudal to rostral LPFC both in specific network connectivity and structure-function associations that supports a domain-general role for the mid-LPFC. Furthermore, meta-analyzing inter-hemispheric asymmetries along the rostrocaudal gradient reveals specific associations with topics of language, memory, response inhibition, and error processing. Overall, we provide a comprehensive mapping of the organizing principles of task-dependent activity in the LPFC, grounding future hypothesis generation on a quantitative overview of past findings.


Answering meta-analytic questions on heterogeneous and uncertain neuroscientific data with probabilistic logic programming

November 2021

·

17 Reads

·

1 Citation

This thesis contributes to the development of a probabilistic logic programming language specific to the domain of cognitive neuroscience, coined NeuroLang, and presents some of its applications to the meta-analysis of the functional brain mapping literature. By relying on logic formalisms such as datalog, and their probabilistic extensions, we show how NeuroLang makes it possible to combine uncertain and heterogeneous data to formulate rich meta-analytic hypotheses. We encode the Neurosynth database into a NeuroLang program and formulate probabilistic logic queries resulting in term-association brain maps and coactivation brain maps similar to those obtained with existing tools, and highlighting existing brain networks. We prove the correctness of our model by using the joint probability distribution defined by the Bayesian network translation of probabilistic logic programs, showing that queries lead to the same estimations as Neurosynth. Then, we show that modeling term-to-study associations probabilistically based on term frequency-document inverse frequency (TF-IDF) measures results in better accuracy on simulated data, and a better consistency on real data, for two-term conjunctive queries on smaller sample sizes. Finally, we use NeuroLang to formulate and test concrete functional brain mapping hypotheses, reproducing past results. By solving segregation logic queries combining the Neurosynth database, topic models, and the data-driven functional atlas DiFuMo, we find supporting evidence of the existence of an heterogeneous organisation of the frontoparietal control network (FPCN), and find supporting evidence that the subregion of the fusiform gyrus called visual word form area (VWFA) is recruited within attentional tasks, on top of language-related cognitive tasks.


Complex Coordinate-Based Meta-Analysis with Probabilistic Programming

May 2021

·

10 Reads

·

1 Citation

Proceedings of the AAAI Conference on Artificial Intelligence

With the growing number of published functional magnetic resonance imaging (fMRI) studies, meta-analysis databases and models have become an integral part of brain mapping research. Coordinate-based meta-analysis (CBMA) databases are built by extracting both coordinates of reported peak activations and term associations using natural language processing techniques from neuroimaging studies. Solving term-based queries on these databases makes it possible to obtain statistical maps of the brain related to specific cognitive processes. However, existing tools for analysing CBMA data are limited in their expressivity to propositional logic, restricting the variety of their queries. Moreover, with tools like Neurosynth, term-based queries on multiple terms often lead to power failure, because too few studies from the database contribute to the statistical estimations. We design a probabilistic domain-specific language (DSL) standing on Datalog and one of its probabilistic extensions, CP-Logic, for expressing and solving complex logic-based queries. We show how CBMA databases can be encoded as probabilistic programs. Using the joint distribution of their Bayesian network translation, we show that solutions of queries on these programs compute the right probability distributions of voxel activations. We explain how recent lifted query processing algorithms make it possible to scale to the size of large neuroimaging data, where knowledge compilation techniques fail to solve queries fast enough for practical applications. Finally, we introduce a method for relating studies to terms probabilistically, leading to better solutions for two-term conjunctive queries (CQs) on smaller databases. We demonstrate results for two-term CQs, both on simulated meta-analysis databases and on the widely used Neurosynth database.


Complex Coordinate-Based Meta-Analysis with Probabilistic Programming

December 2020

·

11 Reads

With the growing number of published functional magnetic resonance imaging (fMRI) studies, meta-analysis databases and models have become an integral part of brain mapping research. Coordinate-based meta-analysis (CBMA) databases are built by automatically extracting both coordinates of reported peak activations and term associations using natural language processing (NLP) techniques. Solving term-based queries on these databases make it possible to obtain statistical maps of the brain related to specific cognitive processes. However, with tools like Neurosynth, only singleterm queries lead to statistically reliable results. When solving richer queries, too few studies from the database contribute to the statistical estimations. We design a probabilistic domain-specific language (DSL) standing on Datalog and one of its probabilistic extensions, CP-Logic, for expressing and solving rich logic-based queries. We encode a CBMA database into a probabilistic program. Using the joint distribution of its Bayesian network translation, we show that solutions of queries on this program compute the right probability distributions of voxel activations. We explain how recent lifted query processing algorithms make it possible to scale to the size of large neuroimaging data, where state of the art knowledge compilation (KC) techniques fail to solve queries fast enough for practical applications. Finally, we introduce a method for relating studies to terms probabilistically, leading to better solutions for conjunctive queries on smaller databases. We demonstrate results for two-term conjunctive queries, both on simulated meta-analysis databases and on the widely-used Neurosynth database.

Citations (6)


... In particular, we use the NeuroLang (https://github.com/NeuroLang/NeuroLang; Wassermann et al., 2022) library to perform all meta-analysis steps and the BrainSpace library (https://github.com/MICA-MNI/BrainSpace; Vos de Wael et al., 2022) to estimate low-dimensional embeddings of meta-analytic connectivity patterns in the LPFC (Vos de Wael et al., 2020). ...

Reference:

Functional gradients in the human lateral prefrontal cortex revealed by a comprehensive coordinate-based meta-analysis
Meta-Analysis of the Functional Neuroimaging Literature with Probabilistic Logic Programming

... In this study, we pushed the limits of meta-analysis, as we leveraged the full power of NeuroLang's representation and query evaluation engines. These contributions have been reviewed at Science Advances, are still under revision by our team [Iov+21], and were presented in chapter 5. ...

Meta-analysis of the functional neuroimaging literature with probabilistic logic programming

... We might speculate that while a mechanism of inter-hemispheric inhibition is strongly represented within the motor cortex to subserve accurate execution of movements and motor learning [52], this process would not be represented within the PFC likewise. Indeed, although some cognitive and affective processes have been shown to be preferentially lateralised in the two hemispheres, a mechanisms of inter-hemispheric inhibition may be less central in the cerebello-prefrontal network [53,54]. Future studies are needed to compare functional connectivity between the cerebellum and the motor and prefrontal cortex together with the relative peculiarities. ...

Functional gradients in the human lateral prefrontal cortex revealed by a comprehensive coordinate-based meta-analysis

eLife

... Moreover, we incorporate data-driven topic models, learned and openly shared by Neurosynth 19 , within a TopicAssociation probabilistic table, containing one row (t, s, P) for each uncertain association between a topic t and a study s. In probabilistic logic, we write TopicAssociation(t, s)::P to state 'study s has a probability P of being associated with topic t' 20 . This data representation process is illustrated in Fig. 1. ...

Complex Coordinate-Based Meta-Analysis with Probabilistic Programming
  • Citing Article
  • May 2021

Proceedings of the AAAI Conference on Artificial Intelligence

... com/NeuroLang/NeuroLang; Wassermann et al., 2022. In-depth details on NeuroLang are forund in Abdallah et al., 2022 andIovene, 2021. All code was developed based on open-source, publicly available software packages. ...

Answering meta-analytic questions on heterogeneous and uncertain neuroscientific data with probabilistic logic programming
  • Citing Thesis
  • November 2021