Jennifer W. Evans’s research while affiliated with National Institute of Mental Health, National Institutes of Health and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (46)


242. Treatment-Resistant Depression Working Memory fMRI Brain Activity After IV Ketamine
  • Article

May 2025

Biological Psychiatry

Adam Fijtman

·

Níall Lally

·

Gregory Jones

·

[...]

·

Carlos Zarate

Figure 1. Results reporting examples, showing a single slice of task-based FMRI data (see Chen, Pine, et al., 2022). Each neuroimaging panel shows the same axial slice in MNI template space at z = 36S (image left = subject left), with thresholding is applied at voxelwise p = 0.001 and cluster size = 40 voxels (FWE = 5%). The data used for both overlay coloration and thresholding are the Z-score statistics. Panel A displays FMRI results using conventional strict (or opaque) thresholding, and shows one cluster in the right intraparietal sulcus. Panel B displays the same results with transparent thresholding (suprathreshold regions are opaque and outlined; subthreshold regions fade as the statistic decreases), revealing relevant context in the subthreshold regions that are hidden in A. Panel C shows a classic example from Anscombe (1973) of the risks of over-reducing data, here for a simple scatterplot. Panel D shows how the same considerations apply to neuroimaging: each dataset would have very different
Figure 5. Example images of transparent thresholding from various software implementations (and see Figs. 6 and 7 for more examples). Descriptions of the data and software usage are provided in the Supplements.
Figure 6. Example images of transparent thresholding from various software implementations (and see Figs. 5 and 7 for more examples). Descriptions of the data and software usage are provided in the Supplements.
Go Figure: Transparency in neuroscience images preserves context and clarifies interpretation
  • Preprint
  • File available

April 2025

·

182 Reads

Visualizations are vital for communicating scientific results. Historically, neuroimaging figures have only depicted regions that surpass a given statistical threshold. This practice substantially biases interpretation of the results and subsequent meta-analyses, particularly towards non-reproducibility. Here we advocate for a "transparent thresholding" approach that not only highlights statistically significant regions but also includes subthreshold locations, which provide key experimental context. This balances the dual needs of distilling modeling results and enabling informed interpretations for modern neuroimaging. We present four examples that demonstrate the many benefits of transparent thresholding, including: removing ambiguity, decreasing hypersensitivity to non-physiological features, catching potential artifacts, improving cross-study comparisons, reducing non-reproducibility biases, and clarifying interpretations. We also demonstrate the many software packages that implement transparent thresholding, several of which were added or streamlined recently as part of this work. A point-counterpoint discussion addresses issues with thresholding raised in real conversations with researchers in the field. We hope that by showing how transparent thresholding can drastically improve the interpretation (and reproducibility) of neuroimaging findings, more researchers will adopt this method.

Download

Figure 1. Results reporting examples, showing a single slice of task-based FMRI data (see Chen, Pine, et al., 2022). Each neuroimaging panel shows the same axial slice in MNI template space at z = 36S (image left = subject left), with thresholding is applied at voxelwise p = 0.001 and cluster size = 40 voxels (FWE = 5%). The data used for both overlay coloration and thresholding are the Z-score statistics. Panel A displays FMRI results using conventional strict (or opaque) thresholding, and shows one cluster in the right intraparietal sulcus. Panel B displays the same results with transparent thresholding (suprathreshold regions are opaque and outlined; subthreshold regions fade as the statistic decreases), revealing relevant context in the subthreshold regions that are hidden in A. Panel C shows a classic example from Anscombe (1973) of the risks of over-reducing data, here for a simple scatterplot. Panel D shows how the same considerations apply to neuroimaging: each dataset would have very different
Figure 5. Example images of transparent thresholding from various software implementations (and see Figs. 6 and 7 for more examples). Descriptions of the data and software usage are provided in the Supplements.
Figure 6. Example images of transparent thresholding from various software implementations (and see Figs. 5 and 7 for more examples). Descriptions of the data and software usage are provided in the Supplements.
Go Figure: Transparency in neuroscience images preserves context and clarifies interpretation

April 2025

·

78 Reads

Visualizations are vital for communicating scientific results. Historically, neuroimaging figures have only depicted regions that surpass a given statistical threshold. This practice substantially biases interpretation of the results and subsequent meta-analyses, particularly towards non-reproducibility. Here we advocate for a “transparent thresholding” approach that not only highlights statistically significant regions but also includes subthreshold locations, which provide key experimental context. This balances the dual needs of distilling modeling results and enabling informed interpretations for modern neuroimaging. We present four examples that demonstrate the many benefits of transparent thresholding, including: removing ambiguity, decreasing hypersensitivity to non-physiological features, catching potential artifacts, improving cross-study comparisons, reducing non-reproducibility biases, and clarifying interpretations. We also demonstrate the many software packages that implement transparent thresholding, several of which were added or streamlined recently as part of this work. A point-counterpoint discussion addresses issues with thresholding raised in real conversations with researchers in the field. We hope that by showing how transparent thresholding can drastically improve the interpretation (and reproducibility) of neuroimaging findings, more researchers will adopt this method.


Dissecting heterogeneity in cortical thickness abnormalities in major depressive disorder: a large-scale ENIGMA MDD normative modelling study

March 2025

·

102 Reads

Importance: Major depressive disorder (MDD) is highly heterogeneous, with marked individual differences in clinical presentation and neurobiology, which may obscure identification of structural brain abnormalities in MDD. To explore this, we used normative modeling to index regional patterns of variability in cortical thickness (CT) across individual patients. Objective: To use normative modeling in a large dataset from the ENIGMA MDD consortium to obtain individualised CT deviations from the norm (relative to age, sex and site) and examine the relationship between these deviations and clinical characteristics. Design, setting, and participants: A normative model adjusting for age, sex and site effects was trained on 35 CT measures from FreeSurfer parcellation of 3,181 healthy controls (HC) from 34 sites (40 scanners). Individualised z-score deviations from this norm for each CT measure were calculated for a test set of 2,119 HC and 3,645 individuals with MDD. For each individual, each CT z-score was classified as being within the normal range (95% of individuals) or within the extreme range (2.5% of individuals with the thinnest or thickest cortices). Main outcome measures: Z-score deviations of CT measures of MDD individuals as estimated from a normative model based on HC. Results: Z-score distributions of CT measures were largely overlapping between MDD and HC (minimum 92%, range 92-98%), with overall thinner cortices in MDD. 34.5% of MDD individuals, and 30% of HC individuals, showed an extreme deviation in at least one region, and these deviations were widely distributed across the brain. There was high heterogeneity in the spatial location of CT deviations across individuals with MDD: a maximum of 12% of individuals with MDD showed an extreme deviation in the same location. Extreme negative CT deviations were associated with having an earlier onset of depression and more severe depressive symptoms in the MDD group, and with higher BMI across MDD and HC groups. Extreme positive deviations were associated with being remitted, of not taking antidepressants and less severe symptoms. Conclusions and relevance: Our study illustrates a large heterogeneity in the spatial location of CT abnormalities across patients with MDD and confirms a substantial overlap of CT measures with HC. We also demonstrate that individualised extreme deviations can identify protective factors and individuals with a more severe clinical picture.


Figure 1: Proposed conceptualization levels and implementation of classification procedure. Left: Higher classification performance in MDD vs HC classification task can be achieved by implementing deep ML models, such as DenseNet, in comparison to a shallow ML model, for example, SVM. Furthermore, the analysis of integrated morphometric features can provide a more detailed description of cortical organization than separated features, leading to better differentiability of MDD from HC. The application of ComBat may improve the generalizability of results as site-related differences are removed. Right: Cortical sulcal depth, curvature, and thickness are first projected into the 2D grid and then transformed into 2D images using OMT projection. We split the data into 10 CV folds according to age and sex (Splitting by Age/Sex) and according to the site belonging (Splitting by Site). After the residualization step, where the age and sex effect are regressed out linearly, we train and test SVM and DenseNet on the diagnosis classification.
Figure 2: MDD vs HC classification performance of SVM and DenseNet applied to vertexwise cortical features. Balanced accuracy for both classification models when trained on all features integrated with and without ComBat harmonization for both splitting strategies and when trained on single features. Error bars represent standard deviation.
Participating sites. The total number of subjects, number of MDD patients and number of HCs, as well as their mean age (in years) and sex (number and % of females) is presented.
Classification of Major Depressive Disorder Using Vertex-Wise Brain Sulcal Depth, Curvature, and Thickness with a Deep and a Shallow Learning Model

January 2025

·

183 Reads

Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, has the potential to provide diagnostic and predictive biomarkers for MDD. However, previous attempts to demarcate MDD patients and healthy controls (HC) based on segmented cortical features via linear machine learning approaches have reported low accuracies. In this study, we used globally representative data from the ENIGMA-MDD working group containing 7,012 participants from 30 sites (N=2,772 MDD and N=4,240 HC), which allows a comprehensive analysis with generalizable results. Based on the hypothesis that integration of vertex-wise cortical features can improve classification performance, we evaluated the classification of a DenseNet and a Support Vector Machine (SVM), with the expectation that the former would outperform the latter. As we analyzed a multi-site sample, we additionally applied the ComBat harmonization tool to remove potential nuisance effects of site. We found that both classifiers exhibited close to chance performance (balanced accuracy DenseNet: 51%; SVM: 53%), when estimated on unseen sites. Slightly higher classification performance (balanced accuracy DenseNet: 58%; SVM: 55%) was found when the cross-validation folds contained subjects from all sites, indicating site effect. In conclusion, the integration of vertex-wise morphometric features and the use of the non-linear classifier did not lead to the differentiability between MDD and HC. Our results support the notion that MDD classification on this combination of features and classifiers is unfeasible. Future studies are needed to determine whether more sophisticated integration of information from other MRI modalities such as fMRI and DWI will lead to a higher performance in this diagnostic task.


Transdiagnostic alterations in white matter microstructure associated with suicidal thoughts and behaviours in the ENIGMA Suicidal Thoughts and Behaviours consortium

November 2024

·

228 Reads

Previous studies have suggested that alterations in white matter (WM) microstructure are implicated in suicidal thoughts and behaviours (STBs). However, findings of diffusion tensor imaging (DTI) studies have been inconsistent. In this large-scale mega-analysis conducted by the ENIGMA Suicidal Thoughts and Behaviours (ENIGMA-STB) consortium, we examined WM alterations associated with STBs. Data processing was standardised across sites, and resulting WM microstructure measures (fractional anisotropy, axial diffusivity, mean diffusivity and radial diffusivity) for 25 WM tracts were pooled across 40 cohorts. We compared these measures among individuals with a psychiatric diagnosis and lifetime history of suicide attempt (n=652; mean age=35.4, sd=14.7; female=71.8%), individuals with a psychiatric diagnosis but no STB (i.e., clinical controls; n=1871; mean age=34, sd=14.8; female=59.8%), and individuals with no mental disorder diagnosis and no STB (i.e., healthy controls; n=642; mean age=29.6, sd=13.1; female=62.9%). We also compared these measures among individuals with recent suicidal ideation (n=714; mean age=36.3, sd=15.3; female=66.1%), clinical controls (n=1184; mean age=36.8, sd=15.6; female=63.1%), and healthy controls (n=1240; mean age= 31.6, sd=15.5; female=61.0%). We found subtle but statistically significant effects, such as lower fractional anisotropy associated with a history of suicide attempt, over and above the effect of psychiatric diagnoses. These effects were strongest in the corona radiata, thalamic radiation, fornix/stria terminalis, corpus callosum and superior longitudinal fasciculus. Effect sizes were small (Cohens d < 0.25). Recent suicidal ideation was not associated with alterations in WM microstructure. This large-scale coordinated mega-analysis revealed subtle regional and global alterations in WM microstructure in individuals with a history of suicide attempt. Longitudinal studies are needed to confirm whether these alterations are a risk factor for suicidal behaviour.




Study design illustrating timing of 3T and 7T scans with respect to the infusions.
Box-whisker plots illustrating the (A) hippocampal (top) and (B) whole amygdalar (bottom) volumes at 3T for healthy volunteers (HVs) (red) and individuals with treatment-resistant depression (TRD) (blue) at baseline, acute, and interim scans after ketamine and placebo infusions. Dots indicate volume for individual participants; the boxplot illustrates the mean and quartiles of the distributions.
Subfield volume differences from baseline at acute and interim scans after ketamine and placebo infusions for (A) hippocampus and (B) amygdala at 3T. Abbreviations: HATA hippocampal-amygdaloid transition region; GC ML DG granule cells in the molecular layer of the dentate gyrus; CA1-4 cornu ammonis.
Bland-Altman⁴³ plot of total gray matter measured at baseline between 3 and 7T scans within the same individuals for bilateral whole amygdala and hippocampus. Each dot represents the difference between 3 and 7T total gray matter for an individual plotted against the mean of total gray matter across both field strengths for that same individual for the region specified in the plot title. The red lines are plotted at one standard deviation of the differences and the blue line represents the overall mean of the differences. The dots are coloured by the age of the participant.
Hippocampal volume changes after (R,S)-ketamine administration in patients with major depressive disorder and healthy volunteers

February 2024

·

39 Reads

·

5 Citations

The hippocampus and amygdala have been implicated in the pathophysiology and treatment of major depressive disorder (MDD). Preclinical models suggest that stress-related changes in these regions can be reversed by antidepressants, including ketamine. Clinical studies have identified reduced volumes in MDD that are thought to be potentiated by early life stress and worsened by repeated depressive episodes. This study used 3T and 7T structural magnetic resonance imaging data to examine longitudinal changes in hippocampal and amygdalar subfield volumes associated with ketamine treatment. Data were drawn from a previous double-blind, placebo-controlled, crossover trial of healthy volunteers (HVs) unmedicated individuals with treatment-resistant depression (TRD) (3T: 18 HV, 26 TRD, 7T: 17 HV, 30 TRD) who were scanned at baseline and twice following either a 40 min IV ketamine (0.5 mg/kg) or saline infusion (acute: 1–2 days, interim: 9–10 days post infusion). No baseline differences were noted between the two groups. At 10 days post-infusion, a slight increase was observed between ketamine and placebo scans in whole left amygdalar volume in individuals with TRD. No other differences were found between individuals with TRD and HVs at either field strength. These findings shed light on the timing of ketamine’s effects on cortical structures.


Feature weights of support vector machines (SVM) with the linear kernel. To assess the decision-making of SVM to differentiate subjects with major depressive disorder (MDD) from healthy controls (HC), we investigate the importance of the structural brain features by looking at the corresponding feature weights for the regional cortical surface areas, cortical thicknesses and subcortical volumes. The horizontal bars indicate the 95% confidence interval calculated using percentile method via bootstrapping.
The most informative features for classification including regional cortical surface areas, thicknesses and subcortical volumes, trained on the whole data set without and with ComBat harmonization. Increased and decreased feature weight values in the major depressive disorder (MDD) group are represented by red and blue colormap, respectively.
Detailed analysis pipeline. Initial data from all cohorts is split into training and test sets according to splitting strategies (Splitting by Age/Sex and Splitting by Site) after removing subjects with more than 75% missing data and data imputation steps. The corresponding training folds are then residualized directly to remove ICV, age and sex related effects and fed to the classification algorithms. In case of harmonization by ComBat, the residualization step takes place after the harmonization step is conducted. If training folds were harmonized by ComBat, the test fold was harmonized as well by using ComBat estimates from the training folds. Next, the test fold was residualized by using estimates obtained from the training folds. We estimated classification performance on the residualized test fold. This routine was performed iteratively for each combination of training and test folds.
Multi-site benchmark classification of major depressive disorder using machine learning on cortical and subcortical measures

January 2024

·

276 Reads

·

22 Citations

Machine learning (ML) techniques have gained popularity in the neuroimaging field due to their potential for classifying neuropsychiatric disorders. However, the diagnostic predictive power of the existing algorithms has been limited by small sample sizes, lack of representativeness, data leakage, and/or overfitting. Here, we overcome these limitations with the largest multi-site sample size to date (N = 5365) to provide a generalizable ML classification benchmark of major depressive disorder (MDD) using shallow linear and non-linear models. Leveraging brain measures from standardized ENIGMA analysis pipelines in FreeSurfer, we were able to classify MDD versus healthy controls (HC) with a balanced accuracy of around 62%. But after harmonizing the data, e.g., using ComBat, the balanced accuracy dropped to approximately 52%. Accuracy results close to random chance levels were also observed in stratified groups according to age of onset, antidepressant use, number of episodes and sex. Future studies incorporating higher dimensional brain imaging/phenotype features, and/or using more advanced machine and deep learning methods may yield more encouraging prospects.


Citations (21)


... 28 Furthermore, a study reported increased amygdalar volumes in patients suffering from treatment-resistant depression 10 days after ketamine infusion, which highlights the importance of the timing of measurements, revealing different adaptions in the brain in temporal orders. 29 Hence, it can be assumed that neurobiological adaptions follow certain sequential processes evoking short and long-term adaptions in the human brain. On the other hand, it was shown that ketamine does not only increase certain brain volumes, but chronic ketamine (mis-)use may lead to lower gray matter volumes. ...

Reference:

Acute effects of intranasal esketamine application on thalamic structures in healthy individuals
Hippocampal volume changes after (R,S)-ketamine administration in patients with major depressive disorder and healthy volunteers

... Indeed, comparing our results to existing classification rs-fMRI studies, our ML test performance is aligned with or surpasses studies using similar sample sizes (Bondi et al., 2023). Furthermore, our approach remains competitive against large-scale efforts, such as the ENIGMA consortium study by Belov et al., which reported balanced test accuracies between 52% and 63% using only structural data (Belov et al., 2024). ...

Multi-site benchmark classification of major depressive disorder using machine learning on cortical and subcortical measures

... In patients with MDD or TRD, ketamine treatments have been shown to produce sustained changes in connectivity that differ from baseline (Figure 1e). Notable findings include increased connectivity between the FPN-limbic system (Vasavada et al. 2021), DMN-limbic system (Alexander et al. 2023), and FPN-DMN (Gärtner et al. 2019), as well as decreased connectivity between the DMN-limbic system (Abdallah et al. 2017) and DMN-SN (Chen et al. 2019). These alterations could disrupt the cycle of maladaptive thoughts, thereby enhancing patients' control over their emotional responses. ...

Preliminary evidence that ketamine alters anterior cingulate resting-state functional connectivity in depressed individuals

Translational Psychiatry

... Определенные надежды связываются с такими инструментальными методами, как функциональная магнитнорезонансная томография (МРТ) и позитронноэмиссионная томография (ПЭТ) головного мозга, однако и они обладают диагностической специфичностью и являются весьма дорогостоящими, требующими высокой квалификации сотрудников и специально оборудованных помещений. К тому же они не обладают возможностями определения и квалификации характеристик динамических, текущих процессов ввиду низкого временного разрешения данных методов исследований, а поэтому они малодоступны для широкого применения [4,5]. ...

Review: The use of functional magnetic resonance imaging (fMRI) in clinical trials and experimental research studies for depression

Frontiers in Neuroimaging

... At present, research exploring ketamine's direct psychological mechanisms using task-based fMRI is limited. 122 Our synthesized evidence for ketamine's antidepressive neurocognitive mechanisms requires validation via paradigms that directly test unique human higher-order cognitive processes. It is also worth highlighting that depression is a highly heterogeneous disorder. ...

Functional MRI markers for treatment-resistant depression: Insights and challenges
  • Citing Chapter
  • June 2023

Progress in Brain Research

... 46 Significant changes in the spectral pattern of Glu H4 were observed across the TE range of 68-88 ms (Figure 4) due to both the strong scalar coupling between the two geminal H4 protons of Glu and the weak couplings between the H3 and H4 protons of Glu. 47 At TE = 68 ms, a substantial positive peak appeared at the upfield end of the Glu H4 signal. This positive peak dominated the overlap between the Glu H4 signal and the positive GABA H2 signal, resulting in a negative correlation coefficient with a large magnitude (r = −0.71). ...

Roles of Strong Scalar Couplings in Maximizing Glutamate, Glutamine and Glutathione Pseudo Singlets at 7 Tesla

Frontiers in Physics

... Biological samples and neuroimaging data may be other predictors to increase accuracy. Neuroimaging studies have revealed structural differences in brain regions linked to suicidal tendencies [86][87][88] . Potential biomarkers include abnormalities in serotonin signaling and inflammatory markers like IL-1β and IL-6 levels 89,90 . ...

Brain Correlates of Suicide Attempt in 18,925 Participants Across 18 International Cohorts
  • Citing Article
  • March 2021

Biological Psychiatry

... 23,30,[33][34][35][36][37] In clinical settings, imaging studies showed that depressed patients display altered activity of the NAc, 34,38 and a recent study, which demonstrated that depressed subjects displayed clinical improvement after a single administration of ketamine, found that this amelioration was paralleled by normalization of fronto-striatal connectivity evaluated by resting-state functional magnetic resonance imaging. 39 Despite this evidence, a circuit and synaptic-level dissection of whether ketamine acts against anhedonia by acting on the NAc is lacking. Moreover, given that stress determines opposing physiological changes in MSN subtypes, such a dissection needs to take these findings into account and tackle the question with cellular resolution. ...

Ketamine modulates fronto-striatal circuitry in depressed and healthy individuals

Molecular Psychiatry

... In accordance with modern concepts, any human activity and behavior are maintained by dynamically organized and spatially distributed functional neuroanatomical systems that are characterized by their anatomical composition, the level of functional activity of their elements, and connections between them. The modern literature also contains a large amount of neuroimaging data that describes both the activity of such systems in a healthy brain and their reorganization in psychiatric diseases [2][3][4]. Based on accumulated data, numerous models of the reorganization of brain work during the development of mental disorders have been created. From the literature, one can see the discrepancy between deep scientific ideas about pathological reorganization of the brain mechanisms and the practical application of this knowledge. ...

ENIGMA MDD: seven years of global neuroimaging studies of major depression through worldwide data sharing

Translational Psychiatry

... These components have distinct evolu tionary origins 10 and are regulated by different genetic and cellular processes. 11,12 A recent metaanalysis with 18 925 adult participants (6448 with depression, of whom 694 had a his tory of suicide attempt) indicated that on average, people with depression who had attempted suicide had a lower left inferior parietal lobe surface area than people with depres sion who had not attempted suicide, 13 but the 2 groups showed no differences in cortical thickness. In addition to identifying differences in cortical surface area, the same metaanalysis found that smaller volumes of the bilateral thalami and right pallidum characterized people who had attempted suicide compared to clinical controls. ...

Brain correlates of suicide attempt in 18,925 participants across 18 international cohorts