Lawrence O. Hall’s research while affiliated with University of South Florida and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (303)


Fig. 1. Illustration of the workflow for image classification with GPT-4o.
Comparison of accuracy and ground-truth annotation time
Active Prompt Tuning Enables Gpt-40 To Do Efficient Classification Of Microscopy Images
  • Preprint
  • File available

November 2024

·

30 Reads

Abhiram Kandiyana

·

·

·

[...]

·

Traditional deep learning-based methods for classifying cellular features in microscopy images require time- and labor-intensive processes for training models. Among the current limitations are major time commitments from domain experts for accurate ground truth preparation; and the need for a large amount of input image data. We previously proposed a solution that overcomes these challenges using OpenAI's GPT-4(V) model on a pilot dataset (Iba-1 immuno-stained tissue sections from 11 mouse brains). Results on the pilot dataset were equivalent in accuracy and with a substantial improvement in throughput efficiency compared to the baseline using a traditional Convolutional Neural Net (CNN)-based approach. The present study builds upon this framework using a second unique and substantially larger dataset of microscopy images. Our current approach uses a newer and faster model, GPT-4o, along with improved prompts. It was evaluated on a microscopy image dataset captured at low (10x) magnification from cresyl-violet-stained sections through the cerebellum of a total of 18 mouse brains (9 Lurcher mice, 9 wild-type controls). We used our approach to classify these images either as a control group or Lurcher mutant. Using 6 mice in the prompt set the results were correct classification for 11 out of the 12 mice (92%) with 96% higher efficiency, reduced image requirements, and lower demands on time and effort of domain experts compared to the baseline method (snapshot ensemble of CNN models). These results confirm that our approach is effective across multiple datasets from different brain regions and magnifications, with minimal overhead.

Download



Applications of Automatic Unbiased Stereology to Neural Tissue

June 2024

·

14 Reads

Stereology methods are the current best practice for state-of-the-art quantification of stained biostructures in tissue sections. Over the past six decades, stereological applications to bioscience research have added knowledge about neurostructural changes in normal aging, age-related neurodegeneration (Alzheimer’s, Parkinson’s, ALS), oncology, toxic exposures, learning and memory behavior, a range of mental illnesses, and drug discovery to name a few. In current practice, however, all computer-assisted stereology systems require time-consuming manual data collection by well-trained experts, which is prone to inter-rater error due to the variation in training, experience, motivation, and fatigue. Recent work shows these limitations can be overcome by novel developments in artificial intelligence (AI)-based deep learning (DL) as shown in other domains for solving similar problems. In this chapter, we review basic stereology concepts for avoiding bias, tissue processing, and tissue sampling for quantifying total cell number in tissue sections and show how DL approaches can automatically collect stereology data with comparable accuracy but superior precision (reproducibility) and throughput efficiency as manual stereology methods. Finally, we identify the major challenges to automation using DL approaches and review novel applications to the quantification of stained biostructures using unbiased methods.


Comparison of Methods for Counting Neurons and Neuron Profiles in Brain Sections

June 2024

·

19 Reads

Advances in high-resolution microscopy and cell staining techniques allow for quantifying well-stained cells and other microstructures in anatomically defined regions of interest (ROIs) in ex vivo brains from humans and animal models of human disease. The current best practice is the optical disector, an unbiased stereology method for counting individual cells or other tissue deposits while manually focusing a thin focal plane through a known z-axis volume (optical disector). By eliminating all known sources of methodological bias, the optical disector method ensures that the ex vivo count of the total number of neurons (∑NCells) converges on true in vivo values. A current limitation with all computer-assisted methods is the time, labor, and effort for counting (clicking) on a hundred or more cells as the counting process repeats at 100–200 x-y locations through each ROI. Faster data collection methods include semiquantitative approaches for sampling and counting 2D neuron profiles (∑NProf), though these methods can include unknown and unknowable amounts of systematic error (bias). Here we compare accuracy, reproducibility, and efficiency of counting total number of NeuN-immunostained neurons using the manual optical disector (gold standard) versus three techniques for counts of NeuN profiles. All counts of ∑NCells and ∑NProf were done to the same sampling intensity in the same high-power (100×) stacks of z-axis images (i.e., disector stacks) collected in a systematic-random manner through the entire neocortex (NCTX) in six mouse brains. The findings showed no statistical differences between ∑NCells by the gold standard and ∑NProf counts using three methods (fully manual, semiautomatic, fully automatic). Reproducibility (inter-rater error) by two similarly trained data collectors ranged from 0% for fully automatic counts of ∑NProf to ~5% for ∑NCells counts by gold standard with average data collection times between <1 min per case to ~12 min per case, respectively. Notably, all three methods for counting ∑NProf required a significant amount of unsupervised time, i.e., from 30 to 90 min per case, for preprocessing image stacks prior to data collection. Estimates of ∑NCells by the manual optical disector required no preprocessing. In summary, the manual optical disector remains the best practice due to the avoidance of all known sources of methodological bias, low inter-rater error, and moderate-to-high throughput. These results provide a baseline for comparisons with other methods for quantifying stained cells in tissue sections, including novel deep learning approaches for automatic counts of well-stained cells and other microstructures in the brain.


Synergizing Deep Learning-Enabled Preprocessing and Human–AI Integration for Efficient Automatic Ground Truth Generation

April 2024

·

34 Reads

The progress of incorporating deep learning in the field of medical image interpretation has been greatly hindered due to the tremendous cost and time associated with generating ground truth for supervised machine learning, alongside concerns about the inconsistent quality of images acquired. Active learning offers a potential solution to these problems of expanding dataset ground truth by algorithmically choosing the most informative samples for ground truth labeling. Still, this effort incurs the costs of human labeling, which needs minimization. Furthermore, automatic labeling approaches employing active learning often exhibit overfitting tendencies while selecting samples closely aligned with the training set distribution and excluding out-of-distribution samples, which could potentially improve the model’s effectiveness. We propose that the majority of out-of-distribution instances can be attributed to inconsistent cross images. Since the FDA approved the first whole-slide image system for medical diagnosis in 2017, whole-slide images have provided enriched critical information to advance the field of automated histopathology. Here, we exemplify the benefits of a novel deep learning strategy that utilizes high-resolution whole-slide microscopic images. We quantitatively assess and visually highlight the inconsistencies within the whole-slide image dataset employed in this study. Accordingly, we introduce a deep learning-based preprocessing algorithm designed to normalize unknown samples to the training set distribution, effectively mitigating the overfitting issue. Consequently, our approach significantly increases the amount of automatic region-of-interest ground truth labeling on high-resolution whole-slide images using active deep learning. We accept 92% of the automatic labels generated for our unlabeled data cohort, expanding the labeled dataset by 845%. Additionally, we demonstrate expert time savings of 96% relative to manual expert ground-truth labeling.



A novel deep learning-based method for automatic stereology of microglia cells from low magnification images

February 2024

·

149 Reads

·

3 Citations

Neurotoxicology and Teratology

Microglial cells mediate diverse homeostatic, inflammatory, and immune processes during normal development and in response to cytotoxic challenges. During these functional activities, microglial cells undergo distinct numerical and morphological changes in different tissue volumes in both rodent and human brains. However, it remains unclear how these cytostructural changes in microglia correlate with region-specific neurochemical functions. To better understand these relationships, neuroscientists need accurate, reproducible, and efficient methods for quantifying microglial cell number and morphologies in histological sections. To address this deficit, we developed a novel deep learning (DL)-based classification, stereology approach that links the appearance of Iba1 immunostained microglial cells at low magnification (20×) with the total number of cells in the same brain region based on unbiased stereology counts as ground truth. Once DL models are trained, total microglial cell numbers in specific regions of interest can be estimated and treatment groups predicted in a high-throughput manner (<1 min) using only low-power images from test cases, without the need for time and labor-intensive stereology counts or morphology ratings in test cases. Results for this DL-based automatic stereology approach on two datasets (total 39 mouse brains) showed >90% accuracy, 100% percent repeatability (Test-Retest) and 60× greater efficiency than manual stereology (<1 min vs. ∼ 60 min) using the same tissue sections. Ongoing and future work includes use of this DL-based approach to establish clear neurodegeneration profiles in age-related human neurological diseases and related animal models.


GPU Memory allocation (%) per LLM fine-tuned using AdamW 8bits and 32bits optimizer.
The results of prompting the fine-tuned GPT-NeoXT-chat-20B models with two instructions provided in Section X. We show the output for each fine-tuned model using both AdamW 8bits and 32bits.
The fine-tuned time and test time in seconds for GPT-NeoX-20B and Llama2-7B models for seven runs.
The fine-tuned time time and test time in seconds for GPT-NeoXT-chat-20B and Llama2-chat-7B models for seven runs.
Repeatability of Fine-Tuning Large Language Models Illustrated Using QLoRA

January 2024

·

172 Reads

·

5 Citations

IEEE Access

Large language models (LLMs) have shown progress and promise in diverse applications ranging from the medical field to chat bots. Developing LLMs requires a large corpus of data and significant computation resources to achieve efficient learning. Foundation models (in particular LLMs) serve as the basis for fine-tuning on a new corpus of data. Since the original foundation models contain a very large number of parameters, fine-tuning them can be quite challenging. Development of the low-rank adaption technique (LoRA) for fine-tuning, and the quantized version of LoRA, also known as QLoRA, allows for fine-tuning of LLMs on a new smaller corpus of data. This paper focuses on the repeatability of fine-tuning four LLMs using QLoRA. We have fine-tuned them for seven trials each under the same hardware and software settings. We also validated our study for the repeatability (stability) issue by fine-tuning LLMs on two public datasets. For each trial, each LLM was fine-tuned on a subset of the dataset and tested on a holdout test set. Fine-tuning and inference were done on a single GPU. Our study shows that fine-tuning of LLMs with the QLoRA method is not repeatable (not stable), such that different fine-tuned runs result in different performance on the holdout test set.


COMPARISON OF STEREOLOGY METHODS FOR ASSESSING AGE-RELATED EFFECTS ON IMMUNOSTAINED BRAIN CELLS

December 2023

·

41 Reads

Innovation in Aging

The primary benefit of stereology methods is quantification of well-stained biological objects in tissue sections with the ability to adjust sampling intensity to achieve desired levels of precision. The advent of hand-crafted algorithms and artificial intelligence-based deep learning (DL) provides an opportunity for more standardized collection of stereology data with enhanced efficiency and higher reproducibility compared to state-of-the-art manual stereology. We contrasted and compared the performance of four manual, semi-automatic, and fully automatic approaches for generating data for total number of Neu-N immunostained neurons in neocortex (NCTX) in the mouse brain. The gold standard for these studies was manual counts using the state-of-the-art optical fractionator method on 3-D reconstructed serial z-axis image stacks through a known tissue volume (disector stacks). To allow for direct methodological comparisons on the same images, disector stacks were automatically converted into extended depth of field (EDF) images in which all neurons in the disector stack were imaged at each cell’s maximal plane of focus. Total number of Neu-N neurons on the same EDF images were counted by a fully automatic hand-crafted method [automatic segmentation algorithm (ASA)] and a semi-automatic method [ASA counts manually corrected for false positives and negatives]. All comparison counts were done using unbiased frames and counting rules with total counts of NeuN-immunostained neurons by the optical fractionator method. The results were comparable across methods with wide variations in throughput efficiency and inter-rater agreement. These results are discussed with respect to applications to experimental studies of brain aging, neuroinflammation and neurodegenerative disease.


Citations (73)


... To solve this problem, several parameter-efficient fine-tuning methods have emerged, such as P-Tuning, LoRA, and QLoRA. Among them, LoRA is suitable for scenarios that require fast fine-tuning and have limitations on storage and computational resources; QLoRA, an extension of LoRA, employs quantization techniques to further reduce the model's computational demands, thereby significantly lowering training costs; and P-Tuning is more general, with outstanding performance in dealing with complex questions and tasks, which is more suitable for the research tasks in this paper [24][25][26]. In particular, ChatGLM officially provides the P-Tuning v2 fine-tuning scheme, which provides strong support for efficient fine-tuning of model parameters. ...

Reference:

Privacy-Preserving Information Extraction for Ethical Case Studies in Machine Learning Using ChatGLM-LtMP
Repeatability of Fine-Tuning Large Language Models Illustrated Using QLoRA

IEEE Access

... Our previous work used the then state-of-the-art VLM model, GPT-4(Vision), for the classification of Iba-1 immunostained microglia cells in the hippocampus of tissue sections through brains of mice treated with either a powerful neurotoxin (tri-methyl tin, TMT) or saline [11]. A pilot study on a subset of data from the current study compared GPT-4(Vision) and GPT-4o. ...

Active Prompting of Vision Language Models for Human-in-the-loop Classification and Explanation of Microscopy Images
  • Citing Conference Paper
  • June 2024

... We categorized microglia into the following morphological phenotypes: rod-type, round, non-ramified, ramified, and amoeboid. Many morphological phenotypes of microglia have been identified in published studies using morphometric analysis methods, including those based on machine learning, such as homeostatic, activated, reactive, hy-pertrophic, dystrophic, non-ramified, branched, hyperbranched, spiny, bushy, inflamed, rod-type, amoeboid, large amoeboid, Hitter cells, "fried egg", bipolar, honeycomb-shaped, and jellyfish [6,44,47,[50][51][52][53][54][55][56][57]. To date, there is no uniform classification of microglial morphological phenotypes, which makes it difficult to compare the results of different studies. ...

A novel deep learning-based method for automatic stereology of microglia cells from low magnification images
  • Citing Article
  • February 2024

Neurotoxicology and Teratology

... Accurate assessments of cellular damage are essential for evaluating brain aging, neurotoxicology, and the efficacy of potential treatments for neurological conditions such as Parkinson's, Huntington's, and Alzheimer's diseases. Previous work shows deep learning architectures can be trained to automatically quantify cell loss in defined regions of interest (ROIs) using disector-based multiple-input multiple-output (MIMO) frameworks [1,2]. This approach shows comparable accuracy compared to manual stereology counts by humans (>90%) with improved throughput efficiency and less supervised time from experts. ...

MIMO YOLO - A Multiple Input Multiple Output Model for Automatic Cell Counting
  • Citing Conference Paper
  • June 2023

... 59 Some studies have employed image segmentation to help diagnose prostate cancer. 9,14,22,62,80,83,84 Object detection: It is the process of recognizing and localizing objects inside an image by assigning bounding boxes to relevant regions. 75 Several studies apply object detection to prostate cancer diagnosis. ...

Unsupervised Prostate Cancer Histopathology Image Segmentation via Meta-Learning
  • Citing Conference Paper
  • June 2023

... In the digital era, the landscape of information sharing has been revolutionized (Iamnitchi et al. 2023), necessitating a nuanced understanding of "information cascade", which sheds light on how information diffuses through social networks (Bikhchandani, Hirshleifer, and Welch 1992;Jalili and Perc 2017). Cascade prediction, a vital task in this field, concentrates on delineating the trajectories of information (Gong et al. 2023;Guille et al. 2013). ...

Modeling information diffusion in social media: data-driven observations

Frontiers in Big Data

... The baseline scores presented by the authors focus on distinguishing primary Gleason patterns in individual glands or small patches of prostate whole slide images (WSI). The study demonstrates that a CNN deep learning (DL) model can accurately discriminate malignant patterns from benign tissue [25]. ...

Classifying Malignancy in Prostate Glandular Structures from Biopsy Scans with Deep Learning

... To determine sensible value ranges and scaling steps for IF parameters, we relied on the examples found in literature [3]. To select parameters and their ranges for XGBoost, we also relied on the examples found in the literature [5], [30], [6]. Some parameter values were adjusted based on the empirical findings. ...

Simulating New and Old Twitter User Activity with XGBoost and Probabilistic Hybrid Models
  • Citing Conference Paper
  • December 2022

... Deep learning models [6] offer a promising avenue to enhance this process and support cytologists to manage their increasing workload by classifying cells as healthy or unhealthy. In medical imaging, extensive efforts have been dedicated to develop deep learning methods for various tasks in cell analysis, including segmentation [7,8], detection [9,10], and classification [11,12]. However, Pap smear cell classification remains challenging due to the limited number of publicy available dataset [13,14,15], the presence of images unsuitable for evaluation (e.g., artifacts, poor resolution) and the class imbalance, where unhealthy cells are significantly outnumbered by healthy ones, as illustrated by Figure 2. To address this, the PS3C Challenge introduced the APACC dataset [5] to facilitate the development and evaluation of algorithms capable of classifying pep smell images. ...

A Review of Nuclei Detection and Segmentation on Microscopy Images Using Deep Learning With Applications to Unbiased Stereology Counting
  • Citing Article
  • November 2022

IEEE Transactions on Neural Networks and Learning Systems

... It performs more realistic qualification in image detection with lesions of the network performance [9]. Various research group proposed a different image processing and image processing and image processing to identify the gastrointestinal tract diseases in using wireless endoscopic images, but it takes large number of times to predict, so using pretrained deep learning model [10] to classify the diseases of alimentary tract infection from images of wireless endoscopic visuals. In this study make some following contributions. ...

Reduction of Video Capsule Endoscopy Reading Times Using Deep Learning with Small Data