Shilpa Gite

Shilpa Gite
  • Mtech(IT),PhD in Computer Science(Deep Learning)
  • Professor (Associate) at Symbiosis International University

About

156
Publications
109,368
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,839
Citations
Introduction
I have completed my PhD in deep learning for driver assistance. I would like to collaborate now in other research problems as well. My research interest areas are Machine learning, deep learning, computer vision etc.
Current institution
Symbiosis International University
Current position
  • Professor (Associate)
Additional affiliations
December 2020 - present
Symbiosis centre for Applied AI
Position
  • Research Faculty
July 2012 - present
Symbiosis Institute of Technology
Position
  • Professor (Associate)
Education
August 2013 - November 2019
Symbiosis International University
Field of study
  • Deep learning

Publications

Publications (156)
Article
Full-text available
The development of generative architectures has resulted in numerous novel deep-learning models that generate images using text inputs. However, humans naturally use speech for visualization prompts. Therefore, this paper proposes an architecture that integrates speech prompts as input to image-generation Generative Adver-sarial Networks (GANs) mod...
Article
Full-text available
Growing threats in public spaces have forced people to question personal security, making technology more relevant, especially in speech recognition. This paper proposes a security safety system by considering keyword and negative emotion detection to solve this problem. It detects the wake-up word "ON" whenever it is spoken with negative emotion....
Article
Full-text available
Accurate, reliable and transparent crop yield prediction is crucial for informed decision-making by governments, farmers, and businesses regarding food security as well as agricultural business and management. Deep learning (DL) methods, particularly Long Short-Term Memory networks, have emerged as one of the most widely used architectures in yield...
Article
Full-text available
The timely and reliable prediction of crop yields on a larger scale is crucial for ensuring a stable food supply and food security. In the last few years, many studies have demonstrated that deep learning can offer reliable solutions for crop yield prediction. However, a key challenge in applying deep-learning models to crop yield prediction is the...
Article
Full-text available
This article presents a thorough examination of the progress and limitations in the application of Natural Language Processing (NLP) and Machine Learning (ML), particularly Deep Learning (DL), in the healthcare industry. This paper examines the progress and limitations in the utilisation of Natural Language Processing (NLP) and Machine Learning (ML...
Article
Recent advancements in artificial intelligence (AI) have increased interest in intelligent transportation systems, particularly autonomous vehicles. Safe navigation in traffic-heavy environments requires accurate road scene segmentation, yet traditional computer vision methods struggle with complex scenarios. This study emphasizes the role of deep...
Article
Cerebral palsy (CP) is a group of disorders that alters patients’ muscle coordination, posture, and movement, resulting in a wide range of deformities. Cerebral palsy can be caused by various factors, both prenatal and postnatal, such as infections or injuries that damage different parts of the brain. As brain plasticity is more prevalent during ch...
Article
This study addresses the challenges associated with landslide prediction due to limited geotechnical data and high costs. A comprehensive approach is proposed to address these challenges. It integrates soil mechanics computations, rainfall infiltration analysis and synthetic aperture radar (SAR) time-series data to develop a transparent and unified...
Chapter
Public safety is a massive concern for governments and civilians alike. Measures taken to ensure public safety range from urban and safe city planning at the government level to self-defence programs for the public. Among these measures, baggage checking and surveillance systems are already in place. The motivation for this paper is to enhance thes...
Chapter
The early diagnosis of skin cancer has significantly improved with the use of computer-aided techniques and deep learning (DL) models. However, existing methods often struggle with issues of interpretability and adaptability, which are crucial for clinical application. To address these limitations, we employed a Multi-Task Learning (MTL) approach t...
Chapter
As the sensor technology and reliability of the obstacle detection techniques advances automated driving is going to become the most pivotal technology that will be the birth of the next revolution in mobility industry. The impact of autonomous systems will no longer be limited to automobile industry, but to the mankind. Controversies, discussions,...
Chapter
The human body can undergo changes in blood cell structure and characteristics when affected by contaminants. Examining the tiny images of blood cells helps identify potential infections or irregularities within the body, aiming to detect diseases. Accurately segmenting these cells significantly enhances disease detection, making it more precise an...
Chapter
Skin lesions are a severe disease and the most predominant type of cancer worldwide. It is becoming more prevalent in modern society, with rising cases every year. The World Health Organization (WHO) claims melanoma is from the most severe forms of skin cancer, affecting well over 100,000 people worldwide each year. While early-stage lesions are fr...
Data
This study presents a dataset consisting of 268 retinal images from 179 individuals, including 133 left-eye and 135 right-eye images, collected from Natasha Eye Care and Research Institute in Pune, Maharashtra, India. The images were captured using a nonmydriatic Optical Coherence Tomography Angiography (OCTA) device, specifically the Optovue Avant...
Preprint
Full-text available
This study presents a dataset consisting of 268 retinal images from 179 individuals, including 133 left-eye and 135 right-eye images, collected from Natasha Eye Care and Research Institute in Pune, Maharashtra, India. The images were captured using a nonmydriatic Optical Coherence Tomography Angiography (OCTA) device, specifically the Optovue Avant...
Article
Detecting psychological disorders, particularly depression, is a complex and critical task within the realm of mental health assessment. This research explores a novel approach to improve the identification of psychological distresses, such as depression, by addressing the subjectivity, complexity, and biasness inherent in traditional diagnostic te...
Conference Paper
Full-text available
Increased production in agriculture is anticipated with localized management that is founded on several analyses of the crops and soil. The following issues arise with this method: it takes a long time because there are a lot of measurement locations, different sensors are needed for different measures and the sun and wind might cause inaccurate da...
Article
The prevalence of diabetic retinopathy (DR) among the geriatric population poses significant challenges for early detection and management. Optical Coherence Tomography Angiography (OCTA) combined with Deep Learning presents a promising avenue for improving diagnostic accuracy in this vulnerable demographic. In this method, we propose an innovative...
Article
Full-text available
Recently, Artificial Intelligence (AI) has seen significant progress, especially in Natural Language Processing (NLP) and Conversational AI, making response generation more efficient. This advancement, combined with increased availability of conversational data, has greatly improved conversational bots, thus enhancing their effectiveness and scope....
Article
Full-text available
Large language models (LLMs) have transformed open-domain abstractive summarization, delivering coherent and precise summaries. However, their adaptability to user knowledge levels is largely unexplored. This study investigates LLMs’ efficacy in tailoring summaries to user familiarity. We assess various LLM architectures across different familiarit...
Article
Full-text available
Vehicular traffic significantly contributes to economic growth but generates frictional noise that impacts urban environments negatively. Road traffic is a primary noise source, causing annoyance and interference. Traditional regression models predict two-dimensional (2D) noise maps, but this study explores the impact and visualization of noise usi...
Article
Full-text available
This study explores the fusion of artificial intelligence (AI) and machine learning (ML) methods within anti–money laundering (AML) frameworks using data from the US Treasury’s Financial Crimes Enforcement Network (FinCEN). ML and deep learning (DL) algorithms—such as random forest classifier, elastic net regressor, least absolute shrinkage and sel...
Article
Full-text available
This study aims to compare deep learning explainability (DLE) with explainable artificial intelligence and causal artificial intelligence (Causal AI) for fraud detection, emphasizing their distinct methodologies and potential to address critical challenges, particularly in finance. An empirical evaluation was conducted using the Bank Account Fraud...
Article
Full-text available
The study focuses on intelligent driving, emphasizing the importance of recognizing nearby vehicles and estimating their positions using visual input from a single image. It employs transfer learning techniques, integrating deep convolutional networks’ features into a modified CenterNet model for six-degrees-of-freedom (6DoF) vehicle position estim...
Article
This paper introduces an airborne object dataset comprising 22,516 images categorizing four classes of airborne objects: airplanes, helicopters, drones, and birds. The dataset was compiled from YouTube-8 M, Anti-UAV, and Ahmed Mohsen's dataset hosted on Roboflow. Videos were sourced from the first two platforms and converted into individual frames,...
Article
Introduction: We aimed to develop machine learning (ML) algorithms for the automated prediction of postoperative ureteroscopy outcomes for pediatric kidney stones based on preoperative characteristics. Materials and Methods: Data from pediatric patients who underwent ureteroscopy for stone treatment by a single experienced surgeon, between 2010 and...
Article
Full-text available
Automated mathematical problem-solving represents a unique intersection of natural language processing (NLP) and mathematical reasoning, posing significant challenges in semantic comprehension and logical deduction. This survey paper explores the domain of mathematical word problems (MWPs), focusing on the nuanced integration of linguistic understa...
Article
Background and objective The integration of machine learning (ML) in health care has garnered significant attention because of its unprecedented opportunities to enhance patient care and outcomes. In this study, we trained ML algorithms for automated prediction of outcomes of ureteroscopic laser lithotripsy (URSL) on the basis of preoperative chara...
Chapter
This paper explores the domain of story generation and presents a novel approach that uses Generative Adversarial Networks (GANs), Recurrent Neural Networks (RNNs), and Long Short-Term Memory (LSTM) networks. The objective is to generate realistic and engaging stories for children. The traditional language models are proficient in maintaining gramm...
Article
Full-text available
In leukemia diagnosis, automating the process of decision-making can reduce the impact of individual pathologists' expertise. While deep learning models have demonstrated promise in disease diagnosis, combining them can yield superior results. This research introduces an ensemble model that merges two pre-trained deep learning models, namely, VGG-1...
Article
Full-text available
The field of natural language processing (NLP) and conversational artificial intelligence (AI) has one ingenious application in the psychological space. Depression and anxiety are two major issues that the world is facing, with close to 41% of adults reporting these symptoms in the United States alone, as of December 2020. It has also been observed...
Article
Full-text available
This article presents a Multimodal database consisting of 222 images of 76 people wherein 111 are OCTA images and 111 are color fundus images taken at the Natasha Eye Care and Research Institute of Pune Maharashtra, India. Nonmydriatic fundus images were acquired using a confocal SLO widefield fundus imaging Eidon machine. Nonmydriatic OCTA images...
Article
Full-text available
Liveness face detection is essential for modern biometric systems, ensuring that input data is genuine and not derived from a false image or video. Liveness face detection in today’s biometric systems will ensure that input comes from a real, live person rather than a manipulated image or video. The novelty of this study lies in combining deep lear...
Article
Implementing a virtual reality skateboard which would be further expanded into the Metaverse, a cyberspace where multiple people can come together and interact virtually. Further implementation of this VR experience into the Metaverse will be conceptualized in the research paper published along with the project. The motive behind the project was to...
Article
Full-text available
Infection of leukemia in humans causes many complications in its later stages. It impairs bone marrow’s ability to produce blood. Morphological diagnosis of human blood cells is a well-known and well-proven technique for diagnosis in this case. The binary classification is employed to distinguish between normal and leukemiainfected cells. In additi...
Preprint
Full-text available
In the realm of biometrics, face recognition (F.R.) is one of the most exciting new developments. In the past decade, computer vision and artificial intelligence advancements have improved face recognition systems by several orders of magnitude. Many attacks can be launched against these systems, such as the low-cost and low-effort Presentation att...
Article
Full-text available
Audio Emotion Recognition (AER) is an important factor for Human Emotion Analysis with or without any visual aiding components. Such audio has different modular parameters, such as rhythm, tone, and pitch. However, emotions are highly complex, and the way they get delivered to human ears with preconceived emotions are then instantly understood by h...
Article
Full-text available
Pulmonary Embolism (PE) occurs when blood clots travel to the lungs from different parts of the body. It is amongst the most lethal cardio-respiratory diseases after stroke and heart attack. It occurs due to injury or inactivity due to Deep Vein Thrombosis (DVT). Over the last decade, the PE mortality rate has increased by 23%. Moreover, Vein Throm...
Article
Full-text available
Pose estimation of human activity recognition has been a keen area of interest in augmented reality experiences, gaming and robotics, animations, behavioral analysis, and more. One such exciting variant of pose estimation in the field of health and science is yoga pose estimation. This paper explores yoga pose estimation using deep learning network...
Article
Full-text available
Lyme disease diagnosis poses a significant challenge, with blood tests exhibiting an alarming inaccuracy rate of nearly 60% in detecting early-stage infections. As a result, there is an urgent need for improved diagnostic methods that can offer more accurate detection outcomes. To address this pressing issue, our study focuses on harnessing the pot...
Chapter
Digital recordings are now affordable, quick to make, and straightforward to upload on web platforms because of the widespread availability of video recording technology in smartphones and other digital devices. The truth of films published on social media cannot be trusted. With the latest tools used for video editing tools, movies may now be read...
Article
Full-text available
Sign language is a form of communication where people use bodily gestures, particularly those of hands and arms. This method of communication is put into motion when spoken communication is unattainable or disfavored. There are very few people who can translate sign language and readily understand them. It would be convenient for the hearing-impair...
Article
Full-text available
Reliable and timely crop-yield prediction and crop mapping are crucial for food security and decision making in the food industry and in agro-environmental management. The global coverage, rich spectral and spatial information and repetitive nature of remote sensing (RS) data have made them effective tools for mapping crop extent and predicting yie...
Article
Full-text available
People’s mental conditions are often reflected in their social media activity due to the internet's anonymity. Psychiatric issues are often detected through such activities and can be addressed in their early stages, potentially preventing the consequences of unattended mental disorders like depression and anxiety. In this paper, the authors have i...
Article
Full-text available
Feature selection and feature extraction have always been of utmost importance owing to their capability to remove redundant and irrelevant features, reduce the vector space size, control the computational time, and improve performance for more accurate classification tasks, especially in text categorization. These feature engineering techniques ca...
Article
Full-text available
Biometrics has been evolving as an exciting yet challenging area in the last decade. Though face recognition is one of the most promising biometrics techniques, it is vulnerable to spoofing threats. Many researchers focus on face liveness detection to protect biometric authentication systems from spoofing attacks with printed photos, video replays,...
Article
Full-text available
Contactless verification is possible with iris biometric identification, which helps prevent infections like COVID-19 from spreading. Biometric systems have grown unsteady and dangerous as a result of spoofing assaults employing contact lenses, replayed the video, and print attacks. The work demonstrates an iris liveness detection approach by utili...
Article
Full-text available
The intelligent transportation system, especially autonomous vehicles, has seen a lot of interest among researchers owing to the tremendous work in modern artificial intelligence (AI) techniques, especially deep neural learning. As a result of increased road accidents over the last few decades, significant industries are moving to design and develo...
Article
Full-text available
Diabetic retinopathy occurs due to long-term diabetes with changing blood glucose levels and has become the most common cause of vision loss worldwide. It has become a severe problem among the working-age group that needs to be solved early to avoid vision loss in the future. Artificial intelligence-based technologies have been utilized to detect a...
Article
Full-text available
With the advances in brain imaging, magnetic resonance imaging (MRI) is evolving as a popular radiological tool in clinical diagnosis. Deep learning (DL) methods can detect abnormalities in brain images without an extensive manual feature extraction process. Generative adversarial network (GAN)-synthesized images have many applications in this fiel...
Article
Full-text available
In this era of free and open-access satellite and spatial data, modern innovations in cloud computing and machine-learning algorithms (MLAs) are transforming how Earth-observation (EO) datasets are utilized for geological mapping. This study aims to exploit the potentialities of the Google Earth Engine (GEE) cloud platform using powerful MLAs. The...
Article
Full-text available
Speech recognition systems have become a unique human-computer interaction (HCI) family. Speech is one of the most naturally developed human abilities; speech signal processing opens up a transparent and hand-free computation experience. This paper aims to present a retrospective yet modern approach to the world of speech recognition systems. The d...
Article
Full-text available
Human ideas and sentiments are mirrored in facial expressions. They give the spectator a plethora of social cues, such as the viewer’s focus of attention, intention, motivation, and mood, which can help develop better interactive solutions in online platforms. This could be helpful for children while teaching them, which could help in cultivating a...
Article
Machine learning (ML) models have been extensively used in several geological applications. Owing to the increase in model complexity, interpreting the outputs becomes quite challenging. Shapley additive explanation (SHAP) measures the importance of each input attribute on the model’s output. This study implemented SHAP to estimate earthquake proba...
Article
Full-text available
COVID-19 patients require effective diagnostic methods, which are currently in short supply. In this study, we explained how to accurately identify the lung regions on the X-ray scans of such people’s lungs. Images from X-rays or CT scans are critical in the healthcare business. Image data categorization and segmentation algorithms have been develo...
Article
Full-text available
Machine learning (ML) has emerged as a critical enabling tool in the sciences and industry in recent years. Today’s machine learning algorithms can achieve outstanding performance on an expanding variety of complex tasks–thanks to advancements in technique, the availability of enormous databases, and improved computing power. Deep learning models a...
Article
Full-text available
In the recent decade, comprehensive research has been carried out in terms of promising biometrics modalities regarding humans’ physical features for person recognition. This work focuses on iris characteristics and traits for person identification and iris liveness detection. This study used five pre-trained networks, including VGG-16, Inceptionv3...
Chapter
Predicting rainfall is essential for assessing the impact of climatic and hydrological changes over a specific region, predicting natural disasters or day-to-day life. It is one of the most prominent, complex, and essential weather forecasting and meteorology tasks. In this chapter, long short-term memory network (LSTM), artificial neural network (...
Article
Full-text available
In the last decade, the proactive diagnosis of diseases with artificial intelligence and its aligned technologies has been an exciting and fruitful area. One of the areas in medical care where constant monitoring is required is cardiovascular diseases. Arrhythmia, one of the cardiovascular diseases, is generally diagnosed by doctors using Electroca...
Article
Full-text available
Epileptic seizures occur due to brain abnormalities that can indirectly affect patient’s health. It occurs abruptly without any symptoms and thus increases the mortality rate of humans. Almost 1% of world’s population suffers from epileptic seizures. Prediction of seizures before the beginning of onset is beneficial for preventing seizures by medic...
Article
Full-text available
With the advances in technology, assistive medical systems are emerging with rapid growth and helping healthcare professionals. The proactive diagnosis of diseases with artificial intelligence (AI) and its aligned technologies has been an exciting research area in the last decade. Doctors usually detect tuberculosis (TB) by checking the lungs’ X-ra...
Chapter
Any contamination in the human body can prompt changes in blood cell morphology and various parameters of cells. The minuscule images of blood cells are examined for recognizing the contamination inside the body with an expectation of maladies and variations from the norm. Appropriate segmentation of these cells makes the detection of a disease pro...
Article
Full-text available
Learning human languages is a difficult task for a computer. However, Deep Learning (DL) techniques have enhanced performance significantly for almost all-natural language processing (NLP) tasks. Unfortunately, these models cannot be generalized for all the NLP tasks with similar performance. NLU (Natural Language Understanding) is a subset of NLP...
Article
Full-text available
Iris biometric identification allows for contactless authentication, which helps to avoid the transmission of diseases like COVID-19. Biometric systems become unstable and hazardous due to spoofing attacks involving contact lenses, replayed video, cadaver iris, synthetic Iris, and printed iris. This work demonstrates the iris presentation attacks d...
Article
Full-text available
Facial expressions are mirrors of human thoughts and feelings. It provides a wealth of social cues to the viewer, including the focus of attention, intention, motivation, and emotion. It is regarded as a potent tool of silent communication. Analysis of these expressions gives a significantly more profound insight into human behavior. AI-based Facia...
Article
Full-text available
Iris biometric detection provides contactless authentication, preventing the spread of COVID-19-like contagious diseases. However, these systems are prone to spoofing attacks attempted with the help of contact lenses, replayed video, and print attacks, making them vulnerable and unsafe. This paper proposes the iris liveness detection (ILD) method t...
Article
Full-text available
Automated human pose estimation is evolving as an exciting research area in human activity detection. It includes sophisticated applications such as malpractice detection in the examination, distracted driving, gesture detection, etc., and requires robust and reliable pose estimation techniques. These applications help to map the attention of the u...
Article
Full-text available
Biometrics is progressively becoming vital due to vulnerabilities of traditional security systems leading to frequent security breaches. Biometrics is an automated device that studies human beings’ physiological and behavioral features for their unique classification. Iris-based authentication offers stronger, unique, and contactless identification...
Article
Full-text available
Intelligent vehicle technology has made tremendous progress due to Artificial Intelligence (AI) techniques. Accurate behavior prediction of surrounding traffic actors is essential for the safe and secure navigation of the intelligent vehicle. Minor misbehavior of these vehicles on the busy roads may lead to an accident. Due to this, there is a need...
Article
Full-text available
Neural Style Transfer (NST) is a class of software algorithms that allows us to transform scenes, change/edit the environment of a media with the help of a Neural Network. NST finds use in image and video editing software allowing image stylization based on a general model, unlike traditional methods. This made NST a trending topic in the entertain...

Network

Cited By