• Home
  • Vajira Thambawita
Vajira Thambawita

Vajira Thambawita
  • PhD
  • Research Scientist at Simula Metropolitan Center for Digital Engineering

I am looking for research studies generating synthetic data for the medical domain to tackle the data deficiency problem

About

138
Publications
56,728
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,669
Citations
Current institution
Simula Metropolitan Center for Digital Engineering
Current position
  • Research Scientist
Additional affiliations
January 2022 - March 2022
SimulaMet
Position
  • PostDoc Position
March 2022 - present
SimulaMet
Position
  • Research Scientist
January 2022 - present
SimulaMet
Position
  • PostDoc Position
Description
  • Machine learning in medical domain.
Education
March 2007 - March 2011
University of Peradeniya
Field of study
  • Computer Engineering

Publications

Publications (138)
Article
Full-text available
Background Wearables with integrated electrocardiogram (ECG) acquisition have made single-lead ECGs widely accessible to patients and consumers. However, the 12-lead ECG remains the gold standard for most clinical cardiac assessments. In this study, we developed a neural network to reconstruct 12-lead ECGs from single-lead and dual-lead ECGs, and e...
Article
Full-text available
Accurate embryo morphology assessment is essential in assisted reproductive technology for selecting the most viable embryo. Artificial intelligence has the potential to enhance this process. However, the limited availability of embryo data presents challenges for training deep learning models. To address this, we trained two generative models usin...
Article
Full-text available
This paper explores advancements in real-time talking-head generation, focusing on overcoming challenges in Audio Feature Extraction (AFE), which often introduces latency and limits responsiveness in real-time applications. To address these issues, we propose and implement a fully integrated system that replaces conventional AFE models with OpenAI’...
Article
Full-text available
In time-series data analysis, identifying anomalies is crucial for maintaining data integrity and ensuring accurate analyses and decision-making. Anomalies can compromise data quality and operational efficiency. The complexity of time-series data, with its temporal dependencies and potential non-stationarity, makes anomaly detection challenging but...
Preprint
Full-text available
Accurate embryo morphology assessment is essential in assisted reproductive technology for selecting the most viable embryo. Artificial intelligence has the potential to enhance this process. However, the limited availability of embryo data presents challenges for training deep learning models. To address this, we trained two generative models usin...
Preprint
Full-text available
This paper examines the integration of real-time talking-head generation for interviewer training, focusing on overcoming challenges in Audio Feature Extraction (AFE), which often introduces latency and limits responsiveness in real-time applications. To address these issues, we propose and implement a fully integrated system that replaces conventi...
Article
Objective: Evaluate popular explanation methods using heatmap visualizations to explain the predictions of deep neural networks for electrocardiogram (ECG) analysis and provide recommendations for selection of explanations methods. Materials and methods: A residual deep neural network was trained on ECGs to predict intervals and amplitudes. Nine...
Conference Paper
Full-text available
We introduce Kvasir-VQA, an extended dataset derived from the HyperKvasir and Kvasir-Instrument datasets, augmented with question-and-answer annotations to facilitate advanced machine learning tasks in Gastrointestinal (GI) diagnostics. This dataset comprises 6,500 annotated images spanning various GI tract conditions and surgical instruments, and...
Article
Rapid development in deep learning for image analysis inspired studies to focus on predicting cardiovascular risk using retinal fundus images. This scoping review aimed to identify and describe studies using retinal fundus images and deep learning to predict cardiovascular risk markers and diseases. We searched MEDLINE and Embase on 17 November 202...
Preprint
Full-text available
We introduce Kvasir-VQA, an extended dataset derived from the HyperKvasir and Kvasir-Instrument datasets, augmented with question-and-answer annotations to facilitate advanced machine learning tasks in Gastrointestinal (GI) diagnostics. This dataset comprises 6,500 annotated images spanning various GI tract conditions and surgical instruments, and...
Preprint
Full-text available
Background: Cardiovascular risk prediction models based on sociodemographic factors and traditional clinical measurements have received significant attention. With rapid development in deep learning for image analysis in the last decade and the well-known association between micro- and macrovascular complications, some recent studies focused on the...
Chapter
Full-text available
The ImageCLEF evaluation campaign was integrated with CLEF (Conference and Labs of the Evaluation Forum) for more than 20 years and represents a Multimedia Retrieval challenge aimed at evaluating the technologies for annotation, indexing, and retrieval of multimodal data. Thus, it provides information access to large data collections in usage scena...
Article
Full-text available
Polyps are well-known cancer precursors identified by colonoscopy. However, variability in their size, appearance, and location makes the detection of polyps challenging. Moreover, colonoscopy surveillance and removal of polyps are highly operator-dependent procedures and occur in a highly complex organ topology. There exists a high missed detectio...
Article
Full-text available
Cardiac arrest is a sudden loss of heart function with serious consequences. In developing countries, healthcare professionals use clinical documentation to track patient information. These data are used to predict the development of cardiac arrest. We published a dataset through open access to advance the research domain. While using this dataset,...
Article
Full-text available
Cells in living organisms are dynamic compartments that continuously respond to changes in their environment to maintain physiological homeostasis. While basal autophagy exists in cells to aid in the regular turnover of intracellular material, autophagy is also a critical cellular response to stress, such as nutritional depletion. Conversely, the d...
Chapter
Full-text available
This paper presents an overview of the ImageCLEF 2023 lab, which was organized in the frame of the Conference and Labs of the Evaluation Forum – CLEF Labs 2023. ImageCLEF is an ongoing evaluation event that started in 2003 and that encourage the evaluation of the technologies for annotation, indexing and retrieval of multimodal data with the goal o...
Preprint
Full-text available
Automatic analysis of colonoscopy images has been an active field of research motivated by the importance of early detection of precancerous polyps. However, detecting polyps during the live examination can be challenging due to various factors such as variation of skills and experience among the endoscopists, lack of attentiveness, and fatigue lea...
Article
Full-text available
Deep neural networks are complex machine learning models that have shown promising results in analyzing high-dimensional data such as those collected from medical examinations. Such models have the potential to provide fast and accurate medical diagnoses. However, the high complexity makes deep neural networks and their predictions difficult to und...
Article
Full-text available
A manual assessment of sperm motility requires microscopy observation, which is challenging due to the fast-moving spermatozoa in the field of view. To obtain correct results, manual evaluation requires extensive training. Therefore, computer-aided sperm analysis (CASA) has become increasingly used in clinics. Despite this, more data is needed to t...
Preprint
Full-text available
In order to take advantage of AI solutions in endoscopy diagnostics, we must overcome the issue of limited annotations. These limitations are caused by the high privacy concerns in the medical field and the requirement of getting aid from experts for the time-consuming and costly medical data annotation process. In computer vision, image synthesis...
Chapter
Full-text available
Public multimedia datasets can enhance knowledge discovery and model development as more researchers have the opportunity to contribute to exploring them. However, as these datasets become larger and more multimodal, besides analysis, efficient storage and sharing can become a challenge. Furthermore, there are inherent privacy risks when publishing...
Chapter
Head and neck cancers are the fifth most common cancer worldwide, and recently, analysis of Positron Emission Tomography (PET) and Computed Tomography (CT) images has been proposed to identify patients with a prognosis. Even though the results look promising, more research is needed to further validate and improve the results. This paper presents t...
Chapter
Full-text available
In this paper, we provide an overview of the upcoming ImageCLEF campaign. ImageCLEF is part of the CLEF Conference and Labs of the Evaluation Forum since 2003. ImageCLEF, the Multimedia Retrieval task in CLEF, is an ongoing evaluation initiative that promotes the evaluation of technologies for annotation, indexing, and retrieval of multimodal data...
Article
Full-text available
The interest in video anomaly detection systems that can detect different types of anomalies, such as violent behaviours in surveillance videos, has gained traction in recent years. The current approaches employ deep learning to perform anomaly detection in videos, but this approach has multiple problems. For example, deep learning in general has i...
Preprint
Full-text available
Nowadays, most people have a smartphone that can track their everyday activities. Furthermore, a significant number of people wear advanced smartwatches to track several vital biomarkers in addition to activity data. However, it is still unclear how these data can actually be used to improve certain aspects of people's lives. One of the key challen...
Chapter
Full-text available
The in vitro fertilization procedure called intracytoplasmic sperm injection can be used to help fertilize an egg by injecting a single sperm cell directly into the cytoplasm of the egg. In order to evaluate, refine and improve the method in the fertility clinic, the procedure is usually observed at the clinic. Alternatively, a video of the procedu...
Preprint
Full-text available
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in...
Preprint
Full-text available
Manually analyzing spermatozoa is a tremendous task for biologists due to the many fast-moving spermatozoa, causing inconsistencies in the quality of the assessments. Therefore, computer-assisted sperm analysis (CASA) has become a popular solution. Despite this, more data is needed to train supervised machine learning approaches in order to improve...
Preprint
Full-text available
Head and neck cancers are the fifth most common cancer worldwide, and recently, analysis of Positron Emission Tomography (PET) and Computed Tomography (CT) images has been proposed to identify patients with a prognosis. Even though the results look promising, more research is needed to further validate and improve the results. This paper presents t...
Conference Paper
Early identification of a polyp in the lower gastrointestinal (GI) tract can lead to prevention of life-threatening colorectal cancer. Developing computer-aided diagnosis (CAD) systems to detect polyps can improve detection accuracy and efficiency and save the time of the domain experts called endoscopists. Lack of annotated data is a common challe...
Preprint
Full-text available
Cardiac arrest is a sudden loss of heart function with serious consequences. In developing countries, healthcare professionals use clinical documentation to track patient information. These data can be used to predict developing cardiac arrests. We are publishing the data set through open access to advance the research domain. Based on this data se...
Article
Study question Can real-time deep learning model track hundreds of spermatozoa simultaneously? Summary answer The state-of-the-art deep learning detection model YOLOv5 shows possibilities of multi-sperm tracking with high sensitivity and precision. What is known already Computer-aided sperm analysis (CASA) systems can be used for the evaluation o...
Article
Full-text available
Study question Can deep learning be used to detect and track spermatozoa and the different parts of an ICSI procedure? Summary answer Deep learning can be used as a tool to assist and organize the contents of an ICSI procedure. What is known already Sperm tracking has been a topic of research and practice for many years, especially in the context...
Article
Full-text available
Study question Can tracking cell division and predicting human embryo cleavage stages be automated in time-lapse videos (TLV) using AI object detection methods? Summary answer We developed software predicting blastomere count and tracking cell cleavages up until 4-5 stage. The software employs object detection technique called YOLOv5 to detect cel...
Article
Full-text available
When responding to allegations of child sexual, physical, and psychological abuse, Child Protection Service (CPS) workers and police personnel need to elicit detailed and accurate accounts of the abuse to assist in decision-making and prosecution. Current research emphasizes the importance of the interviewer’s ability to follow empirically based gu...
Preprint
Full-text available
Generalizability is seen as one of the major challenges in deep learning, in particular in the domain of medical imaging, where a change of hospital or in imaging routines can lead to a complete failure of a model. To tackle this, we introduce Consistency Training, a training procedure and alternative to data augmentation based on maximizing models...
Preprint
Full-text available
The interest for video anomaly detection systems has gained traction for the past few years. The current approaches use deep learning to perform anomaly detection in videos, but this approach has multiple problems. For starters, deep learning in general has issues with noise, concept drift, explainability, and training data volumes. Additionally, a...
Preprint
Full-text available
Early identification of a polyp in the lower gastrointestinal (GI) tract can lead to prevention of life-threatening colorectal cancer. Developing computer-aided diagnosis (CAD) systems to detect polyps can improve detection accuracy and efficiency and save the time of the domain experts called endoscopists. Lack of annotated data is a common challe...
Article
Introduction Artificial intelligence (AI) is a tool thought to revolutionize the field of reproductive medicine in the years to come. Specifically, machine learning (ML), which is a subset of AI methods used to detect patterns and make predictions based on large datasets, has been used in ART to predict implantation outcome, embryo transfer strateg...
Article
Full-text available
Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Therefore, artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning mo...
Article
Full-text available
Clinicians and software developers need to understand how proposed machine learning (ML) models could improve patient care. No single metric captures all the desirable properties of a model, which is why several metrics are typically reported to summarize a model’s performance. Unfortunately, these measures are not easily understandable by many cli...
Preprint
Full-text available
Polyps are well-known cancer precursors identified by colonoscopy. However, variability in their size, location, and surface largely affect identification, localisation, and characterisation. Moreover, colonoscopic surveillance and removal of polyps (referred to as polypectomy ) are highly operator-dependent procedures. There exist a high missed de...
Preprint
Full-text available
Soccer has a considerable market share of the global sports industry, and the interest in viewing videos from soccer games continues to grow. In this respect, it is important to provide game summaries and highlights of the main game events. However, annotating and producing events and summaries often require expensive equipment and a lot of tedious...
Article
Full-text available
Widely used traditional supervised deep learning methods require a large number of training samples but often fail to generalize on unseen datasets. Therefore, a more general application of any trained model is quite limited for medical imaging for clinical practice. Using separately trained models for each unique lesion category or a unique patien...
Chapter
Full-text available
The holy grail in endoscopy examinations has for a long time been assisted diagnosis using Artificial Intelligence (AI). Recent developments in computer hardware are now enabling technology to equip clinicians with promising tools for computer-assisted diagnosis (CAD) systems. However, creating viable models or architectures, training them, and ass...
Chapter
Full-text available
Contact tracing applications generally rely on Bluetooth data. This type of data works well to determine whether a contact occurred (smartphones were close to each other) but cannot offer the contextual information GPS data can offer. Did the contact happen on a bus? In a building? And of which type? Are some places recurrent contact locations? By...
Article
Full-text available
Detecting events in videos is a complex task, and many different approaches, aimed at a large variety of use-cases, have been proposed in the literature. Most approaches, however, are unimodal and only consider the visual information in the videos. This paper presents and evaluates different approaches based on neural networks where we combine visu...
Thesis
Full-text available
Recent advancements in technology have made artificial intelligence (AI) a popular tool in the medical domain, especially machine learning (ML) methods, which is a subset of AI. In this context, a goal is to research and develop generalizable and well-performing ML models to be used as the main component in computer-aided diagnosis (CAD) systems. H...
Article
Full-text available
The current gold standard for extracting highlight clips from soccer games is the use of manual annotations and clippings, where human operators define the start and end of an event and trim away the unwanted scenes. This is a tedious, time-consuming, and expensive task, to the extent of being rendered infeasible for use in lower league games. In t...
Article
Full-text available
Recent trials have evaluated the efficacy of deep convolutional neural network (CNN)-based AI systems to improve lesion detection and characterization in endoscopy. Impressive results are achieved, but many medical studies use a very small image resolution to save computing resources at the cost of losing details. Today, no conventions between reso...
Article
Full-text available
Recent global developments underscore the prominent role big data have in modern medical science. But privacy issues constitute a prevalent problem for collecting and sharing data between researchers. However, synthetic data generated to represent real data carrying similar information and distribution may alleviate the privacy issue. In this study...
Article
Full-text available
MedAI: Transparency in Medical Image Segmentation is a challenge held for the first time at the Nordic AI Meet that focuses on medical image segmentation and transparency in machine learning (ML)-based systems. We propose three tasks to meet specific gastrointestinal image segmentation challenges collected from experts within the field, including t...
Article
Full-text available
Study question Can artificial intelligence (AI) algorithms identify spermatozoa in a semen sample without using training data annotated by professionals? Summary answer Unsupervised AI methods can discriminate the spermatozoon from other cells and debris. These unsupervised methods may have a potential for several applications in reproductive medi...
Preprint
Full-text available
Detection of colon polyps has become a trending topic in the intersecting fields of machine learning and gastrointestinal endoscopy. The focus has mainly been on per-frame classification. More recently, polyp segmentation has gained attention in the medical community. Segmentation has the advantage of being more accurate than per-frame classificati...
Preprint
Full-text available
Processing medical data to find abnormalities is a time-consuming and costly task, requiring tremendous efforts from medical experts. Therefore, Ai has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. AI tools highly depend on data for training the models. However, there are several constr...
Conference Paper
Full-text available
Detection of colon polyps has become a trending topic in the intersecting fields of machine learning and gastrointestinal endoscopy. The focus has mainly been on per-frame classification. More recently, polyp segmentation has gained attention in the medical community. Segmentation has the advantage of being more accurate than per-frame classificati...
Preprint
Full-text available
Classical supervised methods commonly used often suffer from the requirement of an abudant number of training samples and are unable to generalize on unseen datasets. As a result, the broader application of any trained model is very limited in clinical settings. However, few-shot approaches can minimize the need for enormous reliable ground truth l...
Article
Developing systems for the automatic detection of events in video is a task which has gained attention in many areas including sports. More specifically, event detection for soccer videos has been studied widely in the literature. However, there are still a number of shortcomings in the state-of-the-art such as high latency, making it challenging t...
Article
Full-text available
Artificial intelligence (AI) is predicted to have profound effects on the future of video capsule endoscopy (VCE) technology. The potential lies in improving anomaly detection while reducing manual labour. Existing work demonstrates the promising benefits of AI-based computer-assisted diagnosis systems for VCE. They also show great potential for im...
Article
Full-text available
Deep learning-based tools may annotate and interpret medical data more quickly, consistently, and accurately than medical doctors. However, as medical doctors are ultimately responsible for clinical decision-making, any deep learning-based prediction should be accompanied by an explanation that a human can understand. We present an approach called...
Preprint
Full-text available
Big data is needed to implement personalized medicine, but privacy issues are a prevalent problem for collecting data and sharing them between researchers. A solution is synthetic data generated to represent real dataset carrying similar information. Here, we present generative adversarial networks (GANs) capable of generating realistic synthetic D...
Preprint
Full-text available
Clinicians and model developers need to understand how proposed machine learning (ML) models could improve patient care. In fact, no single metric captures all the desirable properties of a model and several metrics are typically reported to summarize a model's performance. Unfortunately, these measures are not easily understandable by many clinici...
Chapter
Full-text available
The EndoTect challenge at the International Conference on Pattern Recognition 2020 aims to motivate the development of algorithms that aid medical experts in finding anomalies that commonly occur in the gastrointestinal tract. Using HyperKvasir, a large dataset containing images taken from several endoscopies, the participants competed in three tas...
Article
Full-text available
Gastrointestinal endoscopy has been an active field of research motivated by the large number of highly lethal GI cancers. Early Gastrointestinal (GI) cancer precursors are often missed during the endoscopic surveillance. The high missed rate of such abnormalities during endoscopy is thus a critical bottleneck. Lack of attentiveness due to tiring p...
Chapter
In this paper, we present HTAD: A Home Tasks Activities Dataset. The dataset contains wrist-accelerometer and audio data from people performing at-home tasks such as sweeping, brushing teeth, washing hands, or watching TV. These activities represent a subset of activities that are needed to be able to live independently. Being able to detect activi...
Chapter
Full-text available
Gastrointestinal (GI) pathologies are periodically screened, biopsied, and resected using surgical tools. Usually, the procedures and the treated or resected areas are not specifically tracked or analysed during or after colonoscopies. Information regarding disease borders, development, amount, and size of the resected area get lost. This can lead...
Preprint
Full-text available
Deep learning-based tools may annotate and interpret medical tests more quickly, consistently, and accurately than medical doctors. However, as medical doctors remain ultimately responsible for clinical decision-making, any deep learning-based prediction must necessarily be accompanied by an explanation that can be interpreted by a human. In this s...
Preprint
Full-text available
Segmentation of findings in the gastrointestinal tract is a challenging but also an important task which is an important building stone for sufficient automatic decision support systems. In this work, we present our solution for the Medico 2020 task, which focused on the problem of colon polyp segmentation. We present our simple but efficient idea...
Conference Paper
Data is arguably the most important resource today as it fuels the algorithms powering services we use every day. However, in fields like medicine, publicly available datasets are few, and labeling medical datasets require tedious efforts from trained specialists. Generated synthetic data can be to future successful healthcare clinical intelligence...
Article
Full-text available
Artificial intelligence is currently a hot topic in medicine. However, medical data is often sparse and hard to obtain due to legal restrictions and lack of medical personnel for the cumbersome and tedious process to manually label training data. These constraints make it difficult to develop systems for automatic analysis, like detecting disease o...

Questions

Questions (7)
Question
Most of GAN architectures are unstable in training stage for generating a large scale images? What is the main reason for this? Is it gradient vanishing problem ? or any other reason?
Question
Tensor cores are powerful units in NVIDIA Volta GPUs. What are the technological differences between normal CUDA cores and Tensor cores?
Question
I have seen that most of researchers are using Python for machine learning. What is the reason?
For machine learning, we are expecting good performance. However, Python is not a good programming language for parallel computing? Isn't it? Then easiness is the only reason for python. Isn't it?
Question
I want to know deep details about Nvidia Kepler cache architecture. That means, Is it 2 way, 8 way associative, write back or write through, cache line sizes , how to mange cache for multiple threads ..etc.
Question
I compiled my cuda code (Aho-corasick algorithm) for kepler architecture (3.5) and it works. But when I compiled for Fermi architecture (2.0) it doesn't work? why?
Question
I have to expand the memory using a cuda kernel? (like realloc in C).
Question
I have run following code 1 and it returned some out put without error. But when I run same code with cudaMallocManaged() in code 2, it returned unhadled memory exception error (like segmentation fault in linux) in my
visual studio 2012. Please tell me what is the reason for this.
code 1
+++++++++++++
int main(){
int* data;
int* data1;
char* c1;
char* c1_d;
char* c2;
char* c2_d;
char c;
cudaMalloc((void **)&c1_d,sizeof(char)*100);
cudaMalloc((void **)&c2_d,sizeof(char)*100);
c2=(char *)malloc(sizeof(char)*100);
c1="hellloooo";
cudaMemcpy(c1_d,c1,sizeof(char)*100,cudaMemcpyHostToDevice);
kernel<<<1,100>>>(c1_d,c2_d);
cudaDeviceSynchronize();
cudaMemcpy(c2,c2_d,sizeof(char)*100,cudaMemcpyDeviceToHost);
for (int i=0;i<100;i++)
{
printf("%c",c2[i]);
}
printf("\n");
getchar();
return 0;
}
__global__ void kernel(char* c1,char* c2){
int i=threadIdx.x;
c2[i]=c1[i];
__syncthreads();
}
code 2 (this will return error. What is the reason)
==========
int main(){
char* c1;
char* c2;
cudaMallocManaged((void **)c1, sizeof(int)*100);
cudaMallocManaged((void **)c2, sizeof(int)*100);
c1="hellloooo";
kernel<<<1,100>>>(data,data1,c1_d,c2_d);
cudaDeviceSynchronize();
printf("\n");
for (int i=0;i<100;i++)
{
printf("%c",c2[i]);
}
printf("\n");
getchar();
return 0;
}
__global__ void kernel(int* d,int * d1,char* c1,char* c2){
int i=threadIdx.x;
c2[i]=c1[i];
__syncthreads();

Network

Cited By