
Osmar R. ZaïaneUniversity of Alberta | UAlberta · Department of Computing Science
Osmar R. Zaïane
PhD
About
480
Publications
110,510
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
12,684
Citations
Citations since 2017
Introduction
Additional affiliations
July 1999 - present
Publications
Publications (480)
Computer-aided lung cancer diagnosis (CAD) system on computed tomography (CT) helps radiologists guide preoperative planning and prognosis assessment. The flexibility and scalability of deep learning methods are limited in lung CAD. In essence, two significant challenges to be solved are (1) Label scarcity due to cost annotations of CT images by ex...
We present an empirical analysis of basic and depression specific multi-emotion mining in Tweets, using state of the art multi-label classifiers. We choose our basic emotions from a hybrid emotion model consisting of the commonly identified emotions from four highly regarded psychological models. Moreover, we augment that emotion model with new emo...
This work tackles the problem of unsupervised modeling and extraction of the main contrastive sentential reasons conveyed by divergent viewpoints on polarized issues. It proposes a pipeline approach centered around the detection and clustering of phrases, assimilated to argument facets using a novel Phrase Author Interaction Topic-Viewpoint model....
Modeling the dynamics characteristics in functional brain networks (FBNs) is important for understanding the functional mechanism of the human brain. However, the current works do not fully consider the potential complex spatial and temporal correlations in human brain. To solve this problem, we propose a temporal graph representation learning fram...
Medical image segmentation is a challenging task due to the high variation in shape, size and position of infections or lesions in medical images. It is necessary to construct multi-scale representations to capture image contents from different scales. However, it is still challenging for U-Net with a simple skip connection to model the global mult...
Retinal fundus images have been applied for the diagnosis and screening of eye diseases, such as Diabetic Retinopathy (DR) or Diabetic Macular Edema (DME). However, both low-quality fundus images and style inconsistency potentially increase uncertainty in the diagnosis of fundus disease and even lead to misdiagnosis by ophthalmologists. Most of the...
High-quality pseudo labels are essential for semi-supervised semantic segmentation. Consistency regularization and pseudo labeling-based semi-supervised methods perform co-training using the pseudo labels from multi-view inputs. However, such co-training models tend to converge early to a consensus during training, so that the models degenerate to...
The goal of information-seeking dialogue is to respond to seeker queries with natural language utterances that are grounded on knowledge sources. However, dialogue systems often produce unsupported utterances, a phenomenon known as hallucination. To mitigate this behavior, we adopt a data-centric solution and create FaithDial, a new benchmark for h...
The most popular topic modelling algorithm, Latent Dirichlet Allocation, produces a simple set of topics. However, topics naturally exist in a hierarchy with larger, more general super-topics and smaller, more specific sub-topics. We develop a novel topic modelling algorithm, Community Topic, that mines communities from word co-occurrence networks...
Association rule mining can be a powerful computational tool for exploring complex interactions between high-dimensional exposures and health outcomes. Given the high-dimensional nature of the data, many complex association rules may be identified. To narrow down on the most important rules for hypothesis-generating and future investigation in the...
Alzheimer’s disease (AD) is highly prevalent and a significant cause of dementia and death in elderly individuals. Motivated by breakthroughs of multi-task learning (MTL), efforts have been made to extend MTL to improve the Alzheimer’s disease cognitive score prediction by exploiting structure correlation. Though important and well-studied, three k...
PurposeFinding the biomarkers associated with autism spectrum disorder (ASD) is helpful for understanding the underlying roots of the disorder and can lead to earlier diagnosis and more targeted treatments. In essence, we are faced with two challenges (i) how to learn a node representation and a clean graph structure from original graph data with h...
Deep neural networks architecture provides a pow- erful technique for solving various problems including clas- sification. They owe their performance to the complex and layered data representation and processing built upon neural networks. The success of deep neural networks in various fields has resulted in less focus on other techniques like rule...
We describe the development of a model to detect user-level clinical depression based on a user's temporal social media posts. Our model uses a Depression Symptoms Detection (DSD) model, which is trained on the largest existing samples of clinician annotated tweets for clinical depression symptoms. We subsequently use our DSD model to extract clini...
A fundamental component of user-level social media language based clinical depression modelling is depression symptoms detection (DSD). Unfortunately, there does not exist any DSD dataset that reflects both the clinical insights and the distribution of depression symptoms from the samples of self-disclosed depressed population. In our work, we desc...
Identifying fake news is a very difficult task, especially when considering the multiple modes of conveying information through text, image, video and/or audio. We attempted to tackle the problem of automated misinformation/disinformation detection in multi-modal news sources (including text and images) through our simple, yet effective, approach i...
Recently, functional brain networks have been employed for classifying neurological disorders, such as autism spectrum disorders (ASDs). Graph convolutional networks (GCNs) have been shown to be successful in modeling applications with graph structures. However, brain network data is in general of complex structure with small sample size, and the u...
The diagnosis of chest diseases is a challenging task for assessing thousands of radiology subjects. Their diagnosis decisions heavily rely on the expert radiologists' manual annotations. It is important to develop automated analysis methods for the computer-aided diagnosis of chest diseases on chest radiography. To explore the label relationship a...
Algorithmic trading, due to its inherent nature, is a difficult problem to tackle; there are too many variables involved in the real-world which makes it almost impossible to have reliable algorithms for automated stock trading. The lack of reliable labelled data that considers physical and physiological factors that dictate the ups and downs of th...
How do we perform efficient inference while retaining high translation quality? Existing neural machine translation models, such as Transformer, achieve high performance, but they decode words one by one, which is inefficient. Recent non-autoregressive translation models speed up the inference, but their quality is still inferior. In this work, we...
Most recent semantic segmentation methods adopt a U-Net framework with an encoder-decoder architecture. It is still challenging for U-Net with a simple skip connection scheme to model the global multi-scale context: 1) Not each skip connection setting is effective due to the issue of incompatible feature sets of encoder and decoder stage, even some...
The most common Named Entity Recognizers are usually sequence taggers trained on fully annotated corpora, i.e. the class of all words for all entities is known. Partially annotated corpora, i.e. some but not all entities of some types are annotated, are too noisy for training sequence taggers since the same entity may be annotated one time with its...
The dynamic functional connectivity analysis provides valuable information for understanding functional brain activity underlying different cognitive processes. Modeling spatio-temporal dynamics in functional brain networks is critical for underlying the functional mechanism of autism spectrum disorder (ASD). In our study, we propose a machine lear...
Object detection is a fundamental problem in computer vision. Although impressive results have been achieved on large/medium-sized objects, the detection performance of small objects remains a challenging task. Automatic ship detection on remote sensing images is an important module in maritime surveillance system, and it is challenging due to the...
The goal of information-seeking dialogue is to respond to seeker queries with natural language utterances that are grounded on knowledge sources. However, dialogue systems often produce unsupported utterances, a phenomenon known as hallucination. Dziri et al. (2022)'s investigation of hallucinations has revealed that existing knowledge-grounded ben...
The most common Named Entity Recognizers are usually sequence taggers trained on fully annotated corpora, i.e. the class of all words for all entities is known. Partially annotated corpora, i.e. some but not all entities of some types are annotated, are too noisy for training sequence taggers since the same entity may be annotated one time with its...
Knowledge-grounded conversational models are known to suffer from producing factually invalid statements, a phenomenon commonly called hallucination. In this work, we investigate the underlying causes of this phenomenon: is hallucination due to the training data, or to the models? We conduct a comprehensive human study on both existing knowledge-gr...
To effectively train accurate Relation Extraction models, sufficient and properly labeled data is required. Adequately labeled data is difficult to obtain and annotating such data is a tricky undertaking. Previous works have shown that either accuracy has to be sacrificed or the task is extremely time-consuming, if done accurately. We are proposing...
Early detection and treatment of diabetic retinopathy (DR) can significantly reduce the risk of vision loss in patients. In essence, we are faced with two challenges: (i) how to simultaneously achieve domain adaptation from the different domains and (ii) how to build an interpretable multi-instance learning (MIL) on the target domain in an end-to-e...
Purpose:
Accurate diagnosis of autism spectrum disorder (ASD) plays a key role in improving the condition and quality of life for patients. In this study, we mainly focus on ASD diagnosis with functional brain networks (FBNs). The major challenge for brain networks modeling is the high dimensional connectivity in brain networks and limited number...
Identifying fake news is a very difficult task, especially when considering the multiple modes of conveying information through text, image, video and/or audio. We attempted to tackle the problem of automated misinformation/disinformation detection in multi-modal news sources (including text and images) through our simple, yet effective, approach i...
Purpose
Recently, functional brain networks (FBN) have been used for the classification of neurological disorders, such as Autism Spectrum Disorders (ASD). Neurological disorder diagnosis with FBN is a challenging task due to the high heterogeneity in subjects and the noise correlations in brain networks. Meanwhile, it is challenging for the existi...
Exposure to pollution in the environment is a major contributor to disease globally and is a topic of great significance. There remains, however, a dearth of knowledge about the levels and distribution of airborne pollutants in the environment, along with how exposure to complex mixtures of airborne chemicals impacts health outcomes. Recent collabo...
Diabetic retinopathy (DR) is one of the most serious complications of diabetes and is a prominent cause of permanent blindness. However, the low-quality fundus images increase the uncertainty of clinical diagnosis, resulting in a significant decrease on the grading performance of the fundus images. Therefore, enhancing the image quality is essentia...
Twitter is used to provide location-relevant information and event updates. It is important to identify location-relevant tweets in order to harness location-relevant information and event updates from Twitter. However, the identification of location-relevant tweets is a challenging problem as the location names are not always explicit. Instead, mo...
Alzheimer's disease (AD) is a gradually progressive neurodegenerative disease affecting cognition functions. Predicting the cognitive scores from neuroimage measures and identifying relevant imaging biomarkers are important research topics in the study of AD. Despite machine learning algorithms having many successful applications, the prediction mo...
The rapid growth of research in explainable artificial intelligence (XAI) follows on two substantial developments. First, the enormous application success of modern machine learning methods, especially deep and reinforcement learning, have created high expectations for industrial, commercial, and social value. Second, the emerging and growing conce...
Accurate diagnosis of psychiatric disorders plays a critical role in improving the quality of life for patients and potentially supports the development of new treatments. Graph convolutional networks (GCNs) are shown to be successful in modeling applications with graph structures. However, training an accurate GCNs model for brain networks faces s...
Most major events are often accompanied by misinformation on online Social Networking platforms. Due to its nature, the COVID-19 pandemic was bound to lead to an explosion of information online, much of it false or misleading. This information explosion, termed ``infodemic'' by the World Health Organization (WHO), has revealed the need for automati...
How do we perform efficient inference while retaining high translation quality? Existing neural machine translation models, such as Transformer, achieve high performance, but they decode words one by one, which is inefficient. Recent non-autoregressive translation models speed up the inference, but their quality is still inferior. In this work, we...
Alzheimer's disease (AD) is a severe neurodegenerative disorder that usually starts slowly and progressively worsens. Predicting the progression of Alzheimer's disease with longitudinal analysis on the time series data has recently received increasing attention. However, training an accurate progression model for brain network faces two major chall...
Explicitly modeling emotions in dialogue generation has important applications, such as building empathetic personal companions. In this study, we consider the task of expressing a specific emotion for dialogue generation. Previous approaches take the emotion as an input signal, which may be ignored during inference. We instead propose a search-bas...
Most recent semantic segmentation methods adopt a U-Net framework with an encoder-decoder architecture. It is still challenging for U-Net with a simple skip connection scheme to model the global multi-scale context: 1) Not each skip connection setting is effective due to the issue of incompatible feature sets of encoder and decoder stage, even some...
Association rule mining can be a powerful computational tool for exploring complex interactions between high-dimensional exposures and health outcomes. Given the high-dimensional nature of the data, many complex association rules may be identified. To narrow down on the most important rules for hypothesis-generating and future investigation in the...
Social network analysis encompasses the study of networked data and examines questions related to structures and patterns that can lead to the understanding of the data and the intrinsic relationships, such as identifying influential nodes, recognizing critical paths, predicting unobserved relationships, discovering communities, etc. All of these a...
BACKGROUND
Chatbots have been increasingly considered for applications in the health and social care fields. Currently, it remains unclear how a chatbot could engage users coping with complex health needs, such as parents of individuals with neurodevelopmental disorders (NDDs), who often need frequent and ongoing support. One approach to enhancing...
Background:
Chatbots have been increasingly considered for applications in the health care field. However, it remains unclear how a chatbot can assist users with complex health needs, such as parents of children with neurodevelopmental disorders (NDDs) who need ongoing support. Often, this population must deal with complex and overwhelming health...
Lung cancer is one of the most common and deadly malignant cancers. Accurate lung tumor segmentation from CT is therefore very important for correct diagnosis and treatment planning. The automated lung tumor segmentation is challenging due to the high variance in appearance and shape of the targeting tumors. To overcome the challenge, we present an...
We analyze the process of creating word embedding feature representations designed for a learning task when annotated data is scarce, for example, in depressive language detection from Tweets. We start with a rich word embedding pre-trained from a large general dataset, which is then augmented with embeddings learned from a much smaller and more sp...
We focus on exploring various approaches of Zero-Shot Learning (ZSL) and their explainability for a challenging yet important supervised learning task notorious for training data scarcity, i.e. Depression Symptoms Detection (DSD) from text. We start with a comprehensive synthesis of different components of our ZSL modeling and analysis of our groun...
In this paper, we propose a globally normalized model for context-free grammar (CFG)-based semantic parsing. Instead of predicting a probability, our model predicts a real-valued score at each step and does not suffer from the label bias problem. Experiments show that our approach outperforms locally normalized models on small datasets, but it does...
In this paper, we present empirical analysis on basic and depression specific multi-emotion mining in Tweets with the help of state of the art multi-label classifiers. We choose our basic emotions from a hybrid emotion model consisting of the common emotions from four highly regarded psychological models of emotions. Moreover, we augment that emoti...
Dialogue systems powered by large pre-trained language models (LM) exhibit an innate ability to deliver fluent and natural-looking responses. Despite their impressive generation performance, these models can often generate factually incorrect statements impeding their widespread adoption. In this paper, we focus on the task of improving the faithfu...
We propose an efficient model-based clustering approach for creating Gaussian Mixture Models from finite datasets. Models are extracted from HDBSCAN* hierarchies using the Classification Likelihood and the Expectation Maximization algorithm. Prior knowledge of the number of components of the model, corresponding to the number of clusters, is not ne...
Full article available at https://doi.org/10.5220/0010325806370644
Purpose
Recently, brain connectivity networks have been used for the classification of neurological disorder, such as Autism Spectrum Disorders (ASD) or Alzheimer's disease (AD). Network analysis provides a new way for exploring the association between brain functional deficits and the underlying structural disruption related to brain disorders. Ne...
Background
Canada is an ethnically-diverse country, yet its lack of ethnicity information in many large databases impedes effective population research and interventions. Automated ethnicity classification using machine learning has shown potential to address this data gap but its performance in Canada is largely unknown. This study conducted a lar...
In this paper, we design a simple yet powerful deep network architecture, U²-Net, for salient object detection (SOD). The architecture of our U²-Net is a two-level nested U-structure. The design has the following advantages: (1) it is able to capture more contextual information from different scales thanks to the mixture of receptive fields of diff...
Purpose The Work Assessment Triage Tool (WATT) is a clinical decision support tool developed using machine learning to help select interventions for patients with musculoskeletal disorders. The WATT categorizes patients based on individual characteristics according to likelihood of successful return to work following rehabilitation. A previous vali...
Purpose
In this paper, the challenging and thorny issue of assessing graduate attributes (GAs) is addressed. An interdisciplinary team at The University of Alberta ----developed a formative model of assessment centered on students and instructor interaction with course content.
Design/methodology/approach
The paper starts by laying the theoretical...
With the huge success of deep learning, other machine learning paradigms have had to take back seat. Yet other models, particularly rule-based, are more readable and explainable and can even be competitive when labelled data is not abundant. However, most of the existing rule-based classifiers suffer from the production of a large number of classif...
The power of associative classifiers is to determine patterns from the data and perform classification based on the features that are most indicative of prediction. Although they have emerged as competitive classification systems, associative classifiers suffer from limitations such as cumbersome thresholds requiring prior knowledge which varies wi...
Rapid development in computer technology has led to sophisticated methods of analyzing large datasets with the aim of improving human decision making. Artificial Intelligence and Machine Learning (ML) approaches hold tremendous potential for solving complex real-world problems such as those faced by stakeholders attempting to prevent work disabilit...
With the huge success of deep learning, other machine learning paradigms have had to take back seat. Yet other models, particularly rule-based, are more readable and explainable and can even be competitive when labelled data is not abundant. However, most of the existing rule-based classifiers suffer from the production of a large number of classif...
In this paper, we describe our mUlti-task learNIng for cOmmonsense reasoNing (UNION) system submitted for Task C of the SemEval2020 Task 4, which is to generate a reason explaining why a given false statement is non-sensical. However, we found in the early experiments that simple adaptations such as fine-tuning GPT2 often yield dull and non-informa...
In this paper, we design a simple yet powerful deep network architecture, U$^2$-Net, for salient object detection (SOD). The architecture of our U$^2$-Net is a two-level nested U-structure. The design has the following advantages: (1) it is able to capture more contextual information from different scales thanks to the mixture of receptive fields o...