Ioannis Katakis

Ioannis Katakis
  • University of Nicosia

About

63
Publications
64,436
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
8,683
Citations
Introduction
Skills and Expertise
Current institution
University of Nicosia

Publications

Publications (63)
Article
Aspect terms extraction (ATE), a key subtask for aspect-based sentiment analysis, opinion summarization, and topic modeling aims at extracting grammatical elements (nouns, phrases, and adjectives) from user reviews that reveal the discussed features of the entity under review. These aspect terms are usually the targets of the opinions expressed. Id...
Article
Full-text available
Central nervous system diseases (CNSDs) lead to significant disability worldwide. Mobile app interventions have recently shown the potential to facilitate monitoring and medical management of patients with CNSDs. In this direction, the characteristics of the mobile apps used in research studies and their level of clinical effectiveness need to be e...
Chapter
This paper presents Zenon, an affective, multi-modal conversational agent (chatbot) specifically designed for treatment of brain diseases like multiple sclerosis and stroke. Zenon collects information from patients in a non-intrusive way and records user sentiment using two different modalities: text and video. A user-friendly interface is designed...
Article
Sentiment analysis is a fast-accelerating discipline that develops algorithms for knowledge discovery from opinionated content. The challenges however, when it comes to analyzing user reviews are plenty. Bad-quality, informal use of language and lack of labels, are only a few obstacles. Most importantly, users, consciously or subconsciously, use di...
Article
Full-text available
This is the french translation of "Tsoumakas, G., & Katakis, I. (2007). Multi-label classification: An overview. International Journal of Data Warehousing and Mining (IJDWM), 3(3), 1-13. 10.4018/jdwm.2007070101"
Article
Full-text available
Machine Learning (ML) is now becoming a key driver empowering the next generation of drone technology and extending its reach to applications never envisioned before. Examples include precision agriculture, crowd detection, and even aerial supply transportation. Testing drone projects before actual deployment is usually performed via robotic simula...
Article
Full-text available
As forecasting becomes more and more appreciated in situations and activities of everyday life that involve prediction and risk assessment, more methods and solutions make their appearance in this exciting arena of uncertainty. However, less is known about what makes a promising or a poor forecast. In this article, we provide a multi-factor analysi...
Article
This commentary introduces a correlation analysis of the top-10 ranked forecasting methods that participated in the M4 forecasting competition. The M competitions attempt to promote and advance research in the field of forecasting by inviting both industry and academia to submit forecasting algorithms to be evaluated over a large corpus of real-wor...
Conference Paper
The ability to accurately understand opinionated content is critical for a large set of applications. Models targeting at learning from such content should overcome the inherent difficulties of the data. We propose a novel hybrid neural network embedded in a deep learning framework that can be used for sentiment classification. Our method consists...
Article
Full-text available
Sentiment analysis is a challenging task that attracted increasing interest during the last years. The availability of online data along with the business interest to keep up with consumer feedback generates a constant demand for online analysis of user-generated content. A key role to this task plays the utilization of domain-specific lexicons of...
Conference Paper
Social networks have become the de facto online resource for people to share, comment on and be informed about events pertinent to their interests and livelihood, ranging from road traffic or an illness to concerts and earthquakes, to economics and politics. This has been the driving force behind research endeavors that analyse such data. In this p...
Conference Paper
Full-text available
This paper examines the connectivity among political networks on Twitter. We explore dynamics inside and between the far right and the far left, as well as the relation between the structure of the network and sentiment. The 2015 Greek political context offers a unique opportunity to investigate political communication in times of political intensi...
Article
Online Social Networks (OSNs) constitute one of the most important communication channels and are widely utilized as news sources. Information spreads widely and rapidly in OSNs through the word-of-mouth effect. However, it is not uncommon for misinformation to propagate in the network. Misinformation dissemination may lead to undesirable effects,...
Conference Paper
Full-text available
Applications targeting Smart Cities tackle common challenges, however solutions are seldom portable from one city to another due to the heterogeneity of city ecosystems. A major obstacle involves the differences in the levels of available information. In this demonstration we present REMI, a reusable elements framework to handle varying degrees of...
Article
Full-text available
Hierarchy Of Multi-label classifiers (HOMER) is a multi-label learning algorithm that breaks the initial learning task to several, easier sub-tasks by first constructing a hierarchy of labels from a given label set and secondly employing a given base multi-label classifier (MLC) to the resulting sub-problems. The primary goal is to effectively addr...
Preprint
Hierarchy Of Multi-label classifiers (HOMER) is a multi-label learning algorithm that breaks the initial learning task to several, easier sub-tasks by first constructing a hierarchy of labels from a given label set and secondly employing a given base multi-label classifier (MLC) to the resulting sub-problems. The primary goal is to effectively addr...
Conference Paper
In this demo we present INSIGHT, a system that provides traffic event detection in Dublin by exploiting Big Data and Crowdsourcing techniques. Our system is able to process and analyze input from multiple heterogeneous urban data sources.
Conference Paper
Urban data management is already an essential element of modern cities. The authorities can build on the variety of automatically generated information and develop intelligent services that improve citizens daily life, save environmental resources or aid in coping with emergencies. From a data mining perspective, urban data introduce a lot of chall...
Article
Modern cities generate a flood of rich and varied data. New information sources like public transport and wearable devices provide opportunities for novel applications that will improve citizens׳ quality of life by reducing transportation time, enhancing city planning, and improving air quality to name a few applications. From a data science perspe...
Chapter
Event detection is a research area that attracted attention during the last years due to the widespread availability of social media data. The problem of event detection has been examined in multiple social media sources like Twitter, Flickr, YouTube and Facebook. The task comprises many challenges including the processing of large volumes of data...
Article
Modern cities are flooded with data. New information sources like public transport and wearable devices provide opportunities for novel applications that will improve citizens' quality of life. From a data science perspective, data emerging from smart cities give rise to a lot of challenges that constitute a new interdisciplinary field of research....
Article
Modern cities are flooded with data. New information sources like public transport and wearable devices provide opportunities for novel applications that will improve citizens' quality of life. From a data science perspective, data emerging from smart cities give rise to a lot of challenges that constitute a new inter-disciplinary field of research...
Article
Users in social networks utilize hashtags for a variety of reasons. In many cases, hashtags serve retrieval purposes by labeling the content they accompany. More often than not, hashtags are used to promote content, ideas, or conversations producing viral memes. This paper addresses a specific case of hashtag classification: meme-filtering. We argu...
Article
We present a Software Keyboard for smart touchscreen devices that learns its owner's unique dictionary in order to produce personalized typing predictions. The learning process is accelerated by analysing user's past typed communication. Moreover, personal temporal user behaviour is captured and exploited in the prediction engine. Computational and...
Conference Paper
Full-text available
Detecting traffic events using the sensor network infrastructure is an important service in urban environments that enables the authorities to handle traffic incidents. However, irregular measurements in such settings can derive either from faulty sensors or from unpredictable events. In this paper, we propose an efficient solution to resolve in re...
Conference Paper
Full-text available
The task of opinion mining has attracted interest during the last years. This is mainly due to the vast availability and value of opinions on-line and the easy access of data through conventional or intelligent crawlers. In order to utilize this information, algorithms make exten-sive use of word sets with known polarity. This approach is known as...
Conference Paper
The task of opinion mining has attracted interest during the last years. This is mainly due to the vast availability and value of opinions on-line and the easy access of data through conventional or intelligent crawlers. In order to utilize this information, algorithms make extensive use of word sets with known polarity. This approach is known as d...
Article
Full-text available
Voting advice applications (VAAs) are online tools that have become increasingly popular and purportedly aid users in deciding which party/candidate to vote for during an election. In this paper we present an innovation to current VAA design which is based on the introduction of a social network element. We refer to this new type of online tool as...
Conference Paper
Voting Advice Application (VAA) is a web application that recommends a candidate or a party to a voter. From an online questionnaire, which voters and candidates are called to answer, the VAA proposes to each individual voter the candidate who replied like him/her. It is very important the voters to reply in all questions of the questionnaire, beca...
Conference Paper
Full-text available
Voting advice applications (VAA) are very recently developed in order to aid users in deciding what to vote in elections. Every user is presented with a set of important issues and she is asked to submit her opinion by selecting one of a predefined set of answers (e.g. agree/disagree). The VAA gathers the same information for all candidates that ar...
Conference Paper
Full-text available
A key challenge for Grid and Cloud infrastructures is to make their services easily accessible and attractive to end-users. In this paper we introduce tagging capabilities to the Miner soft system, a powerful tool for software search and discovery in order to help end-users locate application software suitable to their needs. Miner soft is now able...
Article
Full-text available
A simple yet effective multilabel learning method, called label powerset (LP), considers each distinct combination of labels that exist in the training set as a different class value of a single-label classification task. The computational efficiency and predictive performance of LP is challenged by application domains with large number of labels a...
Chapter
Full-text available
A large body of research in supervised learning deals with the analysis of single-label data, where training examples are associated with a single label λ from a set of disjoint labels L. However, training examples in several application domains are often associated with a set of labels Y ⊆ L. Such data are called multi-label. Textual data, such a...
Article
Full-text available
Concept drift constitutes a challenging problem for the machine learning and data mining community that frequently appears in real world stream classification problems. It is usually defined as the unforeseeable concept change of the target variable in a prediction task. In this paper, we focus on the problem of recurring contexts, a special sub-ty...
Article
Full-text available
Multi-label classification methods are increasingly required by modern applications, such as protein function classification, music categorization, and semantic scene classification. This article introduces the task of multi-label classification, organizes the sparse related literature into a structured presentation and performs comparative experim...
Conference Paper
Full-text available
Semantic Web services have emerged as the solution to the need for automating several aspects related to service-oriented architectures, such as service discovery and composition, and they are realized by combining Semantic Web technologies and Web service standards. In the present paper, we tackle the problem of automated classification of Web ser...
Article
Full-text available
With the explosive growth of the Word Wide Web, information overload became a crucial concern. In a data-rich information-poor environ- ment like the Web, the discrimination of useful or desirable information out of tons of mostly worthless data became a tedious task. The role of Machine Learning in tackling this problem is thoroughly discussed in...
Chapter
Multi-label classification methods are increasingly required by modern applications, such as protein function classification, music categorization, and semantic scene classification. This article introduces the task of multi-label classification, organizes the sparse related literature into a structured presentation and performs comparative experim...
Conference Paper
Full-text available
This paper proposes a general framework for classify- ing data streams by exploiting incremental clustering in order to dynamically build and update an ensemble of incremental classi- fiers. To achieve this, a transformation function that maps batches of examples into a new conceptual feature space is pro- posed. The clustering algorithm is then ap...
Article
Full-text available
This paper contributes a novel algorithm for effective and computationally efficient multilabel classification in domains with large label sets L. The HOMER algorithm constructs a Hierarchy Of Mul-tilabel classifiERs, each one dealing with a much smaller set of labels compared to L and a more balanced example distribution. This leads to improved pr...
Chapter
Multi-label classification methods are increasingly required by modern applications, such as protein function classification, music categorization, and semantic scene classification. This article introduces the task of multi-label classification, organizes the sparse related literature into a structured presentation and performs comparative experim...
Article
Nowadays, multi-label classification methods are increasingly required by modern applications, such as protein function classification, music categorization and semantic scene classification. This paper introduces the task of multi-label classification, organizes the sparse related literature into a structured presentation and performs comparative...
Conference Paper
Full-text available
In this paper, we present a web-based, machine-learning enhanced news reader (PersoNews). The main advantages of PersoNews are the aggregation of many different news sources, machine learning filtering offering personalization not only per user but also for every feed a user is subscribed to, and finally the ability for every user to watch a more a...
Conference Paper
Full-text available
Multiple Classier systems have been developed in order to improve classication accuracy using methodologies for eectiv e classi- er combination. Classical approaches use heuristics, statistical tests, or a meta-learning level in order to nd out the optimal combination func- tion. We study this problem from a Reinforcement Learning perspective. In o...
Article
Full-text available
Machine learning is one of the older areas of artificial intelligence and concerns the study of computational methods for the discovery of new knowledge and for the management of existing knowledge. Machine learning methods have been applied to various application domains. However, in the few last years due to various technological advances and res...
Article
Full-text available
ABSTRACT Email has met tremendous popularity over the past f ew years. People are sending and receiving many messages per day, communicating with partners and friends, or exchanging files and information. Unfortunately, the phenomenon of email overload has grown over the past years becoming a personal headache for users and a financ ial issue for c...
Article
Full-text available
Real world text classiflcation applications are of special inter- est for the machine learning and data mining community, mainly because they introduce and combine a number of special di-culties. They deal with high dimensional, streaming, unstructured, and, in many occasions, concept drifting data. Another important peculiarity of streaming text,...
Conference Paper
Full-text available
In this paper we argue that incrementally updating the fea- tures that a text classiflcation algorithm considers is very important for real-world textual data streams, because in most applications the distri- bution of data and the description of the classiflcation concept changes over time. We propose the coupling of an incremental feature ranking...
Conference Paper
Full-text available
This paper deals with the combination of classification models that have been derived from running di#erent (heterogeneous) learning algorithms on the same data set. We focus on the Classifier Evaluation and Selection (ES) method, that evaluates each of the models (typically using 10-fold cross-validation) and selects the best one. We examine the p...
Article
Full-text available
Concept drift is a common phenomenon in streaming data environ-ments and constitutes an interesting challenge for researchers in the machine learning and data mining community. This paper proposes a probabilistic repre-sentation model for data stream classification and investigates the use of incre-mental clustering algorithms in order to identify...
Article
Full-text available
Nowadays, multi-label classification methods are increasingly required by modern applications, such as protein function classification, music categorization and semantic scene classification. This paper intro-duces the task of multi-label classification, organizes the sparse related literature into a structured presentation and performs comparative...

Network

Cited By