Yahya Slimani

Yahya Slimani
Université de la Manouba | ESCT · Institut Supérieur des Arts Multimédia de la Manouba

PhD
Full Professor in Computer Science Emeritus Professor since 2022

About

177
Publications
28,519
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,328
Citations
Introduction
Information retrieval for arabic language Cloud computing interoperability Information credibility on social media Communities in social networks
Additional affiliations
September 2012 - present
Université de la Manouba
Position
  • Professor (Full)
September 1989 - September 2012
University of Tunis El Manar
Position
  • Professor (Full)

Publications

Publications (177)
Article
Purpose A number of approaches and algorithms have been proposed over the years as a basis for automatic indexing. Many of these approaches suffer from precision inefficiency at low recall. The choice of indexing units has a great impact on search system effectiveness. The authors dive beyond simple terms indexing to propose a framework for multi-w...
Article
Integrating useful input information is essential to provide efficient recommendations to users. In this work, we focus on improving items ratings prediction by merging both multiple contexts and multiple criteria based research directions which were addressed separately in most existent literature. Throughout this article, Criteria refer to the it...
Article
Purpose The semantic relations between Arabic word representations were recognized and widely studied in theoretical studies in linguistics many centuries ago. Nonetheless, most of the previous research in automatic information retrieval (IR) focused on stem or root-based indexing, while lemmas and patterns are under-exploited. However, the authors...
Article
Full-text available
Fake news has invaded social media platforms where false information is being propagated with malicious intent at a fast pace. These circumstances required the development of solutions to monitor and detect rumor in a timely manner. In this paper, we propose an approach that seeks to detect emerging and unseen rumors on Twitter by adapting a pre-tr...
Article
This article presents a new query expansion (QE) method aiming to tackle term mismatch in information retrieval (IR). Previous research showed that selecting good expansion terms which do not hurt retrieval effectiveness remains an open and challenging research question. Our method investigates how global statistics of term co-occurrence can be use...
Article
Full-text available
One of the most challenging issues dealing with dynamic social networks, with evolving community structures, is to understand and model the way in which these structures evolve over time. Studies on tracking and predicting community evolution generally split these networks into time-series of static networks, each including interactions aggregated...
Article
Full-text available
The vendor lock‐in is a prominent issue in cloud computing. It is caused by cloud providers who offer proprietary services, which hinders the cloud interoperability. Client‐centric interoperability enables the migration of the data and applications across clouds; it gives the clients control over their workloads and a wider range of service choices...
Article
Over the last decade, cloud computing has emerged as a new paradigm for delivering various on‐demand virtualized resources as services. Cloud services have inherited not only the major characteristics of web services but also their classical issues, in particular, the interoperability issues and the heterogeneous nature of their hosting environment...
Chapter
Full-text available
The quest for trustworthy, reliable and efficient sources of information has been a struggle long before the era of internet. However, social media unleashed an abundance of information and neglected the establishment of competent gatekeepers that would ensure information credibility. That’s why, great research efforts sought to remedy this shortco...
Chapter
The massive growth of information produced and shared online has made retrieving relevant documents a difficult task. Query Expansion (QE) based on term co-occurrence statistics has been widely applied in an attempt to improve retrieval effectiveness. However, selecting good expansion terms using co-occurrence graphs is challenging. In this paper,...
Article
In this paper, we propose to build a morpho-semantic knowledge graph from Arabic vocalized corpora. Our work focuses on classical Arabic as it has not been deeply investigated in related works. We use a tool suite which allows analyzing and disambiguating Arabic texts, taking into account short diacritics to reduce ambiguities. At the morphological...
Chapter
In Information Retrieval (IR), user queries are often too short, making the selection of relevant documents hard. Pseudo-relevance feedback (PRF) is an effective method to automatically expand the query with new terms using a set of pseudo-relevant documents. However, a main issue in PRF is the selection of good expansion terms that allow improving...
Chapter
Full-text available
Both multi-criteria recommendation and context-aware recommendation are well addressed in previous research but separately in most of existing work. In this paper, we aim to contribute to the under-explored research problem which consists in tailoring the multi-criteria rating predictions to users involved in specific contexts. We investigate the a...
Article
Full-text available
With the emergence of Cloud Computing, the amount of data generated in different fields such as physics, medical, social networks, etc. is growing exponentially. This increase in the volume of data and their large scale make the problem of their processing more complex. Actually, the current datasets are very different in nature, ranging from small...
Conference Paper
Full-text available
In this paper, we investigate the prediction of community future occurring events in dynamic social networks, based on change rates of features that describe a community throughout its evolution life-cycle rather than absolute values of features. Besides, we explore the most predictive features for each event. Our experiments on DBLP and Facebook d...
Article
In this article, we describe an empirical evaluation of compounds indexing for Turkish texts. We dive beyond the keyword indexing to propose a framework for Turkish compounds extraction and indexing. We identify twelve Turkish compounds pattern types that we classify in six categories. To extract Turkish compounds, we rely on a light natural langua...
Conference Paper
Query Expansion is an important process in information retrieval, which consists in adding new related terms to the original query in order to better identify relevant documents. In this paper, we discuss the participation of the JARIR research group to the TREC 2018 Common Core Track. We present different Query Expansion methods, which are based o...
Article
This article describes how the idea of a hybrid cloud comes from the coupling of public and private clouds to more efficiently address user requirements. This article addresses the problem of resource provisioning in hybrid cloud. This article is mainly concerned about optimizing the resources provisioning task through the reduction of the tasks co...
Chapter
Existent literature proposes several approaches to enhance Arabic document retrieval using different indexing units. In anterior work [1, 2], we proposed to combine multiple indexing units which improved retrieval performance. This paper develops this approach and suggests enhancing term weighting through result aggregation and pseudo-relevance fee...
Chapter
This paper proposes a hybrid possibilistic approach for bilingual terminology extraction using possibility and necessity measures. On the one hand, we extract domain-relevant terms from the source language, and on the other hand, we build a co-occurrence-based translation graph, which is mined to translate terms in the target language. We compare o...
Article
Full-text available
Query expansion (QE) is an important process in information retrieval applications that improves the user query and helps in retrieving relevant results. In this paper, we introduce a hybrid query expansion model (HQE) that investigates how external resources can be combined to association rules mining and used to enhance expansion terms generation...
Chapter
Using either stems or roots as index terms offered considerable performance to Arabic Information Retrieval (IR) systems compared to the use of surface words for indexing. Many comparative works tried to find out the best from these two indexing approaches but until then, no of the two methods widely overtook the other. Each of the two index types...
Conference Paper
Full-text available
We propose and evaluate a Cross-language Information Retrieval model (CLIR) based on the extraction and the translation of Formal Concepts avoiding queries and/or documents translation. The contribution of this work is the unified formal framework that integrates Formal Concept Analysis (FCA) and information retrieval for effective CLIR. The model...
Conference Paper
Full-text available
Cloud computing is becoming a popular platform to deliver service-based applications (SBAs) based on service oriented architecture (SOA) principles. Monitoring the performance and functionality in all the layers which affects the final step of adaptations of SBAs deployed on multiple Cloud providers and adapting them to variations/events produced b...
Article
In recent years, cloud computing paradigm has attracted a lot of attention from both industry and academia. However, each cloud provider uses its own techniques (languages, standards, ontologies, or models, etc.) to describe cloud services. The diversity of these techniques leads to the vendor lock-in problem, and thus, the lack of a cloud service...
Article
Full-text available
Social information retrieval becomes a very challenging task with the increase use of social networks and the amount of social information they provide continuously in different fields. In this paper, we aim at exploring different kind of social information, namely descriptions (tags) and reactions (clicks) to build user and document profiles for p...
Article
Cloud service description (CSD) becomes an active area which attracts the attention of many research organizations, while no standard CSD exists. This lack of standardization is caused by the vendor lock-in problem, where cloud providers use various techniques (languages, standards, ontologies, models, etc.) to describe cloud services. Furthermore,...
Article
Using either stems or roots as index terms offered considerable performance to Arabic Information Retrieval IR systems compared to the use of surface words for indexing. Many comparative works tried to find out the best from these two indexing approaches but until then, no of the two methods widely overtook the other. Each of the two index types pe...
Conference Paper
Full-text available
Twitter has emerged as the most popular among microblogging service providers. The content provided in Twitter is large, diverse, and huge in quantity. Given the increasing amount of information available through such microblogging sites, it would be interesting to be able to retrieve useful tweets in response to a given information need. However,...
Article
Full-text available
Computational grids have the potential for solving large-scale scientific problems using heterogeneous and geographically distributed resources. At this scale, computer resources and network failures are no more exceptions, but belong to the normal system behavior. Therefore, one of the most valuable characteristics of grid tools, apart from the pe...
Conference Paper
Full-text available
Tweets are short 140 characters-limited messages that do not always conform to proper spelling rules. This spelling variation makes them hard to understand without some kind of context. For these reasons, the tweet contextualization task was introduced, aiming to provide automatic contexts to explain the tweets. We present, in this paper, two tweet...
Article
Full-text available
Microblogging sites, like Twitter, have emerged as a popular platform for expressing opinions. Bound to 140 characters, Twitter's publications (tweets) are very short and not always written maintaining formal grammar and proper spelling. The spelling variations increase the likelihood of vocabulary mismatch and make the tweets difficult to understa...
Conference Paper
Full-text available
In the past few years several cross-layer monitoring and adaptation technologies have been proposed. Although these are cross-layer adaptation technologies, however, in practice they focus on a particular layer. Some solutions involves two layers, yet none of the existing solutions do not consider all the layers during adaptation process. Furthermo...
Article
Full-text available
The literature on information retrieval shows the importance of information reliability as a key criterion for relevance judgment. However, information reliability evaluation is discussed in many disciplines such as history, Arabic storytelling, and computer science. Although these disciplines share common principles, they differ in many aspects, w...
Conference Paper
This article describes a new method to build comparable corpora from Twitter. Our strategy relies on the fact that Twitter is one of the most popular online social microblog allowing large audiences to express their thoughts and reactions about specific events or breaking news in various languages. Given two languages and a particular topic, We pro...
Conference Paper
In this paper, we propose an approach for constructing Arabic Ontology based on normalized dictionaries. This approach mainly consists in transforming non structured Arabic dictionaries into LMF (Lexical Markup Framework) based-normalized ones. We are basically exploiting Arabic dictionaries of Hadith for experimentation. Then, from an Arabic norma...
Article
Full-text available
The aim of the frequent subgraph mining task is to find frequently occurring subgraphs in a large graph database. However, this task is a thriving challenge, as graph and subgraph isomorphisms play a key role throughout the computations. Since subgraph isomorphism testing is a hard problem, subgraph miners are exponential in runtime. To alleviate t...
Article
Full-text available
Service-oriented computing offers efficient solutions for executing complex applications in an acceptable amount of time. These solutions provide important computing and storage resources, but they are too difficult for individual users to handle. In fact, Service-oriented architectures are usually sophisticated in terms of design, specifications,...
Article
International audience The feature selection for classification is a very active research field in data mining and optimization. Its combinatorial nature requires the development of specific techniques (such as filters, wrappers, genetic algorithms, and so on) or hybrid approaches combining several optimization methods. In this context, the support...
Article
Full-text available
Developing algorithms that discover all frequently occurring subgraphs in a large graph database is computationally extensive, as graph and subgraph isomorphisms play a key role throughout the computations. Since subgraph isomorphism testing is a hard problem, fragment miners are exponential in runtime. To alleviate the complexity issue, we propose...
Article
Full-text available
In this paper, we propose a new version of FP-Growth algorithm to find association rules. In this version, we vary the minsup value from one level to another. This variation is made in two cases: increasing and decreasing the minsup value. We performed a set of experiments to validate the usefulness of our proposition in the generation of associati...
Conference Paper
Full-text available
نقدم، في هذا المقال، مشروع "كنوز" المتمثل في مدونة مرجعية للبحث عن المعلومة باللغة العربية والإنجليزية. إن مشروع "كنوز" يرتكز على نصوص الحديثا لنبوي باللغتين المذكورتين ونقترح عبر بوابة "كنوز المصطفى" برنامجا لتقييم نتائج البحث عن إستعلامات تمت تجميعها وفق طريقة تريك. نهدف من خلال المدونة المرجعية إلى النهوض بالبحوث باللغة العربية ولغات أخرى.
Article
Full-text available
Efficient tool and platform for several areas, concept lattice are widely used in many fields of research. Dynamic environment requires an incremental algorithm to build formal concepts. It plays an essential role in the application of concept lattice. This paper presents a fast, efficient, incremental algorithm to compute formal concepts. Algorith...
Article
Grid reliability analysis and modeling are not easy tasks because of the complexity and large scale of the system. For some grid services which have large subtasks requiring time-consuming computation, the reliability of grid service could be rather low. To resolve this problem, RNFR and LNFR are proposed in literature. This paper describes the pro...