Barbara Catania

Barbara Catania
Università degli Studi di Genova | UNIGE · Dipartimento di Informatica, Bioingegneria, Robotica e Ingegneria dei Sistemi (DIBRIS)

Full Professor

About

127
Publications
15,842
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,624
Citations
Introduction
Skills and Expertise

Publications

Publications (127)
Article
Full-text available
The data science era is characterized by data-driven automated decision systems (ADS) enabling, through data analytics and machine learning, automated decisions in many contexts, deeply impacting our lives. As such, their downsides and potential risks are becoming more and more evident: technical solutions, alone, are not sufficient and an interdis...
Conference Paper
Full-text available
Research on database and information technologies has been rapidly evolving over the last couple of years. This evolution was lead by three major forces: Big Data, AI and Connected World that open the door to innovative research directions and challenges, yet exploiting four main areas: (i) computational and storage resource modeling and organizati...
Book
This book constitutes thoroughly reviewed and selected papers presented at Workshops and Doctoral Consortium of the 24th East-European Conference on Advances in Databases and Information Systems, ADBIS 2020, the 24th International Conference on Theory and Practice of Digital Libraries, TPDL 2020, and the 16th Workshop on Business Intelligence and B...
Article
Many solutions for coarse geolocating of users at the time they post a message exist. However, for many important applications, like traffic monitoring and event detection, finer geolocation at the level of city neighborhoods, i.e., at a sub-city level, is needed. Data-driven approaches often do not guarantee good accuracy and efficiency due to the...
Chapter
SPARQL query optimization is an important issue for RDF data stores that can benefit from the usage of caching frameworks. Most caching approaches rely on a precise match semantics, that limits the number of cache hits and, as a consequence, the potential benefit. Others propose relaxed matches for the entire query, which is precisely executed over...
Conference Paper
The traditional Web is evolving into the Web of Data, which gathers huge collections of structured data over distributed, heterogeneous data sources. Live queries are needed to get current information out of this global data space. In live query processing, source selection deserves special attention, because it allows us to identify the sources th...
Conference Paper
Geolocation of microblog messages has been largely investigated in the literature. Many solutions have been proposed that achieve good results at the city level. Existing approaches are mainly data-driven (i.e., they rely on a training phase). However, the development of algorithms for geolocation at sub-city level is still an open problem. In this...
Chapter
From the late 1970s, the database research community has devoted huge efforts in devising more and more effective and efficient solutions for processing queries against data. In this chapter, we retrace some of challenges that have been faced in the last 25 years to enable data to be effectively and efficiently queried and how the notions of querie...
Book
This book constitutes the thoroughly refereed short papers, workshops and Doctoral Consortium papers of the 20th East European Conference on Advances in Databases and Information Systems, ADBIS 2016, held in Prague, Czech Republic, in August 2016. The 11 short papers and one historical paper were carefully selected and reviewed from 85 submissions....
Book
This book constitutes the proceedings of the 42nd International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2016, held in Harrachov, Czech Republic, in January 2016. The 43 full papers presented in this volume were carefully reviewed and selected from 116 submissions. They are organized in topical sections named:...
Conference Paper
The wealth of information generated by users interacting with the network and its applications is often under-utilized due to complications in accessing heterogeneous and dynamic data and in retrieving relevant information from sources having possibly unknown formats and structures. Processing complex requests on such information sources is, thus,...
Book
This book constitutes the revised selected papers of the combined workshops on Web Information Systems Engineering, WISE 2014, held in Thessaloniki, Greece, in October 2014. The 19 selected papers presented were carefully revised and report from the four workshops: computational social networks, IWCSN 2014, enterprise social networks, Org2 2014, pe...
Chapter
Research on database and information system technologies has been rapidly evolving over the last few years. Advances concern either new data types, new management issues, and new kind of architectures and systems. The 17th East-European Conference on Advances in Databases and Information Systems (ADBIS 2013), held on September 1–4, 2013 in Genova,...
Book
The LNCS journal Transactions on Large-Scale Data- and Knowledge-Centered Systems focuses on data management, knowledge discovery, and knowledge processing, which are core and hot topics in computer science. Since the 1990s, the Internet has become the main driving force behind application development in all domains. An increase in the demand for r...
Conference Paper
Full-text available
The wealth of information generated by users interacting with the network and its applications is often under-utilized due to complications in accessing heterogeneous and dynamic data and in retrieving relevant information from sources having possibly unknown formats and structures. Processing complex requests on such information sources is, thus,...
Article
The 16th International Conference on Database Theory (ICDT 2013) was held in Genoa, Italy, March 18--22, 2013. Originally biennial, the ICDT conference has been held annually and jointly with EDBT ("Extending Database Technology") since 2009.
Conference Paper
In a data stream management system, users may not be acquainted with the actual data arriving on the stream. Therefore, they may issue queries that return an empty result over several windows. In the relational context, relaxation skyline queries have been proposed as a solution to the so-called empty answer problem. Given a query composed of selec...
Article
Knowledge patterns, such as association rules, clusters or decision trees, can be defined as concise and relevant information that can be extracted, stored, analyzed, and manipulated by knowledge workers in order to drive and specialize business decision processes. In this paper we deal with data mining patterns. The ability to manipulate different...
Article
Relaxation and approximation techniques have been proposed as approaches for improving the quality of query results, in terms of completeness and accuracy, in environments where the user may not be able to specify the query in a complete and exact way, since data are quite heterogeneous or she may not know all the characteristics of data at hand. T...
Conference Paper
Relaxation skyline queries have been proposed, in the relational context, as a solution to the so-called empty answer problem. Given a query composed of selection and join operations, a relaxation skyline query relies on the usage of a relaxation function (usually, a numeric function) to quantify the distance of each tuple (pair of tuples in case o...
Chapter
Traditional query processing techniques have played a major role in the success of relational Database Management Systems over the last decade. However, they do not obviously extend to much more challenging, unorganized and unpredictable data providers, typical of emerging data intensive applications and novel processing environments. For them, adv...
Chapter
The traditional query processing approach, by which queries are executed exactly according to a query execution plan selected before query execution starts, breaks down in heterogeneous and dynamic processing environments that are becoming more and more common as query processing contexts. In such environments, queries are often relaxed and query p...
Chapter
Approximation techniques for spatial data traditionally concern data capture and data representation issues. On the other hand, more recently developed approximation techniques refer to the query to be executed and not to data representation as in the the past monolithic Geographic Information Systems and for this reason they are called query-based...
Conference Paper
The aim of the paper is to describe a project concerned with the development of a daily monitoring system for elderly people living alone. The system relies on a new non invasive type of communication based on devices commonly owned by elderly people, to reduce initial cost of deployment. All collected data could then be analyzed by a Medical Docto...
Conference Paper
Recommendation functionalities have been recently considered in traditional database systems as an approach for guaranteeing a satisfactory interaction with the database also to users with a low or moderate technical skill or in presence of huge volumes of, potentially heterogeneous, data. Recommendation is performed by extending query results with...
Chapter
Text databases provide rapid access to collections of digital documents. Such databases have become ubiquitous: text search engines underlie the online text repositories accessible via the Web and are central to digital libraries and online corporate document management.
Chapter
There has been a growing acceptance of the object-oriented data model as the basis of next generation database management systems (DBMSs). Both pure object-oriented DBMS (OODBMSs) and object-relational DBMS (ORDBMSs) have been developed based on object-oriented concepts. Object-relational DBMS, in particular, extend the SQL language by incorporatin...
Chapter
Innovative applications over distributed architectures, like the Web, often require the analysis of strongly related, highly heterogeneous data, stored in remote and autonomous data sources, that can be either totally available at query processing time (stored data) or become available in a continuous stream (data stream). In these contexts, search...
Article
Knowledge intensive applications rely on the usage of knowledge artifacts, called patterns, to represent in a compact and semantically rich way huge quantities of heterogeneous raw data. Due to pattern characteristics of patterns, specific systems are required for pattern management in order to model, store, retrieve and manipulate patterns in an e...
Chapter
Patterns can be defined as concise, but rich in semantics, representations of data. Due to pattern characteristics, ad-hoc systems are required for pattern management, in order to deal with them in an efficient and effective way. Several approaches have been proposed, both by scientific and industrial communities, to cope with pattern management pr...
Article
Patterns can be defined as knowledge artifacts, providing a compact and semantically rich representation of a huge quantity of heterogeneous raw data. Due to the specific characteristics of patterns, ad hoc systems are required for pattern management, in order to model, store, retrieve, and manipulate patterns in an efficient and effective way. Pat...
Article
Information overloading is today a serious concern that may hinder the potential of modern web-based information systems. A promising approach to deal with this problem is represented by knowledge extraction methods able to produce artifacts (also called patterns) that concisely represent data. Patterns are usually quite heterogeneous and voluminou...
Article
Full-text available
Securing access to data in location-based services and mobile applications requires the definition of spatially aware access control systems. Even if some approaches have already been proposed either in the context of geographic database systems or context-aware applications, a comprehensive framework, general and flexible enough to cope with spati...
Article
Pattern management is nowdays a key component of any knowledge intensive application. In the context of pattern management, interoperability issues are without doubt very important, since raw data and patterns may reside at dif-ferent sites, often distributed, relying on different logical models and management technologies. Existing solutions for p...
Chapter
Full-text available
Spatial data are today needed in a wide range of application domains. Indeed, spatial properties are included in several application contexts requiring the management of very large data sets, such as, for instance, computer-aided design (CAD), very large scale integration (VLSI), robotics, and image processing. However, the primary target of system...
Chapter
We give a general overview of the European Esprit project CONCERTO. The central idea of CONCERTO is to represent the ‘meaning’ of digital documents making use of their associated ‘conceptual annotations’: a conceptual annotation is intended to supply a computer-usable description of the main information elements of the whole document or of a part o...
Conference Paper
Full-text available
In recent times, the proliferation of spatial data on the Inter- net is beginning to allow a much larger audience to access and share data currently available in various Geographic Information Systems (GISs). Unfortunately, even if the user can potentially access a huge amount of data, often, she has not enough knowledge about the spatial domain sh...
Conference Paper
Patterns are concise, but rich in semantic, representation of data. The approaches proposed in the literature and by commercial systems for pattern management usually deal with few types of knowledge artifacts and mainly concern pattern extraction issues. Little effort has been posed in designing an overall framework dedicated to the management of...
Chapter
Knowledge intensive applications rely on the usage of knowledge artifacts, called patterns, to represent in a compact and semantically rich way huge quantities of heterogeneous raw data. Due to pattern characteristics of patterns, specific systems are required for pattern management in order to model, store, retrieve and manipulate patterns in an e...
Chapter
Knowledge intensive applications rely on the usage of knowledge artifacts, called patterns, to represent in a compact and semantically rich way huge quantities of heterogeneous raw data. Due to pattern characteristics of patterns, specific systems are required for pattern management in order to model, store, retrieve and manipulate patterns in an e...
Article
XML's increasing diffusion makes efficient XML query processing and indexing all the more critical. Given the semistructured nature of XML documents, however, general query processing techniques won't work. Researchers have proposed several specialized indexing methods that offer query processors efficient access to XML documents, although none are...
Article
Some XML query processors operate on an internal representation of XML documents and can leverage neither the XML storage structure nor the possible access methods dedicated to this storage structure. Such query processors are often used in organizations that usually process transient XML documents received from other organizations. In this paper,...
Article
Because XML is rapidly becoming the Internet standard for data representation and exchange, efficient XML document storage has become a core data management issue. Most early XML storage practices rely on conventional database management systems. However, such systems involve mappings and transformations between XML and the underlying database stru...
Conference Paper
XML documents are normally stored as plain text files. Hence, the natural and most convenient way to update XML documents is to simply edit the text files. But efficient query evaluation algorithms require XML documents to be indexed. Every element is given a unique identifier based on its location in the document or its preorder-traversal order, a...
Conference Paper
Patterns represent in a compact and rich in semantics way huge quantity of heterogeneous data. Due to their characteristics, specific systems are required for pattern management, in order to model and manipulate patterns, with a possibly user-defined structure, in an efficient and effective way. In this demonstration we present allows the user to:...
Conference Paper
Full-text available
Securing access to data in location-based services and mobile applications requires the definition of spatially aware access control systems. Even if some approaches have already been proposed either in the context of geographic database systems or context-aware applications, a comprehensive framework, general and flexible enough to cope with spati...
Conference Paper
Full-text available
Several application contexts require the ability to use together and compare different geographic datasets (maps) concerning the same or overlapping areas. This is for example the case of mediator systems, integrating distinct data sources for query processing, and GISs dealing with multi-resolution maps. In both cases, distinct maps may represent...
Conference Paper
Securing access to data in location-based services and mobile applications requires the definition of spatially aware access-control systems. Even if some approaches have already been proposed either in the context of geographic database systems or context-aware applications, a comprehensive framework, general and flexible enough to deal with spati...
Article
Full-text available
Constraint databases have recently been proposed as a powerful framework to model and retrieve spatial data. The use of constraint databases should be supported by access data structures that make effective use of secondary storage and reduce query processing time. In this paper, we consider the indexing problem for objects represented by conjuncti...
Conference Paper
Full-text available
To represent and manage data mining patterns, several aspects have to be taken into account: (i) patterns are heterogeneous in nature; (ii) patterns can be extracted from raw data by using data mining tools (a-posteriori patterns) but also defined by the users and used for example to check how well they represent some input data source (a-priori pa...
Article
The EDBT'04 Workshop on Database T echnologies for H a ndling X ML Information on the Web (DataX'04) was held in Heraklion, Crete, on Sunday 14 March, 2004, and attracted approximately 30 participants from different countries.
Article
Patterns are concise, but rich in semantic, representation of data. The approaches proposed in the literature to cope with pattern management problems usually deal with a single type of knowledge artifact and mainly concern pattern extraction issues. Little emphasis has been posed in defining an overall environment to represent and e#ciently manage...
Conference Paper
To represent and manipulate data extracted from Web-server logs or applicational logs, clustering techniques can be used. The generated clusters are often different in types, are generated by using different algorithms, and should be homogeneously manipulated together with other knowledge mined from data, for example association rules or decision t...
Conference Paper
Full-text available
Patterns are concise, but rich in semantic, representation of data. The approaches proposed in the literature to cope with pattern management problems usually deal with a single type of knowledge ar- tifact and mainly concern pattern extraction issues. Little emphasis has been posed in defining an overall environment to represent and efficiently ma...
Conference Paper
Full-text available
Access control is an important component of any database management system. Several access control models have been proposed for conventional databases. However, these models do not seem adequate for geographical databases, due to the peculiarities of geographical data. Previous work on access control models for geographical data mainly concerns ra...
Conference Paper
Full-text available
Nowadays, the vast volume of collected digital data obliges us to employ processing methods like pattern recognition and data min- ing in order to reduce the complexity of data management. In this paper, we present the architecture and the logical foundations for the manage- ment of the produced knowledge artifacts, which we call patterns. To this...
Article
Static analysis of declarative languages deals with the detection, at compile time, of program properties that can be used to better understand the program semantics and to improve the efficiency of program evaluation. In logical update languages, an interesting problem is the detection of conflicting updates, inserting and deleting the same fact,...
Article
Full-text available
The increased availability of tools and technologies to access and use the data has made more urgent the needs for data protection. Moreover, emerging applications and data models call for more flexible and expressive access control models. This has lead to an extensive research activity that has resulted in the definition of a variety of access co...
Conference Paper
Full-text available
Nowadays, the vast volume of collected digital data obliges us to employ processing methods like pattern recognition and data mining in order to reduce the complexity of data management. The output of these techniques are knowledge artifacts, heterogeneous in both structure and semantics. We claim that the concept of pattern is a good candidate for...
Conference Paper
Geographical maps can be represented and manipulated using different data models, each dealing with different map properties: geometrical properties, describing the shape, extension and location of the objects composing the map, combinatorial properties, interpreting the map as a set of disjoint elements satisfying a certain set of combinatorial co...
Chapter
Full-text available
Web is characterized by a huge amount of very heterogeneous data sources, that differ both in media support and format representation. In this scenario, there is the need of an integrating approach for querying heterogeneous Web documents. To this purpose, XML can play an important role since it is becoming a standard for data representation and ex...
Article
Full-text available
The problem of integrating knowledge from multiple and heterogeneous sources is a fundamental issue in current information systems. In order to cope with this problem, the concept of mediator has been introduced as a software component providing intermediate services, linking data resources and application programs, and making transparent the heter...
Article
Full-text available
Deductive database technology represents an important step towards the goal of developing highly-declarative database programming languages. In order to make deductive databases a practical technology, deductive rules have to be extended to provide a dynamic behavior. In particular, current applications require not only a support for updates and tr...
Conference Paper
Clustering algorithms for hypertext documents consider not only the document content but also the links existing between them. All the similarity functions proposed in the literature assume that just one type of link exists between documents, with a unique semantic meaning. With the rapid diffusion of XML documents, a specific language, called XLin...
Chapter
Web is characterized by a huge amount of very heterogeneous data sources, that differ both in media support and format representation. In this scenario, there is the need of an integrating approach for querying heterogeneous Web documents. To this purpose, XML can play an important role since it is becoming a standard for data representation and ex...
Article
Full-text available
This article presents an approach to concurrency control for transactions in a Multilevel Secure Database Management System (MLS/DBMS). The major problem is that concurrency control mechanisms used in traditional DBMSs are not adequate in a MLS/DBMS, since they may be exploited to establish covert channels. The approach presented in this article, w...
Article
Metadata represent the vehicle by which digital documents can be efficiently indexed and retrieved. The need for such kind of information is particularly evident in multimedia digital libraries, which store documents dealing with different types of media (text, images, sound, video). In this context, a relevant metadata function consists in superim...
Article
XML is becoming a standard for data communication over the Internet. Like HTML, it is a markup language, but it supports a richer set of features, such as user-defined tags that allow both data and descriptive information about data to be represented within a single document. At the same time, presentation aspects remain decoupled from data represe...