Pasquale Savino

Pasquale Savino
Italian National Research Council | CNR · Institute of Information Science and Technology "Alessandro Faedo" ISTI

Laurea in Fisica

About

124
Publications
10,163
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,310
Citations
Additional affiliations
January 1996 - present
Italian National Research Council
Position
  • CNR - Consiglio Nazionale delle Ricerche
Education
October 1981 - June 1982
Universtià di Pisa
Field of study
  • Computer Science
November 1974 - June 1979
Università di Pisa
Field of study
  • Physics

Publications

Publications (124)
Article
Virtual restoration of digital copies of the human documental heritage is crucial for facilitating both the traditional work of philologists and paleographers and the automatic analysis of the contents. Here we propose a practical and fast procedure for the correction of the typically complex background of recto–verso historical manuscripts. The pr...
Technical Report
Full-text available
The Artificial Intelligence for Media and Humanities laboratory (AIMH) has the mission to investigate and advance the state of the art in the Artificial Intelligence field, specifically addressing applications to digital media and digital humanities, and taking also into account issues related to scalability. This report summarize the 2020 activiti...
Chapter
Virtual restoration of the nowadays frequently available digital copies of the human documental heritage is crucial for facilitating both the traditional work of philologists and paleographers and the automatic analysis of the contents. Here we propose a fast procedure for the correction of the typically complex background of recto-verso historical...
Technical Report
Full-text available
The Artificial Intelligence for Multimedia Information Retrieval (AIMIR) research group is part of the NeMIS laboratory of the Information Science and Technologies Institute ``A. Faedo'' (ISTI) of the Italian National Research Council (CNR). The AIMIR group has a long experience in topics related to: Artificial Intelligence, Multimedia Information...
Article
Full-text available
In recent years, extensive campaigns of digitization of the documental heritage conserved in libraries and archives have been performed, with the primary goal to ensure the preservation and fruition of this important part of the human cultural and historical patrimony. Besides protecting conservation, the availability of high quality digital copies...
Article
Full-text available
Ancient manuscripts written on both pages of the sheet are frequently affected by ink bleeding from the reverse side. This phenomenon produces a significant degradation of both the foreground text and the general appearance of the manuscript. Effective digital image restoration techniques may require the use of the content of both document sides, t...
Article
Full-text available
Bleed-through is a frequent, pervasive degradation in ancient manuscripts, which is caused by ink seeped from the opposite side of the sheet. Bleed-through, appearing as an extra interfering text, hinders document readability and makes it difficult to decipher the information contents. Digital image restoration techniques have been successfully emp...
Article
Full-text available
Bleed-through is a commonly encountered degradation in ancient printed documents and manuscripts, which severely impair their readability. Digital image restoration techniques can be effective to remove or significantly reduce this degradation. In bleed-through document image restoration the main issue is to identify the bleed-through pixels and re...
Article
In this paper we consider the problem of removing achromatic reflections from a picture of a scene taken through a semi-transparent medium, assuming that the reflection pattern is due to a light source or another object located in front of the object of interest. While other works assume the availability of multiple observations, we consider the mo...
Article
We propose a fast automatic procedure for registration and restoration of images of recto-verso pairs of color manuscripts affected by bleed-through distortion. The registration algorithm assumes a rigid projective deformation of a side with respect to the other. The coefficients of the geometric transformation are computed from a large number of p...
Conference Paper
We propose a general approach to remove reflections from a color image acquired through a semi-transparent medium, and show its application to the restoration of images of paintings framed behind glass and manuscripts laminated for conservation purposes, affected by the reflection of a light source. The problem is modeled by assuming that the unwan...
Article
Full-text available
We address the problem of the removal of a text superimposed to a more important one, in a document image, considering the two instances of canceling back-to-front interferences from recto and verso images of archival documents and of recovering the erased text in palimpsests from multispectral images. Both problems are approached through a model w...
Article
The paper describes a technique that supports efficient and effective Content-Based Image Retrieval (CBIR) in very large image archives as well as automatic image tagging. The proposed technique uses a unified representation for image visual features and for image textual descriptions. Images are clustered according to their image visual features w...
Conference Paper
Optical imaging is one of the less expensive and less invasive techniques to capture information from tangible cultural heritage. An additional advantage is that it is faster than many other methods, providing comprehensive maps of the quantities under study in just one or a few steps. More specialized techniques, such as chemical analysis or spect...
Article
Wide access to large information collections is of great potential importance in many aspects - economic, environmental, health, cultural, social, etc. - of everyday life. Historical video documentaries hold by national audiovisual archives, constitute one of the most precious - from a historical and cultural viewpoint - and less accessible cultura...
Conference Paper
In this paper, we address the removal of severe back-to-front interferences in archival documents, when recto and verso images of the page are available. The problem is approached from a modeling point of view, considering the ideal images of the two separated texts as individual source patterns that overlap in the observed images through some para...
Conference Paper
Film archives, containing collections of cinema-related digital material, have been created in many European countries. Today, the EC Best Practice Network Project EFG (European Film Gateway) provides a single access point to 59 collections from 19 archives and across 14 European countries, for a total of 640,000 digital objects. This paper illustr...
Article
We propose a new efficient and accurate technique for generic approximate similarity searching, based on the use of inverted files. We represent each object of a dataset by the ordering of a number of reference objects according to their distance from the object itself. In order to compare two objects in the dataset, we compare the two correspondin...
Chapter
This chapter deals with the problem of processing and analyzing digital images of ancient or degraded documents to increase the possibilities of inferring their structures. Classification and recognition are needed to infer structure but, when dealing with degraded documents, they are particularly difficult to apply directly to unprocessed images....
Conference Paper
Content-based image retrieval is becoming a popular way for searching digital libraries as the amount of available multimedia data increases. However, the cost of developing from scratch a robust and reliable system with content-based image retrieval facilities for large databases is quite prohibitive. In this paper, we propose to exploit an approa...
Conference Paper
Full-text available
Historical documents often undergo various changes over time, that alter their original state and reduce their legibility. Digital techniques are widely applied for document preservation, archiving, analysis and dissemination over the Internet. The goal of restoration is to improve the document legibility for both human and automatic operators, to...
Conference Paper
Full-text available
In the demonstration we will show a system for searching by similarity and automatically classifying images in a very large dataset. The demonstrated techniques are based on the use of the MI-File (Metric Inverted File) as the access method for executing similarity search efficiently. The MI-File is an access methods based on inverted files that re...
Conference Paper
Full-text available
We propose a new approach to perform approximate similarity search in metric spaces. The idea at the basis of this technique is that when two objects are very close one to each other they 'see' the world around them in the same way. Accordingly, we can use a measure of dissimilarity between the view of the world, from the perspective of the two obj...
Article
Full-text available
We propose a new approach to perform approximate similarity search in metric spaces. The idea at the basis of this technique is that when two objects are very close one to each other they 'see' the world around them in the same way. Accordingly, we can use a measure of dissimilarity between the view of the world, from the perspective of the two obj...
Conference Paper
We propose a new approach to perform approximate similarity search in metric spaces. The idea at the basis of this technique is that when two objects are very close one to each other they 'see' the world around them in the same way. Accordingly, we can use a measure of dissimilarity between the view of the world, from the perspective of the two obj...
Article
Full-text available
A new approach for video-stream filtering that makes use of the features representing video content and exploits the properties of metric spaces can help reduce the filtering receiver's computational load.
Conference Paper
MultiMatch is a 30 month targeted research project under the Sixth Framework Programme, supported by the unit for Content, Learning and Cultural Heritage (Digicult) of the Information Society DG. MultiMatch is developing a multimedia/multilingual search engine designed specifically for the access, organization and personalized presentation of cultu...
Conference Paper
We propose a technique for automatic recognition of content in images. Our technique uses machine learning methods to build classifiers which are able to decide about the presence of semantic concepts in images. Our classifiers exploit a representation of images in terms of vectors of visual terms. A visual term represents a set of visually similar...
Conference Paper
We propose a technique for automatic recognition of content in images. Our technique uses machine learning methods to build classifiers able to decide about the presence of semantic concepts in images. Our classifiers exploits a representation of images in terms of vectors of visual terms. A visual term represents a set of visually similar regions...
Conference Paper
In this paper we present an approach for image similarity search that takes inspiration from text retrieval. Images are indexed using visual terms chosen from a visual lexicon. Each visual term represents a typology of visual regions, according to various criteria. The visual lexicon is obtained by analyzing a training set of images, to infer which...
Conference Paper
Full-text available
Cultural heritage content is everywhere on the web, in traditional environments such as libraries, museums, gal- leries and audiovisual archives, but also in popular maga- zines and newspapers, in multiple languages and multiple media. MultiMatch is a 30 month specific targeted research project under the Sixth Framework Programme, supported by the...
Conference Paper
Full-text available
We present a novel technique for processing image sim- ilarity search by using an approach that takes inspiration from text retrieval techniques. In our approach images are indexed by using visual terms taken from a visual lexicon obtained clustering regions of images in the dataset. A weighting and matching schema is defined that allow effec- tive...
Conference Paper
Image similarity is typically evaluated by using low level features such as color histograms, textures, and shapes. Image similarity search algorithms require computing similarity between low level features of the query image and those of the images in the database. Even if state of the art access methods for similarity search reduce the set of fea...
Conference Paper
Full-text available
We present a novel technique for processing image sim- ilarity search by using an approach that takes inspiration from text retrieval techniques. In our approach images are indexed by using visual terms taken from a visual lexicon obtained clustering regions of images in the dataset. A weighting and matching schema is defined that allow effec- tive...
Conference Paper
In this paper we present the MILOS1 Multimedia Content Management System. MILOS supports the storage and content based retrieval of any multimedia documents whose descriptions are provided by using arbitrary metadata models represented in XML. It provides developers of digital library applications with functionalities for dealing with heterogeneous...
Conference Paper
Full-text available
The digital library field is recently broadening its scope of applicability and it is also continuously adapting to the frequent changes occurring in the internet society. Accordingly, digital libraries are slightly moving from a controlled environment accessible only to professionals and domain-experts, to environments accessible to casual users t...
Chapter
The retrieval process in multimedia document systems is inherently different from the retrieval process in traditional (record oriented) database systems. While the latter can be considered an exact process (records either satisfy the query or not), the former is not an exact process and the system must take into account the uncertainty factor (i.e...
Conference Paper
Full-text available
Building new digital library applications requires a developement plat- form that offers standard and powerful building blocks to support application de- velopers. In this paper we discuss our experience of using MILOS, a multimedia content management system oriented to the construction of digital libraries, to build a demanding application dedicat...
Conference Paper
Full-text available
Given the lack of standard building component, in several cases digital library applications are built from scratch using ad-hoc ap-proaches to implement all required components. On the other hand, our claim is that the development of ad-hoc software modules for each new digital library is not convenient. It is necessary to define and design stan-d...
Conference Paper
Full-text available
(Extended abstract) Abstract. This paper describes the architecture of the MILOS Content Management System. MILOS supports the storage and content based retrieval of any XML document, as well as multimedia documents whose descriptions are provided by using heterogenous metadata models represented in XML. MILOS is flexible in the management of docum...
Conference Paper
Full-text available
We study the problem of finding relevant relationships among user defined nodes of XML documents. We define a language that determines the nodes as results of XPath expressions. The expressions are structured in a conjunctive normal form and the relationships among nodes qualifying in different conjuncts are determined as tree twigs of the searched...
Conference Paper
Full-text available
In this paper a technique for evaluating the effectiveness of MPEG-7 image features on specific image data sets is proposed. It is based on well defined statistical characteristics. The aim is to improve the effectiveness of the image retrieval process, based on the similarity computed on these features. This technique is validated with extensive e...
Article
Full-text available
Easy access to large information collections is of great importance in many aspects of everyday life. However, limitations in information and communication technologies have so far prevented the average person from taking much advantage of existing resources. Historical documentaries held by national archives constitute some of the most precious ye...
Conference Paper
Full-text available
Similarity search in metric spaces represents an important paradigm for content-based retrieval of many applications. Existing cen- tralized search structures can speed-up retrieval, but they do not scale up to large volume of data because the response time is linearly increasing with the size of the searched file. The proposed GHT* index is a scal...
Article
While pages on the Web contain more and more multimedia information, such as images, videos and audio, today search engines are mostly based on textual information. There is an emerging need of a new generation of search engines that try to exploit the full multimedia information present on the Web. The approach presented in this paper is based on...
Chapter
Full-text available
Video can be considered today as a primarily mean of communication, due to its richness in informative content and to its appeal. Indeed, the combination of audio and video is an extremely important communication channel: it is considered that approximately 50% of what is seen and heard simultaneously is retained. Due to all these considerations, a...
Article
Full-text available
In order to speedup retrieval in large collections of data, index structures partition the data into subsets so that query requests can be evaluated without examining the entire collection. As the complexity of modern data types grows, metric spaces have become a popular paradigm for similarity retrieval. We propose a new index structure, called D-...
Conference Paper
Similarity join in distance spaces constrained by the metric postulates is the necessary complement of more famous similarity range and the nearest neighbors search primitives. However, the quadratic computational complexity of similarity joins prevents from applications on large data collections. We first study the underlying principles of such jo...
Article
Full-text available
Similarity search structures for metric data typically bound object partitions by ball regions. Since regions can overlap, a relevant issue is to estimate the proximity of regions in order to predict the number of objects in the regions’ intersection. This paper analyzes the problem using a probabilistic approach and provides a solution that effect...
Conference Paper
Full-text available
Similarity retrieval is an important paradigm for searching in environments where exact match has little meaning. Moreover, in or- der to enlarge the set of data types for which the similarity search can efficiently be performed, the notion of mathematical metric space pro- vides a useful abstraction for similarity. In this paper we consider the pr...
Conference Paper
Full-text available
Wide access to large information collections is of great potential importance in many aspects -economic, environmental, health, cultural, social, etc. -of everyday life. Historical video documentaries hold by national audiovisual archives, constitute one of the most precious -from a historical and cultural viewpoint -and less accessible cultural in...