About
33
Publications
2,549
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
634
Citations
Introduction
Sławek Staworko currently works at University of Lille Nord de France. Sławek does research in Databases and Artificial Intelligence.
Current institution
Publications
Publications (33)
We investigate graph transformations, defined using Datalog-like rules based on acyclic conjunctive two-way regular path queries (acyclic C2RPQs), and we study two fundamental static analysis problems: type checking and equivalence of transformations in the presence of graph schemas. Additionally, we investigate the problem of target schema elicita...
We study the problem of containment for shape expression schemas (ShEx) for RDF graphs. We identify a subclass of ShEx that has a natural graphical representation in the form of shape graphs and their semantics is captured with a tractable notion of embedding of an RDF graph in a shape graph. When applied to pairs of shape graphs, an embedding is a...
We investigate the problem of aligning two RDF databases, an essential problem in understanding the evolution of ontologies. Our approaches address three fundamental challenges: 1) the use of "blank" (null) names, 2) ontology changes in which different names are used to identify the same entity, and 3) small changes in the data values as well as sm...
We investigate the problem of learning join queries from user examples. The user is presented with a set of candidate tuples and is asked to label them as positive or negative examples, depending on whether or not she would like the tuples as part of the join result. The goal is to quickly infer an arbitrary n-ary join predicate across an arbitrary...
We introduce a class of deterministic sequential top-down tree-to-word transducers (stws) and investigate a number of its fundamental properties and related problems. First, we investigate the problem of normalization of stws: we identify a subclass of earliest stws (estws) that is as expressive as stws and present an effective procedure for conver...
Finding high-level structure in scores is one of the main challenges in music information retrieval. Searching for a formalization enabling variety through fixed musical concepts, we use parametric grammars , an extension of context-free grammars with predicates that take parameters. Parameters are here small patterns of music that will be used wit...
We study the expressiveness and complexity of Shape Expression Schema (ShEx), a novel schema formalism for RDF currently under development by W3C. ShEx assigns types to the nodes of an RDF graph and allows to constrain the admissible neighborhoods of nodes of a given type with regular bag expressions (RBEs). We formalize and investigate two alterna...
In this paper, we revisit the view based security framework for XML without imposing any of the previously considered restrictions on the class of queries, the class of DTDs, and the type of annotations used to define the view. First, we study query rewriting with views when the classes used to define queries and views are Regular XPath and MSO. Ne...
We consider unordered XML, where the relative order among siblings is
ignored, and we investigate the problem of learning schemas from examples given
by the user. We focus on the schema formalisms proposed in [10]: disjunctive
multiplicity schemas (DMS) and its restriction, disjunction-free multiplicity
schemas (MS). A learning algorithm takes as i...
We consider unordered XML, where the relative order among siblings is
ignored, and propose two simple yet practical schema formalisms: disjunctive
multiplicity schemas (DMS), and its restriction, disjunction-free multiplicity
schemas (MS). We investigate their computational properties and characterize
the complexity of the following static analysis...
Deterministic regular expressions are widely used in XML processing. For instance, all regular expressions in DTDs and XML Schemas are required to be deterministic. In this paper we show that determinism of a regular expression e can be tested in linear time. The best known algorithms, based on the Glushkov automaton, require O(σ|e|) time, where σ...
We investigate the problem of learning XML queries, path queries and tree
pattern queries, from examples given by the user. A learning algorithm takes on
the input a set of XML documents with nodes annotated by the user and returns a
query that selects the nodes in a manner consistent with the annotation. We
study two learning settings that differ...
We study normalization of deterministic sequential top-down tree-to-word transducers (stWs), that capture the class of deterministic top-down nested-word to word transducers. We identify the subclass of earliest
stWs (estWs) that yield unique normal forms when minimized. The main result of this paper is an effective normalization procedure for
stWs...
We study the problem of update translation for views on XML documents. More precisely, given an XML view definition and a user defined view update program, find a source update program that translates the view update without side effects on the view. Additionally, we require for the translation to be defined on all possible source documents; this c...
Résumé : Après avoir rappelé le cadre général de la programmation logique in-ductive, nous proposons une sous-famille des clauses de Horn nommée MQD. Visant des applications de classification de document XML, nous définissons un langage de clauses L fcns permettant de représenter des arbres et des motifs d'ar-bres. Ce langage nous fournit exemples...
We study the problem of update propagation across views in the setting where both the view and the source database are XML documents. We consider a simple class of XML views that remove selected parts of the source document. The considered update operations permit to insert and delete subtrees of the document. We focus on constructing propagations...
We study the equivalence problem of deterministic nested word to word
transducers and show it to be surprisingly robust. Modulo polynomial
time reductions, it can be identified with 4 equivalence problems for
diverse classes of deterministic non-copying order-preserving
transducers. In particular, we present polynomial time back and fourth
reductio...
In this paper, we revisit the view based security framework for XML without imposing any of the previously considered restrictions
on the class of queries, the class of DTDs, and the type of annotations used to define the view. First, we show that the full
class of Regular XPath queries is closed under query rewriting. Next, we address the problem...
A consistent query answer in an inconsistent database is an answer obtained
in every (minimal) repair. The repairs are obtained by resolving all conflicts
in all possible ways. Often, however, the user is able to provide a preference
on how conflicts should be resolved. We investigate here the framework of
preferred consistent query answers, in whi...
The framework of consistent query answers and repairs has been introduced to alleviate the impact of inconsistent data on the answers to a query. A repair is a minimally different consistent instance and an answer is consistent if it is present in every repair. In this article we study the complexity of consistent query answers and repair checking...
We investigate the problem of querying (regular) sets of XML documents represented with tree automata and we consider $n$-ary tree automata queries whose expressive power captures MSO on trees. Because finite automata can represent infinite sets of documents, we propose the notions of {\em universal} and {\em existential} query answers, answers tha...
We consider the problem of querying XML documents which are not valid with respect to given DTDs. We propose a framework for measuring the invalidity of XML documents and compactly representing minimal repair- ing scenarios. Furthermore, we present a validity-sensitive method of querying XML documents, which extracts more information from invalid X...
One of the goals of cleaning an inconsistent database is to remove conflicts between tuples. Typically, the user specifies how the conflicts should be resolved. Sometimes this specification is incomplete, and the cleaned database may still be inconsistent. At the same time, data cleaning is a rather drastic approach to conflict resolution: It remov...
We study here the impact of priorities on conflict resolution in inconsistent relational databases. We extend the framework of repairs and consistent query answers. We propose a set of postulates that an extended framework should satisfy and consider two instantiations of the framework: (locally preferred) l-repairs and (globally preferred) g-repai...
A consistent query answer in a possibly inconsistent database is an answer which is true in every (minimal) repair of the database. We present here a practical framework for computing consistent query answers for large, possibly inconsistent relational databases. We consider relational algebra queries without projection, and denial constraints. Bec...
Integrity constraints express important properties of data, but the task of preserving data consistency is becoming increasingly
problematic with new database applications. For example, in the case of integration of several data sources, even if the sources
are separately consistent, the integrated data can violate the integrity constraints. The tr...