Michael Steinbrunn’s research while affiliated with University of Passau and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (15)


Optimization and Evaluation of
  • Article

January 2003

·

16 Reads

·

1 Citation

Klaus Peithner

·

Michael Steinbrunn

this paper, we propose a novel technique, called ###### ##########, for evaluating such disjunctive queries. The bypass processing technique is based on new selection and join operators that produce two output streams: the ####-stream with tuples satisfying the selection (join) predicate and the #####-stream with tuples not satisfying the corresponding predicate. Splitting the tuple streams in this way enables us to bypass costly predicates whenever the fate of the corresponding tuple (stream) can be determined without evaluating this predicate. In the paper, we show how to systematically generate bypass evaluation plans utilizing a bottom-up building block approach. We show that our evaluation technique allows to incorporate the standard SQL semantics of null values. For this, we devise two different approaches: One is based on explicitly incorporating three-valued logic into the evaluation plans; the other one relies on two-valued logic by moving all negations to atomic conditions of the selection predicate. We describe how to extend an iterator-based query engine to support bypass evaluation with little extra overhead. This query engine was used to quantitatively evaluate the bypass evaluation plans against the traditional evaluation techniques utilizing a CNF- or DNF-based query predicate


Fig. 1. Evaluation plans for % oI ' g oI†•t oIYoP† ‚ I  ‚ P ††.
Fig. 2. ER-schema of the flight reservation system.
Fig. 4. CNF-based evaluation plan for ªImmigration Airports.º (Relative Cost: Factor 150).
Fig. 7. Visualization of the construction process.
Fig. 8. Preliminary query evaluation plan for aImmigration Airports.o 

+9

Optimization and Evaluation of Disjunctive Queries
  • Article
  • Full-text available

April 2000

·

4,770 Reads

·

37 Citations

IEEE Transactions on Knowledge and Data Engineering

Jens Claussen

·

Alfons Kemper

·

·

[...]

·

Michael Steinbrunn

It is striking that the optimization of disjunctive queries-i.e. those which contain at least one OR-connective in the query predicate-has been vastly neglected in the literature, as well as in commercial systems. In this paper, we propose a novel technique, called bypass processing, for evaluating such disjunctive queries. The bypass processing technique is based on new selection and join operators that produce two output streams: the TRUE-stream with tuples satisfying the selection (join) predicate and the FALSE-stream with tuples not satisfying the corresponding predicate. Splitting the tuple streams in this way enables us to “bypass” costly predicates whenever the “fate” of the corresponding tuple (stream) can be determined without evaluating this predicate. In the paper, we show how to systematically generate bypass evaluation plans utilizing a bottom-up building-block approach. We show that our evaluation technique allows us to incorporate the standard SQL semantics of null values. For this, we devise two different approaches: one is based on explicitly incorporating three-valued logic into the evaluation plans; the other one relies on two-valued logic by “moving” all negations to atomic conditions of the selection predicate. We describe how to extend an iterator-based query engine to support bypass evaluation with little extra overhead. This query engine was used to quantitatively evaluate the bypass evaluation plans against the traditional evaluation techniques utilizing a CNFor DNF-based query predicate

Download

Heuristic and Randomized Optimization for the Join Ordering Problem

December 1999

·

811 Reads

·

380 Citations

The VLDB Journal

Recent developments in database technology, such as deductive database systems, have given rise to the demand for new, cost-effective optimization techniques for join expressions. In this paper many different algorithms that compute approximate solutions for optimizing join orders are studied since traditional dynamic programming techniques are not appropriate for complex problems. First, two possible solution spaces, the space of left-deep and bushy processing trees, respectively, are evaluated from a statistical point of view. The result is that the common limitation to leftdeep processing trees is only advisable for certain join graph types. Basically, optimizers from three classes are analysed: heuristic, randomized and genetic algorithms. Each one is extensively scrutinized with respect to its working principle and its fitness for the desired application. It turns out that randomized and genetic algorithms are well suited for optimizing join expressions. They generate...


Bypassing Joins in Disjunctive Queries

December 1999

·

16 Reads

·

22 Citations

In this paper we develop a novel optimization strategy for disjunctive queries involving join predicates. This work is an extension of our previously published approach [KMPS94] for optimizing disjunctive selection predicates by generating two output streams from selection operators: a "true"-stream for objects (tuples) satisfying the selection predicate and a "false"-stream for those objects not satisfying the predicate. Then, each stream undergoes an individual, "customized" optimization. Here, we extend the basic idea of [KMPS94] to disjunctive queries with join operators. Analogously to selections, we propose to generate two output streams from a join operator: one "true"-stream for pairs of objects (of the two input streams) that satisfy the join predicate and one "false"- stream for those pairs not satisfying it. In combination with the extended selection predicate processing, this provides a large potential for efficiently evaluating disjunctive queries because it allows to "byp...


Figure 27: AB Algorithm ably higher performance for the genuine algorithm, as the experimental results in the cited reference indicate.
Optimizing Join Orders

December 1999

·

610 Reads

·

23 Citations

Recent developments in database technology, such as deductive database systems, have given rise to the demand for new, cost-effective optimization techniques for join expressions. In this paper many different algorithms that compute approximative solutions for optimizing join orders are studied since traditional dynamic programming techniques are not appropriate for complex problems. First, two possible solution spaces, the space of left-deep and bushy processing trees, respectively, are evaluated from a statistical point of view. The result is that the common limitation to left-deep processing trees is, from a purely statistical point of view, only advisable for certain cost models. Basically, optimizers from three classes are analysed: heuristic, randomized and genetic algorithms. Each one is extensively scrutinized with respect to its working principle and its fitness for the desired application. It turns out that randomized and genetic algorithms are well suited for optimizing join expressions. They generate solutions of high quality within a reasonable running time. The benefits of heuristic optimizers, namely the short running time, are often outweighed by merely moderate optimization performance.


Optimizing Boolean Expressions in Object Bases

December 1999

·

124 Reads

·

13 Citations

In this paper we address the problem of optimizing the evaluation of boolean expressions in the context of object-oriented data modelling. We develop a new heuristic for optimizing the evaluation sequence of boolean expressions based on selectivity and cost estimates of the terms constituting the boolean expression. The quality and efficiency of the heuristic is evaluated based on a quantitative analysis which compares our heuristic with the optimal, but infeasible algorithm and other known methods. The heuristic is based on the selectivity and evaluation-cost estimates of the terms of which the boolean expression is composed. Deriving these inputs of the heuristics, i.e., the selectivity and cost estimates, is then addressed. We use an adaptation of well-known sampling for estimating the selectivity of terms. The cost estimation is much more complex than in the relational context due to the possibility of invoking functions within a boolean expression. We develop the decapsulation met...


Bypassing Joins in Disjunctive Queries

December 1999

·

92 Reads

·

11 Citations

The bypass technique, which was formerly restricted to selections only [KMPS94], is extended to join operations. Analogous to the selection case, the join operator may generate two output streams---the join result and its complement---whose subsequent operator sequence is optimized individually. By extending the bypass technique to joins, several problems have to be solved. (1) An algorithm for exhaustive generation of the search space for bypass plans has to be developed. (2) The search space for bypass plans is quite large. Hence, partial exploration strategies still resulting in sufficiently efficient plans have to be developed. (3) Since the complement of a join can be very large, those cases where the complement can be restricted to the complement of the semijoin have to be detected. We attack all three problems. Especially, we present an algorithm generating the optimal bypass plan and one algorithm producing near optimal plans exploring the search space only partially. As soon ...


Optimizing Disjunctive Queries with Expensive Predicates

December 1999

·

25 Reads

·

52 Citations

In this work, we propose and assess a technique called bypass processing for optimizing the evaluation of disjunctive queries with expensive predicates. The technique is particularly useful for optimizing selection predicates that contain terms whose evaluation costs vary tremendously; e.g., the evaluation of a nested subquery or the invocation of a user-defined function in an object-oriented or extended relational model may be orders of magnitude more expensive than an attribute access (and comparison). The idea of bypass processing consists of avoiding the evaluation of such expensive terms whenever the outcome of the entire selection predicate can already be induced by testing other, less expensive terms. In order to validate the viability of bypass evaluation, we extend a previously developed optimizer architecture and incorporate three alternative optimization algorithms for generating bypass processing plans. 1 Introduction During the past few years we have witnessed tremendous...


Table 1 : Costs and selectivities for the conditions in \Tiring Flights"
Optimizing Disjunctive Queries in Object Bases

December 1999

·

25 Reads

·

1 Citation

In this work we propose and assess a novel technique, called bypass processing, for optimizing the evaluation of disjunctive queries in object-oriented database systems. In the object-oriented data model, selection predicates often contain terms whose evaluation cost varies tremendously; e.g., the evaluation cost of a user-defined function may be orders of magnitude more expensive than an attribute access (and comparison). The idea of bypass processing consists of avoiding the evaluation of such expensive terms whenever the outcome of the entire selection predicate can already be deduced by testing other, less expensive terms. In order to validate the viability of bypass evaluation, we extend a previously developed optimizer architecture and incorporate three alternative optimization algorithms for generating bypass processing plans and---for comparison purpose---one algorithm for generating conventional query evaluation plans. The optimizer is assessed with respect to runn...


Heuristic and Randomized Optimization for the

January 1997

·

12 Reads

·

8 Citations

The VLDB Journal

Recent developments in database technology, such as deductive databasesystems, have given rise to the demand for new, cost-effective optimizationtechniques for join expressions. In this paper many different algorithmsthat compute approximate solutions for optimizing join orders are studiedsince traditional dynamic programming techniques are not appropriatefor complex problems. First, two possible solution spaces, the space ofleft-deep and bushy processing trees, respectively, are...


Citations (10)


... Further details regarding the SA algorithm are described in Figure S1. (Steinbrunn et al. 1997). ...

Reference:

Systematic simulation of tumor cell invasion and migration in response to time-varying rotating magnetic field
Heuristic and Randomized Optimization for the
  • Citing Article
  • January 1997

The VLDB Journal

... Rather, we present multiple cost models which represent different real-world scenarios, and show that our proofs of optimality hold for all these cases. As a result, our BestD algorithm does not need to make any sort of independence assumption between predicate atoms, as has been done by much of the past work [15] [10]. In addition, our work does not require that the input predicate to be in CNF or DNF and is able to avoid the exponential blowup of terms that often plague these forms. ...

Optimizing Boolean Expressions in Object-Bases.

... Our algorithm deals with predicates in their disjunctive normal form. A predicate is in its in DNF form if and only if it is a disjunction of one or more conjunctions of one or more literals [11]. Qp will then have the following form: ...

Optimizing Disjunctive Queries with Expensive Predicates.

ACM SIGMOD Record

... Second, it includes operators over sum types which we use to encode the semantics of NULL values. Finally, NRA e was chosen for its validated use for optimization of nested queries [Claußen et al. 1997;Cluet and Moerkotte 1993;Moerkotte 2020] and NULL values [Claußen et al. 2000]. ...

Optimization and Evaluation of Disjunctive Queries

IEEE Transactions on Knowledge and Data Engineering

... Neumann pioneered the work on generating optimal DAG-structured query plans [63,66], while others heuristically share CSEs via materialized views [63,84,95] or common operators [6,17,32]. Recent work further introduced a greedy algorithm with guaranteed approximation factor [42]. Sideways information passing such as semi-join reductions [10], magic sets [8], bypass plans for disjunctive queries [86], or adaptive information passing [38,67] also deal with DAGs, but none of these techniques are integrated with query compilation. Although most ML systems have compiler and runtime support for DAGs, their rewrite systems-such as SystemML's static and dynamic rewrites [12]-also deal with CSEs in a heuristic manner. ...

Bypassing Joins in Disjunctive Queries
  • Citing Article
  • December 1999

... It starts at an initial temperature and cools down at a specified rate. As the temperature gradually decreases, the algorithm leverages probabilistic jumps to explore the solution space randomly, searching for the global optimum of the objective function, so it can probabilistically escape local optima and eventually converge to the global optimum [19]. The optimization of DCS model is a search problem in multidimensional space, where each solution makes a potential probability correction scheme. ...

Heuristic and Randomized Optimization for the Join Ordering Problem

The VLDB Journal

... Bypass. Other than OrderP, one of the other seminal works on optimizing disjunctions is the bypass technique [14]. When faced with a disjunction, the bypass technique stores the records which do not satisfy a predicate atom, and only those records are processed for the remaining predicate atoms. ...

Optimizing Disjunctive Queries with Expensive Predicates
  • Citing Article
  • December 1999

... For UNION we add the union of the results of the pattern groups as if it were a base relation, for OPTIONAL we add the result as a base relation using an outer join. In principle it would be possible to optimize these queries more aggressively, but most interesting optimizations require the usage of bypass plans [53] or other non tree-structured execution plans, which is beyond the scope of this work. And these optimizations would only pay off for complex queries; when the disjunctive elements are simple, our nested translation and optimization scheme produces the optimal solution. ...

Bypassing Joins in Disjunctive Queries