Morton M. Astrahan’s research while affiliated with IBM and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (32)


System R: A relational data base management system
  • Conference Paper

January 2006

·

35 Reads

·

1 Citation

Lecture Notes in Computer Science

Morton M. Astrahan

·

Donald D. Chamberlin

·

W. Frank King

·

Irving L. Traiger

System R: An Architectural Overview.

January 1999

·

52 Reads

·

36 Citations

IBM Systems Journal

Mike W. Blasgen

·

Morton M. Astrahan

·

Donald D. Chamberlin

·

[...]

·

Robert A. Yost

We have described the architecture of System R, including the Relational Data System and the Research Storage System. The RDS supports a flexible spectrum of binding times, ranging from precompilation of “canned transactions” to on-line execution of ad hoc queries. The advantages of this approach may be summarized as follows: 1. For repetitive transactions, all the work of parsing, name binding, and access path selection is done once at precompilation time and need not be repeated. 2. Ad hoc queries are compiled on line into small machine-language routines that execute more efficiently than an interpreter. 3. Users are given a single language, SQL, for use in ad hoc queries as well as in writing PL/I and COBOL transaction programs. 4. The SQL parser, access path selection routines, and machine language code generator are used in common between query processing and precompilation of transaction programs. 5. When an index used by a transaction program is dropped, a new access path is automatically selected for the transaction without user intervention.


System R: An architectural overview

January 1999

·

33 Reads

·

58 Citations

IBM Systems Journal

We have described the architecture of System R, including the Relational Data System and the Research Storage System. The RDS supports a flexible spectrum of binding times, ranging from precompilmion of “canned transactions” to on-line execution of ad hoc queries. The advantages of this approach may be summarized as follows: 1. For repetitive transactions, all the work of parsing, name binding, and access path selection is done once at precompilation time and need not be repeated. 2. Ad hoc queries are compiled on line into small machine-language routines that execute more efficiently than an interpreter. 3. Users are given a single language, SQL, for use in ad hoc queries as well as in writing PL/I and COBOL transaction programs. 4. The SQL parser, access path selection routines, and machine language code generator are used in common between query processing and precompilation of transaction programs. 5. When an index used by a transaction program is dropped, a new access path is automatically selected for the transaction without user intervention.


Bibliography

July 1996

·

27 Reads

·

1 Citation

, pages 387--406. Prentice-Hall, 1991. [AHU74] A. V. Aho, J. E. Hopcroft, and J. D. Ullman. The Design and Analysis of Computer Algorithms. Addison Wesley, 1974. [AJPO88] Ada Joint Program Office. Common Ada Programming Support Environment (APSE) Interface Set (CAIS), Revision A. Technical Report DoD-STD-1838A, U.S. Department of Defense, 1988. [ASU86] A. V. Aho, R. Sethi, and J. D. Ullmann. Compilers -- Principles, Techniques and Tools. Addison Wesley, 1986. [Bay95] B. Bayard. Konzeption einer objektorientierten Erweiterung der Designsprache und des Werkzeugs OPUS. Master's thesis, University of Dortmund, Dept. of Computer Science, Software Technology, 1995. 241 242 BIBLIOGRAPHY [BCD + 88] P. Borras, D. Cl'ement, T. Despeyroux, J. Incerpi, G. Kahn, B. Lang, and V. Pascual. CENTAUR: The System. ACM SIGSOFT Software Engineering Notes, 13(5):14--24, 1988. Proc. of the ACM SIGSOFT/SIGPLAN Software Engineering


Approximating the number of unique values of an attribute without sorting

December 1987

·

21 Reads

·

43 Citations

Information Systems

Counts of unique values are frequently needed information in database systems. Especially, they are essential in query optimization and physical database design. Traditionally, exact counts were obtained by sorting, which is an expensive operation. In this paper we present three algorithms for counting unique values by probabilistic methods. These algorithms require only one pass over the data, and produce approximations to the true count with certain standard deviations. For deviations acceptable in practical environments (~10%), the algorithms require only modest amounts of memory space and computation time. We have implemented all three algorithms in System R. We also present the results of the experiments on accuracy and performance of these algorithms.


A History and Evaluation of System R.
  • Article
  • Full-text available

October 1981

·

689 Reads

·

226 Citations

Communications of the ACM

System R, an experimental database system, was constructed to demonstrate that the usability advantages of the relational data model can be realized in a system with the complete function and high performance required for everyday production use. This paper describes the three principal phases of the System R project and discusses some of the lessons learned from System R about the design of relational systems and database systems in general.

Download

Support for Repetitive Transactions and Ad Hoc Queries in System R.

March 1981

·

33 Reads

·

90 Citations

ACM Transactions on Database Systems

System R supports a high-level relational user language called SQL which may be used by ad hoc users at terminals or as an embedded data sublanguage in PL/I or COBOL. Host-language programs with embedded SQL statements are processed by the System R precompiler which replaces the SQL statements by calls to a machine-language access module. The precompilation approach removes much of the work of parsing, name binding, and access path selection from the path of a running program, enabling highly efficient support for repetitive transactions. Ad hoc queries are processed by a similar approach of name binding and access path selection which takes place on-line when the query is specified. By providing a flexible spectrum of binding times, System R permits transaction-oriented programs and ad hoc query users to share a database without loss of efficiency. System R is an experimental database management system designed and built by members of the IBM San Jose Research Laboratory as part of a research program on the relational model of data. This paper describes the architecture of System R, and gives some preliminary measurements of system performance in both the ad hoc query and the “canned program” environments.




Access Path Selection in a Relational Database Management System

January 1979

·

134 Reads

·

1,885 Citations

In a high level query and data manipulation language such as SQL, requests are stated non-procedurally, without reference to access paths. This paper describes how System R chooses access paths for both simple (single relation) and complex queries (such as joins), given a user specification of desired data as a boolean expression of predicates. System R is an experimental database management system developed to carry out research on the relational model of data. System R was designed and built by members of the IBM San Jose Research Laboratory.


Citations (26)


... The transaction manager is responsible for insuring that a distributed transaction is either committed or aborted at all sites visited by the transaction. The R* transaction manager uses a nested form of the 2-phase commit protocol [3,6,11] to insure that all sites of a transaction come to the same conclusion as to the outcome of the transaction. Transactions are assigned globally unique transaction identifiers by the transaction manager at the root of the process tree. ...

Reference:

Computation and Communication in R*: A Distributed Database Manager
System R: An Architectural Update
  • Citing Article
  • January 1979

... P.G. Selinger et.al [4] have discussed in their paper that most modern query optimizers determine the best plan for executing a given query by mathematically modeling the execution cost for each of many alternative query evaluation plans and choosing the one with the cheapest estimated cost. The execution cost is largely dependent upon the number of rows that may be processed by each operator in the query evaluation plan in the query. ...

Access Path Selection in a Relational Database
  • Citing Article
  • January 1979

... Although minimal length routings are not always optimal, they are E D FIGURE 3 useful and easy to generate . Indeed, a common routing algorithm (used, for example, in the Highly Available Systems project at IBM (Aghili et al., 1983 ) produces random minimal length routings . Thus, it becomes important to find networks for which all minimal length routings are fault tolerant. ...

A prototype for a highly available database system
  • Citing Article

... A series of experiments was conducted at the San Jose IBM Research Laboratory to evaluate the success of the System R optimizer in choosing among the available access paths for typical SQL statements. The results of these experiments are reported in [6] . For the purpose of the experiments , the optimizer was modified in order to observe its behavior. ...

Evaluation of the system r access path selection mechanism
  • Citing Article

... These estimation errors often stem from the simplifying assumptions that cardinality estimators rely on -such as uniformity of data distribution, independence of predicates, and the principle of inclusion -which frequently fail to represent the intricacies of real-world data [25,27]. Despite decades of research aimed at improving cardinality estimation through better data summaries [6,21,41], sampling techniques [26,33], or machine learning models [20,23,48,24,29,30,36,45,38], the challenge of accurate cardinality estimation remains. ...

Access path selection in a relational database system
  • Citing Article
  • January 1979