How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
: The emergence of the CD-ROM as a storage medium for full-text databases raises the question of the maximum size database that can be contained by this medium. As an example, the problem of storing the Tr'esor de la Langue Fran¸caise on a CD-ROM is examined in this paper. The text alone of this database is 700 MB long, more than a CD-ROM can hold....
This paper describes a new approach for dealing with the vocabulary problem in human-computer interaction. Most approaches to retrieving textual materials depend on a lexical match between words in users' requests and those in or assigned to database objects. Because of the tremendous diversity in the words people use to describe the same object, l...
The existence of machine readable text makes possible the development of new techniques that assist the literary scholar in locating interesting passages of text. In this paper we explore in a preliminary manner the possibility of adapting techniques developed in the field of document retrieval to the full text context. As an alternative to the con...
Computer programs that access significant amounts of text usually include code that manipulates the textual objects that comprise it. Such programs include electronic mail readers, typesetters and, in particular, full-text information retrieval systems. Such code is often unsatisfying in that access to textual objects is either efficient, or flexib...
A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries. The particular technique used is singular-value decompo...
A novel architecture for full-text information retrieval systems is described. The architecture’s most distinctive feature is a server that is implemented as an interpreter for a lazily evaluated functional programming language. The consequences of this approach for time and space performance are discussed, concentrating especially on the functiona...
In a new method for automatic indexing and retrieval, implicit higher-order structure in the association of terms with documents is modeled to improve estimates of term-document association, and therefore the detection of relevant documents on the basis of terms found in queries. Singular-value decomposition is used to decompose a large term by doc...
An international workshop on Distributed Expert-Based Information Systems (DEBIS) was held at Rutgers University in March 1987. The aims of the workshop were to discuss problems and issues in the design of such systems, and to develop research and implementation strategies for them. The workshop attendees discussed both models and implementations o...
Typescript. Thesis (Ph. D.)--Purdue University, 1984. Includes bibliographical references (leaves 89-91). Microfilm. s
Many years ago I read a paper on a hardware implementation of an information retrieval system. It was implemented as a circuit board, where the query would be set by putting jumpers on one side of the board and the result would be indicated by LEDs or the equivalent on another side of the board. The math behind it was very insightful, and I'd love to find it again, but I've been unable to. The paper was written (probably well) before 1975, perhaps even in the 1950's. I vaguely remember that the primary author's name began with an S but that's as far as I've gotten. (I'm not thinking of Vannevar Bush's Memex.)
Can anyone help?