Yves Marcoux

Yves Marcoux
Université de Montréal | UdeM · EBSI

Ph.D. Informatique

About

36
Publications
1,173
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
98
Citations
Citations since 2017
0 Research Items
8 Citations
201720182019202020212022202301234
201720182019202020212022202301234
201720182019202020212022202301234
201720182019202020212022202301234

Publications

Publications (36)
Article
If the information conveyed by the markup in a document can be identified with the set of inferences we can draw from that markup, as has been proposed in earlier work, then the sets of inferences licensed by documents form an infinitely large lattice, by means of which the relative information content of any two documents (equivalence, subsumption...
Article
Full-text available
After a brief overview of the evolution of interface design and its focus on the user experience, we present the creation process as cycles of ideations-tests illustrated by an analogy with software engineering (cycle theory test). This analogy will rule out the overestimated value of usability testing that rely solely on tests to establish a corre...
Article
Full-text available
The social mission of information professionals is to provide society with high quality information storage and retrieval services. In order to fulfill this mission, the professionals need to have an understanding of the tools they use that is suciently thorough for predicting the behavior of these tools in all normal circumstances. One important c...
Article
Structured text retrieval is the possibility of using the structure of documents (e.g., SGML tags) in search criteria, in addition to full-text operations (adjacency, etc.). Intuitively, using the structure can improve the performance of the retrieval, but for many reasons, it is difficult to compare structured and unstructured retrieval by real ex...
Article
We propose an evolution path for the traditional "PARC" interface found today on most PCs. The proposal, which we call the "post-PARC interface", draws not only on concepts developed at major research laboratories like PARC and HCIL, but also on the work of independent researchers. It is based on two novel concepts: the representation of documents...
Conference Paper
The MLCD Overlap Corpus (MOC) is a collection of samples of texts and text fragments with overlapping structures. The main immediate goal of the MOC project is to build a corpus of well understood and well documented examples of overlap, discontinuity, alternate ordering, and related phenomena in various notations, for use in the investigation of m...
Article
Full-text available
Intertextual semantics (IS; Marcoux & Rizkallah, 2009) is one of several semiotic approaches for the design of information objects. It uses natural language - shared among designer, developer, and users - as an explicitation tool for developing an agreed upon, shared meaning for an artifact. So far, it has been applied constructively to the design...
Conference Paper
The type/token distinction introduced by C. S. Peirce and taken up by many others is familiar when applied to individual symbols or characters in a writing system, and also when applied at a higher level to words (and word-like objects). Some writers apply the distinction not only at some basic or foundational level but also as a description of hig...
Article
In most discussions about information and knowledge management, natural language is described as too fuzzy, ambiguous, and changing to serve as a basis for the development of large-scale tools and systems. Instead, artificial formal languages are developed and used to represent, hopefully in an unambiguous and precise way, the information or knowle...
Article
Full-text available
Intertextual semantics (IS) [1] [4] is a framework in which the meaning of marked-up documents is given in natural language. While formal semantics aims at conveying the meaning of documents to machines, IS aims at conveying it to humans. In the current framework, the IS of a model (DTD, schema) is expressed by peritexts associated with each elemen...
Chapter
Extrait du premier chapitre du livre « Introduction aux sciences de l'information » sur l'évolution des professions dans une perspective comparative France-Amérique du Nord.
Article
Full-text available
The accessibility of research corpora in the public space is a necessity for the advancement of knowledge. This allows not only for the sharing of original texts for analysis, but alsofor the application of different analytical approaches and viewpoints. Many sociological and judicial considerations may explain why researchers resist giving access...
Conference Paper
We establish a necessary and sufficient condition for a graph to correspond to the structure of an overlapping markup document, such as a well-formed TexMECS document (not using interrupted or virtual elements). This provides a test for determining if any given graph can be serialized into a TexMECS document—or any other similar language—using only...
Conference Paper
In an information system, if data-entry agents (authors) do not understand properly the intentions of the designer of the structure they are populating (the modeler), they can make errors, resulting in incomplete or incorrect information entering the system. Thus, modeler-author communication is of prime importance in the performance of a system. I...
Conference Paper
Full-text available
The presence of XML in many recent hypermedia management tools and methods (W3I3, SMIL, etc.) shows better than ever that both structural and textual criteria will continue to play a fundamental role in content-based multimedia retrieval and management. Extracting precise and fine-grained information from structured documents requires powerful patt...
Article
Full-text available
Conference Paper
A high performance information filtering system has three main requirements: it must be effective in supplying users with useful information, it must do so in a timely fashion, and it must be able to handle a large throughput of information and a ...
Article
Full-text available
We present a formal data model and query language for hierarchically structured text and acyclic hypertext. The data model is related to SGML (Standard Generalized Markup Language, ISO 8879) and the retrieval mechanism is based on pattern-matching using context-free grammars (CFGs) with generalized regular expressions on the right-hand sides of pro...
Article
Dans le but d’assurer une diffusion plus large et plus rapide des statistiques annuelles relatives aux bibliothèques publiques et aux centres régionaux de services aux bibliothèques publiques, le ministère de la Culture et des Communications du Québec a convenu avec l’École de bibliothéconomie et des sciences de l’information (EBSI) de rendre dispo...
Article
The creation of electronic information in the form of structured documents is steadily gaining popularity. It is thus necessary to develop information retrieval tools fitted to this type of document. In this article, we present the results of a research project aimed at identifying human-computer interface elements that can support information retr...
Article
Full-text available
We establish a polynomial upper bound on the time complexity of an s-1-1 function in programming systems with a linear-time composition function. This improves the doubly exponential upper bound of [M. Machtey and P. Young, An introduction to the general theory of algorithms. (1978; Zbl 0376.68027)], the only previously known upper bound, and inval...
Conference Paper
The author establishes a polynomial upper bound on the time complexity of s -1-1 in programming systems with a linear time instance of composition. He also exhibits a family of acceptable such programming systems for which the upper bound is optimal. He deduces several bounds on the time complexity of composition, s -1-1, and various classes of con...
Article
Full-text available
This article proposes a format for the exchange of textual corpora among text analysis systems. The proposed format follows the recommendations of the Text Encoding Initiative. It has been the object of an agreement in principle reached by the members of the ATONET network of researchers in August 2005. The basic proposal has already been used to d...
Article
Full-text available
Writing valid XML can be viewed as a collaborative process in which (roughly speaking) the modeler supplies the structure (markup), and the author the contents. When an information management chain includes document creation by a human, failure to mobilize and properly support the author in his task may result in errors or loss of valuable informat...
Article
Full-text available
At Extreme 2006 (Marcoux 2006), we introduced an approach to structured-document modeling based on natural language, with a strong preoccupation for the facilitation of modeler-author communication. The goal was to develop a modeling framework which would (1) facilitate the communication of the "semantic intentions of the modeler" to (human) author...
Article
Full-text available
Pourquoi une "école de bibliothécaires" s'intéresse-t-elle à une technologie comme SGML? C'est que, même si l'essence de la préoccupation fondamentale de l'École n'a pas changé depuis sa fondation en 1960, les manifestations concrètes de cette préoccupation se sont profondéments modifiées et adaptées aux évolutions technologiques, surtout depuis 19...
Article
Full-text available
Résumé Dans la tradition de l'analyse de textes par ordinateur, l'annotation et la catégorisation font partie des opérations permettant d'enrichir les données textuelles au fur et à mesure de leur analyse, éclairée par des outils statistiques et divers modes de lecture comparative. En général cependant, les unités ainsi enrichies sont des occurrenc...
Conference Paper
The problem of overlapping structures has long been familiar to the structured document community. In a poem, for example, the verse and line structures overlap, and having them both available simultaneously is convenient, and sometimes necessary (for example for automatic analyses). However, only structures that embed nicely can be represented dir...
Conference Paper
When marking up a document we chop it up into elements. Elements are parts of the document, some of which contain further elements, i.e., have parts of their own. Thus, the part-whole relation is central to the way markup works. Mereology is precisely the theory of part-whole relationships, but has not yet found much application in markup theory. I...
Conference Paper
In [ Sperberg-McQueen et al. 2000a ], Sperberg-McQueen et al. describe a framework in which the semantics of a structured document is represented by the set of inferences (statements) licensed by the document, that is, statements which can be considered to hold on the basis of the document. The authors suggest that an adequate set of basic inferenc...
Article
Actes du colloque "Le numérique : impact sur le cycle de vie du document" organisé à l'université de Montréal par l'EBSI et l'ENSSIB du 13 au 15 octobre 2004. Son objectif était de traiter de façon interdisciplinaire la problématique suivante : « La numérisation, la diffusion des formats numériques originaux, les nouvelles méthodes d'indexation et...
Article
Full-text available
Intervention au colloque "Le numérique : impact sur le cycle de vie du document", organisé à l'université de Montréal par l'EBSI et l'ENSSIB du 13 au 15 octobre 2004. La bonne gestion de l'information est une des conditions de la performance des organisations. Les outils traditionnels élaborés pour la documentation papier sont désormais inadéquats...

Network

Cited By

Projects

Projects (7)
Project
A group of QDA Software Developers working together on an interoperability standard to enable users to migrate their research projects from one software to another and to archive their whole project. For more information see http://www.qdasoftware.org