Syd Bauman’s research while affiliated with Brown University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (12)


Future Developments for TEI ODD
  • Article

July 2012

·

37 Reads

·

·

·

[...]

·

Piotr Banski

Interchange vs. Interoperability

January 2011

·

92 Reads

·

27 Citations

This paper explores some of the ramifications of the inherent tension between the desire to analyze our own data and to share it with others — between our desire to be expressive and our desire to be conformant — particularly with respect to interchangeability vs. interoperability , positing that focusing on interoperability is to focus on sharing at the expense of analysis.


TEI HORSEing Around.

January 2005

·

20 Reads

·

4 Citations

The Text Encoding Initiative's typed segment-boundary delimiter method is only one of several proposed mechanisms for handling overlap in TEI documents. HORSE (aka CLIX) defines a method by which an XML element is used normally when possible and as an improved version of the typed segment-boundary delimiter method when an overlap problem is encountered. A significant portion of the rules necessary for validation of HORSE markup can be expressed using Schematron. This, combined with an utter hack that can "HORSEify" the declaration of elements in a TEI Relax NG grammar, can provide a potential significant step forward in handling overlap in TEI documents.


Towards an international standard on feature structure representation (2)
  • Conference Paper
  • Full-text available

May 2004

·

99 Reads

·

10 Citations

This paper describes the preliminary results of a joint initiative of the TEI (Text Encoding Initiative) Consortium and the ISO Committee TC 37SC 4 (Language Resource management) to provide a standard for the representation and interchange of feature structures. The paper published in the proceedings of this workshop is in fact an extension of a paper published in the LREC 2004 proceedings, and about 50% are identical with it.

Download


TEI and the Encoding of the Physical Structure of Books

April 1999

·

12 Reads

·

2 Citations

Computers and the Humanities

The TEI Guidelines provide little detail on how to encode a text within the physical structures of the book in which it is contained. This paper describes the physical structures of an early printed book and presents two methods for encoding a text within that structure through use of the TEI elements and .


Names Proper and Improper: Applying the TEI to the Classification of Proper Nouns

July 1997

·

19 Reads

·

6 Citations

Computers and the Humanities

This paper discusses the encoding of proper names using the TEI Guidelines, describing the practice of the Women Writers Project at Brown University, and the CELT Project at University College, Cork. We argue that such encoding may be necessary to enable historical and literary research, and that the specific approach taken will depend on the needs of the project and the audience to be served. Because the TEI Guidelines provide a fairly flexible system for the encoding of proper names, we conclude that projects may need to collaborate to determine more specific constraints, to ensure consistency of approach and compatibility of data.


Migrating Language Resources from SGML to XML: the Text Encoding Initiative Recommendations

75 Reads

·

1 Citation

The largest effort in the area of standardisation of computer encoding of language resources has been the Text Encoding Initiative (TEI), established in 1987. TEI chose as its underlying standard SGML (Standard Generalized Markup Language), and in the years before the inception of XML, a number of projects encoded their data according to some SGML DTD, TEI compliant, or otherwise. These projects could now benefit from migrating their data to XML. Apart from validation, the most compelling reason for migration is the scarcity of SGML-aware software and the abundance of XML-based tools and related recommendations. However, despite the fact that XML is a subset of SGML, migration is not a trivial process, especially in the case of large holdings of legacy language resources. This is why in 2002 the TEI Consortium established a Task Force on SGML to XML migration. The TF has now produced a number of reports that simplify and make explicit the conversion of SGML TEI (version P3) to XML TEI (version P4) documents. The reports are also relevant for a general audience of SGML users that are considering migrating their language resources to XML. This paper presents the recommendations made by the TF, concentrating on strategic considerations, the practical guide, and one case study, the conversion of the British National Corpus.


Freedom to Constrain

9 Reads

·

2 Citations

Where should attribute constraints live? In an external schema? In the document’s own metadata? In a separate file? Several possibilities are examined, raising lots of questions and offering a few answers.



Citations (6)


... It implemented a tool called the Wayback Machine that makes it possible to go back in time to consult URLs in the way in which they were presented earlier. 23 . What is displayed by the Wayback Machine, though, is only the interface, that is, what is displayed on one specific URL at one specific point in time -like a screenshot from the past. ...

Reference:

From Handwriting to Footprinting: Text and Heritage in the Age of Climate Crisis
Interchange vs. Interoperability
  • Citing Conference Paper
  • January 2011

... The above-mentioned ODD ("One Document Does it all") files are the TEI's recipe for what Bauman (2008) calls "literate encoding", by reference to the literate programming paradigm (Knuth, 1984): TEI schemas are defined in TEI documents, with the typical TEI header and a standard text body with the addition of special elements that provide instructions for constructing schemes out of the content models and attribute classes offered by the TEI, cf. (Burnard and Rahtz, 2004). ...

Freedom to Constrain
  • Citing Conference Paper

... But original histori-cal documents were not constrained to be "well-formed" when written [6]. Upon conversion to a CFG-conformant markup language, features like quotations overlap with other markup divisions such as lines and verses [8,9]. And different layers of encoding such as formatting structures and text-to-image links overlap so much that the same text must be encoded twice: once for each layer of encoding [10]. ...

TEI HORSEing Around.
  • Citing Conference Paper
  • January 2005

... We also have reason to believe that historians, for example , could find this facility useful, insofar as these fictional narratives also contain descriptions of real locations, characterizations of contemporary public figures, and so on. Flanders et al. (1998, p. 285) argue that references to people in historical sources are of intrinsic interest since they may reveal " networks of friendship, enmity, and collaboration ; familial relationships; and political alliances [. . . ] class position, intellectual affiliations, and literary bent of the author " . ...

Names Proper and Improper: Applying the TEI to the Classification of Proper Nouns
  • Citing Article
  • July 1997

Computers and the Humanities

... Il s'agit en effet, d'une part, de transcrire des contenus dans l'univers numérique pour les faire bénéficier des outils d'analyse, de recherche et de diffusion actuels et, d'autre part, de rendre compte de la forme tant typographique qu'éditoriale du document original, nécessaire pour l'étude de ces ouvrages. Une réflexion menée dès 1987 au sein d'une communauté internationale d'acteurs de l'édition électronique et de l'ingénierie linguistique et textuelle, rejoints depuis par ceux des humanités numériques, a produit à partir de 1994 un schéma d'encodage et de description des textes qui s'appuie sur un langage de balisage des contenus pour rendre compte tant de la forme typographique et éditoriale du document originel que de la signification de ses contenus : la Text Encoding Initiative (TEI,Bauman et Catapano, 1999). Le langage de balisage XML, qui a remplacé en 2003 le langage SGML dans la dernière version de la TEI, présente divers avantages, en plus de sa capacité à étiqueter des contenus dans une structure conforme aux recommandations de la TEI. ...

TEI and the Encoding of the Physical Structure of Books
  • Citing Article
  • April 1999

Computers and the Humanities