Sarah L. Jacobson’s research while affiliated with Buffalo State University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (5)


Modeling the scholars: Detecting intertextuality through enhanced word-level n-gram matching
  • Article

December 2014

·

91 Reads

·

27 Citations

Digital Scholarship in the Humanities

·

·

Thomas Buck

·

[...]

·

Sarah Jacobson

The study of intertextuality, or how authors make artistic use of other texts in their works, has a long tradition, and has in recent years benefited from a variety of applications of digital methods. This article describes an approach for detecting the sorts of intertexts that literary scholars have found most meaningful, as embodied in the free Tesserae website http://tesserae.caset.buffalo.edu/. Tests of Tesserae Versions 1 and 2 showed that word-level n-gram matching could recall a majority of parallels identified by scholarly commentators in a benchmark set. But these versions lacked precision, so that the meaningful parallels could be found only among long lists of those that were not meaningful. The Version 3 search described here adds a second stage scoring system that sorts the found parallels by a formula accounting for word frequency and phrase density. Testing against a benchmark set of intertexts in Latin epic poetry shows that the scoring system overall succeeds in ranking parallels of greater significance more highly, allowing site users to find meaningful parallels more quickly. Users can also choose to adjust both recall and precision by focusing only on results above given score levels. As a theoretical matter, these tests establish that lemma identity, word frequency, and phrase density are important constituents of what make a phrase parallel a meaningful intertext.


Fig. 1 Tesserae user interface. 
Fig. 2 Tesserae results. 
Table 2 All parallels reported by Tesserae and four commentaries, by type
Fig. 3 Types 4–5 parallels reported by Tesserae and four commentators. Tesserae returned a significant number of matches unremarked by commentators. 
The Tesserae Project: Intertextual analysis of Latin poetry
  • Article
  • Full-text available

June 2013

·

1,111 Reads

·

43 Citations

Literary and Linguistic Computing

Tesserae is a web-based tool for automatically detecting allusions in Latin poetry. Although still in the start-up phase, it already is capable of identifying significant numbers of known allusions, as well as similar numbers of allusions previously unnoticed by scholars. In this article, we use the tool to examine allusions to Vergil’s Aeneid in the first book of Lucan’s Civil War. Approximately 3,000 linguistic parallels returned by the program were compared with a list of known allusions drawn from commentaries. Each was examined individually and graded for its literary significance, in order to benchmark the program’s performance. All allusions from the program and commentaries were then pooled in order to examine broad patterns in Lucan’s allusive techniques which were largely unapproachable without digital methods. Although Lucan draws relatively constantly from Vergil’s generic language in order to maintain the epic idiom, this baseline is punctuated by clusters of pointed allusions, in which Lucan frequently subverts Vergil’s original meaning. These clusters not only attend the most significant characters and events but also play a role in structuring scene transitions. Work is under way to incorporate the ability to match on word meaning, phrase context, as well as metrical and phonological features into future versions of the program.

Download

Intertextuality in the Digital Age

September 2012

·

1,677 Reads

·

20 Citations

Transactions of the American Philological Association (1974-)

This paper describes a new digital approach to intertextual study involving the creation of a free online tool for the automatic detection of parallel phrases. A test comparison of Vergil’s Aeneid and Lucan’s Civil War shows that the tool can identify a substantial number of meaningful intertexts, both previously recorded and unrecorded. Analysis of these results demonstrates how automatic detection can provide more comprehensive and accessible perspectives on intertextuality as an aggregate phenomenon. Identification of the language features necessary to detect intertexts also provides a path toward improved automatic detection and more precise definitions of intertextuality.


Evidence of Intertextuality: Investigating Paul the Deacon's Angustae Vitae

August 2011

·

790 Reads

·

29 Citations

Literary and Linguistic Computing

In this study, we use computational methods to evaluate and quantify philological evidence that an eighth century CE Latin poem by Paul the Deacon was influenced by the works of the classical Roman poet Catullus. We employ a hybrid feature set composed of n-gram frequencies for linguistic structures of three different kinds—words, characters, and metrical quantities. This feature set is evaluated using a one-class support vector machine approach. While all three classes of features prove to have something to say about poetic style, the character-based features prove most reliable in validating and quantifying the subjective judgments of the practicing Latin philologist. Word-based features were most useful as a secondary refining tool, while metrical data were not yet able to improve classification. As these features are developed in ongoing work, they are simultaneously being incorporated into an existing online tool for allusion detection in Latin poetry.


Citations (4)


... These intertextual networks provide traceable evidence for the dissemination and evolutionary trajectory of human ideas. As a manifestation of intertextual relationships, text reuse serves as quantitative evidence for various cultural studies themed on the similarity (Sturgeon, 2018b;Burns et al., 2021), influence (Büchler et al., 2013;Forstall et al., 2014), and evolution (Hartberg and Wilson, 2017;Duan et al., 2023) of literary works. The feasibility of this text analysis approach has been validated across different languages, including Latin (Coffee et al., 2012b), French (Ganascia et al., 2014), English (Smith et al., 2013), and ancient Chinese (Sturgeon, 2018a). ...

Reference:

Evol project: a comprehensive online platform for quantitative analysis of ancient literature
Modeling the scholars: Detecting intertextuality through enhanced word-level n-gram matching
  • Citing Article
  • December 2014

Digital Scholarship in the Humanities

... 11 https://www.etrap.eu/research/tracer/. See alsoBüchler et al. (2014) andFranzini et al. (2019). 12 https://github.com/tesserae/tesserae/. See alsoCoffee et al. (2013). 13 https://github.com/dasmiq/passim/. ...

The Tesserae Project: Intertextual analysis of Latin poetry

Literary and Linguistic Computing

... As a manifestation of intertextual relationships, text reuse serves as quantitative evidence for various cultural studies themed on the similarity (Sturgeon, 2018b;Burns et al., 2021), influence (Büchler et al., 2013;Forstall et al., 2014), and evolution (Hartberg and Wilson, 2017;Duan et al., 2023) of literary works. The feasibility of this text analysis approach has been validated across different languages, including Latin (Coffee et al., 2012b), French (Ganascia et al., 2014), English (Smith et al., 2013), and ancient Chinese (Sturgeon, 2018a). The Evol platform employed the text reuse technique to effectively quantify and visually represent the instances of text reuse, thereby facilitating the identification and exploration of potential cultural phenomena. ...

Intertextuality in the Digital Age

Transactions of the American Philological Association (1974-)

... This indicates the increasingly close relationship between linguistics and humanities research, as well as the growing application of linguistic tools and theories to research on literature (e.g. Noe 1988;Hoover 2001;Forstall, Jacobson, and Scheirer 2011;Zheng and Jin 2022) and text analysis (e.g. Holmes and Forsyth 1995;Koppel, Argamon, and Shimoni 2002;Gorman 2019). ...

Evidence of Intertextuality: Investigating Paul the Deacon's Angustae Vitae

Literary and Linguistic Computing