October 2024
·
8 Reads
·
1 Citation
SN Computer Science
This paper presents a statistical model of the opinion column, a discourse genre typical of the press. The model was derived from a relatively small sample (ca. 4000 texts), and it takes into account discourse variables such as text and paragraph length, discourse markers, deixis and modalization. In order to test the accuracy of the model, it was evaluated against a different corpus of mixed column and non-column documents. The idea was to test whether the model is able to identify those texts pertaining to the target genre. Results show that it is indeed accurate, with results ranging from 85 to 77% precision and 40–61% recall, depending on how restrictive the application of the model is.