Conference Paper

Sentence and Expression Level Annotation of Opinions in User-Generated Discourse

Conference: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics

ABSTRACT In this paper, we introduce a corpus of consumer reviews from the rateitall and the eopinions websites annotated with opinion-related information. We present a two-level annotation scheme. In the first stage, the reviews are analyzed at the sentence level for (i) relevancy to a given topic, and (ii) expressing an evaluation about the topic. In the second stage, on-topic sentences containing evaluations about the topic are further investigated at the expression level for pinpointing the properties (semantic orientation, intensity), and the functional components of the evaluations (opinion terms, targets and holders). We discuss the annotation scheme, the inter-annotator agreement for different subtasks and our observations.


Available from: Niklas Jakob, May 28, 2015
1 Follower
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Opinion mining is an interesting area of research because of its applications in various fields. Collecting opinions of people about products and about social and political events and problems through the Web is becoming increasingly popular every day. The opinions of users are helpful for the public and for stakeholders when making certain decisions. Opinion mining is a way to retrieve information through search engines, Web blogs and social networks. Because of the huge number of reviews in the form of unstructured text, it is impossible to summarize the information manually. Accordingly, efficient computational methods are needed for mining and summarizing the reviews from corpuses and Web documents. This study presents a systematic literature survey regarding the computational techniques, models and algorithms for mining opinion components from unstructured reviews.
    05/2014; 26(3). DOI:10.1016/j.jksuci.2014.03.009
  • [Show abstract] [Hide abstract]
    ABSTRACT: We present a fine-grained scheme for the annotation of polar sentiment in text, that accounts for explicit sentiment (so-called private states), as well as implicit expressions of sentiment (polar facts). Polar expressions are annotated below sentence level and classified according to their subjectivity status. Additionally, they are linked to one or more targets with a specific polar orientation and intensity. Other components of the annotation scheme include source attribution and the identification and classification of expressions that modify polarity. In previous research, little attention has been given to implicit sentiment, which represents a substantial amount of the polar expressions encountered in our data. An English and Dutch corpus of financial newswire text, consisting of over 45,000 words each, was annotated using our scheme. A subset of this corpus was used to conduct an inter-annotator agreement study, which demonstrated that the proposed scheme can be used to reliably annotate explicit and implicit sentiment in real-world textual data, making the created corpora a useful resource for sentiment analysis.
    Language Resources and Evaluation 01/2015; DOI:10.1007/s10579-015-9297-4 · 0.52 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Writing comments on products or news has become a popular activity in social media. The amount of opinionated text available online has been growing rapidly, increasing the need for techniques that can analyze opinions expressed in such text so that reviews can be easily absorbed by users. To date, most techniques depend on annotated corpora. However, existing corpora are almost sentence-level works that ignore important global sentiment information in other sentences. Given the rise of advanced applications, more fine-grained corpora are needed, even at the sentence level. The authors aim to create a fine-grained corpus for Chinese sentiment analysis, and more importantly, explore new sentiment analysis tasks by analyzing the annotated corpus. The proposed fine-grained annotation scheme not only introduces cross-sentence and global sentiment information (such as "target entity"') but also includes new sentence-level elements (such as "implicit aspect"). Based on this scheme, this corpus can provide a more fine-grained platform for researchers to study algorithms for advanced applications. In addition, an in-depth analysis on the annotated corpus is made and several important but ignored tasks, such as the target-aspect pair extraction task, are explored, which can give useful hints about future directions.
    Intelligent Systems, IEEE 01/2015; 30(1):36-43. DOI:10.1109/MIS.2014.33 · 1.92 Impact Factor