Conference Paper

A feature dependent method for opinion mining and classification

DLSI, Univ. Alicante, Alicante
DOI: 10.1109/NLPKE.2008.4906796 Conference: Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Source: IEEE Xplore

ABSTRACT Mining the web for customer opinion on different products is both a useful, as well as challenging task. Previous approaches to customer review classification included document level, sentence and clause level sentiment analysis and feature based opinion summarization. In this paper, we present a feature driven opinion summarization method, where the term ldquodrivenrdquo is employed to describe the concept-to-detail (product class to product-specific characteristics) approach we took. For each product class we first automatically extract general features (characteristics describing any product, such as price, size, design), for each product we then extract specific features (as picture resolution in the case of a digital camera) and feature attributes (adjectives grading the characteristics, as for example high or low for price, small or big for size and modern or faddy for design). Further on, we assign a polarity (positive or negative) to each of the feature attributes using a previously annotated corpus and Support Vector Machines Sequential Minimal Optimization machine learning with the Normalized Google Distance. We show how the method presented is employed to build a feature-driven opinion summarization system that is presently working in English and Spanish. In order to detect the product category, we use a modified system for person names classification. The raw review text is split into sentences and depending on the product class detected, only the phrases containing the specific product features are selected for further processing. The phrases extracted undergo a process of anaphora resolution, Named Entity Recognition and syntactic parsing. Applying syntactic dependency and part of speech patterns, we extract pairs containing the feature and the polarity of the feature attribute the customer associates to the feature in the review. Eventually, we statistically summarize the polarity of the opinions different customers expressed about the product on the -
-
web as percentages of positive and negative opinions about each of the product features. We show the results and improvements over baseline, together with a discussion on the strong and weak points of the method and the directions for future work.

0 Bookmarks
 · 
78 Views
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Opinion mining is an interesting area of research because of its applications in various fields. Collecting opinions of people about products and about social and political events and problems through the Web is becoming increasingly popular every day. The opinions of users are helpful for the public and for stakeholders when making certain decisions. Opinion mining is a way to retrieve information through search engines, Web blogs and social networks. Because of the huge number of reviews in the form of unstructured text, it is impossible to summarize the information manually. Accordingly, efficient computational methods are needed for mining and summarizing the reviews from corpuses and Web documents. This study presents a systematic literature survey regarding the computational techniques, models and algorithms for mining opinion components from unstructured reviews.
    Journal of King Saud University - Computer and Information Sciences. 05/2014;
  • [Show abstract] [Hide abstract]
    ABSTRACT: The explosion of the Web 2.0 platforms, with massive volume of user generated data, has presented many new opportunities as well as challenges for organizations in understanding consumer's behavior to support for business planning process. Feature based sentiment mining has been an emerging area in providing tools for automated opinion discovery and summarization to help business managers with achieving such goals. However, the current feature based sentiment mining systems were only able to provide some forms of sentiments summary with respect to product features, but impossible to provide insight into the decision making process of consumers. In this paper, we will present a relatively new decision support method based on Choquet Integral aggregation function, Shapley value and Interaction Index which is able to address such requirements of business managers. Using a study case of Hotel industry, we will demonstrate how this technique can be applied to effectively model the user's preference of (hotel) features. The presented method has potential to extend the practical capability of sentiment mining area, while, research findings and analysis are useful in helping business managers to define new target customers and to plan more effective marketing strategies.
    Fuzzy-IEEE; 01/2012
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Social media constitutes a major component of Web 2.0 and includes social networks, blogs, forum discussions, micro-blogs, etc. Users of social media generate a huge volume of reviews and comments on daily basis. These reviews and comments reflect the opinions of users about different issues, such as: products, news, entertainments, or sports. Therefore different establishments may need to analyze these reviews and comments. For examples: It is essential for companies to know the pros and cons of their products or services in the eyes of customers. Governments may want In addition to know the attitude of people towards certain decisions, services, etc. Although the manual analysis of textual reviews and comments can be more accurate than the automatic methods, nonetheless, it is time consuming, expensive, and can be In addition subjective. In addition, the huge amount of data contained in social networks can make it impractical to perform analysis manually. This paper focuses on evaluating social content in Arabic language and contexts. Currently, Middle East is an area rich of major political and social reforms. The social media can be a rich source of information to evaluate such contexts. In this research we developed an opinion mining and analysis tool to collect different forms of Arabic language (i.e. Standard or MSA, and colloquial). The tool accepts comments or opinions as input and generates polarity based outputs related to the comments. For example the output can be whether the comment or review is: (subjective or objective), (positive or negative), and (strong or weak). The evaluation of the performance of the developed tool showed that it yields more accurate results when it is applied on domain-based Arabic reviews relative to general-based Arabic reviews.
    International Journal of Advanced Computer Science and Applications 05/2014; 5(5):181-195. · 1.32 Impact Factor