Shapan Das Uzzal’s scientific contributions

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (3)


Sentiment Analysis of Bengali Comments With Word2Vec and Sentiment Information of Words
  • Conference Paper

April 2017

·

936 Reads

·

101 Citations

Md Al-Amin

·

·

Shapan Das Uzzal

The vector representation of Bengali words using word2vec model (Mikolov et al. (2013)) plays an important role in Bengali sentiment classification. It is observed that the words that are from same context stay closer in the vector space of word2vec model and they are more similar than other words. In this article, a new approach of sentiment classification of Bengali comments with word2vec and Sentiment extraction of words are presented. Combining the results of word2vec word co-occurrence score with the sentiment polarity score of the words, the accuracy obtained is 75.5%.


Fig. 1. Parts of speech ratio model. 
Fig. 2. Cosine similarity using TF-IDF. 
Fig. 3. Cosine similarity using custom TF-IDF.
Fig. 4. Naïve Bayes model using Uni-gram & stammer. 
Fig. 5. Naïve Bayes model using Bi-gram, stammer & normalizer. 
A Comprehensive Study on Sentiment of Bengali Text
  • Conference Paper
  • Full-text available

April 2017

·

303 Reads

·

8 Citations

Sentiment Analysis is one of the most important and challenging research topic in the field of natural language processing and opinion mining. In this article, six different approaches are discussed to determine the actual sentiment of the sentence and analyzed their performances. In parts of speech ratio method, the Parts of Speech (POS) of the queries are tagged and the POS ratio and the hamming distance between positive classifier and query and negative classifier and query are computed. To detect the sentiment more accurately, cosine similarity using TF-IDF is applied which is calculated by computing TF, DF and IDF and calculate positive vector, negative vector and query vector. In Cosine similarity using custom TF-IDF, custom POS tagger is used and TF, DF and IDF are computed. Another method with Naïve Bayes model using Uni-gram & stammer also gives good performance. In this approach, prior probability and conditional probability are calculated and the root words of the words are extracted. Naïve Bayes model using Bi-gram, stammer and normalizer is better than the other models. The last method discussed is Word Embedding with Hellinger PCA which presents the idea of word co-occurrence matrix and Skip-Gram to determine the actual contexts of the words, Hellinger PCA to determine most similar words and generate a sliding window of most probable context words around each word.

Download

Word Embedding with Hellinger PCA to Detect the Sentiment of Bengali Text

December 2016

·

146 Reads

·

19 Citations

The sentiment of a sentence or a comment can be detected more accurately by applying Word Embeddings. This article presents the idea of word co-occurrence matrix and Skip-Gram to determine the actual contexts of the words, Hellinger PCA to determine the most similar words and generate a sliding window of most probable context words around each word. It is shown that, by applying Word Embeddings to classify the sentiment of a comment achieves higher accuracy with larger corpus. For our corpus of 2500 comments, the accuracy achieved is 70%, which is rapidly increasing with the size of the corpus.

Citations (3)


... Many works have been done on Bengali text sentiment detection. Al-Amin et al. [16] A study found that the language of comments was Bengali, using Word2Vec. The approach of this study depends on a large amount of data, which helps to make incremental improvements in accuracy. ...

Reference:

Leveraging Multi-Modal Ensemble Approach for Sentiment and Emotion Classification in Bengali Text
Sentiment Analysis of Bengali Comments With Word2Vec and Sentiment Information of Words
  • Citing Conference Paper
  • April 2017

... SA assists data interpreters with complex attempts to estimate public evaluation, counsellor label and merchandise credit, and assume customer activities. In joining, data analytics firms often integrate third-party SA APIs into their customer activity administration, social media observation, or human resources analytics policies, to pass valuable insights to their clients [1]. Nowadays people use social media, blog sites, news sites or others to express as well as share their statements or opinions in regular life. ...

A Comprehensive Study on Sentiment of Bengali Text

... Liu et al. [3] presented a rank model for the multimodal fusion concept to reduce the cost of tensor-based techniques. Hellinger [8] used word embedding and PCA to identify the opinion of Bengali co mments. Similar terms appear more frequently in the same context of Word2vec [9]. ...

Word Embedding with Hellinger PCA to Detect the Sentiment of Bengali Text
  • Citing Conference Paper
  • December 2016