scholarly journals Product Review Mining for Opinion Using Ensemble Classifier

Author(s):  
G. Vinodhini ◽  
RM. Chandrasekaran

Online product reviews is considered as a major informative resource which is useful for both customers and manufacturers. The online reviews are unstructured-free-texts in natural language form. The task of manually scanning through huge volume of review is very tedious and time consuming. Therefore it is needed to automatically process the online reviews and provide the necessary information in a suitable form. In this paper, we dedicate our work to the task of classifying the reviews based on the opinion, i.e. positive or negative opinion. This paper mainly addresses using ensemble approach of Support Vector Machine (SVM) for opinion mining. Ensemble classifier was examined for feature based product review dataset for three different products. We showed that proposed ensemble of Support Vector Machine is superior to individual baseline approach for opinion mining in terms of error rate and Receiver operating characteristics Curve.   Key words: Opinion, Classification, Machine Learning.

2020 ◽  
Vol 8 (6) ◽  
pp. 2862-2867

E-commerce is a website or mobile application platform that help people to buy products. Before purchasing the product, customer will decide to buy it or not by reading the review from previous buyer. There is a problem that there are a lot of review so it will take a long time for customer to read it all. This research will be using sentiment analysis method to classify the review data. Sentiment analysis or opinion mining is a machine learning approach to classify and analyse texts or documents about human’s sentiments, emotions, and opinions. In this research, sentiment analysis was used to classify product reviews from e-commerce websites into positive or negative classes. The results could be processed further and be used to summarize customers' opinions about a certain product without reading every single review. The goal of this research is to optimize classification performance by using feature selection technique. Terms Frequency-Inverse Document Frequency (TF-IDF) feature extraction, Backward Elimination feature selection, and five different classifiers (Naïve Bayes, Support Vector Machine, K-Nearest Neighbour, Decision Tree, Random Forest) were used in analysing the sentiment of the reviews. In this research, the dataset used are Indonesian language and classified into two classes(positive and negative). The best accuracy is achieved by using TF-IDF, Backward Elimination and Support Vector Machine (SVM) with a score of 85.97%, which increases by 7.91% if compared to the process without feature selection. Based on the results, Backward Elimination feature selection succeeded in improving all performance for all classifiers used in this research.


2020 ◽  
Vol 11 (1) ◽  
pp. 49-57
Author(s):  
Soumadip Ghosh ◽  
Arnab Hazra ◽  
Abhishek Raj

Sentiment analysis denotes the analysis of emotions and opinions from text. The authors also refer to sentiment analysis as opinion mining. It finds and justifies the sentiment of the person with respect to a given source of content. Social media contain vast amounts of the sentiment data in the form of product reviews, tweets, blogs, and updates on the statuses, posts, etc. Sentiment analysis of this largely generated data is very useful to express the opinion of the mass in terms of product reviews. This work is proposing a highly accurate model of sentiment analysis for reviews of products, movies, and restaurants from Amazon, IMDB, and Yelp, respectively. With the help of classifiers such as logistic regression, support vector machine, and decision tree, the authors can classify these reviews as positive or negative with higher accuracy values.


2011 ◽  
Vol 219-220 ◽  
pp. 1513-1517
Author(s):  
Rui Liu ◽  
Yi An ◽  
Lang Song

Automatic opinion mining and summarization from online reviews are very useful for customers and merchants. This paper proposes a method to extract opinions from Chinese product reviews. Firstly, reviews are pre-processed and the sentiment features are extracted based on a sentiment lexicon. Then, it finds out the matching target attribute using the extracted sentiment features base on the using co-occurrence knowledge of topic feature and sentiment feature. After the opinions were found, it generates the summary for products according to the most common opinions.


This substantial issue is increasingly important in business and culture. It presents many challenging research scenarios but guarantees a relevant insight for everybody interested in view evaluation and social networking analysis. This paper's key aim is to detect sentiment polarity such as positive, negative, and emoji representation with customer feedback on various products. Opinion mining from e-commerce sites has a significant part in making purchase decisions and founders to boost their product and marketing strategies. But, it becomes very difficult for the clients to understand and assess the product's actual view manually. Because of this, we need an automated way. The majority of the researchers used machine learning algorithms to do an automated representation of phrase embedding. Among the popular techniques in machine learning has been used the support vector machine (SVM). The weighted support vector machine (WSVM) is the improved version for the standard SVM to grow the outlier sensitivity issue. In this paper, the word2Vec version uses to extract the attributes from customer reviews in WSVM based on opinion analysis of product reviews in E-commerce websites. The experiment result shows that the suggested WSVM can works better on the opinion classification job doing any version applied.


Author(s):  
Enakshi Jana ◽  
V. Uma

With the immense increase of the number of users of the internet and simultaneously the massive expansion of the e-commerce platform, millions of products are sold online. To improve user experience and satisfaction, online shopping platform enables every user to give their reviews for each and every product that they buy online. Reviews are long and contain only a few sentences which are related to a particular feature of that product. It becomes very difficult for the user to understand other customer views about different features of the product. So, we need accurate opinion-based review summarization which will help both customers and product manufacture to understand and focus on a particular aspect of the product. In this chapter, the authors discuss the abstractive document summarization method to summarize e-commerce product reviews. This chapter has an in-depth explanation about different types of document summarization and how that can be applied to e-commerce product reviews.


Entropy ◽  
2018 ◽  
Vol 20 (11) ◽  
pp. 884 ◽  
Author(s):  
Tingyu Zhang ◽  
Ling Han ◽  
Wei Chen ◽  
Himan Shahabi

The main purpose of the present study is to apply three classification models, namely, the index of entropy (IOE) model, the logistic regression (LR) model, and the support vector machine (SVM) model by radial basis function (RBF), to produce landslide susceptibility maps for the Fugu County of Shaanxi Province, China. Firstly, landslide locations were extracted from field investigation and aerial photographs, and a total of 194 landslide polygons were transformed into points to produce a landslide inventory map. Secondly, the landslide points were randomly split into two groups (70/30) for training and validation purposes, respectively. Then, 10 landslide explanatory variables, such as slope aspect, slope angle, altitude, lithology, mean annual precipitation, distance to roads, distance to rivers, distance to faults, land use, and normalized difference vegetation index (NDVI), were selected and the potential multicollinearity problems between these factors were detected by the Pearson Correlation Coefficient (PCC), the variance inflation factor (VIF), and tolerance (TOL). Subsequently, the landslide susceptibility maps for the study region were obtained using the IOE model, the LR–IOE, and the SVM–IOE model. Finally, the performance of these three models was verified and compared using the receiver operating characteristics (ROC) curve. The success rate results showed that the LR–IOE model has the highest accuracy (90.11%), followed by the IOE model (87.43%) and the SVM–IOE model (86.53%). Similarly, the AUC values also showed that the prediction accuracy expresses a similar result, with the LR–IOE model having the highest accuracy (81.84%), followed by the IOE model (76.86%) and the SVM–IOE model (76.61%). Thus, the landslide susceptibility map (LSM) for the study region can provide an effective reference for the Fugu County government to properly address land planning and mitigate landslide risk.


Opinion Mining (OM) is also called as Sentiment Analysis (SA). Aspect Based Opinion Mining (ABOM) is also called as Aspect Based Sentiment Analysis (ABSA). In this paper, three new features are proposed to extract the aspect term for Aspect Based Sentiment Analysis (ABSA). The influence of the proposed features is evaluated on five classifiers namely Decision Tree (DT), Naive Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machine (SVM) and Conditional Random Fields (CRF). The proposed features are evaluated on the Two datasets on Restaurant and Laptop domains available in International Workshop on Semantic Evaluation 2014 i.e. SemEval 2014. The influence of proposed features is evaluated using Precision, Recall and F1 measures. The proposed features are highly influencing for aspect term extraction on classifiers. The performance of SVM and CRF classifiers with proposed features is more influencing for aspect term extraction compared with NB, DT and KNN classifiers.


Author(s):  
Dimple Chehal ◽  
Parul Gupta ◽  
Payal Gulati

Sentiment analysis of product reviews on e-commerce platforms aids in determining the preferences of customers. Aspect-based sentiment analysis (ABSA) assists in identifying the contributing aspects and their corresponding polarity, thereby allowing for a more detailed analysis of the customer’s inclination toward product aspects. This analysis helps in the transition from the traditional rating-based recommendation process to an improved aspect-based process. To automate ABSA, a labelled dataset is required to train a supervised machine learning model. As the availability of such dataset is limited due to the involvement of human efforts, an annotated dataset has been provided here for performing ABSA on customer reviews of mobile phones. The dataset comprising of product reviews of Apple-iPhone11 has been manually annotated with predefined aspect categories and aspect sentiments. The dataset’s accuracy has been validated using state-of-the-art machine learning techniques such as Naïve Bayes, Support Vector Machine, Logistic Regression, Random Forest, K-Nearest Neighbor and Multi Layer Perceptron, a sequential model built with Keras API. The MLP model built through Keras Sequential API for classifying review text into aspect categories produced the most accurate result with 67.45 percent accuracy. K- nearest neighbor performed the worst with only 49.92 percent accuracy. The Support Vector Machine had the highest accuracy for classifying review text into aspect sentiments with an accuracy of 79.46 percent. The model built with Keras API had the lowest 76.30 percent accuracy. The contribution is beneficial as a benchmark dataset for ABSA of mobile phone reviews.


Sign in / Sign up

Export Citation Format

Share Document