A machine learning-based sentiment analysis of online product reviews with a novel term weighting and feature selection approach

作者:

Highlights:

摘要

Recently, online shopping has turned into a mainstream means for users to purchase as well as consume with the upsurge development of Internet technology. User satisfaction can be improved effectively by doing Sentiment Analysis (SA) of a large quantity of user reviews on e-commerce platforms. It is still challenging to envisage the accurate sentiment polarities of the user reviews because of the changes in sequence length, textual order, along with complicated logic. This paper proposes a new optimized Machine Learning (ML) algorithm called the Local Search Improvised Bat Algorithm based Elman Neural Network (LSIBA-ENN) for the SA of online product reviews. The proposed work of SA encompasses ‘4’ major steps: i) Data Collection (DC), ii) preprocessing, iii) Features Extraction (FE) or Term Weighting (TW), Feature Selection (FS), and polarity or Sentiment Classifications (SC). Initially, the Web Scrapping Tool (WST) is utilized to extract the customer reviews of the products for which the data is gathered as of the E-commerce websites. Next, preprocessing is carried out on the web scrap extracted data. Those preprocessed data go through TW and FS for additional processing by means of Log Term Frequency-based Modified Inverse Class Frequency (LTF-MICF) and Hybrid Mutation based Earth Warm Algorithm (HMEWA). Lastly, the HM-EWA data is rendered to the LSIBA-ENN, which classifies the customer reviews’ sentiment as positive, negative, and neutral. For the performance analysis of the proposed and prevailing classifiers, ‘2’ yardstick datasets are taken. The outcomes exhibit that the LSIBA-ENN attains the best performance in SC when weighted against the existing top-notch algorithms. The observations of the reviewer are exact. The prevailing ENN proffers recall of 87.79 when utilizing the proposed LTF-MICF scheme, whereas ENN only achieve 83.55, 84.03, 85.48, and 86.04 of recall whilst utilizing W2V, TF, TF-IDF, and TF-DFS schemes respectively.

论文关键词:Sentiment analysis,Polarity classification,Machine learning,Sentiment analysis of online product reviews,Term weighting,Elman Neural Network (ENN),Bat algorithm (BA)

论文评审过程:Received 15 March 2021, Revised 5 May 2021, Accepted 1 June 2021, Available online 16 June 2021, Version of Record 16 June 2021.

论文官网地址:https://doi.org/10.1016/j.ipm.2021.102656