Keras Model for Text Classification in Amazon Review Dataset using LSTM
PDF

Keywords

Ecommerce
LSTM model
feed-forward mechanism
feedback mechanism
Keras
TensorFlow
relu
sigmoid
GPU device

How to Cite

.S, Thivaharan, and Srivatsun .G. 2021. “Keras Model for Text Classification in Amazon Review Dataset Using LSTM”. Journal of Artificial Intelligence and Capsule Networks 3 (2): 72-89. https://doi.org/10.36548/jaicn.2021.2.001.

Abstract

With the use of Ecommerce, Industry 4.0 is being effectively used in online product-based commercial transactions. An effort has been made in this article to extract positive and negative sentiments from Amazon review datasets. This will give an upper hold to the purchaser to decide upon a particular product, without considering the manual rating given in the reviews. Even the number words in an inherent positive review exceeds by one, where the present classifiers misclassify them under negative category. This article addresses the aforementioned issue by using LSTM (Long-Short-Term-Memory) model, as LSTM model has a feedback mechanism based progression unlike the other classifiers, which are dependent on feed-forward mechanism. For achieving better classification accuracy, the dataset is initially processed and a total of 100239 short and 411313 long reviews have been obtained. With the appropriate Epoch iterations, it is observed that, this proposed model has gain the ability to classify with 89% accuracy, while maintaining a non-bias between the train and test datasets. The entire model is deployed in TensorFlow2.1.0 platform by using the Keras framework and python 3.6.0.

PDF

References

Chakrabarty, Navoneel, and Sanket Biswas. "Navo Minority Over-sampling Technique (NMOTe): A Consistent Performance Booster on Imbalanced Datasets." Journal of Electronics 2, no. 02 (2020): 96-136.

Smys, S., and Jennifer S. Raj. "Analysis of Deep Learning Techniques for Early Detection of Depression on Social Media Network-A Comparative Study." Journal of trends in Computer Science and Smart technology (TCSST) 3, no. 01 (2021): 24-39.

Haoxiang, Wang, and S. Smys. "Big Data Analysis and Perturbation using Data Mining Algorithm." Journal of Soft Computing Paradigm (JSCP) 3, no. 01 (2021): 19-28.

Joe, Mr C. Vijesh, and Jennifer S. Raj. "Location-based Orientation Context Dependent Recommender System for Users." Journal of trends in Computer Science and Smart technology (TCSST) 3, no. 01 (2021): 14-23.

Thilaka, B., Janaki Sivasankaran, and S. Udayabaskaran. "Optimal Time for Withdrawal of Voluntary Retirement Scheme with a Probability of Acceptance of Retirement Request." Journal of Information Technology 2, no. 04 (2020): 201-206.

Siddique, Fathma, Shadman Sakib, and Md Abu Bakr Siddique. "Recognition of handwritten digit using convolutional neural network in python with tensorflow and comparison of performance for various hidden layers." 2019 5th International Conference on Advances in Electrical Engineering (ICAEE). IEEE, 2019.

Moreno-Marcos, Pedro Manuel, et al. "Temporal analysis for dropout prediction using self-regulated learning strategies in self-paced MOOCs." Computers & Education 145 (2020): 103728.

Li, Zhu, et al. "Kernel dependence regularizers and gaussian processes with applications to algorithmic fairness." arXiv preprint arXiv:1911.04322 (2019).

Awan, Ammar Ahmad, et al. "HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training using TensorFlow." arXiv preprint arXiv:1911.05146 (2019).

Adam, Edriss Eisa Babikir. "Deep Learning based NLP Techniques In Text to Speech Synthesis for Communication Recognition." Journal of Soft Computing Paradigm (JSCP) 2, no. 04 (2020): 209-215..

Zou, Difan, et al. "Gradient descent optimizes over-parameterized deep ReLU networks." Machine Learning 109.3 (2020): 467-492.

Goel, Priyanka, and S. Sivaprasad Kumar. "Certain class of starlike functions associated with modified sigmoid function." Bulletin of the Malaysian Mathematical Sciences Society 43, no. 1 (2020): 957-991.

Chakraborty, Rupak, Rama Sushil, and M. L. Garg. "An improved PSO-based multilevel image segmentation technique using minimum cross-entropy thresholding." Arabian Journal for Science and Engineering 44.4 (2019): 3005-3020.

Pooja.C, Thivaharan.s, "Workload based Cluster Auto Scaler using Kuberbet Monitors", International Journal Compliance Engineering Journal (IJCENG), 2021, Vol.12, Issue.6, pp. 40-47, ISSN:0898-3577, DOI:16.10089.CEJ.2021.V12I6.285311.36007.

S. Thivaharan., G. Srivatsun. and S. Sarathambekai., "A Survey on Python Libraries Used for Social Media Content Scraping," 2020 International Conference on Smart Electronics and Communication (ICOSEC), Trichy, India, 2020, pp. 361-366, doi: 10.1109/ICOSEC49089.2020.9215357.

Lnenicka, Martin, and Jitka Komarkova. "Big and open linked data analytics ecosystem: Theoretical background and essential elements." Government Information Quarterly 36.1 (2019): 129-144.

Dube, Thando, Rene Van Eck, and Tranos Zuva. "Review of Technology Adoption Models and Theories to Measure Readiness and Acceptable Use of Technology in a Business Organization." Journal of Information Technology 2, no. 04 (2020): 207-212.

Smilkov, Daniel, Nikhil Thorat, Yannick Assogba, Ann Yuan, Nick Kreeger, Ping Yu, Kangyi Zhang et al. "Tensorflow. js: Machine learning for the web and beyond." arXiv preprint arXiv:1901.05350 (2019).

Manoharan, Samuel. "Embedded Imaging System Based Behavior Analysis of Dairy Cow." Journal of Electronics 2, no. 02 (2020): 148-154.

Salman Taherizadeh., VladoStankovski., “Auto-scaling Applications in Edge Computing: Taxonomy and Challenges” Conference: International Conference on Big Data and Internet of Thing (BDIOT2017) - ACM, At London, United Kingdom