YouTube Comment Sentiment Classification System
PDF

Keywords

NLP (Natural Language Processing)
Stemming and Lemmatization
Tokenization
YouTube Comments
Named Entities
Machine Learning

How to Cite

J, Jana Sorupaa, Anto Nivedha J, Arsha R, and Muthulakshmi K. 2024. “YouTube Comment Sentiment Classification System”. Journal of Artificial Intelligence and Capsule Networks 6 (1): 90-104. https://doi.org/10.36548/jaicn.2024.1.007.

Abstract

With more than 2 billion viewers per month, YouTube is the most widely used video-sharing website worldwide. On this website, users can watch, upload, and share videos covering a wide range of subjects. YouTube comments include facts, opinions, and responses to videos in addition to starting discussions. The number of YouTube comments makes it difficult to manually analyze them all. The study of reading, understanding, and creating text in human languages encompasses a broad range of methods and techniques under the umbrella of natural language processing or NLP. The primary goal of the research is to find and analyze YouTube comments, which, when used with natural language processing algorithms, might be beneficial for the channels' continued development. One of the NLP methods used for this research was tokenization, which is used to break down text into individual words or tokens. Stemming and lemmatization are used to reduce the root words and normalize the variation. Categorization is performed by identifying the named entities, such as people, organizations, and locations. The machine translation was used to convert the comments from one language to another.

PDF

References

Athar, Awais, and Simone Teufel. "Detection of implicit citations for sentiment detection." In Proceedings of the workshop on detecting structure in scholarly discourse, pp. 18-26. 2012.

Abdalgader, Khaled, and Aysha Al Shibli. "Experimental results on customer reviews using lexicon-based word polarity identification method." IEEE Access 8 (2020): 179955-179969.

Poria, Soujanya, Erik Cambria, Alexander Gelbukh, Federica Bisio, and Amir Hussain. "Sentiment data flow analysis by means of dynamic linguistic patterns." IEEE Computational Intelligence Magazine 10, no. 4 (2015): 26-36.

Xu, Guixian, Yueting Meng, Xiaoyu Qiu, Ziheng Yu, and Xu Wu. "Sentiment analysis of comment texts based on BiLSTM." Ieee Access 7 (2019): 51522-51532.

Ishaq, Adnan, Sohail Asghar, and Saira Andleeb Gillani. "Aspect-based sentiment analysis using a hybridized approach based on CNN and GA." IEEE Access 8 (2020): 135499-135512.

Vinodhini, G., and R. M. Chandrasekaran. "Sentiment analysis and opinion mining: a survey." International Journal 2, no. 6 (2012): 282-292.

Cambria, Erik, and Bebo White. "Jumping NLP curves: A review of natural language processing research." IEEE Computational intelligence magazine 9, no. 2 (2014): 48-57.

Athar, Awais. Sentiment analysis of scientific citations. No. UCAM-CL-TR-856. University of Cambridge, Computer Laboratory, 2014.

Pokharel, Rhitabrat, and Dixit Bhatta. "Classifying youtube comments based on sentiment and type of sentence." arXiv preprint arXiv:2111.01908 (2021).

Yang, Yiming, and Jan O. Pedersen. "A comparative study on feature selection in text categorization." In Icml, vol. 97, no. 412-420, p. 35. 1997.

Krishna, Amar, Joseph Zambreno, and Sandeep Krishnan. "Polarity trend analysis of public sentiment on YouTube." In Proceedings of the 19th international conference on management of data, pp. 125-128. 2013.

Bhuiyan, Hanif, Jinat Ara, Rajon Bardhan, and Md Rashedul Islam. "Retrieving YouTube video by sentiment analysis on user comment." In 2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), pp. 474-478. IEEE, 2017.

Alsubait, Tahani, and Danyah Alfageh. "Comparison of machine learning techniques for cyberbullying detection on youtube arabic comments." International Journal of Computer Science & Network Security 21, no. 1 (2021): 1-5.

Deng, Li, and Yang Liu, eds. Deep learning in natural language processing. Springer, 2018.

Mulholland, Eleanor, Paul Mc Kevitt, Tom Lunney, and Karl-Michael Schneider. "Analysing emotional sentiment in people’s YouTube channel comments." In Interactivity, Game Creation, Design, Learning, and Innovation: 5th International Conference, ArtsIT 2016, and First International Conference, DLI 2016, Esbjerg, Denmark, May 2–3, 2016, Proceedings 5, pp. 181-188. Springer International Publishing, 2017.

Muhammad, Abbi Nizar, Saiful Bukhori, and Priza Pandunata. "Sentiment analysis of positive and negative of youtube comments using naïve bayes–support vector machine (nbsvm) classifier." In 2019 International Conference on Computer Science, Information Technology, and Electrical Engineering (ICOMITEE), pp. 199-205. IEEE, 2019.

Savigny, Julio, and Ayu Purwarianti. "Emotion classification on youtube comments using word embedding." In 2017 international conference on advanced informatics, concepts, theory, and applications (ICAICTA), pp. 1-5. IEEE, 2017.

Boudad, Naaima, Rdouan Faizi, Richard O. Haj Thami, and Raddouane Chiheb. "Sentiment Classification of Arabic Tweets: A Supervised Approach." J. Mobile Multimedia 13, no. 3&4 (2017): 233-243.

Manning, Christopher D., Mihai Surdeanu, John Bauer, Jenny Rose Finkel, Steven Bethard, and David McClosky. "The Stanford CoreNLP natural language processing toolkit." In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations, pp. 55-60. 2014.

Agrawal, Ameeta, and Aijun An. "Unsupervised emotion detection from text using semantic and syntactic relations." In 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 1, pp. 346-353. IEEE, 2012.

Jiashen Liu, Jia Liu, Xinyi Wang, and Jian Zhang. "Sentiment analysis on social media: A survey." IEEE Transactions on Computational Social Systems, 7(3),2020, pp. 682-705.