A Review of Text Sentiment Analysis Methods and Applications
DOI:
https://doi.org/10.54097/fbem.v10i1.10171Keywords:
Natural Language Processing, Sentiment Analysis, Machine Learning, Deep Learning.Abstract
This paper reviews the application of natural language processing in sentiment analysis. Sentiment analysis is an important task aimed at automatically identifying and inferring sentiment tendencies and sentiment intensity in texts. This paper first introduces the application areas of sentiment analysis, including practical applications of text sentiment analysis. Then, text pre-processing techniques such as word separation, deactivation removal and punctuation processing are discussed. Then, feature extraction and representation methods are explored, including bag-of-words model, TF-IDF, word embedding and Word2Vec, attention mechanism and Transformer. In addition, methods for sentiment analysis, such as sentiment dictionaries and rule-based methods, traditional machine learning methods, and deep learning-based methods, are presented. Finally, the application areas of sentiment analysis are discussed and conclusions are given. The review in this paper will help readers understand the current status and development trend of natural language processing applications in sentiment analysis, as well as the advantages and disadvantages of different methods in sentiment analysis.
Downloads
References
Zhong Jiawa, Liu Wei, Wang Sili et al. Review of Methods and Applications of Text Sentiment Analysis[J]. Data Analysis and Knowledge Discovery,2021,5(06):1-13.
Wang YJ, Zhu JQ, Wang ZM et al. Review of Applications of natural language processing in text sentiment analysis [J/OL]. Computer Applications:1-12 [2023-07-03].
Zhao Jingsheng, Song Mengxue, Gao Xiang, et al. A study of text representation in natural language processing[J]. Journal of Software, 2021, 33(1): 102-128.
Li Yachao, Xiong Deyi, Zhang Min. A review of neural machine translation[J]. Journal of Computer Science, 2018, 41(12): 2734-2755.
Ao Sheng,Xu Lan,Ao Qingwen.Application of NLP Chinese word separation technology in bridge report data processing[J]. Traffic World,2020(17):3-5.DOI:10.16248/j.cnki.11-3723/u.2020.17.001.
Wang YY, Dong GW. Research on the problems and countermeasures of netroots libraries based on jieba subtext[J]. Jiangsu Science and Technology Information,2023,40(06):35-38.
Yin R, Wang Q, Li P, et al. Multi-granularity chinese word embedding[C]//Proceedings of the 2016 conference on empirical methods in natural language processing. 2016: 981-986.
Zhao Pengfei,Zhao Chunjiang,Wu Huarui et al. BERT-based multi-feature fusion for named entity recognition in agriculture[J]. Journal of Agricultural Engineering, 2022, 38(03): 112-118.
Chen Zhiyan,Li Xiaojie,Zhu Shuhua et al. A two-way maximal matching word splitting method based on Hash structured dictionary[J]. Computer Science,2015,42(S2):49-54.
Li Ship. Statistics in Chinese word splitting[J]. China Statistics, 2020(10):34-35.
Liu, X.Y.Y., Sheng, Y.H., Qin, J.R., et al. A semantic matching method for spatio-temporal trajectories based on hidden Markov model[J]. Geography and Geographic Information Science,2023,39(03):1-6.
Liu, Yunzhong, Lin, Yaping, Chen, Zhiping. Hidden Markov model-based text information extraction[J]. Journal of System Simulation, 2004, 16(3): 507-510.
Li Ronglu, Wang Jianhui, Chen Xiaoyun, et al. Chinese text classification using maximum entropy model[D]. , 2005.
Zhou J.S., Dai X.Y., Yin Cunyan, et al. Automatic recognition of Chinese organization names based on cascaded conditional random field model[J]. Journal of Electronics, 2006, 34(5): 804.
Huang Bo, Liu Chuancai. Automatic Chinese text summarization based on weighted Textrank[J]. Application Research of Computers/Jisuanji Yingyong Yanjiu, 2020, 37(2).
Zhang, Liu, Wang, Liwei, Huang, Bo, et al. Sentiment classification model and experimental study of microblog comments based on word vectors with multi-scale convolutional neural network[J]. Library Intelligence Work, 2019, 63(18): 99.
Gong, Shuang-Shuang, Chen, Yu-Feng, Xu, Jin-An, et al. A multi-word expression extraction method for Chinese based on web text[J]. Journal of Shandong University (Science Edition), 2018, 53(9): 40-48.
Zheng Zheng,Wang Huadan,Hao Lihua. A study of the U.S. military language dictionary from a terminological perspective[J]. China Science and Technology Terminology, 2021, 23(03): 33-41.
Zhou Zhi,Wang Chunying,Zhu Jia-Li. Research on the construction of an emotion lexicon in book domain based on ultra-short reviews[J]. Intelligence Theory and Practice, 2021, 44 (09): 183-189+197.DOI:10.16353/j.cnki.1000-7490.2021.09.026.
Liu P, Wang W, Qiu L, et al. CDCPP: (CDCPP: Cross-Domain Chinese Punctuation Prediction)[C]//Proceedings of the 19th Chinese National Conference on Computational Linguistics. 2020: 593-603.
Yi, Shun-Ming, Yi, Hao, Zhou, Guo-Dong. Research on Twitter sentiment classification method using sentiment feature vector[J]. Small Microcomputer Systems, 2016, 37(11): 2454-2458.
Zhang Huaping, Liu Qun. A coarse classification model for Chinese words based on the N shortest path method[J]. Chinese Journal of Information, 2002, 16(5): 1-7.
Tang Huifeng, Tan Songbo, Cheng Xueqi. A comparative study of Chinese sentiment classification techniques based on supervised learning [D]. , 2007.
Lu YH, Li YF. An improved TF-IDF algorithm for computing the weights of textual feature items[J]. Library Intelligence Work, 2013, 57(03): 90.
Han, Tong-Hui, Yang, Dong-Qiang, Ma, Hong-Wei. A method for constructing text feature representations using sentiment word statistics[J]. Application Research of Computers/Jisuanji Yingyong Yanjiu, 2019, 36(7).
Ramos J. Using tf-idf to determine word relevance in document queries[C]//Proceedings of the first instructional conference on machine learning. 2003, 242(1): 29-48.
Xue Yanfei, Mao Qirong, Zhang Jianming. A deep recommendation model incorporating contextual information [J]. Application Research of Computers/Jisuanji Yingyong Yanjiu, 2021, 38(4).
Rong X. word2vec parameter learning explained[J]. arXiv preprint arXiv:1411.2738, 2014.
Shi L, Wang Y, Cheng Y, et al. A review of attention mechanisms in natural language processing[J]. Data Analysis and Knowledge Discovery, 2020, 4(5): 1-14. Ren Huan, Wang Xuguang. A review of attention mechanisms[J]. Computer Applications, 2021, 41(S01): 1-6.
Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[J]. Advances in neural information processing systems, 2017, 30.
Chen L C, Lee C M, Chen M Y. Exploration of social media for sentiment analysis using deep learning[J]. Soft Computing, 2020, 24: 8187-8197. Pan D, Yuan J, Li L, et al. Deep neural network-based classification model for Sentiment Analysis [C]//2019 6th International Conference on Behavioral, Economic and Socio-Cultural Computing (BESC). IEEE, 2019: 1-4.
Zhang ZQ, Ye Q, Li YJ. A review of sentiment analysis of Internet product reviews[J]. Journal of Management Science, 2010, 13(006): 84-96.
Deng S, Sinha A P, Zhao H. Adapting sentiment lexicons to domain-specific social media texts[J]. Decision Support Systems, 2017, 94: 65-76.
Chen Yan-Fang, Li Zhi-Yu, Liang Zhuan, et al. A review of online social network rumor detection[J]. Journal of Computer Science, 2018, 41(7): 1648-1676.
Li Jingmei, Sun Lihua, Zhang Qiaorong, et al. A plain Bayesian classifier for text processing[J]. Journal of Harbin Engineering University, 2003, 24(1): 71-74.
Xiao Z, Liu F, Li B. A semantic distance-based SVM sentiment classification method for Web reviews[J]. Computer Science, 2014, 41(9): 248-252.
Liu Gang, Zhang Weishi. Sentiment analysis of Internet users' evaluation based on decision tree[J]. Modern Computer: Midterm Journal, 2017 (11): 15-19.
Neethu M S, Rajasree R. Sentiment analysis in twitter using machine learning techniques[C]//2013 fourth international conference on computing, communications and networking technologies (ICCCNT). IEEE, 2013: 1-5.
O'Shea K, Nash R. An introduction to convolutional neural networks[J]. arXiv preprint arXiv:1511.08458, 2015.
Sherstinsky A. Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network[J]. Physica D: Nonlinear Phenomena, 2020, 404: 132306.
Zhao YY, Wang ZY, Wang P, et al. A review of task-based dialogue systems[J]. Journal of Computer Science, 2020, 43(10): 1862-1896.
Shan, W.F., Li, C.Y., Chen, J., et al. Application of convolutional neural network and self-attentive mechanism to identify geomagnetic field disturbance events[J]. Seismic Geomagnetic Observation and Research, 2022, 43(5): 49-63.
Shaw P, Uszkoreit J, Vaswani A. Self-attention with relative position representations[J]. arXiv preprint arXiv:1803.02155, 2018.
Voita E, Talbot D, Moiseev F, et al. Analyzing multi-head self-attention: Specialized heads do the heavy lifting, the rest can be pruned[J]. arXiv preprint arXiv:1905.09418, 2019.
Chen Chen, Xia-Bing Zhou, Zhong-Qing Wang, et al. Conversational sentiment classification based on multi-party attention modeling[J]. Chinese Journal of Informatics, 2022, 36(12): 173-181.
Li Yuqing,Li Xin,Han Xu et al. A bilingual lexicon-based sentiment analysis method for micro-bodied categories[J]. Journal of Electronics,2016,44(09):2068-2073.
Zhao Yan-Yan,Qin Bing,Shi Qiu-Hui et al. Construction of large-scale sentiment lexicon and its application in sentiment classification[J]. Journal of Chinese Information, 2017, 31(02): 187-193.
Asghar M Z, Khan A, Ahmad S, et al. Lexicon-enhanced sentiment analysis framework using rule-based classification scheme[J]. PloS one, 2017, 12(2): e0171649.
Peng M, Xi J, Dai XY, et al. Collaborative filtering recommendation algorithm based on sentiment analysis and LDA topic model[J]. Chinese Journal of Information, 2017, 31(02):194-203.
Li Jinmeng. Research on natural language emotion analysis based on eye-tracking [D]. Beijing University of Posts and Telecommunications, 2021. DOI:10.26969/d.cnki.gbydu.2021.001688.








