Fake News Detection based on Deep Learning
DOI:
https://doi.org/10.54097/fcis.v4i1.9479Keywords:
False News Detection, Tensorflow, CNN and LSTM Fusion, Lightgbm, XGboostAbstract
The rapid popularization of the Internet has broken the professional threshold of information dissemination, enabling more and more people to easily obtain information, share and express views through social media, which has greatly enriched people's daily life. However, due to the huge number of users of social media, false news fabricated for various purposes is emerging in endlessly. Moreover, with the progress of technology, false news is no longer simply spread in the form of text, but more spread through the combination of text, pictures and video, which greatly increases the confusion of false news. The experiment in this paper is based on tensorflow to detect false news. During the experiment, LR was used to obtain the fusion coefficients of CNN and LSTM models, that is the regression coefficient of LR, and then calculated the optimal threshold with the fused model on the verification set. In addition, in terms of model selection, lightgbm and xgboost were selected to train the model on the training set for false news, and predicted the news text on the testing set. The results of three experiments show that the effect of using xgboost model is the best, and the F1 score obtained in the experiment is the highest.
Downloads
References
Yujun Gao, Gang Liang, Fangting Jiang, Chun Xu, Jin Yang, Junren Chen,Hao Wang.A review of social network humor detection[J]. Journal of Electronics, 2020, 48(7): 1421-1435.
Shengsheng Qian,Tianzhu Zhanng,Changsheng Xu.A review of multimedia social event analysis[J].Computer Science, 2021, 48(3): 97-112.
CHEN L, LIANG J, XIE C, XIAO Y. Short Text Entity Linking with Fine-grained Topics[C]//Proceedings of the 27th ACM International Conference on Information and Knowledge Management. Torino, Italy: Association for Computing Machinery, 2018: 457-466.
YU F, LIU Q, WU S, WANG L, TAN T. A Convolutional Approach for Misinformation Identification. [C]//IJCAI. 2017: 3901-3907.
MA J, GAO W, WEI Z, LU Y, WONG K F. Detect Rumors Using Time Series of Social Context Information on Microblogging [M]//Social Media Content Analysis: Natural Language Processing and Beyond. World Scientific, 2018: 67-77.
QI P, CAO J, YANG T, GUO J, LI J. Exploiting multi-domain visual information for fake news detection[C]//2019 IEEE international conference on data mining (ICDM). IEEE, 2019: 518-527.
GUPTA M, ZHAO P, HAN J. Evaluating event credibility on twitter [C]//Proceedings of the 2012 SIAM international conference on data mining. SIAM, 2012: 153-164.
ZHAO L, HU Q, WANG W. Heterogeneous feature selection with multi-modal deep neural networks and sparse group lasso [J]. IEEE Transactions on Multimedia, 2015, 17(11): 1936-19 48.
JOULIN A, GRAVE E, BOJANOWSKI P, MIKOLOV T. Bag of tricks for efficient text classification[J]. arXiv preprint arXiv:1607.01759, 2016.
WANG Y, MA F, JIN Z, YUAN Y, XUN G, JHA K, SU L, GAO J. Eann: Event adversarial neural networks for multi-modal fake news detection[C]//Proceedings of the 24th acm sigkdd international conference on knowledge discovery & data mining. 2018: 849-857.
KHATTAR D, GOUD J S, GUPTA M, VARMA V. Mvae: Multimodal variational autoencoder for fake news detection [C] // The world wide web conference. 2019: 2915-2921.
CHURCH K, HANKS P. Word association norms, mutual information, and lexicography[J]. Computational linguistics, 1990, 16(1): 22-29.
SINGHAL S, SHAH R R, CHAKRABORTY T, KUMARAGURU P, SATOH S. Spotfake: A multi-modal framework for fake news detection[C]//2019 IEEE fifth international conference on multimedia big data (BigMM). IEEE, 2019: 39-47.
DEVLIN J, CHANG M W, LEE K, TOUTANOVA K. Bert: Pre-training of deep bidirectional transformers for language understanding [J]. arXiv preprint arXiv:1810.04805, 2018.
DENG J, DONG W, SOCHER R, LI L J, LI K, FEI-FEI L. Imagenet: A large-scale hierarchical image database[C]//2009 IEEE conference on computer vision and pattern recognition. Ieee, 2009: 248-255.
SINGHAL S, KABRA A, SHARMA M, SHAH R R, CHAKRABORTY T, KUMARAGURU P. Spotfake+: A multimodal framework for fake news detection via transfer learning (student abstract) [C]//Proceedings of the AAAI conference on artificial intelligence: 34. 2020: 13915-13916.
YANG Z, DAI Z, YANG Y, CARBONELL J, SALAKHUTDINOV R R, LE Q V. Xlnet: Generalized autoregressive pretraining for language understanding[J]. Advances in neural information processing systems, 2019, 32.
ZHOU X, WU J, ZAFARANI R. Safe: similarity-aware multi-modal fake news detection (2020) [J]. Preprint. arXiv, 2020, 200304981: 2.
HU G, DING Y, QI S, WANG X, LIAO Q. Multi-depth graph convolutional networks for fake news detection[C]//Natural Language Processing and Chinese Computing: 8th CCF International Conference, NLPCC 2019, Dunhuang, China, October 9–14, 2019, Proceedings, Part I 8. Springer, 2019: 698-710.