Construction of Unhealthy Webpage Filtering Mode Based on Data Mining Technology
Keywords:Data mining, Web filtering, Feature word, Vector space model, Characteristic vector library.
The unhealthy category eigenvector library is constructed by adaptive sample library, and the unhealthy category model is constructed on this basis to realize the filtering of unhealthy webpages. Our expermient proves that this mode can filter unhealthy webpage at higher speed and satisfying precision.
Zhu Guojie.Research and Implementation of Bad Webpage Detection System Based on Text Features [D].Zhejiang University. 2020.
Wang Lei.Research on double filtering method of bad web pages based on content recognition [D]. Jilin University. 2012.
Si Derui. Research on Web Filtering Technology Based on Text Content [D]. Lanzhou University. 2008.
Jiawei Han, Fan Ming&Meng Xiaofeng. Data Mining Concept and Technology ( 3rd Edition ). Machinery Industry Press. 2012.
Tang Jiangang, Xiong Guoping. Research and Application about Unhealthy Webpage Filter Model Based on Words’ Frequency and DataMining Technology[J]. Journal of Xiamen University (Natural Science), 2007(11).
Ronghua Lu. Design of Bad Information Filtering System for Web Pages[J]. Frontiers in Computing and Intelligent Systems, 2022(3).
Yao Mei. Research on Key Technologies Based on Web Content Filtering [J]. Information and Computer (Theoretical Edition . 2022 (14).
How to Cite
This work is licensed under a Creative Commons Attribution 4.0 International License.