Analysis of Chinese Text Automatic Proofreading Technology
DOI:
https://doi.org/10.54097/ijeh.v4i1.1406Keywords:
Text proofreading, Chinese information processing, Deep learning, Real-word error.Abstract
The development of computer technology promotes the research on automatic text proofreading technology. The research of Chinese text automatic proofreading started late but developed rapidly. This paper analyzes the difficulties of Chinese text proofreading and reviews the research methods of Chinese text automatic proofreading technology, including statistical-based approach, rule-based approach, deep learning-based approach and hybrid approach. Although some new progress has been made in Chinese text automatic proofreading in recent years, there are many problems remain to be settled due to the complexity of Chinese and the scale of the universal data set of Chinese proofreading corpus.
Downloads
References
Yangsen Z. and Shiwen Y., Summary of Text Automatic Proofreading Technology, Application Research of Computers, vol.6, 2006, pp.8-12.
Karen K, Techniques for Automatically Correcting Words in Text, ACM Computing Surveys, vol.4, 1992, pp.377-439.
Deshen S. and Liangzhi W. et al, Chinese Spelling Error Detection Based on Statistics, Computer and Communication, vol.8, 1992, pp.19-26.
Chaohuang C., A Pilot Study on Automatic Chinese Spelling Error Correction, Communication of COLIPS, vol.4, 1994, pp.143-149.
Yangsen Z. and Bingqing D., Automatic Errors Detecting of Chinese Texts Based on the Bi-neighborship, Journal of Chinese Information Processing, vol.3, 2001, pp.36-43.
Rong L., A Chinese Spelling Check System for the OCR Output, Journal of Chinese Information Processing, vol.5, 2009, pp.92-97.
Chang Chaohuang C., A New Approach for Automatic Chinese Spelling Correction, In Proceedings of the Natural Language Processing Pacific Rim Symposium, 1995, pp.278-283.
Jianhua L. and Xiaolong W. The Research of Multi-Feature Chinese Text Proofreading Algorithms, Computer Engineering & Science, vol.3, 2001, pp.93.
Ting L. and Hongbin S. et al, Principle of Chinese Computer Aided Proofreading System, Chinese Information, vol.2, 1997, pp.21-23.
Rongxiang Y. and Kekang H., Proofreading Chinese Manuscript with Computer, Journal of Computer Research and Development, vol.5, 1997, pp.346-350.
Xiaojin G. and Zhensheng L., Automatically Detecting Syntactic Errors in Chinese Texts, Computer Engineering and Applications, vol.8, 2003, pp.98-100.
Yangsen Z. and Jia Z., Study of Semantic Error Detecting Method for Chinese Text, Chinese Journal of Computers, vol.4, 2017, pp.911-924.
Shilong M. et al, Deep Learning with Big Data: State of the Art and Development, CAAI Transactions on Intelligent Systems, vol.6, 2016, pp.728-742.
Boqing R., An Intelligent Chinese Text Proofreading Method Based on Deep Learning, vol.4, 2017, pp.55-58.
Xin X., Research on the Method of Correcting Chinese Homophony Misspelling Based on Machine Translation Model, Heilongjiang University, 2017.
Zhou J.and Li C., et al, Chinese Grammatical Error Correction Using Statistical and Neural Models, NLPCC-2018,cham, pp.117-128.
Yongcai T. and Wenle W., et al, Text Proofreading Model with LSTM and Integrated Algorithm, Journal of Chinese Computer Systems, vol.5, 2020, pp.967-971.
Yuzhong H. and Xianguo Y. et al, FASPell: A Fast, Adaptable, Simple, Powerful Chinese Spell Checker Based On DAE-Decoder Paradigm, Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019), 2019.
Xingyi C. and Weidi X. et al, SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check, ACL 2020. arXivpreprint arXiv:2004.14166.
Shaohua Z. and Haoran H. et al, Spelling Error Correction with Soft-Masked BERT, ACL 2020. arXiv preprint arXiv:2005.07421.
Yan W. and Xiukun L.et al, Research on and Implementation of Chinese Text Proof-reading System, Journal of Harbin Institute of Technology, vol.2, 2001, pp.60-64.
Yangsen Z. and Anjie T. et al, Chinese Text Proofreading for Political News Field, Journal of Chinese Information Processing, vol.6, 2014, pp.79-84+128.
Zhao H. and Cai D. et al, A Hybrid Model for Chinese Spelling Check, ACM Transactions on Asian and Low-Resource Language Information Processing, vol.3, 2017, pp.1-22.
Dezhi G., Research on Chinese Real-word Error Automatic Detection and Correction, Jiangsu University of Science and Technology, 2017.
Junpei Z. and Chen L. et al, Chinese Grammatical Error Correction Using Statistical and Neural Models, CCF International Conference on Natural Language Processing and Chinese Computing. Berlin: Springer, 2018: 117-128.
Yongmei T. and Yixiao Y. et al, Grammatical Error Correction Using LSTM and N-gram, Journal of Chinese Information Processing, vol.6, 2018, pp.19-27.
Yongcai T. and Zhaoyang H. et al, Study of Chinese Word Collocation Feature Extraction and Text Proofreading, Journal of Chinese Computer Systems, vol.11, 2018, pp.2485-2490.








