Research On Chinese Text Error Correction

Posted on:2023-03-09

Degree:Master

Type:Thesis

Country:China

Candidate:N K Lin

Full Text:PDF

GTID:2568306815971199

Subject:Cyberspace security

Abstract/Summary:

PDF Full Text Request

Chinese text error correction is a challenging task in Chinese natural language processing,which aims to detect and correct textual errors made by users when using Chinese.With the development of natural language processing and the emergence of seq2 seq models,Chinese text error correction research has made significant progress.However,the size of the existing annotated corpus for the Chinese grammatical error correction task is still difficult to support the training of seq2 seq models.Therefore,recent research has mainly focused on data augmentation strategies,and limited research tried to adapt the existing corpus size by reducing the training difficulty of the models.In addition,the current evaluation system for Chinese grammatical error correction is inadequate,and the evaluation results are to some extent influenced by the Chinese word segment algorithm,so a more reasonable evaluation system for Chinese grammatical error correction tasks needs to be explored.For the Chinese spelling error correction task,the existing research only focuses on the correction ability of the model,but neglects to pay attention to the model’s ability of identifying confusing words.In order to provide a more reasonable evaluation method for Chinese grammatical error correction tasks and to solve the problem in training Chinese grammatical error correction models,this paper investigates the evaluation framework and model optimization methods for Chinese grammatical error correction tasks and uses spelling error correction models to assist grammatical error correction tasks.The main research of the thesis includes:(1)An evaluation framework for Chinese grammatical error correction.In this paper,the test set provided by NLPCC is manually annotated and expanded to address the shortcomings of the existing evaluation metric and evaluation framework,so that the gold-standard correction annotations of the original test set samples can correspond to more correction results.To address the shortcomings of the Chinese grammar error correction evaluation system,three new evaluation metrics are proposed for the Chinese grammatical error correction task.(2)Transformer-based grammatical error correction model.This paper introduces the application of the Transformer model to grammatical error correction tasks,and designs several strategies to improve the effectiveness of the grammatical error correction model,and verifies the superior performance of the Transformer model and the effectiveness of the designed strategies through comparative experiments.(3)A grammatical error correction model based on model compression and grammatical generalization.This paper proposes a Chinese grammatical error correction model that is easier to learn.Experimental results validate the effectiveness of the proposed model.In addition,the effectiveness of the proposed model is also verified on the English grammatical error correction dataset.Meanwhile,ablation experiments and parameter exploration experiments were conducted in this paper to verify the effectiveness of each strategy and the optimal values of the parameters.(4)A spelling error correction model based on reverse contrastive learning.In this paper,a reverse contrastive learning strategy is proposed and applied to build a spelling error correction model to assist in improving the performance of grammatical error correction.The reverse contrastive learning strategy is described in detail,and the constructed spelling error correction model is used in the pre-processing and postprocessing stages of the Chinese grammatical error correction task to verify the effectiveness of the spelling error task for improving the effectiveness of the Chinese grammatical error correction task.This paper achieves certain performance improvements in both Chinese grammatical error correction and Chinese spelling error correction tasks,and provides new research ideas and research directions for Chinese grammatical error correction and Chinese spelling error correction tasks.

Keywords/Search Tags:

Chinese grammatical error correction, Chinese spelling error correction, grammar error correction evaluation, parameter sharing, grammar generalization

PDF Full Text Request

Related items

1	Research On Error Correction Method Of Chinese Short Text Based On BERT
2	Research On Word Error Correction Methods Of Chinese Text
3	Research On Chinese Grammar Error Correction Based On Deep Learning
4	Chinese Spelling Error Correction Algorithm Incorporating Multimodal Semantic Features And Applications
5	Research On Automatic Error Correction Of Tibetan Grammar
6	Incorporating Confusion Set Knowledge In Chinese Grammar Error Correction
7	Research On Key Techniques Of Chinese Grammar Error Correction Based On Neural Network
8	Research And Implementation Of Grammar Error Correction Model Based On Deep Learning
9	The Research Of Grammar Error Correction
10	Research On Deep Learning Error Correction Method Of Chinese Text