A Study Of Self-training Methods For Machine Reading Comprehension Span Extraction Tasks

Posted on:2023-12-15

Degree:Master

Type:Thesis

Country:China

Candidate:M E Fu

Full Text:PDF

GTID:2568307103985179

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Machine Reading Comprehension(MRCC)aims to teach machines to answer the correct answer to a question after understanding a given passage of text,and it is also the foundation and long-term goal of natural language understanding.Several different forms of machine reading comprehension tasks already exist,such as extractive and inferential reading comprehension tasks,and researchers usually focus on one type of task,but real-life application situations often require models that can handle many different types of tasks simultaneously.Secondly natural language processing models are often trained on large samples of labeled data with supervised learning methods in the expectation that the model will learn more potential knowledge.However,in practical application scenarios such as legal,financial,and medical fields labeled data is severely lacking,and labeling a large number of samples is relatively expensive.In summary,how to effectively handle multi-task reading comprehension data and unlabeled data becomes an important part of this research.To address the problem of multi-task reading comprehension data processing,existing methods have been used to handle different reading comprehension tasks separately by introducing additional auxiliary loss functions.However,multi-task learning models based on auxiliary loss often use an average loss weighting method,and such processing does not achieve a balance between multiple tasks in model training.Secondly,for the use of unlabeled data,self-training methods can effectively utilize both labeled and unlabeled data to improve the performance of deep learning models.In the field of natural language processing,self-training methods are widely used in text classification and sequence labeling tasks,however,most of them predict the probability distribution of target labels based on sentence embeddings to select pseudo-labeled samples,which is not suitable for span extraction tasks,which require models to predict the answer span of a question from the word level.The innovative work in this paper is as follows: we propose a self-training method for reading comprehension span extraction,which consists of two parts: a multi-task fusion training reading comprehension model and a word-level based pseudo-label selector.The multi-task fusion training reading comprehension model effectively solves the problem that the multi-task learning model based on the auxiliary loss function cannot achieve the balance between multiple tasks in training by unifying the outputs of different task modules as the output of the span extraction task.The word-level-based pseudolabel selector uses the confidence level of the start and end positions in the model prediction output to obtain valuable pseudolabel data,effectively applying the selftraining method to the reading comprehension span extraction task and effectively solving the problem of obtaining pseudolabel at the word-level for the text self-training method.We conducted experiments on SQu AD2.0,CAIL2019,and medical advice text datasets,and the results show that our proposed self-training method for machine reading comprehension span extraction achieves 1-2% improvement in the performance of machine reading comprehension models in legal and medical fields.

Keywords/Search Tags:

deep learning, machine reading comprehension, pre-training models, multi-task learning, self-training method

PDF Full Text Request

Related items

1	Research On Machine Reading Comprehension Based On Pre-Trained Language Model
2	Research On Machine Reading Comprehension Method Based On Deeping Learning
3	Research On Machine Reading Comprehension Based On Deep Learning
4	Research Of Extractive Chinese Machine Reading Comprehension
5	Research On Machine Reading Comprehension Algorithms Based On Deep Learning
6	Design And Implementation Of Road Image Visibility Recognition System Based On Deep Learning
7	Research On Multi-task Learning Algorithm In Machine Reading Comprehension
8	Reasearch On Machine Reading Comprehension Methods Based On Incorporating External Knowledge
9	Research Of Interpretable Multi-Hop Machine Reading Comprehension Algorithms
10	Research On Key Issues In Machine Reading Comprehension Models