Chinese Machine Reading Comprehension Based On Improved R-NET

Posted on:2020-03-03

Degree:Master

Type:Thesis

Country:China

Candidate:F Pan

Full Text:PDF

GTID:2428330590496508

Subject:Control engineering

Abstract/Summary:

PDF Full Text Request

The goal of natural language processing is to enable computers to process text on the basis of understanding its inherent meaning.Machine reading comprehension refers to a machine that answers questions based on the content of a given article,which is similar to human reading comprehension tests.In recent years,with the development of deep learning and the rise of artificial intelligence,machine reading comprehension,as a representative of intellectualization in the field of natural language processing,has attracted extensive attention from industry and scholars at home and abroad.The development of Chinese machine reading comprehension lags behind that of foreign countries because of its late start.The appearance of CMRC 2018 data set fills in the blank of Chinese span-extraction machine reading comprehension data set.Therefore,this thesis attempts to apply the R-NET model,which is excellent in the English segment extraction machine reading comprehension data set,to the CMRC 2018 Chinese data set and get the Chinese machine reading comprehension benchmark model R-NET(Skip-gram).The objective is to improve the EM value and F1 value of R-NET(Skip-gram)on CMRC 2018 data set.Firstly,the thesis introduces the research background,significance and research status at home and abroad.,and explains the related concepts used in R-NET model,including recurrent neural network,word vector,attention mechanism and Pointer Network.Next,the SQuAD data set used by R-NET and CMRC 2018 data set used in this thesis,as well as their same evaluation of EM and F1,are described respectively.After introducing the four layers of R-NET in detail,R-NET is applied to CMRC 2018 data set.The Skip-gram are selected after comparison,and the benchmark model R-NET(Skip-gram)is obtained.Then,on the basis of R-NET(Skip-gram),the thesis proposes three improvements from word vector,gate function and check layer: 1.Directional Skip-gram word vector is introduced into R-NET model for the first time to generate R-NET(Directional Skip-gram),which strengthens the feature extraction of articles and problems.Directional Skip-gram strengthens feature extraction of articles and questions,and increases EM and F1 values on dev set by 2.050% and 1.518% respectively;2.Try to change the gate function of R-NET(Skip-gram)to tanh and get R-NET(tanh).As gate function,tanh strengthens the negative eigenvalues and improves the circulation of useful information among models.The EM and F1 values on dev set are increased by 2.795% and 2.895% respectively;3.Referring to human's habit of checking when doing reading comprehension problems,the thesis adds check layer to R-NET(Skip-gram)and obtains R-NET(check layer).The experimental results show that the check layer emphasizes the importance of problem features and compensates for the loss of problem features in the process of information transmission.The EM and F1 values on dev set are increased by 1.926% and 1.833%,respectively.At the same time,the check layer increases the complexity of the model and deepens the over-fitting.Finally,the P-Reader model is proposed by combining R-NET(Directional Skip-gram),R-NET(tanh)and R-NET(check layer).Compared with the R-NET(Skip-gram)benchmark model,the model complexity of P-Reader is higher,but the EM and F1 values on dev set are increased by 6.337% and 5.126%,respectively.In order to overcome the over-fitting caused by the small training set of CMRC 2018,the processed DRCD data set is added to the training set,which further improves the performance of P-Reader.And on the Open Challenge Leaderboard,P-Reader ranked third in the test set(single model)and first in the challenge set(single model).In order to show the effect of P-Reader more intuitively,a simple simulation interface is also designed in this thesis.

Keywords/Search Tags:

Machine reading comprehension, Deep learning, R-NET, CMRC 2018

PDF Full Text Request

Related items

1	Research And Implementation Of Chinese Machine Reading Comprehension Based On Deep Learning
2	Research On Machine Reading Comprehension Based On Deep Learning
3	Research On Machine Reading Comprehension Algorithm Based On Deep Representation Learning
4	Research On Machine Reading Comprehension Algorithms Based On Deep Learning
5	Research On Deep Learning-Based Machine Reading Comprehension Model
6	Research On Machine Reading Comprehension Method Based On Deeping Learning
7	Research On Machine Reading Comprehension Based On Deep Learning
8	Research On Particular-form Oriented Machine Reading Comprehension
9	Research And Implementation Of Chinese Machine Reading Comprehension Method Based On Deep Learning
10	Research On Tibetan Extractive Machine Reading Comprehension Based On Deep Learning