Research On Machine Reading Comprehension Method Based On Deeping Learning

Posted on:2022-11-20

Degree:Master

Type:Thesis

Country:China

Candidate:Y M Xu

Full Text:PDF

GTID:2518306746981319

Subject:Automation Technology

Abstract/Summary:

PDF Full Text Request

Machine reading comprehension is one of the hot research topics in the field of natural language processing,and its significance is to enable machines to understand text semantics and have the ability to reason,extract text information and answer related questions.With the continuous development of deep learning technology and the release of large-scale machine reading comprehension datasets,a large number of excellent models have been continuously proposed and improved.However,the traditional word2 vec word embedding technology cannot solve the problems of polysemy and long-distance dependence of semantics,and the semantic information of the context with a long distance cannot be well encoded;the semantic information of the question and the document is not sufficiently interactive and fused,so that the model cannot find the part of the document that is helpful for answering the question.In response to these problems,an ensemble learning method is proposed,which uses BERT-wwm-ext as the text encoder and multiple heterogeneous deep learning models as the base model.The validity of the model is verified on Chinese Squad dataset.The main work is as follows:（1）Build base classification ensemble modelIn order to solve the problems in traditional natural language processing（NLP）,the encoding layer cannot obtain the information of processing context,the polysemy of a word and the feature of Chinese word segmentation are not considered.The BERT-wwm-ext model released by i FLYTEK Labs of Harbin Institute of Technology was selected as the pre-training model,and combined with the advantages of the classic model in the field of machine reading comprehension,a volume based on the attention mechanism and the dimension reduction of model parameters through convolution was proposed.It also includes a product network network,a lightweight convolutional network based on a self-attention mechanism,and a Bi-LSTM model based on a bidirectional long-term and short-term memory network supplemented by an attention mechanism.The convolution-based model improves the abstraction ability of the model,while the bidirectional LSTM has strong semantic extraction ability,and combining them can achieve complementary advantages.（2）Improved stacking ensemble learning methodAiming at the improvement of the classic model,an improved ensemble learning method is proposed to integrate three machine reading comprehension models to improve the effect of the ensemble model.The output layer of the base model is removed,and the output features of the base model are directly input into the meta-model for fusion learning.The experimental results show that the ensemble learning method can effectively improve the recognition effect of the model.

Keywords/Search Tags:

Machine reading comprehension, Extraction, Deep learning, Integrated learning, Pre-training model

PDF Full Text Request

Related items

1	A Study Of Self-training Methods For Machine Reading Comprehension Span Extraction Tasks
2	Research On Machine Reading Comprehension Based On Pre-Trained Language Model
3	Research On Machine Reading Comprehension Based On Deep Learning
4	Data Catagory Oriented Research Of End-to-end Machine Reading Comprehension Based On Deep Learning
5	Research Of Extractive Chinese Machine Reading Comprehension
6	Research On Deep Learning-Based Machine Reading Comprehension Model
7	Research On Methods Of Machine Reading Comprehension Of Unstructured Chinese News Text
8	Research On Event Extraction Methods Based On The Machine Reading Comprehension Framework
9	Research And Implementation Of Chinese Machine Reading Comprehension Based On Deep Learning
10	Research On Machine Reading Comprehension Algorithms Based On Deep Learning