A Study On The Readability Of Second Language Learners’ Learning Materials Under Deep Learning

Posted on:2024-07-27

Degree:Master

Type:Thesis

Country:China

Candidate:H Ding

Full Text:PDF

GTID:2555307067972119

Subject:Cyberspace security

Abstract/Summary:

PDF Full Text Request

Automatic Readability Assessment(ARA)has garnered significant attention as a burgeoning field of research.Its core objective is to automatically evaluate the readability of text by leveraging diverse text features.While traditional ARA methods often consider single or partial features,this thesis proposes a novel approach based on multi-feature extraction.Furthermore,a comprehensive corpus collection comprising multiple corpora,including an analysis of their sources and statistical characteristics,is constructed.From three dimensions,namely structural features,word frequency features,and deep features,the difficulty characteristics of the corpora are extracted,elucidating the distribution of language structural feature attributes across different corpora.To address the limitations of manual feature extraction in traditional ARA methods,this thesis introduces a hierarchical network that incorporates pre-trained language models.By leveraging the features extracted from pre-trained language models and the architecture of the hierarchical network,this approach circumvents the reliance on manual feature engineering,resulting in enhanced prediction accuracy and practicality.Experimental results reveal that the deep features outperform other feature groups in representing text difficulty.In addition,this study compares and contrasts the representations from different layers of the pre-trained model,ultimately selecting the last layer as the optimal difficulty representation for text.Remarkably,the proposed model achieves remarkable accuracy rates of 89.76%,85.32%,and 51.56% on three publicly available corpora,surpassing the performance of baseline models such as convolutional neural networks and long short-term memory networks.Finally,this thesis redefines the ARA task by reframing it as a difficulty ranking problem using pairwise ranking methods.The results of the experiments demonstrate a robust correlation in the consistency of difficulty ranking across corpora,highlighting the efficacy of pairwise ranking methods in capturing the consistency of difficulty across different corpora.This finding further underscores the transfer learning ability of the proposed ranking model in diverse corpora.

Keywords/Search Tags:

Automatic Readability Assessmen, Deep Neural Networks, Pre-trained Language Models, Pairwise Ranking Algorithm, Difficulty Ranking Problem, Cross-Cropus Prediction

PDF Full Text Request

Related items

1	Song Ranking Prediction Based On Random Fores
2	Research On Composition Of Symbolic Music Algorithm Based On Deep Neural Network
3	Automatic Labanotation Generation Of Continuous Movement Based On Deep Learning
4	Automatic readability assessment
5	Liu Yu Replaces Jin And Reconciles Wen And Wu
6	Measuring The Progress Of Neural Machine Translation:A Combined Approach Of Diagnostic Evaluation And Ranking
7	Research On Semantic Understanding And Advertising Ranking In E-commerce Sponsored Search
8	A Tentative Study Of L2 Lexical Difficulty And Grading Of Lexical Difficulty Factors
9	An Empirical Study Of College Students’ Self-perceptions Of Identity Orientations From The Perspective Of Identity Ranking
10	Research On Military Ranking In Qin