Research And Implementation Of Chinese Automatic Abstract Technology Based On Machine Learning

Posted on:2021-01-22

Degree:Master

Type:Thesis

Country:China

Candidate:L Y Xia

Full Text:PDF

GTID:2427330602483542

Subject:Applied statistics

Abstract/Summary:

PDF Full Text Request

In the era of big data,information on the Internet has exploded,and people are more searching for knowledge and browsing news on the Internet.Therefore,it is a common demand for people to obtain main information quickly and efficiently.The abstract is a refined summary of an article,which not only reflects the subject of the article but also greatly reduces the cost of obtaining the main information.With the development of computer technology,the use of computers to automatically obtain text summary information becomes a reality.In the field of natural language processing,continuously improving the accuracy of automatic summarization has become an important research direction.In this thesis,an in-depth study of extractable automatic summarization based on machine learning methods is conducted.In terms of feature extraction of text information,first summarizes the text features based on statistics and rules;secondly integrates Chinese linguistic features,such as:part-of-speech features,dependent syntactic features,semantic role features,and semantic dependent features;and finally introduces depth-based Word2vec word vector features for learning.The sentences in the text are converted into 347-dimensional feature vectors as input to the machine learning model.On the basis of considering the form of artificial summarization of the data set,using these rich features of text information,six classic regression algorithm models are used to automatically extract the text information.Compared with traditional methods,machine learning methods with rich feature sets improve the performance of automatic summarization.On this basis,the abstracts of current affairs news were automatically extracted using the model with excellent performance,and good results were obtained.

Keywords/Search Tags:

automatic summarization, Feature set, Word vector, Machine learning

PDF Full Text Request

Related items

1	Research And Realization Of Mathematics Automatic Marking System In Primary And Middle Schools Based On Machine Learning
2	Several Classification Algorithms And Their Applications In Statistical Learning
3	Research On Automatic Solution Model Of Word Problem Based On Digital Correlation Features
4	Research On Automatic Program Evaluation Algorithm Based On Machine Learning
5	Research And Realization Of Automatic Scoring System For Subjective Questions In Online Education
6	Application Of Support Vector Machine In The Analysis Of Population Data
7	The Prediction Of Sailing Speed Based On Support Vector Machine
8	Gym Exercises Monitoring Based On Pressure Sensing Smart Gloves
9	A Research On Learning Process Evaluation Based On Support Vector Machine
10	Application Of Support Vector Dimensionality Reduction Machine For Multi-instance Learning