Research And Application Of Fraud Phone Recognition Method Based On NLP

Posted on:2022-11-19

Degree:Master

Type:Thesis

Country:China

Candidate:T T Jiang

Full Text:PDF

GTID:2506306770969299

Subject:Computer Software and Application of Computer

Abstract/Summary:

PDF Full Text Request

In recent years,domestic telecommunications network fraud activities have been rampant,and the incidence of cases has increased year by year,posing a great threat to social stability.Telephone fraud is a common form of telecommunications fraud,which refers to the criminal activities that fraudsters defraud money of victims through the phone.It occupies a relatively high proportion especially among the elderly.How to manage telephone fraud effectively is a major challenge for operators.At present,natural language processing（NLP）technology is in the stage of rapid development.NLP tasks can be divided into four categories including sequence labeling tasks,classification tasks,sentence relationship judgment,and generative tasks.Text classification is a classic problem in classification tasks,which is widely used in spam filtering,news classification,part-of-speech tagging,emotion analysis and other fields.Inspired by this,the thesis applies text classification tasks to fraud phone identification at the terminal.Firstly,we convert the voice on phone into text,then use text classification technology to classify phone texts automatically to determine whether it is a fraudulent call or not.The main works of this thesis are as follows:⑴ Building phone text datasets.There are no public telephone text datasets at home and abroad currently,because telephone calls involve personal privacy.The data collected in this thesis comes from various phone fraud cases that users have encountered on websites such as Baidu,Know,Micro Blog,and Little Red Book.Detailed introductions about datasets are in section 3.2.⑵ Two kinds of fraud phone classification models based on attention mechanism are proposed.The first is a classification model based on Bi LSTM-Attention.That is,attention mechanism is introduced after the bidirectional long short term memory（Bi LSTM）neural network model,which solves the problem that a single neural network cannot extract key features effectively.The second is the classification model that integrates the self-attention mechanism.That is,self-attention mechanism is introduced after the convolutional neural networks（CNN）model.Self-attention mechanism can extract the internal dependencies of the sequence and reduce the reliance on external information.Comparative experiments are carried out on phone text datasets to verify the performance of the two proposed models.The experimental results show that the proposed models have improved in terms of accuracy and F1 value and so on.⑶ A fraud phone fusion classification model based on improved Transformer and CNN is proposed.The output vector after word embedding model is input to Encoder of Transformer and Text CNN model respectively in order to give full play to the advantages of Transformer in extracting long-distance features and CNN in extracting local features.Then we splice with the two output vectors and input to the softmax layers to classify finally.The performance of the model is verified on the phone text datasets.

Keywords/Search Tags:

fraud phone, phone text, NLP, text classification, attention mechanism

PDF Full Text Request

Related items

1	Research On Classification Method Of Fraud Phone Text Based On Deep Learning
2	Research On Precision Classification Technology Of Appeal Short Text
3	Research On Multi-Label Judicial Text Classification Algorithm Based On Attention Mechanism
4	Research On Chinese Case Text Understanding Technology
5	Design And Implementation Of Legal Text Retrieval And Classification System
6	Research On Information Extraction Method And Its Application In Knowledge Graphs Of The Government Text
7	Research On Corporate Service Text Classification System Based On Domestic Policy
8	Research On Commonweal Judicial Intelligence Based On Text Mining
9	Research On Text Classification Of Government-people Interactive Messages Based On Deep Learning
10	Qualitative Analysis Of Several Subsequent Infringe Property Behaviors Of Stealing Mobile Phone