Prediction Of Type ? Secreted Effectors Based On Word Embedding And Deep Learning

Posted on:2021-05-06

Degree:Master

Type:Thesis

Country:China

Candidate:X F Fu

Full Text:PDF

GTID:2480306503472204

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The type ? secreted effectors(T3SEs)are one of the indispensable proteins in the growth and reproduction of Gram-negative bacteria.In particular,the pathogenesis of Gram-negative bacteria depends on the type ? secreted effectors,and by injecting T3 SEs into a host cell,the host cell's immunity can be destroyed.The high diversity of T3SE sequences and the lack of defined secretion signals make it difficult to identify and predict.Moreover,the related study of the pathological system associated with T3SE remains a hot topic in bioinformatics.Some computational tools have been developed to meet the growing demand for the recognition of T3SEs and the studies of type III secretion systems(T3SS).Although these tools can help biological experiments in certain procedures,there is still room for improvement,even for the current best model,as the existing methods adopt hand-designed feature and traditional machine learning methods.In this study,we propose two T3SE prediction models.The first model is a predictor model based on word embedding and deep learning,called WEDeepT3.Our work consists of three key steps.First,we train protein word vectors in a large-scale protein sequence corpus.Second,we combine word embeddings with traditional protein features(PSSM)to build a more comprehensive feature representation.Finally,we build a deep neural network model for predicting T3SE.The second model is a self-attention-based prediction model.Our work involves two key steps.First,we build a large-scale protein sequence corpus and pre-train the language model on the corpus.Then,we fine-tune the parameters of the saved model on the T3SE dataset.Both prediction models are universal models in protein sequence research,and experimental results show that the performance of both models exceeds the existing T3SE prediction models.

Keywords/Search Tags:

T3SE, Word Embedding, PSSM, Self-attention

PDF Full Text Request

Related items

1	The Binning Of Metagenomic Sequence Based On Statistical Model And Word Embedding
2	Key Technology And Application Of Understanding Elementary Mathematical Problem Based On Word Embedding
3	Research On Biomedical Word Sense Disambiguation Based On Attention Neural Network Model
4	Climate Event Detection Algorithm Based On Climate Category Word Embedding
5	Research On Network Embedding Method Based On Graph Attention Mechanism
6	{1, 2, 3} N-space On Any Word Seepage
7	Oxidoreductase Classification Prediction Based On HOG-Cos-PSSM Feature Description And TKSE Ensemble Method
8	Optimization And Application Of DNA Sequence Word Segmentation Method
9	Predicting MHC-Ⅱ Binding Affinity Using DE And PSSM
10	Transcriptome Analysis Of Acidovorax Citrulli--Cucumber Interaction And Preliminary Identification Of T3SE Genes In Acidovorax Citrulli