Incomplete Multi-view Inductive Matrix Completion For Gene-disease Associations Prediction

Posted on:2023-04-28

Degree:Master

Type:Thesis

Country:China

Candidate:J Xu

Full Text:PDF

GTID:2530306836973579

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Genetic diseases seriously threaten human health,and deciphering the associations between genes and diseases has become an important goal of biomedical research.Discovering genes closely related to diseases is extremely important for disease prevention,diagnosis and treatment.With the continuous mining of various biological data and the rapid development of computer technology,many methods have proposed for gene-disease associations prediction.However,most existing methods use a two-step strategy with the beginning feature fusion step and the following associations prediction step,ignoring the reciprocal relationship between these two steps.Meanwhile,these methods do not fully excavate the multi-source feature information of genes and diseases,which are often affected by data redundancy and missing.In response to these problems,this article proposes two models from different perspectives,the main research contents are as follows:(1)Aiming at the shortcoming of the existing two-stage models and the problem of multi-source data fusion,a one-step multi-view inductive matrix completion model is proposed.The model employs the multi-view representation learning to fully capture the consistency and complementary information of the multi-view data,and thus obtain the common latent representations for genes/diseases.It is also suitable for incomplete multi-view data.In addition,we also introduce the adaptive weighting scheme into traditional inductive matrix completion model,penalizing the known and the unknown associations differently to adapt large-scale PU(Positive-Unlabeled)learning problem.Multi-view representation learning and weighted inductive matrix completion are integrated into one jointly model to learn latent representations and predictive matrices simultaneously,and promote each other,which can not only improve latent representation learning,but also boost the final prediction performance.Finally,extensive experiments conducted on real gene-disease dataset demonstrate the superior performance of our method compared to other methods.(2)Aiming at the limitations of shallow linear models in extracting nonlinear features and learning complex associations,a deep multi-view inductive matrix completion model is proposed.The model integrates information from multiple views into an intact representation by the nested autoencoder networks.Our model jointly performs view-specific representation learning(with the inner autoencoder networks)and multi-view shared representation learning(with the outer autoencoder networks)in a unified framework,flexibly balancing the complementary and consistency of multi-view data.The multi-modal low-rank bilinear pooling network for associations prediction is used to fully mine complex gene-disease associations.Finally,experimental results on the real-world dataset demonstrate its effectiveness and superiority.

Keywords/Search Tags:

Low-rank Learning, Multi-view Learning, Deep Latent Representation Learning, Inductive Matrix Completion, Gene-disease Associations Prediction

PDF Full Text Request

Related items

1	Research On Prediction Of Microbe-Disease Associations Based On Multi-View Representation Learning
2	A Multi-level Feature Augmented Deep Graph Representation Learning Model
3	Link Prediction Via Modular Structures And Network Representation Learning
4	Associations Prediction Between SnoRNA And Diseases By Matrix Inference
5	The Study Of MiRNA Prediction Based On Matrix Completion And Active Learning
6	Research On Representation Learning Method For Transcriptome Data
7	Research On Protein Fold Recognition Based On Multi-view Learning Algorithm
8	A Research On Deep Multi-View Clustering Model Based On Graph Representation Learning
9	Research On Protein Function Prediction Based On Iterative Features And Graph Features
10	Learning on the Graph: Link Prediction, Multi-label Learning, and Applications to Integrative Complex Disease Studies