Font Size: a A A

Bioinformatics Models And Application In Predicting Protein Solubility

Posted on:2015-01-16Degree:MasterType:Thesis
Country:ChinaCandidate:S J TangFull Text:PDF
GTID:2250330428498205Subject:Systems Biology
Abstract/Summary:PDF Full Text Request
Protein solubility plays a crucial role in structural and biophysical research. In manycircumstance, these studies require proteins in soluble status. Besides, upon the request ofconcerning drugs, there is also a demand in industry. Experimental approaches alwaysaccompanied by limitations, for instance, cost in both money and time. Therefore,predicting protein solubility by the use of bioinformatics models can promote the studieseffectively.This paper studied the bioinformatics models for predicting protein solubility, andselected several models to introduce and compare. Nevertheless, we analyzed and testedthese models on independent data set. In addition, we made a summary and evaluationfrom algorithms, performance, and functionality support perspectives.This paper extracted data from protein information database, selected amino acidsequence as well as some physical and chemical properties as features, and then classifiedamino acids. SVM was used to train data and build predictive model. The result of10-foldcross-validation represented capability of this model, where the accuracy reached0.759,with a sensitivity of0.761, specificity of0.757, and MCC value of0.518, indicating thatour predictive model had achieve a satisfactory performance.
Keywords/Search Tags:protein solubility, support vector machine (SVM), prediction, bioinformatics model
PDF Full Text Request
Related items