A proposal of a hybrid model to predict the secondary protein structures based on amino acid sequences
Küçük Resim Yok
Tarih
2020
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Erişim Hakkı
info:eu-repo/semantics/openAccess
Özet
Aim: Predicting the secondary structure of proteins based on amino acid sequences is one of the most significant issues inbioinformatics that requires clarification. A high accuracy in determining the secondary structure is a key to programmaticallyuncover 3D structure of proteins and for individual drug applications of programmable proteins. The success rates in predicting thesecondary structures (Q3 score) were around 0.60 when relevant research was initiated and now the rates have reached to the limitof 0.80.Material and Methods: In this study, the secondary structure was predicted through 3-state (Helix, Strand and Turn). Artificial neuralnetworks and machine learning algorithms were used as a hybrid model and a framework was developed. The probability of thepaired presence of amino acids in sequences was used in digitizing amino acid sequences. Calculations were completed separatelyfor each secondary structural element and the cascade mean filter was used as a threshold method to clarify the differences. Thegenerated matrices were used to digitize the protein sequences. Secondary structure was predicted through the Helix-Strand, HelixTurn, Strand-Turn, and subsequently, a final decision as Helix, Strand and Turn was reached via machine learning models.Results: It was determined that the success rates in the dual estimation of secondary structural elements were 0.797 for helixstrand, 0.848 for helix-turn and 0.829 for strand-turn. The average success rate for paired estimation of secondary structuralelements was calculated as 0.824. In the proposed model, accuracy was calculated as 0.742 for Helix, 0.703 for Strand and 0.880for Turn. Q3 score was obtained as 0.775.
Açıklama
Anahtar Kelimeler
Kaynak
Annals of Medical Research
WoS Q Değeri
Scopus Q Değeri
Cilt
27
Sayı
1