Application of molecular descriptors for recognition of phosphorylation sites in amino acid sequences

Karasev D.A.1 , Savosina P.I.1, Sobolev B.N.2, Filimonov D.A.2, Lagunin A.A.1

1. Institute of Biomedical Chemistry, Moscow, Russia; Pirogov Russian National Research Medical University, Moscow, Russia
2. Institute of Biomedical Chemistry, Moscow, Russia
Section: OMICS-Technologies
DOI: 10.18097/PBMC20176305423      PubMed Id: 29080875
Year: 2017  Volume: 63  Issue: 5  Pages: 423-427
Recognition of the phosphorylation sites in proteins is required for reconstruction of regulatory processes in living systems. This task is complicated because the phosphorylation motifs in amino acid sequences are considerably degenerated. To improve the prediction efficacy researchers often use additional descriptors, which should reflect physicochemical features of site-surrounding regions. We have evaluated the reasonability of this approach by applying molecular descriptors (MNA) for structural presentation of the peptide segments. Comparative testing was performed using the prognostic method PASS and two input data types: sets of the MNA descriptors represented peptides as chemical structures and amino acid sequences written using a one-letter code. Training sets were classified in accordance with the established types of the enzymes (protein kinases), modifying corresponding phosphorylation sites. The accuracy estimates obtained by prognosis validation for various classes of substrates were significantly different with both the letters and molecular descriptors. In case of the letter description, the prognosis accuracy demonstrated less dependence on the length of peptides in the training set, while in the case of structural descriptors the accuracy level was determined by the peptide size and descriptor characteristics (MNA levels). The maximal prognosis accuracy related to various kinase families was achieved at different sizes of molecular fragments covered by the MNA descriptors of corresponding levels. This obviously reflected structural differences in surroundings of phosphorylation sites modified by various protein kinases. The use of molecular descriptors provided the prognostic results comparable with the results obtained using traditional letter representation. The prognosis accuracy demonstrated less dependence on the method describing site-surrounding peptides at higher accuracy rates. Applying the MNA descriptors it is possible to achieve better accuracy in the cases when the letter description cannot provide acceptable accuracy.
Download PDF:  
Keywords: protein phosphorylation, molecular descriptors, amino acid sequences, phosphorylation motifs, site prediction

Karasev, D. A., Savosina, P. I., Sobolev, B. N., Filimonov, D. A., Lagunin, A. A. (2017). Application of molecular descriptors for recognition of phosphorylation sites in amino acid sequences. Biomeditsinskaya Khimiya, 63(5), 423-427.
 2024 (vol 70)
 2023 (vol 69)
 2022 (vol 68)
 2021 (vol 67)
 2020 (vol 66)
 2019 (vol 65)
 2018 (vol 64)
 2017 (vol 63)
 2016 (vol 62)
 2015 (vol 61)
 2014 (vol 60)
 2013 (vol 59)
 2012 (vol 58)
 2011 (vol 57)
 2010 (vol 56)
 2009 (vol 55)
 2008 (vol 54)
 2007 (vol 53)
 2006 (vol 52)
 2005 (vol 51)
 2004 (vol 50)
 2003 (vol 49)