Review is devoted to computational prediction of functionally related proteins by comparative genomics. Growing possibilities of biotechnology for genome sequencing lead to generation of sequences for millions of genes. However, function of majority of these genes is unknown, and can be determined experimentally only for a few of them. Therefore, accurate and robust methods for in silico prediction (annotation) of gene functions are highly required. We describe here the main techniques of comparative genomics, including the standard method based on transferring functions between homologous sequences and also context-based methods, including phylogenetic profiles and gene-neighbor approaches. Modern methods of comparative genomics allow obtaining correct functional annotations for more than a half of all organism proteins.
Pyatnitskiy M.A., Lisitsa A.V., Archakov A.I. (2009) Prediction of functionally related proteins by comparative genomics in silico. Biomeditsinskaya Khimiya, 55(3), 230-246.
Pyatnitskiy M.A. et al. Prediction of functionally related proteins by comparative genomics in silico // Biomeditsinskaya Khimiya. - 2009. - V. 55. -N 3. - P. 230-246.
Pyatnitskiy M.A. et al., "Prediction of functionally related proteins by comparative genomics in silico." Biomeditsinskaya Khimiya 55.3 (2009): 230-246.
Pyatnitskiy, M. A., Lisitsa, A. V., Archakov, A. I. (2009). Prediction of functionally related proteins by comparative genomics in silico. Biomeditsinskaya Khimiya, 55(3), 230-246.
Gavin A.C., Aloy P., Grandi P., Krause R., Boesche M., Marzioch M., Rau C., Jensen L.J., Bastuck S. et al. (2006) Nature, 440, 631-636. Scholar google search
Li S., Armstrong C.M., Bertin N., Ge H., Milstein S., Boxem M., Vidalain P.O., Han J.D., Chesneau A. et al. (2004) Science, 303, 540-543. CrossRef Scholar google search
Gabaldon T., Huynen M.A. (2004) Cell. Mol. Life Sci., 61, 930-944. Scholar google search
Huynen M.A., Snel B., von Mering C., Bork P. (2003) Curr. Opin. Cell. Biol., 15, 191-198. Scholar google search
Eisenberg D., Marcotte E.M., Xenarios I., Yeates T.O. (2000) Nature, 405, 823-826. Scholar google search
Marcotte E.M., Pellegrini M., Ng H.L., Rice D.W., Yeates T.O., Eisenberg D. (1999) Science, 285, 751-753. CrossRef Scholar google search
Ravasz E., Somera A.L., Mongru D.A., Oltvai Z.N., Barabasi A.L. (2002) Science, 297, 1551-1555. CrossRef Scholar google search
Wu J., Mellor J.C., DeLisi C. (2005) Genome Inform., 16, 142-149. Scholar google search
Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. (1997) Nucleic Acids Res., 25, 3389-3402. Scholar google search
Gusfield D. (1997) Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge, UK. Scholar google search
Mushegian A.R. (2007) Foundations of Comparative Genomics. Academic Press. Scholar google search
Campuzano V., Montermini L., Molto M.D., Pianese L., Cossee M., Cavalcanti F., Monros E., Rodius F., Duclos F., et al. (1996) Science, 271, 1423-1427. CrossRef Scholar google search
Huynen M.A., Snel B., Bork P., Gibson T.J. (2001) Hum. Mol. Genet., 10, 2463-2468. Scholar google search
Chen O.S., Hemenway S., Kaplan J. (2002) Proc. Natl. Acad. Sci. USA, 99, 12321-12326. Scholar google search
Wu J., Kasif S., DeLisi C. (2003) Bioinformatics, 19, 1524-1530. Scholar google search
Loganantharaj R., Atwi M. (2007) BMC Bioinformatics, 8, Suppl. 7, S25. Scholar google search
Pagel P., Wong P., Frishman D. (2004) J. Mol. Biol, 344, 1331-1346. Scholar google search
Rodionov D.A., Gelfand M.S. (2005) Trends Genet., 21, 385-389. Scholar google search
Haft D.H., Paulsen I.T., Ward N., Selengut J.D. (2006) BMC Biol., 4, 29. Scholar google search
Li J.B., Gerdes J.M., Haycraft C.J., Fan Y., Teslovich T.M., May-Simera H., Li H., Blacque O.E., Li L. et al. (2004) Cell, 117, 541-552. Scholar google search
Huynen M.A., Diaz-Lazcoz Y., Bork P. (1997) Trends Genet., 13, 389-390. Scholar google search
Makarova K.S., Wolf Y.I., Koonin E.V. (2003) Trends Genet., 19, 172-176. Scholar google search
Morett E., Korbel J.O., Rajan E., Saab-Rincon G., Olvera L., Olvera M., Schmidt S., Snel B., Bork P. (2003) Nat. Biotechnol., 21, 790-795. Scholar google search
Negre B., Casillas S., Suzanne M., Sanchez-Herrero E., Akam M., Nefedov M., Barbadilla A., de Jong P., Ruiz A. (2005) Genome Res., 15, 692-700. Scholar google search
Han J.D., Dupuy D., Bertin N., Cusick M.E., Vidal M. (2005) Nat. Biotechnol., 23, 839-844. Scholar google search
Bader J.S., Chaudhuri A., Rothberg J.M., Chant J. (2004) Nat. Biotechnol., 22, 78-85. Scholar google search
Date S.V., Marcotte E.M. (2003) Nat. Biotechnol., 21, 1055-1062. Scholar google search
Ogata H., Goto S., Sato K., Fujibuchi W., Bono H., Kanehisa M. (1999) Nucleic Acids Res., 27, 29-34. Scholar google search
Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S. et al. (2000) Nat. Genet., 25, 25-29. Scholar google search
Schlicker A., Domingues F.S., Rahnenfuhrer J., Lengauer T. (2006) BMC Bioinformatics, 7, 302. Scholar google search
Strong M., Graeber T.G., Beeby M., Pellegrini M., Thompson M.J., Yeates T.O., Eisenberg D. (2003) Nucleic Acids Res., 31, 7099-7109. Scholar google search
Strong M., Mallick P., Pellegrini M., Thompson M.J., Eisenberg D. (2003) Genome Biol., 4, R59. Scholar google search
Bowers P.M., Pellegrini M., Thompson M.J., Fierro J., Yeates T.O., Eisenberg D. (2004) Genome Biol., 5, R35. Scholar google search
Mellor J.C., Yanai I., Clodfelter K.H., Mintseris J., DeLisi C. (2002) Nucleic Acids Res., 30, 306-309. Scholar google search
Date S.V., Marcotte E.M. (2005) Bioinformatics, 21, 2558-2559. Scholar google search
von Mering C., Jensen L.J., Kuhn M., Chaffron S., Doerks T., Kruger B., Snel B., Bork P. (2007) Nucleic Acids Res., 35, D358-D362. Scholar google search