CombFunc: predicting protein function using heterogeneous data sources

Nucleic Acids Res. 2012 Jul;40(Web Server issue):W466-70. doi: 10.1093/nar/gks489. Epub 2012 May 27.

Abstract

Only a small fraction of known proteins have been functionally characterized, making protein function prediction essential to propose annotations for uncharacterized proteins. In recent years many function prediction methods have been developed using various sources of biological data from protein sequence and structure to gene expression data. Here we present the CombFunc web server, which makes Gene Ontology (GO)-based protein function predictions. CombFunc incorporates ConFunc, our existing function prediction method, with other approaches for function prediction that use protein sequence, gene expression and protein-protein interaction data. In benchmarking on a set of 1686 proteins CombFunc obtains precision and recall of 0.71 and 0.64 respectively for gene ontology molecular function terms. For biological process GO terms precision of 0.74 and recall of 0.41 is obtained. CombFunc is available at http://www.sbg.bio.ic.ac.uk/combfunc.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Gene Expression
  • Internet
  • Protein Interaction Maps
  • Proteins / chemistry
  • Proteins / genetics
  • Proteins / physiology*
  • Sequence Analysis, Protein
  • Software*

Substances

  • Proteins