ENCODE annotations with UniProtKB
Here the ENCODE sequences were analysed using the UniProt KnowledgeBase Annotation Pipeline.
Much of the data set can be mapped to genes whose products have been manually annotated in UniProtKB (over 75%), and most can also be classified according to the Gene Ontology (> 60% coverage of all three ontologies).
212 of the ENCODE peptides have been identified in mass spectrometry experiments, and 617 protein interactions have been shown to exist for proteins in the data set, defining 3 major interaction clusters.
This work done by Jorge Duarte, Paul Kersey, Phil Jones, Sam Kerrien, Daniela Wieser, Henning Hermjakob of
Rolf Apweilers Sequence Databases group at the EBI.