Nuclc. Acids. Res. OUP
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH ARTICLES TABLE OF CONTENTS
Compilation Paper
Categories List
Alphabetical List
Search Summary Papers

InterPro

http://www.ebi.ac.uk/interpro/

Apweiler, R.1, Attwood, T.K.2, Bairoch, A.3, Bateman, A.4, Biswas, M1, Bork, P6, Bucher, P.5, Copley, R.6, Croning, M.D.R.1, Durbin, R.4, Falquet, L.5, Fleischmann, W.1, Gouzy, J.7, Griffiths-Jones, S.4, Hermjakob, H.1, Hulo, N.3, Jonassen, I.8, Kahn, D.7, Kanapin, A.1, Karavidopoulou, Y.1, Lopez, R.1, Mulder, N.J.1, Oinn, T.M.1, Pagni, M.5, Ponting, C.P.9, Servant, F.1, Sigrist, C.J.A.3, Zdobnov, E.M.1

1EMBL Outstation - European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
2School of Biological Sciences, The University of Manchester, Manchester, UK
3Swiss Institute for Bioinformatics, Geneva, Switzerland
4The Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
5Swiss Institute for Experimental Cancer Research, Lausanne, Switzerland
6Biocomputing Unit EMBL, Heidelberg, Germany
7CNRS/INRA, Toulouse, France
8Department of Informatics, University of Bergen, HIB, Bergen, Norway
9MRC Functional Genetics Unit, Department of Human Anatomy &Genetics, University of Oxford, UK

Contact   mulder@ebi.ac.uk


Database Description

Databases with signatures diagnostic for protein families, domains or functional sites are important tools for the computational functional classification of newly determined sequences that lack biochemical characterisation. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam, ProDom and SMART database projects. Each InterPro entry includes a functional description, annotation, literature references, and links back to the relevant member database(s). A new feature includes mapping of InterPro entries to the relevant Gene Ontology (GO) terms, a useful tool for automatic mapping of proteins to GO terms. Release 3.2 of InterPro (July 2001) contains 3,939 entries, representing 2850 families, 1009 domains, 65 repeats and 15 sites of post-translational modification (PTMs). Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (nearly 2,000,000 hits from around 580,000 proteins in SWISS-PROT and TrEMBL). Integration of the next member database, TIGRFAMs, is currently in progress. The database is accessible for text- and sequence-based searches at http://www.ebi.ac.uk/interpro/. Questions can be emailed to Interhelp@ebi.ac.uk.

Recent Developments

New diagnostic signatures from the member databases have been incorporated into InterPro. The SMART (a Simple Modular Architecture Research Tool) database, which allows the identification and annotation of genetically mobile domains and the analysis of domain architectures has been integrated into InterPro. This includes more than 500 domain families found in signalling, extracellular and chromatin-associated proteins. Currently in progress is the integration of the next member database, TIGRFAMs, which provides Hidden Markov Models to identify protein families with identical functions. Other new features includes an additional graphical user interface for viewing protein matches in a condensed graphical view, as well as mapping to Gene Ontology (GO) terms. Where possible, InterPro entries have been manually GO terms so that the terms apply to all proteins matching the entry.

REFERENCES

  1. Apweiler R., Attwood T.K., Bairoch A., Bateman A., Birney E., Biswas M., Bucher P., Cerutti L., Corpet F., Croning M.D., Durbin R., Falquet L., Fleischmann W., Gouzy J., Hermjakob H., Hulo N., Jonassen I., Kahn D., Kanapin A., Karavidopoulou Y., Lopez R., Marx B., Mulder N.J., Oinn T.M., Pagni M., Servant F., Sigrist C.J., Zdobnov E.M. (2001) The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res., 29, 37-40.

Category   Protein Sequence Motifs

Go to the abstract in the NAR 2001 Database Issue.

 

Compilation Paper
Categories List
Alphabetical List
Search Summary Papers