Biology databases: protein families, domains, functional sites, etc.

  • InterPro: database of protein families, domains and functional sites.
  • TIGRFAMs: protein families based on Hidden Markov Models or HMMs.
  • MEROPS: peptidases database.
  • TRANSFAC: transcription Factor Database.




If you know of additional data collections of protein family information which can be useful for biomedical text mining applications, please be so kind and contact me: martink@cnb.uam.es. This way you help to improve completeness of this list of text mining resources for biology and biomedicine. Especially for researches with a computer sciences or computational linguistics background which lack a more extensive knowledge of bioinformatics resources this information migth be useful. Some strategies have been implemented to analyze word frequencies related to protein family associated literature (see Andrade et al.)







HOME