Learn to use InterPro a database that integrates protein signature data from the major protein databases into a single comprehensive resource (Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs ). InterPro can be used to search for data on protein families, domains and sites. Use InterProScan to quickly find domain and family information about novel protein sequences. Learn to use this comprehensive resource to quickly and easily gather protein signature information from all the protein signature databases at once.

You will learn:

  • to use both the basic and advanced search tools to find detailed information on entries in InterPro
  • how to understand and customize the display of your search results
  • to use InterProScan to query novel protein sequences for information on domains and families


This tutorial is a part of the tutorial group Motifs and Domains. You might find the other tutorials in the group interesting:

Melina II: A Web-Based Tool for Promoter Analysis

MINT: Molecular Interaction Database

Gibbs Motif Sampler: A motif finder and analysis tool

MEME Suite Sequence Annotation Tools: MEME suite motif finding and annotating tools

MDscan: Motif Discovery scan for nucleotide and protein motifs

PROSITE: Database of protein domains, families and functional sites

MEME Suite TOMTOM and GOMO algorithms: Motif discovery tool that can detect conserved motifs in a set of DNA or protein sequences that you provide

MEME Suite GLAM2 Algorithm: Part of a motif discovery tool that can detect conserved motifs in a set of DNA or protein sequences.

MEME Algorithm: Multiple Expectation Maximum for Motif Elicitation

MEME Suite Overview: Motif-based sequence analysis tools

SMART: Protein domain annotation and analysis of domain architectures

This tutorial is a part of the tutorial group Interaction resources. You might find the other tutorials in the group interesting:

Pathway Interaction Database: A resource of pathway and network data and displays

MINT: Molecular Interaction Database

Cytoscape: An open-source software platform used for visualization and analysis of molecular interaction and network data

BiologicalNetworks: Analyze and visualize molecular interaction networks

BioSystems: Database of Biological Systems

Reactome Legacy: Older version of the current Reactome knowledgebase of biological processes.

Reactome: Knowledgebase of biological processes

GeneMANIA: GeneMANIA: Fast Gene Function Predictions

GenMAPP: A freely available open source software application for visualizing microarray data in the context of biological pathways.

VisANT: A web-based or downloadable software platform used for visualization and analysis of networks and interaction pathways

IntAct protein interaction database: IntAct is an open source database and analysis resource for protein interaction data

KEGG: KEGG, The Kyoto Encyclopedia of Genes and Genomes


Algorithms and Analysis : This category contains various tools that may help perform analysis of different genomics data types. This may include sequence alignment, large-scale or complex queries, motif finding, or comparative assessments.

EBI : This category includes all resources maintained at the European Bioinformatics Institute (EBI)

Proteins : Tools that are primarily used in the storage, retrieval, or exploration of amino acid based data. Some tools may also involve nucleotide sequence information.


Video Tip of the Week: Aquaria, streamlined access to protein structures for biologists: This week's Video Tip of the Week is Aquaria, a new resource for exploring protein structures, mutations, and similarities to other proteins. It's a very well-designed and interactive experience for en...

NAR database issue (always a treasure trove): The advance access release of most of the  NAR database issue articles is out. As usual, this this database issue includes a wealth of new and updated data repositories and analysis tools. We'll be wri...

Update on Friday's Service Announcement - All's Well: From the looks of my searches this morning, all the weekend updates and downtime that I wrote about on Friday has ended happily. I am now getting lovely results from InterPro rather than error messages...

Service Announcement: EMBL-EBI Hinxton data centre offline for scheduled maintenance: Was following a protein information trail that ended in the error message "Sorry! That service is temporarily unavailable. Data centre offline until Monday, Oct 24, 2011" If you were planning on stayin...

Guest Post: WAVe - Pedro Lopes: This next post in our continuing semi-regular Guest Post series is from Pedro Lopez, developer of WAVe at the University of Aveiro Bioinformatic Group in Aveiro Portugal. If you are a provider of a fre...


Recent BioMed Central research articles citing this resource

De Silva Ryan Jeremy et al., Genetic clustering and polymorphism of the merozoite surface protein-3 of Plasmodium knowlesi clinical isolates from Peninsular Malaysia. Parasites Vectors (2017) doi:10.1186/s13071-016-1935-1

Kim Seongok et al., The complete genome sequence of Cronobacter sakazakii ATCC 29544 T , a food-borne pathogen, isolated from a child’s throat. Gut Pathogens (2017) doi:10.1186/s13099-016-0150-0

Caragata P. E. et al., The transcriptome of the mosquito Aedes fluviatilis (Diptera: Culicidae), and transcriptional changes associated with its native Wolbachia infection Multicellular invertebrate genomics. BMC Genomics (2017) doi:10.1186/s12864-016-3441-4

Lisov V. Alexander et al., Xylanases of Cellulomonas flavigena : expression, biochemical characterization, and biotechnological potential. AMB Express (2017) doi:10.1186/s13568-016-0308-7

Adams Jon et al., A workforce survey of Australian chiropractic: the profile and practice features of a nationally representative sample of 2,005 chiropractors Integration into healthcare. BMC Complementary and Alternative Medicine (2017) doi:10.1186/s12906-016-1542-x