All data collected by the IMPC is freely availabe. Besides viewing in the web portal, it can also be downloaded for independent analysis. Several channels are available, each tailored for accessing data for individual items, small sets, or in bulk.
Data access through the web portal
Data on individual items – such as genes or experiments – are summarized on their dedicated pages in the web portal (gene or chart pages, respectively). Tables on those pages are connected with export links. Those export links create files that can be opened in a spreadsheet.
For summary data for small subsets of genes, the batch query tool provides a convenient interface that can be customized using a web form. This can be used to download phenotyping status, significant phenotypes, and other fields.
Snapshots of the entire dataset captured at the time of data releases are available for bulk download via FTP (ftp://ftp.ebi.ac.uk/pub/databases/impc/). These archives allow for a historical perspective.
The full range of up-to-date IMPC data can be accessed through an application programming interface, or API. The IMPC infrastructure is powered by [SOLR](https://lucene.apache.org/solr/guide/8_2/) and the full [SOLR query syntax](https://lucene.apache.org/solr/guide/8_2/the-standard-query-parser.html) and [SOLR query parameters](https://lucene.apache.org/solr/guide/8_2/common-query-parameters.html) are available for use.
Data are stored across several compartments, or cores, and each can be queried independently. The cores provide many fields that can be searched on. For example, it is possible to search and filter by phenotyping centre, mouse colony, phenotype, and many other settings. Many of these filtering criteria are shared across the cores, so there are common data access patterns. There are, however, fields that are specific to each core.