Help & Documentation

MCP Solr Server

What is it?

The Solr server is the most powerful and flexible of the four servers. It provides access to the main IMPC database, which is organised into 10 specialised collections (called “cores”). Each core contains a different type of data.

What can you ask about?

This server covers the broadest range of IMPC data. Here is what each data collection contains:

Data collectionWhat it containsWhen to use it
geneGene information: symbols, names, IDs, associated phenotypes and diseasesWhen you want to know about a specific gene or find genes linked to a phenotype
genotype-phenotypeStatistically significant results connecting genes to phenotypesWhen you want to find what phenotypes a gene causes, or which genes cause a phenotype
statistical-resultDetailed statistical analysis results with effect sizes and p-valuesWhen you need the full statistical evidence behind a gene-phenotype association
mpMammalian Phenotype (MP) ontology terms and their hierarchyWhen you need to look up phenotype term definitions or navigate the phenotype hierarchy
impc_imagesImages of mutant mice: X-ray, lacZ staining, histopathology, etc.When you want to find images related to a gene or phenotype
phenodigmGene-disease associations calculated by the PhenoDigm algorithmWhen you want to find mouse models relevant to a human disease
experimentRaw experimental measurements (QC-passed)When you need the actual experimental data points and measurements
mgi-phenotypePhenotype annotations from Mouse Genome Informatics (MGI)When you want to cross-reference phenotype data with the MGI database
pipelineExperimental protocols, procedures, and parameter definitionsWhen you want to know what tests were performed and how
productProduct information and availabilityWhen you want to check product availability for a gene or allele

Example questions you can ask:

Finding genes and phenotypes:

  • “What phenotypes are associated with the gene Bmp4?”
  • “Which genes are linked to abnormal heart morphology?”
  • “Show me all significant phenotype associations for Notch1.”
  • “What genes have been phenotyped on chromosome 11?”

Disease associations:

  • “What mouse models are relevant to Alzheimer’s disease?”
  • “Which IMPC genes are associated with diabetes?”
  • “Find disease associations for the gene Lepr.”

Images and experiments:

  • “Show me X-ray images for Akt2 knockout mice.”
  • “What experimental procedures were used for the gene Pax6?”
  • “Find lacZ expression images for Bmp4.”

Statistics and data:

  • “What are the most significant phenotypes for Arid1b with p-value below 0.0001?”
  • “How many genes have been phenotyped so far in IMPC?”
  • “What is the statistical evidence for Mecp2 and abnormal behavior?”

What to expect in responses

The AI will return structured data that may include gene symbols, phenotype terms, p-values, effect sizes, image links, and more, depending on your question. Results are typically returned in a readable format with the most relevant information highlighted.