Microbiome Explorer

Compare the taxonomic profiles from a collection of microbiome samples

The world is filled with an incredible diversity of microbial life, and one of the achievements of modern technology is the ability to construct a rather accurate census of the collection of microbes which are present in any particular physical specimen.
The Microbiome Explorer is designed to provide a convenient interface for comparing the collections of microbes which are present across a group of samples.

Example Datasets

All datasets were obtained from the amazing curatedMetagenomicData project (doi: 10.1038/nmeth.4468).

DiseaseBrowserNum. StudiesNum. Samples
NonelinkN/AN/A
Inflammatory Bowel Disease (IBD)link62,881
Colorectal Cancer (CRC)link111,395
Cirrhosislink2282
Type 1 Diabetes (T1D)link3214
Shiga-toxigenic E. coli (STEC)link142

Input Data

To display a new dataset using the Microbiome Explorer simply format an abundance table (and optional annotation table) in the format described below, and then upload them to the blank Microbiome Explorer app provided above.
Once the data has loaded, use the “Clone” button to save a copy of the updated app as HTML.

Input Data: Abundance Table
The microbial abundance values may be provided as either:

  • Integer values indicating the number of reads assigned to each taxon, or
  • Scalar (percentage or proportion) values indicating the fraction of the community predicted to consist of a particular taxon

The abundance table may be CSV or TSV, with support for gzip-compression. The table must be oriented such that each row contains information for a different taxon and each column denotes a different sample. The first row contains sample names and the first column contains taxon names. The format of taxon names is shown below, using a particular format which includes each nested rank delimited with a | or ; character. Any rows starting with # are ignored.

SID0002_gtiSID0004_cleSID0006_evhSID0008_kcuSID0009_rhg
k__Bacteria|p__Bacteroidetes|c__Bacteroidia|o__Bacteroidales|f__Bacteroidaceae|g__Bacteroides|s__Bacteroides_vulgatus16.298614.899120.18930.3181520.1322
k__Bacteria|p__Bacteroidetes|c__Bacteroidia|o__Bacteroidales|f__Bacteroidaceae|g__Bacteroides|s__Bacteroides_uniformis10.408119.41032.92210.275284.07602
k__Bacteria|p__Bacteroidetes|c__Bacteroidia|o__Bacteroidales|f__Bacteroidaceae|g__Bacteroides|s__Bacteroides_plebeius9.710260000
k__Bacteria|p__Firmicutes|c__Clostridia|o__Clostridiales|f__Lachnospiraceae|g__Lachnospiraceae_unclassified|s__Eubacterium_rectale8.055870.021322.9665616.82330.12402
k__Bacteria|p__Bacteroidetes|c__Bacteroidia|o__Bacteroidales|f__Rikenellaceae|g__Alistipes|s__Alistipes_finegoldii6.7836000.016730
k__Bacteria|p__Bacteroidetes|c__Bacteroidia|o__Bacteroidales|f__Bacteroidaceae|g__Bacteroides|s__Bacteroides_caccae6.5360802.5360200
k__Bacteria|p__Bacteroidetes|c__Bacteroidia|o__Bacteroidales|f__Tannerellaceae|g__Parabacteroides|s__Parabacteroides_distasonis5.594685.35512.104841.0540512.0291
k__Bacteria|p__Firmicutes|c__Bacilli|o__Lactobacillales|f__Streptococcaceae|g__Streptococcus|s__Streptococcus_salivarius3.929240.006633.04441.176550.01676
k__Bacteria|p__Firmicutes|c__Clostridia|o__Clostridiales|f__Eubacteriaceae|g__Eubacterium|s__Eubacterium_sp_CAG_1803.4011805.5181200
k__Bacteria|p__Bacteroidetes|c__Bacteroidia|o__Bacteroidales|f__Tannerellaceae|g__Parabacteroides|s__Parabacteroides_merdae2.7116901.403610.888620

Input Data: Annotation Table
The annotation table has a much more flexible format. Each row contains information for a different sample, and the first column contains the sample identifier (matching the column names from the abundance table). The first row must contain the header labels for each column.

subject_idstudy_conditiondiseaseage_categoryNCBI_accessiondisease_subtypedisease_stage
SID0002_gtiloombaSUB_subject-0002controlfatty_liveradultSRR5275484NAFLD0
SID0004_cleloombaSUB_subject-0004controlfatty_liveradultSRR5275483NAFLD0
SID0006_evhloombaSUB_subject-0006cirrhosiscirrhosisseniorSRR5275481cirrhosis4
SID0008_kculoombaSUB_subject-0008controlfatty_liveradultSRR5275480NAFLD0
SID0009_rhgloombaSUB_subject-0009controlfatty_liveradultSRR5275479NAFLD0

Analysis Features

The Microbiome Explorer can be used to display a number of different types of analysis for a single dataset. Use the Add and Remove buttons to modify the number of different analyses which are displayed. Any one of the individual displays may be modified using the menus provided at the top of each section.

Every one of the different analysis types may be annotated with a customized legend text, making it easy to describe the major findings of each display to the reader.

Abundant Organisms

Display the most abundant organisms as either a stacked bar chart or a heatmap.

Options:

  • Plot Type: Stacked Bars or Heatmap
  • Taxonomic Level: Class, Order, Family, etc.
  • Filter Samples By: Only show those samples which match the metadata filter
  • Color Samples By: Add a marginal color annotation indicating the value of one or more metadata annotations
  • Max # of Organisms: Set the number of organisms displayed
  • Sort Samples By: Optionally sort samples based on the provided metadata
  • Heatmap Color Palette: Controls the colors used in the heatmap
  • Annotation Color Palette: Controls the colors used in the marginal sample annotations
  • Annotation Size: Controls the relative size of the marginal sample annotations
  • Figure Height: Controls the height of the figure (width scales with the browser)
  • Title: Add a title to the plot
  • Legend: Add legend text below the plot

Ordination (PCA/t-SNE)

Display a scatterplot representing the similarity of samples in terms of beta-diversity using either PCA or t-SNE.

Options:

  • Ordination Type: PCA or t-SNE
  • Taxonomic Level: Class, Order, Family, etc.
  • Filter Samples By: Only show those samples which match the metadata filter
  • Color Samples By: Add a color to each point indicating the value of one or more metadata annotations
  • Show PCA Loadings: Optionally add a line to the PCA plots showing which organisms are associated with different principal coordinates
  • 3D Plot: Optionally display a three-dimensional plot which can be rotated by the viewer (by clicking and dragging on the plot)
  • Title: Add a title to the plot
  • Legend: Add legend text below the plot

Alpha Diversity

Display a histogram representing the complexity of samples in terms of alpha-diversity using either Shannon or Simpson diversity.

Options:

  • Diversity Metric: Shannon or Simpson
  • Taxonomic Level: Class, Order, Family, etc.
  • Filter Samples By: Only show those samples which match the metadata filter
  • Compare Samples By: For continuous values, show a scatterplot; for categorical values, show multiple histograms
  • Number of bins: Controls the granularity of the histogram
  • Figure Height: Controls the height of the figure (width scales with the browser)
  • Title: Add a title to the plot
  • Legend: Add legend text below the plot

Beta Diversity

Display a histogram representing the similarity of samples in terms of beta-diversity.

Options:

  • Diversity Metric: Bray-Curtis, Euclidean, or Jensen-Shannon
  • Taxonomic Level: Class, Order, Family, etc.
  • Filter Samples By: Only show those samples which match the metadata filter
  • Compare Samples By: For continuous values, show a scatterplot; for categorical values, show multiple histograms
  • Number of bins: Controls the granularity of the histogram
  • Figure Height: Controls the height of the figure (width scales with the browser)
  • Title: Add a title to the plot
  • Legend: Add legend text below the plot

Differential Abundance

Compare the abundance of individual organisms between samples on the basis of a particular metadata annotation

Options:

  • Compare Samples By: Select the metadata category used to compare samples
  • Filter Samples: Only use those samples which match the metadata filter
  • Taxonomic Level: Class, Order, Family, etc.
  • Figure Height: Controls the height of the figure (width scales with the browser)
  • Title: Add a title to the plot
  • Legend: Add legend text below the plot

Single Organism

Show the relative abundance of a single organism across samples, including categorical or continuous comparisons across samples

Options:

  • Organism: The organism to display
  • Plot Type: Box or Scatter
  • Compare Samples By: Select the metadata category used to compare samples
  • Filter Samples: Only use those samples which match the metadata filter
  • Log10 Scale: Optionally scale the vertical axis by log10
  • Figure Height: Controls the height of the figure (width scales with the browser)
  • Title: Add a title to the plot
  • Legend: Add legend text below the plot

Compare Two Organisms

Compare the relative abundance of two organisms across samples, including categorical or continuous comparisons across samples

Options:

  • Organism 1 & 2: The organisms to display
  • Color Samples By: Optionally annotate samples by metadata annotations
  • Filter Samples: Only use those samples which match the metadata filter
  • Log10 Scale: Optionally scale the axes by log10
  • Figure Height: Controls the height of the figure (width scales with the browser)
  • Title: Add a title to the plot
  • Legend: Add legend text below the plot