MACARRoN
Metabolome Analysis and Compound Annotation with Ranked Relevance on Networks
What is MACARRoN?
MACARRoN (Metabolome Analysis and Compound Annotation with Ranked Relevance on Networks) is a workflow for systematic prioritization of metabolites from untargeted metabolomics studies. It integrates metabolomics data with microbial community profiles (e.g., from MetaPhlAn or HUMAnN) to identify metabolites that are:
- Chemically distinct (novel or understudied)
- Differentially abundant between groups
- Microbiome-associated
MACARRoN answers the question: โWhich metabolites in this dataset are the most biologically relevant and worth following up on?โ
- ๐ GitHub
- ๐ Bioconductor
- ๐๏ธ Paper: Bhosle et al. 2022, Nature Communications
When to Use MACARRoN
Use MACARRoN when you have untargeted metabolomics data (LC-MS) and you want to:
- Identify novel microbiome-linked metabolites
- Prioritize metabolites for follow-up biochemical characterization
- Integrate metabolomics with metagenomics data
- Reduce the โunknown compoundโ problem in metabolomics
Installation
R / Bioconductor
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("MACARRoN")From GitHub
devtools::install_github("biobakery/macarron")Basic Usage
MACARRoN is an R package. A typical workflow:
library(MACARRoN)
# Load input data
metabolomics_data <- read.csv("metabolomics.csv", row.names = 1)
metadata <- read.csv("metadata.csv", row.names = 1)
annotations <- read.csv("annotations.csv", row.names = 1)
# Run full MACARRoN pipeline
mac_output <- MACARRoN::MACARRoN(
met_dat = metabolomics_data,
metadata = metadata,
annotation_table = annotations,
output = "macarron_output/"
)Input Requirements
| Input | Format | Description |
|---|---|---|
| Metabolomics data | CSV/TSV | Feature ร sample abundance matrix |
| Sample metadata | CSV/TSV | Sample covariates and group labels |
| Annotation table | CSV/TSV | Chemical annotations (from MS libraries) |
Output
MACARRoN outputs a ranked table of metabolite features with the following scores:
| Column | Description |
|---|---|
MAC_score |
Composite prioritization score |
Avg_Abundance |
Mean abundance across samples |
Prevalence |
Fraction of samples with detection |
Qval |
Adjusted p-value from MaAsLin2 |
Effect_size |
Effect size of differential abundance |
Annotation_cluster |
Chemical cluster identity |
Tips & Gotchas
Pair with MaAsLin2 โ MACARRoN internally calls MaAsLin2 for the differential abundance step. Ensure MaAsLin2 is installed.
Missing annotations โ Many features in untargeted metabolomics are unannotated. MACARRoN can still prioritize these based on abundance and association patterns.
Hierarchical clustering of metabolite features is a key MACARRoN step. Features that cluster together chemically are scored together.