Our paper on Compositional Tensor Factorization (CTF) of microbial dynamics is now published in @NatureBiotech! nature.com/articles/s4158…
It might change how you analyze longitudinal microbiome data. Thread below:
In a cross-sectional study, you run a PCoA and look at the top PCs. But with temporal data, applying PCoA separately on each time point may mask important information that is carried over time.
For your longitudinal data analysis you should use CTF!
CTF is an unsupervised dimensionality reduction method tailored for repeated measures.It decomposes your temporal or spatial microbiome data into loading vectors corresponding to subjects, microbial features, and time/space - while accounting for its compositionality.
In simulations, generated based on real longitudinal data distributions, CTF showed higher accuracy than existing beta-diversity metrics including Jaccard, Bray–Curtis, Aitchison, unweighted UniFrac and weighted UniFrac.
With real data, tracking infant gut development over time, CTF was *tenfold* more accurate than other beta-diversity metrics at discriminating vaginally from cesarean born infants based on their microbiomes.
The cool part is that CTF identified the same taxa as being associated with birth mode across independent studies, suggesting a birth mode signature across infants. These datasets were collected and processed using different protocols and by different laboratories!
The robustness of this microbial signature across multiple datasets shows that CTF can be used to identify important players in the microbiome that are associated with host phenotypes.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
We focused on 3 major barriers to understanding #breastmilk
1⃣Milk contains thousands of components, yet most studies analyze just one or a few
2⃣Milk changes over time, yet longitudinal studies are rare
3⃣We lack computational tools & methods to study milk as a complex SYSTEM
From a #DataScience perspective, to study complex biological processes such as #humanmilk and lactation, it is essential to take an integrative and ‘multi-layer’ approach.
We did it, @DaveZeevi! Our paper is now published in @ScienceMagazine! science.sciencemag.org/content/370/65…
We present a data-driven, computational perspective on how selective pressures resulting from nutrient limitation shape microbial coding sequences. Thread below:
We study ‘resource-driven’ selection using metagenomic and single-cell data of marine microbes, while adopting concepts common in statistical genetics like linear mixed models with variance components.
Using tailored algorithms, we partition the variance in selection metrics, calculated using marine microbes, and show that a significant portion of the selection is explained by the environment and is associated with nitrogen availability.