Method

Efficient computation of Faith's phylogenetic diversity with applications in characterizing microbiomes

    • 1 University of California, San Diego;
    • 2 IBM T. J. Watson Research Center;
    • 3 IBM Research Europe;
    • 4 Arizona State University;
    • 5 IBM Almaden Research Center;
    • 6 Finnish Institute for Health and Welfare;
    • 7 Baker Heart and Diabetes Institute;
    • 8 University of Turku;
    • 9 University of Cambridge
Published September 3, 2021. https://doi.org/10.1101/gr.275777.121
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 4
Current Issue:

Abstract

The number of publicly available microbiome samples is continually growing. As dataset size increases, bottlenecks arise in standard analytical pipelines. Faith’s phylogenetic diversity is a highly utilized phylogenetic alpha diversity metric that has thus far failed to effectively scale to trees with millions of vertices. Stacked Faith's Phylogenetic Diversity (SFPhD) enables calculation of this widely adopted diversity metric at a much larger scale by implementing a computationally efficient algorithm. The algorithm reduces the amount of computational resources required, resulting in more accessible software with a reduced carbon footprint, as compared to previous approaches. The new algorithm produces identical results to the previous method. We further demonstrate that the phylogenetic aspect of Faith's PD provides increased power in detecting diversity differences between younger and older populations in the FINRISK study's metagenomic data.

Loading
Loading
Back to top