Fig. 5: Uncharacterized proteins are predicted to be broadly distributed and species-specific functions with high confidence in the human microbiome.
From: Predicting functions of uncharacterized gene products from microbial communities

a, Enumeration of the fraction of annotated taxa (first column), AUROC values per taxon (second column; nâ=â3,297 total termâspecies pairs for prediction) and numbers of annotations (third column) for species-shared BP terms predicted by FUGAsseM in the HMP2. The top 15 terms with the largest number of species with at least one assignment are listed in decreasing order of average preserved annotations across species before running FUGAsseM (full results in Supplementary Tables 15 and 16). Box plots display the median (line at the 50th percentile), IQR (box spanning the 25th to 75th percentiles) and whiskers (extending to 1.5 Ã the IQR). b, Fraction of annotated taxa (first column), AUROC values per taxon (second column; nâ=â100 total termâspecies pairs for prediction) and numbers of annotations (third column) for species-specific BP terms predicted by FUGAsseM from the HMP2 cohort. The 15 least frequently predicted terms are listed in decreasing order of mean number of preserved annotations as in a (full results in Supplementary Tables 17 and 18). Box plots are displayed as in a.