Fig. 3: MTX-based coexpression contributes substantially to FUGAsseM-full protein function predictions.
From: Predicting functions of uncharacterized gene products from microbial communities

aâc, Distribution of RF importance scores from the FUGAsseM-full modelâs second (data integration) layer. Only GO terms with sufficient performance (resulting in predictions with confidence probabilityââ¥â0.75) are included for BP (a; nâ=â14,249 total termâspecies pairs for prediction), MF (b; nâ=â18,553 total termâspecies pairs for prediction) and CC (c; nâ=â1,623 total termâspecies pairs for prediction). The full list is provided in Supplementary Table 11. dâf, Distribution of importance scores for successful FUGAsseM models (those that assigned GO annotations to proteins with a prediction probabilityââ¥â0.75) showing newly predicted annotations that were not used for training, based on accumulated experimental evidence over time (nâ=â65 total termâspecies pairs for BP (d), 34 total termâspecies pairs for MF (e) and 11 total termâspecies pairs for CC (f)). The full list is provided in Supplementary Table 12. Box plots display the median (line at the 50th percentile), IQR (box spanning the 25th to 75th percentiles), whiskers (extending to 1.5 à the IQR) and mean values (dark points).