Our team from EMBL-EBI have published a new preprint titled “Expansion of novel biosynthetic gene clusters from diverse environments using SanntiS”.
This preprint represents a proof of concept for BlueRemediomics, especially at to how we can apply new tools to existing data in the MGnify microbiome resource and drive the experimental validation of in silico predictions.
In the preprint, EMBL-EBI applied a newly developed in-house ML tool named SanntiS to MGnify data, a new machine learning-based approach for identifying biosynthetic gene clusters (BGCs). SanntiS achieved high precision and recall in both genomic and metagenomic datasets, effectively capturing a broad range of BGCs, illustrating the significance of metagenomic datasets in comprehensively understanding the diversity and distribution of BGCs in microbial communities.
The experimental work acknowledges UKRI guarantee funding for BlueRemediomics and suitably sets the stage for further planned work in the BlueRemediomics project!