Dataset: cluster analysis - cross-phyla protein clustering