Skip to main content
Fig. 4 | BMC Genomics

Fig. 4

From: Functional annotation of a divergent genome using sequence and structure-based similarity

Fig. 4

Examples of high-confidence structure-based hits for BUSCO genes, cell-division cycle and endoplasmic reticulum resident proteins. (a) BUSCO scores of a selection of microsporidian genomes compared to the score of V. necatrix. The genus Encephalitozoon is colored light grey. The V. necatrix BUSCO score bar is colored yellow with an extension in green representing the four additional genes identified using Foldseek. (b) AlphaFold structures of E. cuniculi (magenta) and V. necatrix (gold) proteins corresponding to the four microsporidia BUSCO genes. These four genes were exclusively identified via structural matching due to their low protein sequence identity. (c) Unambiguous identification of cell-division control protein 45, endoplasmic reticulum resident protein 44 and coiled-coil domain-containing protein 47 through structural similarity searches. Sequence-based searches lead to moderate-to-low-confidence hits comprising uncharacterized proteins, annotated protein domains or proteins with incorrect functional annotation. Sequence identity was calculated with ClustalW (v2.1), and TM scores were generated using TM-align (https://zhanggroup.org/TM-align/). TM score was normalized according to the length of the reference protein. Gold: Identified microsporidian proteins; magenta: Homologs; AF, AlphaFold; PDB, Protein Data Bank

Back to article page