Skip to main content

Table 2 Summary of annotations of the C. sinensis unigenes

From: Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds

  Sequences (n) Annotations (n) Functional classification
All assembled unigenes 127,094 - -
Gene annotations against plant proteins of NR 41,483 41,483 -
Gene annotations against Arabidopsis protein of NR 50,214 50,214  
Unique gene annotations against NR 53,937 53,937  
Gene annotations against UniProt 25,462 25,462 -
Gene annotations against InterPro 32,646 32,646 3,485 domains/families
Gene annotations against Pfam 40,638 40,638 6,673 domains/families
Gene annotations against COG 11,241 15,701 24 categories
Gene annotations against KEGG 16,939 16,939 214 pathways
GO annotations for NR protein hits 3,577 14,283 3 main categories 43 sub-categories
GO annotations for Arabidopsis protein hits 32,017 157,650 3 main categories 41 sub-categories
All annotated Unigenes 55,088 - -
Unigenes matching all six databases 9,139 - -