Skip to main content

Table 1 Novel transcript contigs in brain and cell lines.

From: Revealing the missing expressed genes beyond the human reference genome by RNA-Seq

Items

Brain

Cell lines

Total number of transcript contigs

254769

204625

Number of contigs unaligned to GRCh37, RefSeq genes and EST

16225

11638

Number of unmapped contigs aligned to Human Fosmid sequences

41

25

Number of unmapped contigs aligned to HuRef genome

184

100

Number of unmapped contigs aligned to Celera genome

181

90

Number of unmapped conitgs aligned to YH novel sequences

103

42

Number of unmapped contigs aligned to NA18507 novel sequences

137

46

Number of unmapped contigs aligned to chimpanzee genome

119

56

Number of unmapped contigs aligned to macaque genome

64

36

Total number of aligned unmapped contigs

313

173

Total contig length (bp)

50324

29664

N50 contig size (bp)

195

194

  1. *The brain and cell line transcript contigs were aligned to the human reference genome (GRCh37), RefSeq genes and EST sequences with 90% indentify and 90% coverage as threshold. The unalignable brain and cell line transcript contigs were then aligned to human Fosmid sequences, HuRef genome, Celera genome, Asian (YH) and African (NA18507) novel sequences, chimpanzee and macaque genomes with 90% identity and 100% coverage as the threshold.