Id | Sequence | Number | Forward (%) |
---|
1 | Consensus | 2678 | 46 |
2 | C114Del | 486 | 1* |
3 | T101Del | 357 | 99* |
4 | T39G | 242 | 46 |
5 | A40C | 101 | 56 |
6 | T121A | 92 | 41 |
7 | T74G | 78 | 47 |
8 | T80Del | 78 | 100* |
9 | G84C | 78 | 41 |
10 | C42G | 76 | 53 |
11 | G1A | 74 | 43 |
12 | A110G | 65 | 48 |
13 | A112T | 61 | 38 |
14 | T19C | 59 | 37 |
15 | T39G-C114Del | 57 | 0* |
16 | A151C | 56 | 52 |
17 | G79C | 55 | 53 |
18 | C89T | 53 | 49 |
19 | G1T | 46 | 63 |
20 | T39G-T101Del | 41 | 98* |
- The sequences are ordered and numbered according to the number of identical copies of the sequence in the monomer dataset. The “Sequence” column indicates how each sequence differs from the consensus sequence of the C1 family, using standard notations. The “Number” column displays the number of identical copies of the sequence in the monomer dataset. The “Forward” column displays the percentage of reads obtained in the forward orientation (i.e. the orientation of our reference sequence)
- Strong biases for read orientation are shown with an asterix (*)