| N=10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | 100 |
---|
Total | 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | 100 |
Discarded | 0 | 0 | 2 | 3 | 5 | 6 | 7 | 9 | 9 | 13 |
Remaining | 10 | 20 | 28 | 37 | 45 | 54 | 63 | 71 | 81 | 87 |
Survival rate | 1.0 | 1.0 | .93 | .93 | .90 | .90 | .90 | .89 | .90 | .87 |
- Discarded indicates the number of residues abandoned during preprocessing. During preprocessing, the model checks if there are residues having too much similarity in their neighboring sequences. If two residues have too much similarity in neighboring sequence contexts, they cannot be distinguished by NGS-seq due to the limited read length. Table shows that remaining rate is consistent even when N reaches 100, which indicates that the model can handle panels with large size without losing too much residues