Skip to main content

Table 2 Test results on independent test set of 12 MCF7 cells obtained from GSE86316. Two passes of the cell identification process was invoked since two cell lines (G41726.MCF7.5 and G28020.KPL-1.1) both pass the correlation/p-value test. By removing common variants between two cells and with deep coverage requirement, the difference of correlations is much larger than the first test

From: CeL-ID: cell line identification using RNA-seq data

Samples

First test with all variants (17,730)

Second test after removing common variants (2631)

Match cell names

corr. Coef

p-value

Match cell names

corr. Coef

p-value

Sample 1

G41726.MCF7.5

0.91

8.59E-21

G41726.MCF7.5

0.53

0.084

G28020.KPL-1.1

0.89

1.74E-19

G28020.KPL-1.1

0.34

0.996

G25206.NCI-H1694.1

0.43

0.781

   

sample 2

G41726.MCF7.5

0.9

6.63E-20

G41726.MCF7.5

0.49

0.309

G28020.KPL-1.1

0.88

1.07E-18

G28020.KPL-1.1

0.32

1

G30560.TO_175.T.1

0.42

0.863

   

sample 3

G41726.MCF7.5

0.9

5.14E-20

G41726.MCF7.5

0.5

0.266

G28020.KPL-1.1

0.88

6.73E-19

G28020.KPL-1.1

0.35

0.993

G30560.TO_175.T.1

0.42

0.869

   

Sample 4

G41726.MCF7.5

0.9

1.64E-20

G41726.MCF7.5

0.55

0.053

G28020.KPL-1.1

0.89

3.50E-19

G28020.KPL-1.1

0.37

0.983

G30560.TO_175.T.1

0.42

0.847

   

sample 5

G41726.MCF7.5

0.9

4.22E-20

G41726.MCF7.5

0.49

0.304

G28020.KPL-1.1

0.88

7.66E-19

G28020.KPL-1.1

0.32

0.999

G30560.TO_175.T.1

0.42

0.843

   

sample 6

G41726.MCF7.5

0.9

5.94E-20

G41726.MCF7.5

0.5

0.283

G28020.KPL-1.1

0.88

1.10E-18

G28020.KPL-1.1

0.31

1

G41731.Hs_936.T.5

0.42

0.859

   

sample 7

G41726.MCF7.5

0.9

3.48E-20

G41726.MCF7.5

0.51

0.185

G28020.KPL-1.1

0.88

6.27E-19

G28020.KPL-1.1

0.35

0.995

G30560.TO_175.T.1

0.42

0.854

   

sample 8

G41726.MCF7.5

0.9

1.90E-20

G41726.MCF7.5

0.52

0.122

G28020.KPL-1.1

0.89

3.88E-19

G28020.KPL-1.1

0.33

0.999

G30560.TO_175.T.1

0.41

0.89

   

sample 9

G41726.MCF7.5

0.91

1.89E-21

G41726.MCF7.5

0.56

0.025

G28020.KPL-1.1

0.9

2.53E-20

G28020.KPL-1.1

0.39

0.954

G25206.NCI-H1694.1

0.48

0.455

   

sample 10

G41726.MCF7.5

0.91

3.85E-21

G41726.MCF7.5

0.55

0.037

G28020.KPL-1.1

0.9

4.70E-20

G28020.KPL-1.1

0.38

0.971

G25206.NCI-H1694.1

0.46

0.572

   

sample 11

G41726.MCF7.5

0.91

5.55E-21

G41726.MCF7.5

0.54

0.058

G28020.KPL-1.1

0.89

7.66E-20

G28020.KPL-1.1

0.38

0.966

G25206.NCI-H1694.1

0.46

0.571

   

sample 12

G41726.MCF7.5

0.9

1.42E-20

G41726.MCF7.5

0.53

0.086

G28020.KPL-1.1

0.89

1.68E-19

G28020.KPL-1.1

0.37

0.985

G25206.NCI-H1694.1

0.45

0.642

   
  1. Note:
  2. 1. First test takes all variants with DP > = 10, and at least one sample FREQ > 0. Total of 17,730 variants are included
  3. 2. Second test takes variants with DP > = 20, and the difference of max(FREQ of MCF7 and KPL-1.1) and min(FREQ of MCF7 and KPL-1.1) > 10. Total of 2631 variants are taken for all 12 samples’ second test