Skip to main content

Table 1 Summary information about the series of colorectal cancer (CRC) samples that were collected to produce the integrated data set analyzed in this work

From: Survival marker genes of colorectal cancer derived from consistent transcriptomic profiling

GEO dataset

Sample Source

Sample Description

Total samples in dataset

PubMed PMID

Authors and Year

Samples discarded

Samples processed

GSE14333

Royal Melbourne Hospital, Western Hospital and Peter MacCallum Cancer Center, AUSTRALIA. H Lee Moffitt Cancer Center, USA

primary colorectal cancers

290

19996206

Jorissen RN et al. (2009)

64

226

GSE17536

Moffitt Cancer Center, USA

colorectal cancer patients

177

19914252

Smith JJ et al. (2010)

0

177

GSE31595

Roskilde Hospital, DENMARK

patients with stage II and III colorectal cancer

37

–

Thorsteinsson M et al. (2011)

0

37

GSE33113

Academic Medical Center in Amsterdam, NETHERLANDS

primary tumor resections from stage II colorectal patients

90

22496204

Kemper K et al. (2012)

0

90

GSE38832

Vandervilt University Medical Center, USA

tumor samples collected from colorectal patients

122

25320007

Tripathi MK et al. (2014)

0

122

GSE39084

Toulouse Hospital, FRANCE

sporadic early onset primary colorectal carcinomas

70

25083765

Kirzin S et al. (2014)

1

69

GSE39582

Institut G. Roussy (Villejuif), Hosp. Saint Antoine (Paris), Hosp. G.Pompidou (Paris), Hosp. Hautepierre (Strasbourg), Hosp. Purpan (Toulouse), Institut P. Calmettes (Marseille), Centre Antoine Lacassagne (Nice), FRANCE

colorectal cancer samples

566

23700391

Marisa L et al. (2013)

14

552

Total number

  

1352

   

1273

  1. All the CRC samples were tested for global gene expression profiling using high-density microarrays Human Genome U133 Plus 2.0 from Affymetrix (that measure the signal of 20,141 human genes). The total collection included 1352 samples, but only 1273 were finally used. A group of 79 samples were discarded because they did not have survival data or they presented anomalous data distributions with respect to the other samples of the same series