Skip to main content

Genome wide analysis of DNA copy number neutral loss of heterozygosity (CNNLOH) and its relation to gene expression in esophageal squamous cell carcinoma



Genomic instability plays an important role in human cancers. We previously characterized genomic instability in esophageal squamous cell carcinomas (ESCC) in terms of loss of heterozygosity (LOH) and copy number (CN) changes in tumors using the Affymetrix GeneChip Human Mapping 500K array in 30 cases from a high-risk region of China. In the current study we focused on copy number neutral (CN = 2) LOH (CNNLOH) and its relation to gene expression in ESCC.


Overall we found that 70% of all LOH observed was CNNLOH. Ninety percent of ESCCs showed CNNLOH (median frequency in cases = 60%) and this was the most common type of LOH in two-thirds of cases. CNNLOH occurred on all 39 autosomal chromosome arms, with highest frequencies on 19p (100%), 5p (96%), 2p (95%), and 20q (95%). In contrast, LOH with CN loss represented 19% of all LOH, occurred in just half of ESCCs (median frequency in cases = 0%), and was most frequent on 3p (56%), 5q (47%), and 21q (41%). LOH with CN gain was 11% of all LOH, occurred in 93% of ESCCs (median frequency in cases = 13%), and was most common on 20p (82%), 8q (74%), and 3q (42%). To examine the effect of genomic instability on gene expression, we evaluated RNA profiles from 17 pairs of matched normal and tumor samples (a subset of the 30 ESCCs) using Affymetrix U133A 2.0 arrays. In CN neutral regions, expression of 168 genes (containing 1976 SNPs) differed significantly in tumors with LOH versus tumors without LOH, including 101 genes that were up-regulated and 67 that were down-regulated.


Our results indicate that CNNLOH has a profound impact on gene expression in ESCC, which in turn may affect tumor development.


Genomic instability is important for cancer development and can manifest as copy number (CN) gain or loss as well as loss of heterozygosity (LOH). Copy number neutral LOH (CNNLOH) has been observed in tumors following the widespread application of SNP array technology [1, 2]. CNNLOH is common in many tumor types, including basal cell carcinoma [3], acute myeloid leukemia [4, 5], medulloblastoma [6], melanoma [7], follicular lymphoma [8], colorectal cancers [911], glioblastoma [12, 13], cutaneous squamous cell carcinomas [14], acute promyelocytic leukemia [15], acute lymphoblastic leukemia [16], ovarian tumor [17], and esophageal adenocarcinoma [18], and has recently been reviewed for myeloid malignancies [19]. CNNLOH is thought to result from mitotic recombination or nondisjunction in somatic tumor cells [3]. However, the distribution of complex DNA alterations and its relation to gene expression in tumors have not been characterized in ESCC.

ESCC is a common malignancy worldwide and one of the most common cancers in the Chinese population; Shanxi Province in north central China has some of the highest esophageal cancer rates in the world [20, 21]. Previously, we identified several regions of LOH and CN alteration in ESCC using microsatellite markers and low- and high-density SNP arrays [2227], where the majority of ESCC patients from this high-risk population were found to have high genomic instability and high frequency of LOH on several chromosome arms. However, we have not found causal mutations in candidate genes within the LOH regions identified. For example, 82% of 56 ESCCs showed LOH when tested with four microsatellite markers flanking ANXA1 ( 9q11-q21) , but no somatic mutations were detected in these patients [28]. Another example is BRCA2, which also showed frequent LOH in ESCC (57% for D13S260, 83% for D13S767), but only infrequent somatic mutations in these cancer patients (2/56, 3.5%) [29, 30]. Contrary to expectation, expression of BRCA2 was often increased (unpublished data).

In the present study, we analyzed DNA from 30 micro-dissected ESCC tumors, adjacent normal tissue, and blood DNA from the same patient using the Affymetrix 500K SNP array to identify the distribution of complex DNA alterations, including CNNLOH, and we related CNNLOH to expression of the genes affected as assessed with the Affymetrix U133A 2.0 array in these patients.


Case selection

This study was approved by the Institutional Review Boards of the Shanxi Cancer Hospital and the US National Cancer Institute (NCI). Cases diagnosed with ESCC between 1998 and 2001 in the Shanxi Cancer Hospital in Taiyuan, Shanxi Province, PR China, and considered candidates for curative surgical resection were identified and recruited to participate in this study. None of the cases had prior therapy and Shanxi was the ancestral home for all. After obtaining informed consent, cases were interviewed to obtain information on demographics, cancer risk factors (eg, smoking, alcohol drinking, and detailed family history of cancer), and clinical information. The cases evaluated here were part of a larger case-control study of upper gastrointestinal cancers conducted in Shanxi Province [3133].

Biological specimen collection and processing

Venous blood (10 ml) was taken from each case prior to surgery and germ-line DNA from whole blood was extracted and purified using the standard phenol/chloroform method.

Tumor and adjacent normal tissues were dissected at the time of surgery and stored in liquid nitrogen until used. One 5-micron section was H&E stained and reviewed by a pathologist from the NCI to guide the micro-dissection. Five to ten consecutive 8-micron sections were cut from fresh frozen tumor and adjacent normal tissues. Tumor and normal cells were manually micro-dissected under light microscopy. DNA was extracted from micro-dissected tumor as previously described [34] using the protocol from the Puregene DNA Purification Tissue Kit (Gentra Systems, Inc., Minneapolis, MN). RNA was extracted from 17 of these micro-dissected tumor and matched normal tissue pairs using the protocol from the PureLink Micro-to-Midi Total RNA Purification System (Catalog number 12183-018, Invitrogen, Carlsbad, CA). RNA quality and quantity were determined using the RNA 6000 Labchip/Agilent 2100 Bioanalyzer (Agilent Technologies, Germantown, MD). The same tissue blocks were used for extraction of both DNA and RNA for each case studied.

Target preparation for GeneChip Human Mapping 500 K array set

The Affymetrix GeneChip Human Mapping 500 K array set contains ~262,000 (Nsp I array) and ~238,000 (Sty I array) SNPs (mean probe spacing = 5.8 Kb, mean heterozygosity = 27%). A detailed gene chip protocol can be found at

Experiments were conducted according to the protocol (GeneChip Mapping Assay manual) supplied by Affymetrix, Inc. (Santa Clara, CA). Genotype calls were generated by GTYPE v 4.0 software (Affymetrix). Germ-line, tumor and adjacent normal DNA from each case were run together in parallel in the same experiment (ie, same batch, same day). The GEO accession numbers for these array data are GSE15526 and GSE20347.

Probe preparation and hybridization for Human Genome U133A 2.0 array

The Affymetrix Human Genome U133A 2.0 array is a single array used to interrogate expression of 14,500 well-characterized human genes. Array experiments were performed using 1-5 μg total RNA each. We followed the protocol provided by the manufacturer to carry out reverse transcription, labeling, and hybridization.

GeneChip 500 K array data analysis

Probe intensity data from Affymetrix 500 K SNP arrays were used to identify DNA alterations in the present study. To avoid gender-related issues, SNPs mapped to either the X or Y chromosome were excluded.

Copy number (CN) loss or gain was based on comparisons of either adjacent normal to germ-line DNA or tumor to germ-line DNA. Microarray data were first normalized using the gtype-probe set-genotype package included in Affymetrix Power Tools version 1.85. Each tumor sample was individually normalized via the BRLMM algorithm along with 99 blood samples. These blood samples were obtained from the 30 ESCC cases evaluated in the present study plus 69 healthy controls (age-, sex-, and region-matched to cases) who were all part of a larger case-control study of upper gastrointestinal cancers conducted in Shanxi Province (as noted above). Paired CN analysis was then performed on each sample using the Affymetrix Power Tools paired-copy-number workflow, which implements the Affymetrix Copy Number Analysis Tool (CNAT) algorithm. DNA obtained from the blood of each case served as the normal control; a sliding window of 100 kb was chosen to optimize the identification of extended regions of CN alteration (see The output of the CNAT program is CN state rather than an absolute CN prediction: normal CN corresponds to a state of 2; zero and 1 correspond to CN loss; and states 3 and 4 correspond to CN gain.

In the present study, we modified the method for identifying LOH used in our previous studies [26, 27]. Here, LOH was determined using the Affymetrix Power Tools copynumber-pipeline program paired-LOH workflow. Input was *.CHP files generated with the gtype-probeset-genotype package as described above. Matched blood DNA served as the reference for LOH analysis for each tumor and normal adjacent sample.

Combination of LOH and CN alterations

We defined six combinations of copy number state and LOH status. LOH positive loci may have CN loss (CN ≤ 1), be CN neutral (CNNLOH, CN = 2) or show CN gain (CN ≥ 3); Likewise, LOH negative loci may show CN loss, gain, or neutrality. LOH and CN segments for each tumor were defined independently for each sample as contiguous blocks of informative SNPs that possessed the same LOH and CN state. Endpoints of LOH/CN segments were defined by informative SNPs. Some uninformative SNPs were located between these LOH/CN segments; we considered these SNPs to have an undefined LOH/CN state (see Additional file 1/Figure S1). Segment sizes were empirically observed from the data.

Comparison of CN status in DNA from blood versus micro-dissected adjacent normal tissue

DNA isolated from normal adjacent tissue is frequently used as a control in microarray experiments. In the present study we used DNA isolated from peripheral blood. We expected peripheral blood DNA to be a superior control for two reasons: first, unlike adjacent normal tissue, it is does not run the risk of being contaminated with tumor cells; second, adjacent normal tissue may actually be precancerous and contain genetic lesions. To examine whether blood DNA and adjacent normal esophageal DNA were equivalent controls, we compared copy number state calls for blood and normal adjacent from each of the 30 ESCC patients. We found that the two controls were equivalent: 99.29% to 99.99% of all copy number calls were identical. Overall, 99.96% of SNPs in blood and 99.93% in normal adjacent tissue were CN = 2 state.

Human Genome U133A 2.0 array data analysis and relation between CNNLOH and mRNA expression

The Robust Multiarray Average (RMA) algorithm [35, 36] implemented in Bioconductor in R was used for background correction and normalization across all samples. For each sample log2 fold changes in gene expression were calculated by subtracting the adjacent normal RMA value from the corresponding tumor RMA value.

To determine whether any gene showed a difference in the tumor versus normal gene expression fold change that was dependent on LOH state, we performed the following steps: (i) First, genes assayed by the U133A microarray were mapped onto each LOHCN segment of each sample. Map locations of genes were taken from the Affymetrix version na29 microarray annotation file. Note that probe sets from the same gene may have different reference sequences which differ in their chromosomal locations. Also, not every gene will map to every sample - in a particular sample, a gene may map to a gap between LOHCN regions. (ii) Next, we identified genes for which at least two of the 17 ESCC samples with expression data were LOH negative and at least two samples were LOH positive. (iii) We then performed two-sided unpaired t-tests comparing the log2 fold changes for a probe set in LOH positive and LOH negative samples. A P-value < 0.01 was considered significant. (iv) Finally, SNPs on the 500 K microarray were mapped to the reference sequence for each expression probe set. Since probe sets from the same gene may have different reference sequences, they may differ in the number of SNPs assigned to them (Additional file 2/Figure S2).


In the present study we determined copy number and loss of heterozygosity (LOH) status in DNA isolated from germ-line and micro-dissected tumor and matched adjacent normal samples from 30 ESCC patients using the Affymetrix 500 K SNP array. The average genotype call rate was 96% (89-99%): the 250 K Nsp I array was 96% (90-98%) and 250 K Sty I array was 95% (89-99%). Genotype call rates were similar for all three tissue types examined. We first analyzed whether copy numbers were similar between DNAs from the two normal tissues: germ-line (blood) and micro-dissected adjacent normal samples. Our analysis indicated that DNA CN values were similar between the two normal tissues (Additional file 3 - Table S1), as expected. Our results indicate that germ-line DNA can be used as a normal control in studies of CN alteration; it is more readily available than matched adjacent normal tissue.

Complex DNA alterations in ESCC

The distribution of DNA alterations in each of the 30 ESCC cases is summarized in Table 1 (with LOH) and in Additional file 4/Table S2 (without LOH). We divided genomic regions into three groups based on CN states: CN loss, neutral, and gain. We found that 50%, 90%, and 93% of cases showed LOH in the CN loss, neutral, and gain groups, respectively (Table 1). For each chromosome, we also calculated the percentage of SNPs involved in LOH for each group. They ranged between 20-57%, 7-100%, and 2-100% for the CN loss, neutral, and gain groups, respectively (Table 1). Our results suggest that LOH with CN neutral or gain are common phenomena in ESCC. For SNPs without LOH, we also calculated the percent of SNPs in each CN state; averages were 5%, 84%, and 11% for CN loss, neutral, and gain, respectively.

Table 1 LOH by copy number in ESCC cases by individual case (N = 30)

The distribution of the six types of DNA alterations for all 30 cases by chromosome arm is shown in Table 2 (with LOH) and Additional file 5/Table S3 (without LOH). CNNLOH was observed on all chromosome arms, but most frequently on 19p (100%), 5p (96%), 2p (95%), and 20q (95%). The highest frequencies of LOH with CN loss (CN = 1) were found on 3p (56%), 5q (47%), and 21q (41%); relatively high frequencies were also seen on 18q (31%), 11q (29%), 1p (28%), 19q (27%), and 11p (25%). LOH with CN gain was most common on 20p (82%), 8q (74%) and 3q (42%) (Table 2 and Figure 1). Taken together, our results show that LOH with CNN or CN gain were much more frequent than LOH with CN loss on every chromosome arm but one (ie, 3p).

Table 2 LOH by copy number in ESCC cases by chromosomal arm (N = 30 cases)
Figure 1

Patterns of loss of heterozygosity and copy number variation in 30 ESCC samples for chromosome 3. Each row (numbered 1 - 30) represents an individual ESCC sample. Circles indicate the positions of SNPs showing LOH. SNP positions are color coded as follows: black indicates copy number neutral LOH; blue indicates LOH accompanied by copy number reduction; red indicates LOH with copy number gain. An ideogram of the chromosome is at the bottom of the figure.

Results of CN alterations in non-LOH group by chromosome arms are summarized in Additional file 5/Table S3. Briefly, a frequency of CN loss ≥ 10% was observed on eight chromosome arms (3p, 4p, 4q, 5q, 8p, 9p, 11q, and 13q). A frequency of CN gain ≥ 10% was observed on 13 chromosome arms (1q, 2p, 2q 3q, 5p, 7p, 7q, 8q, 12p, 14q, 18p, 20p, and 20q).

Relation between genomic alterations and gene expression

The average present call rate on the Human Genome U133A array was 53% (range 51- 61%) for the 34 chips from the 17 sample pairs with sufficient tissue for RNA isolation and testing. To investigate the relation between LOH/CNV and gene expression levels, we intersected genes on the Affymetrix U133A chip with SNPs on the 500 K SNP array. SNPs that mapped within genes are summarized in Additional file 6/Table S4 and include 169,687 SNPs within 12,225 genes.

We were interested in identifying differentially-expressed genes between LOH and non-LOH groups in genes that were CN neutral. A total of 4,572 genes qualified for this analysis (see Methods). Among these genes, 168 genes showed significant differences in expression between tumors with and without LOH (P < 0.01) (Additional file 7/Table S5). Based on chance alone (at the P < 0.01 level), differences in only 45 genes would be expected, therefore, expression differences were observed in over three times as many genes as expected. One hundred and one (60%) of the 168 genes showed lower expression levels in CNNLOH than in the normal group (ie, CNN, no LOH), whereas 67 genes (40%) showed higher expression levels in CNNLOH (Additional file 7/Table S5). Twenty-eight of the 101 down-regulated genes (32 probes) and 18 of the 67 up-regulated genes (19 probes) showed expression differences ≥ 2-fold (Table 3). These findings suggest that in the CN neutral state, LOH can affect gene expression.

Table 3 Comparison of gene expression in copy number neutral (CNN) genes with LOH and without LOH (normal) (N = 46 genes significantly differentially-expressed 2-fold or greater)*

We also compared expression of genes with LOH versus no LOH in CN loss genes. We identified six of 600 genes which showed significantly different expression between the LOH groups. All six genes showed increased expression in tumors with LOH (Table 4a).

Table 4 Comparison of gene expression in copy number loss/gain genes with LOH and without LOH*

Finally, we compared gene expression in the CN gain state between tumors with and without LOH. We found that six of 354 genes showed significant differences in expression between the two groups, including two down-regulated and four up-regulated genes (Table 4b).


We characterized ESCC tumors for complex DNA alterations - LOH and CNV - and related these genomic alterations to gene expression. To our knowledge, this is the first report to comprehensively address the distribution of complex DNA alterations in ESCC and its relation to gene expression on a genome-wide scale.

Ninety percent of cases showed CNNLOH in their tumors and, over all cases, CNNLOH was found on every chromosome arm, indicating that it is a common phenomenon.

The frequency of CNNLOH observed here in ESCC was much less than has been reported in other cancers [319]. For example, in colon cancer and basal cell carcinoma nearly all LOH was associated with copy number neutral regions [3, 10]. In general, CNNLOH occurs with variable frequency in different genomic regions in tumors of different origin. There are several differences between the study reported here and previous studies which likely influenced the results. First, DNA from micro-dissected tumor and adjacent normal was used in the present study, while either cancer DNA without matched controls or cancer cell lines were used in most other reported studies. Second, we examined LOH and CN alterations using the same SNP array platform, while other studies used SNPs for LOH and CGH arrays for CN analyses. Third, the criteria for identifying LOH differed among the studies reported. Finally, the types of cancers studied previously differ from the present study which is the first report of CNNLOH in ESCC.

In previous LOH studies, we reported high-frequency LOH on several chromosome arms, including 3p, 4p, 4q, 9p, 9q, 13q, 17p, and 17q [23, 26, 27]. By integrating LOH and CN alteration data in the present study, we can now say that the LOH on 3p is primarily due to CN loss LOH, while the LOH on the other seven chromosome arms is predominantly due to CNNLOH.

Our results showed that CNNLOH can change expression levels of genes in ESCC, either increasing or decreasing them. We do not know why CNNLOH changes gene expression, but one possibility is that the two alleles may have different gene expression levels. For example, if allele A expression is greater than allele B, the expression level for the 3 genotypes would be ordered as AA > AB > BB. CNNLOH with retention of two B alleles (genotype BB) would then show lower expression than genotype AB. Conversely, CNNLOH with loss of the allele B would result in two copies of allele A and a higher level of expression than that of AB cells. Another possibility is that the two alleles have different expression due to different epigenetic states, with LOH resulting in copies with two extreme epigenetic states. A third possibility is that one allele harbors a mutation and subsequent LOH leads to a homozygous mutant. Several studies have shown that CNNLOH regions can harbor mutated genes. For example, JAK2 V617F, FLT3-ITD, AML1/RUNX1, WT1, and NPM1 mutations were all found in CNNLOH regions in AML [15]. These various hypotheses merit testing in the future.

The study design in the present study has several important features: (i) we compared CN status between DNA from germ-line and micro-dissected adjacent normal tissue; (ii) we used micro-dissected DNA from tumor tissue; (iii) we assessed both LOH and CN alterations simultaneously using the same array platform; and (iv) we integrated complex DNA alterations and gene expression data on a genome-wide level using both high density SNP and expression arrays in the same cases. A noteworthy weakness of our study is the relatively small number of cases evaluated (including a particularly small number of cases with both LOH and RNA expression data to evaluate, due in part to the 500K chip mean heterozygosity of 27%), which limited our power to detect significant differences in loci between LOH and non-LOH groups. In addition, findings for ESCC from this high-risk region may not be generalizable to populations elsewhere in the world.

In summary, we investigated the distribution of complex DNA alterations in ESCCs at the genome-wide level and determined that CN neutral is the most common CN state in LOH, and that CNNLOH is a very common phenomenon overall. Importantly, we also showed that CNNLOH could alter the expression level of genes affected in ESCC.


CNNLOH is a common phenomenon in many cancers, including ESCC, and non-disjunction and/or somatic recombination are the most likely mechanisms for its occurrence. CNNLOH can result in changes in gene expression which are functionally significant. Expression differences in CNNLOH suggest that alleles are different in terms of their gene expression potential, and that these differences may result from differences in genotype and/or epigenetics.



single nucleotide polymorphism


esophageal squamous cell carcinoma


copy number neutral loss of heterozygosity


copy number.


  1. 1.

    Huang J, Wei W, Zhang J, Liu G, Bignell GR, Stratton MR: Whole genome DNA copy number changes identified by high density oligonucleotide arrays. Hum Genomics. 2004, 1: 287-299.

    CAS  Article  Google Scholar 

  2. 2.

    Bignell GR, Huang J, Greshock J, Watt S, Butler A, West S: High-resolution analysis of DNA copy number using oligonucleotide microarrays. Genome Res. 2004, 14: 287-295. 10.1101/gr.2012304.

    CAS  Article  Google Scholar 

  3. 3.

    Teh MT, Blaydon D, Chaplin T, Foot NJ, Skoulakis S, Raghavan M: Genomewide single nucleotide polymorphism microarray mapping in basal cell carcinomas unveils uniparental disomy as a key somatic event. Cancer Res. 2005, 65: 8597-8603. 10.1158/0008-5472.CAN-05-0842.

    CAS  Article  Google Scholar 

  4. 4.

    Fitzgibbon J, Smith LL, Raghavan M, Smith ML, Debernardi S, Skoulakis S: Association between acquired uniparental disomy and homozygous gene mutation in acute myeloid leukemias. Cancer Res. 2005, 65: 9152-9154. 10.1158/0008-5472.CAN-05-2017.

    CAS  Article  Google Scholar 

  5. 5.

    Raghavan M, Lillington DM, Skoulakis S, Debernardi S, Chaplin T, Foot NJ: Genome-wide single nucleotide polymorphism analysis reveals frequent partial uniparental disomy due to somatic recombination in acute myeloid leukemias. Cancer Res. 2005, 65: 375-378.

    CAS  PubMed  Google Scholar 

  6. 6.

    Langdon JA, Lamont JM, Scott DK, Dyer S, Prebble E, Bown N: Combined genome-wide allelotyping and copy number analysis identify frequent genetic losses without copy number reduction in medulloblastoma. Genes Chromosomes Cancer. 2006, 45: 47-60. 10.1002/gcc.20262.

    CAS  Article  Google Scholar 

  7. 7.

    Stark M, Hayward N: Genome-wide loss of heterozygosity and copy number analysis in melanoma using high-density single-nucleotide polymorphism arrays. Cancer Res. 2007, 67: 2632-2642. 10.1158/0008-5472.CAN-06-4152.

    CAS  Article  Google Scholar 

  8. 8.

    Ross CW, Ouillette PD, Saddler CM, Shedden KA, Malek SN: Comprehensive analysis of copy number and allele status identifies multiple chromosome defects underlying follicular lymphoma pathogenesis. Clin Cancer Res. 2007, 13: 4777-4785. 10.1158/1078-0432.CCR-07-0456.

    CAS  Article  Google Scholar 

  9. 9.

    Gaasenbeek M, Howarth K, Rowan AJ, Gorman PA, Jones A, Chaplin T: Combined array-comparative genomic hybridization and single-nucleotide polymorphism-loss of heterozygosity analysis reveals complex changes and multiple forms of chromosomal instability in colorectal cancers. Cancer Res. 2006, 66: 3471-3479. 10.1158/0008-5472.CAN-05-3285.

    CAS  Article  Google Scholar 

  10. 10.

    Andersen CL, Wiuf C, Kruhoffer M, Korsgaard M, Laurberg S, Orntoft TF: Frequent occurrence of uniparental disomy in colorectal cancer. Carcinogenesis. 2007, 28: 38-48. 10.1093/carcin/bgl086.

    CAS  Article  Google Scholar 

  11. 11.

    van PM, Middeldorp A, Tops CM, van ER, van der Klift HM, Vasen HF: Genome-wide copy neutral LOH is infrequent in familial and sporadic microsatellite unstable carcinomas. Fam Cancer. 2008, 7: 319-330. 10.1007/s10689-008-9194-8.

    Article  Google Scholar 

  12. 12.

    Lo KC, Bailey D, Burkhardt T, Gardina P, Turpaz Y, Cowell JK: Comprehensive analysis of loss of heterozygosity events in glioblastoma using the 100K SNP mapping arrays and comparison with copy number abnormalities defined by BAC array comparative genomic hybridization. Genes Chromosomes Cancer. 2008, 47: 221-237. 10.1002/gcc.20524.

    CAS  Article  Google Scholar 

  13. 13.

    Kuga D, Mizoguchi M, Guan Y, Hata N, Yoshimoto K, Shono T: Prevalence of copy-number neutral LOH in glioblastomas revealed by genomewide analysis of laser-microdissected tissues. Neuro Oncol. 2008, 10: 995-1003. 10.1215/15228517-2008-064.

    CAS  Article  Google Scholar 

  14. 14.

    Purdie KJ, Lambert SR, Teh MT, Chaplin T, Molloy G, Raghavan M: Allelic imbalances and microdeletions affecting the PTPRD gene in cutaneous squamous cell carcinomas detected using single nucleotide polymorphism microarray analysis. Genes Chromosomes Cancer. 2007, 46: 661-669. 10.1002/gcc.20447.

    CAS  Article  Google Scholar 

  15. 15.

    Akagi T, Shih LY, Kato M, Kawamata N, Yamamoto G, Sanada M: Hidden abnormalities and novel classification of t(15;17) acute promyelocytic leukemia (APL) based on genomic alterations. Blood. 2009, 113: 1741-1748. 10.1182/blood-2007-12-130260.

    CAS  Article  Google Scholar 

  16. 16.

    Sulong S, Moorman AV, Irving JA, Strefford JC, Konn ZJ, Case MC: A comprehensive analysis of the CDKN2A gene in childhood acute lymphoblastic leukemia reveals genomic deletion, copy number neutral loss of heterozygosity, and association with specific cytogenetic subgroups. Blood. 2009, 113: 100-107. 10.1182/blood-2008-07-166801.

    CAS  Article  Google Scholar 

  17. 17.

    Gorringe KL, Ramakrishna M, Williams LH, Sridhar A, Boyle SE, Bearfoot JL: Are there any more ovarian tumor suppressor genes? A new perspective using ultra high-resolution copy number and loss of heterozygosity analysis. Genes Chromosomes Cancer. 2009, 48: 931-942. 10.1002/gcc.20694.

    CAS  Article  Google Scholar 

  18. 18.

    Nancarrow DJ, Handoko HY, Smithers BM, Gotley DC, Drew PA, Watson DI: Genome-wide copy number analysis in esophageal adenocarcinoma using high-density single-nucleotide polymorphism arrays. Cancer Res. 2008, 68: 4163-4172. 10.1158/0008-5472.CAN-07-6710.

    CAS  Article  Google Scholar 

  19. 19.

    O'Keefe C, McDevitt MA, Maciejewski JP: Copy neutral loss of heterozygosity: a novel chromosomal lesion in myeloid malignancies. Blood. 2010, 115: 2731-2739. 10.1182/blood-2009-10-201848.

    CAS  Article  Google Scholar 

  20. 20.

    Li JY: Epidemiology of esophageal cancer in China. Natl Cancer Inst Monogr. 1982, 62: 113-120.

    CAS  PubMed  Google Scholar 

  21. 21.

    Qiao YL, Hou J, Yang L, He YT, Liu YY, Li LD: [The trends and preventive strategies of esophageal cancer in high-risk areas of Taihang Mountains, China]. Zhongguo Yi Xue Ke Xue Yuan Xue Bao. 2001, 23: 10-14.

    CAS  PubMed  Google Scholar 

  22. 22.

    Hu N, Roth MJ, Emmert-Buck MR, Tang ZZ, Polymeropolous M, Wang QH: Allelic loss in esophageal squamous cell carcinoma patients with and without family history of upper gastrointestinal tract cancer. Clin Cancer Res. 1999, 5: 3476-3482.

    CAS  PubMed  Google Scholar 

  23. 23.

    Hu N, Roth MJ, Polymeropolous M, Tang ZZ, Emmert-Buck MR, Wang QH: Identification of novel regions of allelic loss from a genomewide scan of esophageal squamous-cell carcinoma in a high-risk Chinese population. Genes Chromosomes Cancer. 2000, 27: 217-228. 10.1002/(SICI)1098-2264(200003)27:3<217::AID-GCC1>3.0.CO;2-A.

    CAS  Article  Google Scholar 

  24. 24.

    Huang J, Hu N, Goldstein AM, Emmert-Buck MR, Tang ZZ, Roth MJ: High frequency allelic loss on chromosome 17p13.3-p11.1 in esophageal squamous cell carcinomas from a high incidence area in northern China. Carcinogenesis. 2000, 21: 2019-2026. 10.1093/carcin/21.11.2019.

    CAS  Article  Google Scholar 

  25. 25.

    Hu N, Su H, Li WJ, Giffen C, Goldstein AM, Hu Y: Allelotyping of esophageal squamous-cell carcinoma on chromosome 13 defines deletions related to family history. Genes Chromosomes Cancer. 2005, 44: 271-278. 10.1002/gcc.20242.

    CAS  Article  Google Scholar 

  26. 26.

    Hu N, Wang C, Hu Y, Yang HH, Kong LH, Lu N: Genome-wide loss of heterozygosity and copy number alteration in esophageal squamous cell carcinoma using the Affymetrix GeneChip Mapping 10 K array. BMC Genomics. 2006, 7: 299-10.1186/1471-2164-7-299.

    Article  Google Scholar 

  27. 27.

    Hu N, Wang C, Ng D, Clifford R, Yang HH, Tang ZZ: Genomic characterization of esophageal squamous cell carcinoma from a high-risk population in China. Cancer Res. 2009, 69: 5908-5917. 10.1158/0008-5472.CAN-08-4622.

    CAS  Article  Google Scholar 

  28. 28.

    Hu N, Flaig MJ, Su H, Shou JZ, Roth MJ, Li WJ: Comprehensive characterization of annexin I alterations in esophageal squamous cell carcinoma. Clin Cancer Res. 2004, 10: 6013-6022. 10.1158/1078-0432.CCR-04-0317.

    CAS  Article  Google Scholar 

  29. 29.

    Li G, Hu N, Goldstein AM, Tang ZZ, Roth MJ, Wang QH: Allelic loss on chromosome bands 13q11-q13 in esophageal squamous cell carcinoma. Genes Chromosomes Cancer. 2001, 31: 390-397. 10.1002/gcc.1158.

    CAS  Article  Google Scholar 

  30. 30.

    Hu N, Li G, Li WJ, Wang C, Goldstein AM, Tang ZZ: Infrequent mutation in the BRCA2 gene in esophageal squamous cell carcinoma. Clin Cancer Res. 2002, 8: 1121-1126.

    CAS  PubMed  Google Scholar 

  31. 31.

    Ng D, Hu N, Hu Y, Wang C, Giffen C, Tang ZZ: Replication of a genome-wide case-control study of esophageal squamous cell carcinoma. Int J Cancer. 2008, 123: 1610-1615. 10.1002/ijc.23682.

    CAS  Article  Google Scholar 

  32. 32.

    Gao Y, Hu N, Han X, Giffen C, Ding T, Goldstein A: Family history of cancer and risk for esophageal and gastric cancer in Shanxi, China. BMC Cancer. 2009, 9: 269-10.1186/1471-2407-9-269.

    Article  Google Scholar 

  33. 33.

    Gao Y, Hu N, Han X, Giffen C, Ding T, Goldstein AM: Jasmine tea consumption and upper gastrointestinal cancer in China. Cancer Causes Control. 2009, 20: 1997-2007. 10.1007/s10552-009-9394-z.

    Article  Google Scholar 

  34. 34.

    Emmert-Buck MR, Bonner RF, Smith PD, Chuaqui RF, Zhuang Z, Goldstein SR: Laser capture microdissection. Science. 1996, 274: 998-1001. 10.1126/science.274.5289.998.

    CAS  Article  Google Scholar 

  35. 35.

    Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003, 4: 249-264. 10.1093/biostatistics/4.2.249.

    Article  Google Scholar 

  36. 36.

    Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003, 19: 185-193. 10.1093/bioinformatics/19.2.185.

    CAS  Article  Google Scholar 

Download references


This research was supported by the Intramural Research Program of the NIH, the National Cancer Institute, the Division of Cancer Epidemiology and Genetics, and the Center for Cancer Research.

Author information



Corresponding authors

Correspondence to Philip R Taylor or Maxwell P Lee.

Additional information

Authors' contributions

NH, AMG, TD, and PRT designed, conducted, and supervised the field and clinical studies; PRT obtained funding for the project; NH and CW designed and performed the laboratory analyses; NH, RJC, HHY, and MPL conducted the statistical analyses; NH and ML drafted the manuscript; NH, RJC, AMG, PRT, and MPL conceptualized the data analyses and revised and edited the manuscript. All authors read and approved the final manuscript.

Nan Hu, Robert J Clifford contributed equally to this work.

Electronic supplementary material

Additional file 1: Figure S1. Definition of LOH/CN blocks within a single sample. (XLS 36 KB)

Additional file 2: Figure S2. The relationship between genes, Affymetrix expression probesets, and SNPs. (PDF 238 KB)

Additional file 3: Table S1. Comparison of copy number alterations between DNA from blood and microdissected normal tissue. (XLS 18 KB)

Additional file 4: Table S2. Copy number status of SNPs without LOH by case (N = 30). (XLS 18 KB)

Additional file 5: Table S3. CN status of SNPs without LOH by chromosome arm (N = 30 cases). (XLS 28 KB)

Additional file 6: Table S4. No. of genes matched on Affymetrix 500 K SNP and U133A expression arrays. (XLS 14 KB)

Additional file 7: Table S5. Comparison of gene expression in copy number neutral (CNN) genes with LOH and without LOH (normal) (N = 168 genes significantly differentially-expressed). (XLS 49 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Cite this article

Hu, N., Clifford, R.J., Yang, H.H. et al. Genome wide analysis of DNA copy number neutral loss of heterozygosity (CNNLOH) and its relation to gene expression in esophageal squamous cell carcinoma. BMC Genomics 11, 576 (2010).

Download citation


  • Esophageal Squamous Cell Carcinoma
  • Copy Number Alteration
  • Esophageal Squamous Cell Carcinoma Patient
  • Copy Number Gain
  • Copy Number Loss