- Research article
- Open Access
Development of the first marmoset-specific DNA microarray (EUMAMA): a new genetic tool for large-scale expression profiling in a non-human primate
BMC Genomics volume 8, Article number: 190 (2007)
The common marmoset monkey (Callithrix jacchus), a small non-endangered New World primate native to eastern Brazil, is becoming increasingly used as a non-human primate model in biomedical research, drug development and safety assessment. In contrast to the growing interest for the marmoset as an animal model, the molecular tools for genetic analysis are extremely limited.
Here we report the development of the first marmoset-specific oligonucleotide microarray (EUMAMA) containing probe sets targeting 1541 different marmoset transcripts expressed in hippocampus. These 1541 transcripts represent a wide variety of different functional gene classes. Hybridisation of the marmoset microarray with labelled RNA from hippocampus, cortex and a panel of 7 different peripheral tissues resulted in high detection rates of 85% in the neuronal tissues and on average 70% in the non-neuronal tissues. The expression profiles of the 2 neuronal tissues, hippocampus and cortex, were highly similar, as indicated by a correlation coefficient of 0.96. Several transcripts with a tissue-specific pattern of expression were identified. Besides the marmoset microarray we have generated 3215 ESTs derived from marmoset hippocampus, which have been annotated and submitted to GenBank [GenBank: EF214838 – EF215447, EH380242 – EH382846].
We have generated the first marmoset-specific DNA microarray and demonstrated its use to characterise large-scale gene expression profiles of hippocampus but also of other neuronal and non-neuronal tissues. In addition, we have generated a large collection of ESTs of marmoset origin, which are now available in the public domain. These new tools will facilitate molecular genetic research into this non-human primate animal model.
The common marmoset monkey (Callithrix jacchus), a small non-endangered New World primate native to eastern Brazil, is becoming increasingly used as a non-human primate model in biomedical research, drug development and safety assessment. Not only their close genetic, physiological and metabolic similarity to humans but also several unique physiological differences between Old World and New World primates, underlie its widespread use as an animal model in studies involving aspects of infectious disease, stem cell research, neural and cognitive sciences, toxicology and drug development and reproductive biology. Moreover, its availability, size, ease of breeding in laboratory conditions and its clear advantages over other non-human primates in terms of animal welfare, costs and practicality contribute to its popularity as an attractive alternative to other non-human primate species such as the rhesus macaque . In contrast to the growing interest for the marmoset as an animal model, the molecular tools for genetic analysis are extremely limited.
Although a Marmoset Genome Project was initiated in 2004 and is ongoing , today there are still only very few mRNA sequences of marmoset origin available in the public domain. To allow gene expression profiling of different marmoset organs and delineation of complex biological pathways, we have developed a marmoset-specific DNA microarray, called EUMAMA (EUropean MArmoset MicroArray) starting from large scale EST sequencing of marmoset transcripts to generate the required sequence information.
The marmoset array is an initiative within a European Commission-funded consortium aimed at investigating the long term effects of prenatal exposure to glucocorticoids  (hormones secreted by the adrenal cortex in response to stress) in a non-human primate model. The working hypothesis of this consortium is that excessive foetal exposure to glucocorticoid hormones (GCs) exerts organisational effects on the development of a variety of target tissues resulting in altered gene expression patterns that may persist throughout life. These 'programming' effects are believed to shape vulnerability to a variety of adult diseases including heart disease, diabetes, kidney failure and brain disorders [4–8]. Since the effects of prenatal GC exposure are so widespread, we chose to initially focus on the effects in the brain by using hippocampal RNA as a source to generate EST sequences on the marmoset array. The hippocampus is a brain structure involved in learning and memory processes, mood and regulation of the stress response via the hypothalamic-pituitary-adrenal axis. GCs are essential for normal hippocampal development, exerting a wide spectrum of effects, resulting in altered structural plasticity and function [9–12]. Therefore, the obvious applicability of the marmoset microarray is to study neuronal gene expression. However, we show here that the microarray can also be used to generate expression profiles in a wide variety of non-neuronal marmoset tissues.
Besides the marmoset microarray we have generated 3215 ESTs derived from marmoset hippocampus, which have been functionally annotated and submitted to GenBank [GenBank: EF214838 – EF215447, EH380242 – EH382846]. These new tools will facilitate molecular genetic research into this non-human primate animal model.
Mapping, annotation and submission of sequences to GenBank
A total of 3441 high quality sequences (Phred score: Q15>600; 87.5% of the sequences also passed the threshold of Q20>600 bp; the overall minimal Q20 score was 429 bp) were selected for annotation . After removal of redundant sequences, 3215 high quality unique 3' EST sequences remained which were trimmed to an average length of 678 bases. Annotation of the EST dataset to the RefSeq dataset of human genes allowed unambiguous assignment of gene names for 2250 sequences. A remaining 292 sequences could also be confidently assigned with gene names by mapping to the human genome or cross-species mapping. Of these 2542 (2250 + 292) sequences assigned with a gene name, 610 (19%) sequences could also be annotated with an open reading frame (ORF) (Figure 1A). These ORF-containing sequences were submitted to GenBank [GenBank: EF214838 – EF215447]. The majority of the remaining 673 sequences without an assigned gene name consisted of a group of 642 (20%) which could not be matched to a known mRNA but were mappable on the genome and 31 (1%) which could not mapped to RefSeq genes nor to a mammalian genome (Figure 1A). The latter 31 genes most likely represent novel genes that may be uniquely expressed in the marmoset hippocampus. All 2605 (81%) EST sequences without an ORF were submitted to dbEST [GenBank: EH380242 – EH382846].
Gene selection and construction of the marmoset microarray
A subset of 1448 of the annotated 3'ESTs were selected by the EUPEAH consortium to be represented on the marmoset array, supplemented with 68 marmoset cDNA sequences available in GenBank. We selected the GeneChip® CustomExpress® Array Program of Affymetrix for design and manufacturing of the marmoset array (format 100–2187 containing approx. 1,600 gene sequences at 11 probe pairs per probe set). This choice was based on the previous experience of Affymetrix working with consortia to design specific dedicated arrays but also due to the advantages of standardisation of the hybridisation and data extraction procedure. The sequence data was submitted to the Affymetrix Chip Design Group and used to design 11 perfect match and mismatch 25-mer oligonucleotides per sequence according to the standard algorithms used in Affymetrix commercially available catalogue arrays. The most optimal probes sets per sequence were selected for in situ synthesis on the marmoset array. Each unique transcript was represented by a single probe set, except for 11 transcripts where there was no obvious ideal probe set. In these 11 cases 2 different probes sets were synthesized in situ representing different nucleotides of the same transcript. As controls the standard Affymetrix control set present on all Affymetrix expression arrays, consisting of quality control and alignment controls as well as target preparation and hybridisation controls, was tiled. Since we did not have a set of full-length housekeeping control genes of the marmoset for assessment of sample quality, scaling and/or normalization of arrays, the species-specific control sets of rhesus monkey and human were tiled. In addition, both 5' and 3' sequences of 11 marmoset genes abundantly expressed in hippocampus were included as controls for assessing the overall labeling efficiency and quality of biotin-labeled target sample (Table 1).
The final design of the marmoset microarray consisted of a total of 1649 probe sets of which 97 control probe sets and 1552 probe sets representing 1541 different marmoset transcripts. A total of 71 probe sets were derived from 68 different sequences already available in GenBank, while the remaining 1481 probe sets were derived from 1473 of our proprietary sequences obtained by sequencing marmoset ESTs. The majority (1445) of these 1473 ESTs were derived from the set of 3215 ESTs that have been submitted to GenBank (Figure 1B). Another 28 ESTs had already been selected for representation on the marmoset array based on our initial annotation, but did not pass the criteria we applied for GenBank submission.
The majority of the ESTs represented on the marmoset array assigned with a gene symbol (in total 1251) could be classified according to gene ontology based on biological function (1034) or molecular process (1106), representing a wide variety of different functional classes (Additional file 1).
Expression profiling of hippocampus and several non-neuronal tissues
Detection rates, indicated by present calls, on RNA derived from hippocampus and cortex were high with 85% present calls. Even though the sequences represented on the marmoset GeneChip are of hippocampal origin, detection rates on RNA derived from various peripheral tissues, including kidney, liver, adipose tissue, aorta, skeletal muscle, ovary and testis, were also high. All peripheral tissues tested so far displayed approximately 70% present calls, with the exception of adipose tissue that had an overall level of detection of 78% (Table 2).
Correlation coeffcients were calculated based on hybridisation signals on the marmoset microarray for all possible combinations of tissues (Table 3). The highest correlation coefficient (0.96) was obtained when comparing gene expression in both neuronal tissues, i.e. hippocampus and cortex, indicating a high degree of similarity in expression profiles. In contrast, comparing hippocampal gene expression to all peripheral tissues resulted in a severe drop in correlation coefficient (Figure 2), ranging from 0.69 in muscle to 0.50 in liver.
Half of the probe sets of (767/1552 = 49.4%) were detectable (defined by a present call) in all tissues we tested so far, albeit in most cases at varying levels. Sorting of the expression profiles based on detection call allowed the identification of transcripts expressed exclusively in 1 of the 9 tissues tested. In liver, for example, we identified 7 probe sets with a present or marginal call that had absent calls in the other 8 tissues tested. Three of these 7 probes sets represented cytochrome p-450, enzymes that mediate the oxidative biotransformation of most drugs and are known to be predominantly expressed in liver . Detection of 5 probe sets was restricted to the kidney, one of which represented (prepro)renin. Renin is part of the renin-angiotensin system known to play a major role in volume regulation and is expressed at high levels in the kidney . Sperm protein 17 was exclusively expressed in testis. An explanation for the fact that we find a number of genes that appear to be exclusively expressed in other tissues than brain, while the origin of the ESTs is brain tissue, may be that these genes are predominantly derived from the list of 68 pre-existing GenBank sequences on the array, which are not necessarily expressed in brain. Indeed, this appears to be the case in many instances. For example, the 3 cytochrome p-450 probe sets exclusively expressed in liver are derived from pre-existing GenBank sequences, as well as 4 of the 5 kidney-specific probe sets (including renin) and 3 of the 4 testis-specific probe sets (including sperm protein 17). An additional explanation is that expression levels in brain may be too low to detect using microarrays, but were detectable in our normalised cDNA library of hippocampus.
Detection of a total of 82 probe sets was restricted to neuronal tissue (hippocampus and cortex) (Additional file 3). Among these 82 probe sets were several ion channels (KCNAB3, KCNF1, KCNH3, SCN1a, CACNB1), neurotransmitter receptors (GABRA5, 5-HTR2C, GRIN1), neural cell adhesion molecules (NCAM, OPCML) and diverse components involved in neurotransmitter secretion (SYN2, SNPH), axonogenesis (SLITRK1) and neurotrophic factor signalling (NTRK2). It must be noted that although the sequences have been annotated, the information is not that detailed that for each gene the exact isoform or splice variant is indicated. For example, a closer look at the data of CACNB1 indicates that only isoform 1 is targeted by our sequence, which indeed is expressed in brain, while isoform 2 is expressed in muscle. The users of the marmoset microarray have access to the exact probe set sequences as well as the EST sequences, allowing unambiguous determination of which part of the transcript is targeted and which splice variants or isoforms are detected.
Ninety probe sets were not detected in any of the tissues tested, which may be an intrinsic property of the selected probes or due to potential sequencing errors resulting in mismatches of the probe sets with the labeled mRNA (Additional file 3).
We have constructed a marmoset-specific oligonucleotide microarray representing 1541 different marmoset transcripts expressed in hippocampus. Although the sequences are primarily derived from a cDNA library of hippocampus, we have demonstrated that the marmoset array can also be efficiently used to characterise gene expression in other tissues of non-neuronal origin. The obtained detection rates are high, albeit somewhat lower in peripheral tissues than in neuronal tissue. The reproducibility of hybridisation is high, judged by the correlation (0.957) between both neuronal expression profiles (hippocampus and cortex). Moreover, the obtained tissue-specific patterns of expression are in line with what can be expected, with neuronal genes expressed exclusively in hippocampus and cortex, cytochrome p-450s in liver, preprorenin in kidney and sperm protein 17 in testis.
In the process of development of the marmoset microarray we have generated a set of 3215 marmoset 3' ESTs which have been submitted to GenBank. The number of marmoset sequences present today in publically available sequence databases including GenBank are 1433 (query "callithrix jacchus" at the National Center for Biotechnology Information ), which is a mix of working draft genomic sequence of the marmoset genome project, (partial) mRNA sequences and ESTs and contains several redundancies. Our contribution of 3215 different 3' ESTs is a substantial expansion of the number of marmoset sequences. Both the marmoset array and the generated ESTs therefore significantly expand the available molecular tools for genetic analysis in this non-human primate animal model.
To our knowledge EUMAMA is the first DNA microarray specific for the common marmoset. As is the case with all first-generation microarrays for a species, some day in the future our marmoset microarray is likely to become obsolete and be replaced by an array targeting a larger part of the transcriptome. Until there is a better replacement (e.g when the marmoset genome is sequenced) this microarray is the only molecular tool allowing analysis of gene expression of a reasonably comprehensive number of genes expressed in marmoset brain. The strength of this array is not only that it is species-specific, but also that there are corresponding marmoset sequences available for genetic follow-up of differentially expressed transcripts. So far the availability of microarrays specific for non-human primates is very limited. Oligonucleotide microarrays for the rhesus macaque (Macaca mulatta) have recently become commercially available at both Affymetrix and Agilent . However, today expression profiling in non-human primates including chimpanzee (Pan troglodytes), orangutan (Pongo pygmaeus), African green monkey (Chlorocebus aethiops), common marmoset (Callithrix jacchus) and cynomolgus monkeys (Macaca fascicularis) is still performed using human arrays [18–23], despite the fact that sequence mismatches affect hybridisation intensity and likely result in a high level of false negatives or underrepresented expressed genes [18, 24, 25]. Interspecies use of microarrays is particularly problematic when using human microarrays to study gene expression in non-human primates that are more divergent to humans than the great apes. While chimpanzee or orangutan have an average nucleotide sequence divergence of 0.8–1% and 3% respectively [26–28], the average sequence divergence of human and rhesus macaque is approximately 5% [29, 30] increasing to 11% when comparing the marmoset and human genome . A 4–8% difference in sequence will cause each human-specific 25-mer probe on a microarray to contain on average a single nucleotide mismatch with its target . A large effect of sequence divergence on hybridisation signal was found in a study using a multi-primate cDNA array containing human, chimpanzee, orangutan and rhesus sequences, even between organisms that are only diverged ~1% . The general picture that emerges is that the use of single species arrays for comparison between species may be problematic and yield spurious results , warranting careful data analysis and a significant amount of confirmatory analysis using other techniques. Meanwhile, the clear need for generation of species-specific cDNA and oligonucleotide-based microarray resources remains .
In summary, we have generated the first marmoset-specific DNA microarray and demonstrated its use to characterise large scale gene expression profiles of hippocampus but also of other neuronal and peripheral tissues. In addition, we have generated a large collection of ESTs of marmoset origin which are now available in the public domain. These new tools will facilitate molecular genetic research into this non-human primate animal model.
Generation of marmoset cDNA sequences
Total RNA was isolated from hippocampi of 5 adult marmosets, both males and females ranging from 3 to 10 years in age, After checking the quality of the isolated RNA it was pooled and used to construct a custom normalised cDNA library in pCMVSport6.1 (Invitrogen, Carlsbad, USA). For the development of the marmoset microarray a total of 23,300 clones were gridded in 61 384-well plates. In total approximately 5700 sequencing reactions were performed at Baseclear (Leiden, The Netherlands) using an anchored oligo-dT primer on 15 randomly selected 384-well plates to generate partial 3' EST sequences.
RNA preparation and labelling
Total RNA was isolated using TRIzol® reagent (Invitrogen Life Technologies, Carlsbad, CA, USA) according to the manufacturer's instructions. After isolation, total RNA was purified using the QIAGEN RNEasy® Mini Kit RNA Cleanup procedure (QIAGEN Inc., Valencia, CA, USA). RNA quality was assessed with the LabChip® RNA 6000 Nano Assay on the 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Per RNA sample, 5 μg was used as input into the MessageAmp™ II-Biotin Enhanced Single Round aRNA Amplification Kit (AM1791, Ambion, Austin, USA) according to the manufacturer's instructions. Briefly, total RNA was converted to double-stranded cDNA followed by in vitro transcription to synthesize biotin-labeled aRNA. After purification the size distribution of the aRNA was checked with the LabChip® RNA 6000 Nano Assay on the 2100 Bioanalyzer (Agilent Technologies) and the quantity was assessed by small sample spectophotometry on a Nanodrop ND-1000 Spectrophotometer (NanoDrop Technologies, Wilmington, Delaware USA).
The biotinylated aRNA was fragmented and 15 μg was hybridised to marmoset arrays at the Leiden Genome Technology Center (LGTC®), Leiden University, The Netherlands, according to the manufacturer's recommendations (Affymetrix, Santa Clara, USA).
Affymetrix GeneChip® Operating Software (GCOS) was used to estimate signal intensities plus signal reliabilities. Data was median normalized in excel and subsequently scatterplots were constructed and correlation coefficients calculated using R version 2.2.0 . Probe sets assigned with a gene symbol were functionally classified according to biological process and molecular function using Netaffx Analysis Center .
Mapping and annotation of EST sequences
In order to obtain high-quality data the EST sequences were trimmed before further analysis. At the 3' end, at least 15% of each sequence was removed so that final DNA fragment lengths were 700 bases. In addition, 20 bases from the 5' end of the EST sequences were trimmed. For the annotation of the EST dataset, the sequences were reverse complemented and compared to the RefSeq dataset of human genes (NCBI Refseq build 36.1) using a locally installed version of the Blat software . Remaining sequences that could not be unambiguously assigned to a gene name were mapped to the human genome sequence (NCBI Refseq assembly version 36.1) according to the standard parameters of the software. ESTs with a hit on the genome were annotated with the gene name of the gene which is located at the respective chromosome region. Ambiguities, e.g. if an EST sequence covered more than one gene, were resolved manually. Finally, a Blat search with increasing sensitivity parameters was performed to map ESTs across species.
The annotation procedure described above allowed a reliable annotation of EST data even if sequencing errors would cause frame-shifts in coding regions. In order to annotate open reading frames (ORFs) associated with the ESTs, ORFs with a length of at least 100 bases starting from the 5' end of the reverse complementary sequences were extracted. The translated amino acid sequences were mapped against the human protein sequences from UniProt (Release 50 of May 2006, ) using Blat . All sequences without any hit on a protein sequence starting at the first position ofthe 5' end where rejected. The remaining candidate ORFs were first validated by a comparison of the gene names from the UniProt hits to the gene names of the above determined RefSeq hits. To be included as bona fide ORFs, the translated ORF sequences had to fulfil an additional criterion. The length of the peptide sequences derived from the ESTs had to cover at least 80% of the peptide sequence length which can be expected from the corresponding part of the protein sequence from UniProt.
Mansfield K: Marmoset models commonly used in biomedical research. Comp Med. 2003, 53: 383-392.
Mansfield K, Tardif S, Eichler E: White Paper for Complete Sequencing of the Common Marmoset (Callithrix jacchus) genome. 2004, [http://www.genome.gov/Pages/Research/Sequencing/SeqProposals/MarmosetSeq.pdf]
Glucocorticoid Hormone Programming in Early Life and its Impact on Adult Health . [http://www.eupeah.org]
Grino M: Prenatal nutritional programming of central obesity and the metabolic syndrome: role of adipose tissue glucocorticoid metabolism. Am J Physiol Regul Integr Comp Physiol. 2005, 289: R1233-5.
Hales CN, Barker DJ: The thrifty phenotype hypothesis. Br Med Bull. 2001, 60: 5-20. 10.1093/bmb/60.1.5.
Barker DJ: Fetal origins of coronary heart disease. Br Heart J. 1993, 69: 195-196.
Hauser J, Dettling-Artho A, Pilloud S, Maier C, Knapman A, Feldon J, Pryce CR: Effects of prenatal dexamethasone treatment on postnatal physical, endocrine and social development in the common marmoset monkey. Endocrinology. 2007
Seckl JR: Prenatal glucocorticoids and long-term programming. Eur J Endocrinol. 2004, 151 Suppl 3: U49-62. 10.1530/eje.0.151U049.
Sousa N, Almeida OF: Corticosteroids: sculptors of the hippocampal formation. Rev Neurosci. 2002, 13: 59-84.
de Kloet ER, Joels M, Holsboer F: Stress and the brain: from adaptation to disease. Nat Rev Neurosci. 2005, 6: 463-475. 10.1038/nrn1683.
Fujioka A, Fujioka T, Ishida Y, Maekawa T, Nakamura S: Differential effects of prenatal stress on the morphological maturation of hippocampal neurons. Neuroscience. 2006, 141: 907-915. 10.1016/j.neuroscience.2006.04.046.
Coe CL, Kramer M, Czeh B, Gould E, Reeves AJ, Kirschbaum C, Fuchs E: Prenatal stress diminishes neurogenesis in the dentate gyrus of juvenile rhesus monkeys. Biol Psychiatry. 2003, 54: 1025-1034. 10.1016/S0006-3223(03)00698-X.
Phred - Quality Base Calling. [http://www.phrap.com/phred]
Sarlis NJ, Gourgiotis L: Hormonal effects on drug metabolism through the CYP system: perspectives on their potential significance in the era of pharmacogenomics. Curr Drug Targets Immune Endocr Metabol Disord. 2005, 5: 439-448. 10.2174/156800805774912971.
Nguyen G, Burckle C, Sraer JD: The renin receptor: the facts, the promise and the hope. Curr Opin Nephrol Hypertens. 2003, 12: 51-55. 10.1097/00041552-200301000-00009.
National Center for Biotechnology Information. [http://www.ncbi.nlm.nih.gov]
Wallace JC, Korth MJ, Paeper B, Proll SC, Thomas MJ, Magness CL, Iadonato SP, Nelson C, Katze MG: High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations. BMC Genomics. 2007, 8: 28-10.1186/1471-2164-8-28.
Walker SJ, Wang Y, Grant KA, Chan F, Hellmann GM: Long versus short oligonucleotide microarrays for the study of gene expression in nonhuman primates. J Neurosci Methods. 2006, 152: 179-189. 10.1016/j.jneumeth.2005.09.007.
Enard W, Khaitovich P, Klose J, Zollner S, Heissig F, Giavalisco P, Nieselt-Struwe K, Muchmore E, Varki A, Ravid R, Doxiadis GM, Bontrop RE, Paabo S: Intra- and interspecific variation in primate gene expression patterns. Science. 2002, 296: 340-343. 10.1126/science.1068996.
Khaitovich P, Hellmann I, Enard W, Nowick K, Leinweber M, Franz H, Weiss G, Lachmann M, Paabo S: Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees. Science. 2005, 309: 1850-1854. 10.1126/science.1108296.
Marvanova M, Menager J, Bezard E, Bontrop RE, Pradier L, Wong G: Microarray analysis of nonhuman primates: validation of experimental models in neurological disorders. Faseb J. 2003, 17: 929-931.
Caceres M, Lachuer J, Zapala MA, Redmond JC, Kudo L, Geschwind DH, Lockhart DJ, Preuss TM, Barlow C: Elevated gene expression levels distinguish human from non-human primate brains. Proc Natl Acad Sci U S A. 2003, 100: 13030-13035. 10.1073/pnas.2135499100.
Dillman JF, Phillips CS: Comparison of non-human primate and human whole blood tissue gene expression profiles. Toxicol Sci. 2005, 87: 306-314. 10.1093/toxsci/kfi243.
Gilad Y, Rifkin SA, Bertone P, Gerstein M, White KP: Multi-species microarrays reveal the effect of sequence divergence on gene expression profiles. Genome Res. 2005, 15: 674-680. 10.1101/gr.3335705.
Karssen AM, Li JZ, Her S, Patel PD, Meng F, Evans SJ, Vawter MP, Tomita H, Choudary PV, Bunney WE, Jones EG, Watson SJ, Akil H, Myers RM, Schatzberg AF, Lyons DM: Application of microarray technology in primate behavioral neuroscience research. Methods. 2006, 38: 227-234. 10.1016/j.ymeth.2005.09.017.
Hellmann I, Zollner S, Enard W, Ebersberger I, Nickel B, Paabo S: Selection on human genes as revealed by comparisons to chimpanzee cDNA. Genome Res. 2003, 13: 831-837. 10.1101/gr.944903.
Ebersberger I, Metzler D, Schwarz C, Paabo S: Genomewide comparison of DNA sequences between humans and chimpanzees. Am J Hum Genet. 2002, 70: 1490-1497. 10.1086/340787.
Chen FC, Li WH: Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees. Am J Hum Genet. 2001, 68: 444-456. 10.1086/318206.
Gilad Y, Man O, Paabo S, Lancet D: Human specific loss of olfactory receptor genes. Proc Natl Acad Sci U S A. 2003, 100: 3324-3327. 10.1073/pnas.0535697100.
Wall JD, Frisse LA, Hudson RR, Di Rienzo A: Comparative linkage-disequilibrium analysis of the beta-globin hotspot in primates. Am J Hum Genet. 2003, 73: 1330-1340. 10.1086/380311.
She X, Liu G, Ventura M, Zhao S, Misceo D, Roberto R, Cardone MF, Rocchi M, Green ED, Archidiacano N, Eichler EE: A preliminary comparative analysis of primate segmental duplications shows elevated substitution rates and a great-ape expansion of intrachromosomal duplications. Genome Res. 2006, 16: 576-583. 10.1101/gr.4949406.
Hacia JG: Genome of the apes. Trends Genet. 2001, 17: 637-645. 10.1016/S0168-9525(01)02494-5.
Ihaka R, Gentleman R: R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics. 1996, 3: 299-314. 10.2307/1390807.
NetAffx Analysis Center. [http://www.affymetrix.com/analysis/index.affx]
Kent WJ: BLAT--the BLAST-like alignment tool. Genome Res. 2002, 12: 656-664. 10.1101/gr.229202. Article published online before March 2002.
Wu CH, Apweiler R, Bairoch A, Natale DA, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Mazumder R, O'Donovan C, Redaschi N, Suzek B: The Universal Protein Resource (UniProt): an expanding universe of protein information. Nucleic Acids Res. 2006, 34: D187-91. 10.1093/nar/gkj161.
This work was supported by the European Commission grant QLRT-2001-02758 (EUPEAH) and the Centre for Medical Systems Biology (CMSB), Leiden, The Netherlands. The microarray hybridisations were performed at Leiden Genome Technology Center, Leiden, The Netherlands.
NAD generated the microarray, from sequence generation to design, coordinated the study and drafted the manuscript. MM carried out the RNA isolation, labeling and data analysis. SA and VWA contributed the marmoset sequences for CAT, EF2, GCLM, GPx1, GPx4, GSR and GSS and assisted with the initial gene selection. HZ isolated the hippocampal RNA used for the cDNA library and assisted in selection of marmoset housekeeping genes. CS was responsible for isolation and shipment of all marmoset tissues. BED and MAH performed the initial mapping and annotation of the sequences in 2004. BW and AR reannotated and performed the mapping of the sequences in 2006 and were responsible for GenBank submission. ERdK and EF had an advisory role in all steps of the array generation and validation. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Gene content of the marmoset microarray and Gene Ontology Classification. Excel file on content of the marmoset array. From left to right are columns indicating the Probe set ID on the marmoset microarray, the corresponding gene symbol (if known) and gene symbol aliases as well as the gene title. In the next column the source of the sequence is indicated, meaning whether it is derived from the set of 3215 ESTs or an already existing GenBank accession or if is a control sequence. Subsequently the GenBank accessions of the marmoset sequences and the gene ontology classification according to biological process, molecular function and cellular compartment, pathway and if applicable a pathway hyperlink are indicated. (XLS 796 KB)
Additional file 2: Overview of 3215 sequences submitted to GenBank and dbEST. Excel file giving a detailed overview of the panel of 3215 marmoset ESTs. From left to right the columns indicate the name of the EST clone, the corresponding gene symbol and aliases. The column "source" indicates if the sequence was mappable to a genome, could be assigned a gene name or not or contained a (partial) ORF. The next columns indicate the GenBank Accession number and whether the sequence was submitted to GenBank of dbEST. Finally, the last 2 columns indicate whether the sequence is represented on the marmoset array and if so by which probe set(s). (XLS 554 KB)
Additional file 3: Overview of tissue-specific expression profiles. Excel file showing hybridisation signals and detection calls for all tissues tested on the array. From left to right the Probe set ID on the marmoset array, the corresponding gene symbol (if known) and gene symbol aliases as well as the gene title are indicated. In the next columns the signals and the corresponding detection calls are listed for each tissues and finally a string of all detection calls is listed to facilitate identification of transcripts with a tissue-specific pattern of expression. (XLS 1 MB)
About this article
Cite this article
Datson, N.A., Morsink, M.C., Atanasova, S. et al. Development of the first marmoset-specific DNA microarray (EUMAMA): a new genetic tool for large-scale expression profiling in a non-human primate. BMC Genomics 8, 190 (2007). https://doi.org/10.1186/1471-2164-8-190
- Common Marmoset
- Sperm Protein
- Normalise cDNA Library
- Molecular Genetic Research
- Large Scale Gene Expression Profile