- Methodology article
- Open Access
Development of a chicken 5 K microarray targeted towards immune function
© Smith et al; licensee BioMed Central Ltd. 2006
- Received: 17 November 2005
- Accepted: 13 March 2006
- Published: 13 March 2006
The development of microarray resources for the chicken is an important step in being able to profile gene expression changes occurring in birds in response to different challenges and stimuli. The creation of an immune-related array is highly valuable in determining the host immune response in relation to infection with a wide variety of bacterial and viral diseases.
Here we report the development of chicken immune-related cDNA libraries and the subsequent construction of a microarray containing 5190 elements (in duplicate). Clones on the array originate from tissues known to contain high levels of cells related to the immune system, namely Bursa, Peyers patch, thymus and spleen. Represented on the array are genes that are known to cluster with existing chicken ESTs as well as genes that are unique to our libraries. Some of these genes have no known homologies and represent novel genes in the chicken collection. A series of reference genes (ie. genes of known immune function) are also present on the array. Functional annotation data is also provided for as many of the genes on the array as is possible.
Six new chicken immune cDNA libraries have been created and nearly 10,000 sequences submitted to GenBank [GenBank: AM063043-AM071350; AM071520-AM072286; AM075249-AM075607]. A 5 K immune-related array has been developed from these libraries. Individual clones and arrays are available from the ARK-Genomics resource centre.
- Gene Ontology
- Infectious Bursal Disease Virus
- Ensembl Database
- Genomic Solution
- UCSC Table Browser
In recent years, the tools available to the field of chicken genomics have increased greatly. Detailed genetic and physical maps have been constructed , as well as BAC contig maps [2, 3] and a radiation hybrid panel . There is also a substantial EST collection , SNP database and many full-length cDNAs have been sequenced. The development of these resources has culminated with the recent publication of the chicken draft sequence . The chicken can now be regarded as an important model organism for use in comparative genomics, residing in a potentially informative position in the evolutionary ladder. The chicken is also an extremely useful model for developmental biologists and geneticists as well as being a commercially important species.
The latest tools being developed for the chicken are microarrays. There are several small tissue-specific arrays being used by individual labs. These include an intestinal array (3,072 clones) , a macrophage-specific array (4,906 clones) , a lymphocyte array (3,011 clones)  and an 11 K array based on genes found in heart progenitor cells . A 13 K genome-wide array is also available from ARK-Genomics  (Roslin, UK) and from the Fred Hutchinson Cancer Research Centre (Seattle, USA) . We have designed a 5 K immune-related array created from libraries developed from tissue (Bursa, spleen, Peyers patch, thymus) from birds which were previously inoculated with a combination of different vaccines to various common avian diseases including bacterial, protozoa and virus disease-causing organisms (E. coli, Newcastle Disease Virus (NDV), Infectious Bursal Disease Virus (IBDV), coccidiosis, Marek's Disease (MD) and salmonella). The tissues we chose are highly representative of T and B cell populations and were used in order to optimise the numbers of immunologically – related genes that would be present in our libraries. Many known immune genes that have been recently identified in the chicken EST collections  have also been added to the array. This array provides a valuable, cost-effective resource for the investigation of immunological gene expression. It has been created from a pool of stimulated immune tissues and contains genes that represent a wide spectrum of immune functions as well as previously unidentified sequences. Each gene on the array is also functionally annotated as much as possible. Gene ontology  data and Blast  information is provided for each clone, where that information is available.
Construction of the array
Clones sequenced from each library.
No. of clones
Chicken immune 1 ('B cell' standard)
Chicken immune 2 ('B cell' normalized 1)
Chicken immune 3 ('B cell' normalized 2)
Chicken immune 4 ('T cell' standard)
Chicken immune 5 ('T cell' normalized 1)
Chicken immune 6 ('T cell' normalized 2)
Genes on the array
List of known immune genes added as reference genes to the array.
CC chemokine receptor 6
CC CKR 11
CD98 light chain
Chemokine receptor like 2
complement receptor 1
CX 3C chemokine receptor 1
Cytokine like protein 17
Cytokine receptor like 9
Death receptor 6
ephrin type A receptor 2
FASL decoy receptor 3
Ig light chain VJC region
Ig heavy chain VDJ region
Interleukin enhancer binding factor 3
MHC class I
MHC class I minor
MHC class II beta
opioid receptor sigma 1
Orphan chemokine receptor
platelet activating receptor
Regulator of cytokinesis 1
Tcell receptor α
Tcell receptor β
Tcell receptor γ
Tcell receptor ζ
Thymosin beta 4
Analysis of the immune clones
All the sequences of the clones on the array were subject to Blast homology searches against the SwissProt and TREMBL databases using a cut-off value of 1e-10. Using this means of detection, many known immune-related molecules were identified, including cytokines, interferons, interleukins, transcription factors, receptors, cell differentiation antigens, MHC molecules and genes for proteins belonging to the TOLL receptor pathway. Proteins homologous to hypothetical human proteins and mouse cDNAs were also identified.
Sequences, which gave no Blast homology to anything in the nucleotide or protein databases, accounted for about 38% of the clones. Either the search parameters were too stringent to identify these genes or the chicken sequence was sufficiently divergent to be undetectable in a standard Blast search. This is a common feature of immune-related genes, and it is often very difficult to identify such genes by sequence homology to mammalian homologues. Some of these sequences may also represent non-conserved 3' UTR regions of genes. This set of clones may also include genes that have never been identified before and are not represented in the sequence databases. Further, more detailed analysis of these sequences can sometimes help elucidate the nature of the gene in question. Protein sequences can be predicted from the EST nucleotide sequence using programs such as ESTscan [ and ], which takes in to account sequencing errors and thus potential frame-shift mutations which are often present when there is only one EST sequence available for study. Conserved motifs and domains can then sometimes be identified for example, using the Pfam database , which is a large collection of multiple sequence alignments and hidden Markov models covering many common protein domains and families. PSI-Blast searches can also help identify to which type of family a gene will belong.
Genomic location of unique chicken ESTs as identified by the University of Santa Cruz Blat site http://genome.ucsc.edu/cgi-bin/hgBlat?command=start
GenBank Accession No.
Gene ontology (GO) annotations
In order to try and elucidate the function of the genes on the array further, we tried to assign as much annotation to the sequences as possible. GO annotations were assigned to some sequences after searching the GGI and UMIST databases , while other annotation was derived from hits to orthologous human sequences from the ENSEMBL  and GENSCAN  databases, as described in the 'methods' section. Having annotation derived from orthologous human genes means that cross-species comparisons between chicken and human array data may be possible. A search of the ENSEMBL database provided information on 2,292 GO-term associations, the GGGI database 1,542 and GENSCAN 566, while the UMIST full-length cDNA database provided a further 365 annotations. The sequences on the array cover a total of 227 GO terms, with 73% of all the sequences having at least one GO entry assigned to it. The available annotation for the array sequences is broken down as follows: 52% of genes have a 'cellular component' term assigned, 60% have 'molecular function' and 56% of sequences have the 'biological process' described. 83% of all the genes on the array have some kind of gene description and after searching each sequence against the sequences in the Ensembl chicken genome collection (July 2005 genebuild ), 78% of sequences were found to have a known chromosomal location. Now that all these sequences have been added to GenBank and thus have an accession number which can be directly linked into the ENSEMBL databases (work currently underway), obtaining comprehensive, up-to-date annotation data will become much easier.
A file showing the complete annotation for all the sequences on the array is available as supplementary material (Additional file 1). However, Additional file 2 provides an overview of the broad functional classes that are represented by the genes on the array. These are based on more general GO annotations derived from the GO-slims database at EBI, and allow us an insight into the different classes of genes present on the array without having to look at detailed functional annotation for each individual gene.
Annotation is also available for some (9,137) of the ESTs in the UMIST collection. By comparing the relevant GO slims  terms for the sequences in this collection with those present on our array, we are able to see which types of genes appear to be enriched in our set, compared with a larger, more general collection of EST sequences. As can be seen (shown in bold) in Additional file 2, certain classes of gene appear to be more highly represented. For instance, genes involved in protein transport are more abundant in our set of clones, as are those involved in the response to stimulus. This is consistent with our attempts to pre-select for higher numbers of genes involved in the immune system.
Quality of the array
Using the array
This array is available from the Ark-Genomics resource facility at Roslin Institute, providing an immune-focused array which, for anyone interested in immune-research, offers a much more cost-effective and time-saving platform for gene expression experiments, instead of using the large oligo arrays which have thousands more genes, many of which will be of no interest. Analysis of data is also thus much easier and far less time-consuming. Information on the array has been deposited in ArrayExpress (Accession: A-MEXP-307) [ and ] (Additional file 1) and very soon all the sequences will be submitted to the Ensembl database with links to all the GO annotation information in the GOA database .
We have constructed a 5 K chicken cDNA microarray, which is highly selected for genes expressed in tissues which have an immune function. This targeted array contains enough widely-expressed genes (whose expression won't be changing) to enable good normalization, as well as containing numerous known immune genes (from our novel libraries and from existing EST collections). The array also contains many genes with as yet unknown homology and function as well as a few novel genes which are specific to the libraries from which the array was created. These genes of unknown function could well have a role in either the adaptive or innate immune response, and thus provide a valuable resource for analysis of gene expression changes occurring in birds that have been subject to immune challenge. The array has been proven to provide highly reproducible results and is now available to the chicken/microarray community as a whole.
Eight groups of 38 chickens (3-week-old) were vaccinated with two different vaccine regimes. The eight groups were males and females of a commercial line of hybrid broiler (Ross 306, Aviagen, Newbridge, Midlothian, UK) and layer (Lohman Brown, Lohmann Tierzucht, Cuxhaven Germany) chicks given one of the two vaccination schemes. Group 1 were given vaccines for E. coli (0.5 ml in left breast muscle), ND and IBDV (0.5 ml in right breast muscle) formulated in alum-gel and oil-based immuno-potentiators. Intramuscular injections were given to ensure that all the birds were given an equal dose. Group 2 vaccines consisted of Paracox 8 [Eimeria sp.] (0.1 ml in drinking water), Nobilis Rismavac-CA126 [MD] (0.2 ml intramuscularly in leg) and Salenovac [S. enteritidis] (0.5 ml intramuscularly in leg). Tissue samples were obtained (unvaccinated); 5 hr, 24 hr, 72 hr and 7 days post vaccination. Samples from groups of 5 birds were pooled. Tissues collected were Bursa, spleen, Peyers patch and thymus. Tissue from Bursa, spleen and Peyers patch were pooled to make the 'B-cell' libraries and the thymus tissue was used to construct the 'T-cell' libraries. The tissues and time points chosen were in order to try and maximise the number of immune-related transcripts, including those which may only be expressed transiently. All experimental protocols were authorized under the UK Animals (Scientific Procedures) Act, 1986.
Six libraries were constructed at Incyte Genomics (Palo Alto, CA): a standard and 2 normalized Bursa/spleen/Peyers patch libraries and a standard and 2 normalized thymus libraries. cDNA synthesis was initiated using an oligo (dT) primer, using methylated C in the first strand synthesis reaction. Following this first strand reaction, double-stranded cDNA was blunted, ligated to NotI adapters, digested with EcoRI, size-selected, and cloned into the NotI and EcoRI compatible sites of a custom modified MCS of the pBluescript (KS+) vector. Normalization was done in two rounds using conditions adapted from [ and ] except that a significantly longer re-annealing hybridization was used. Around 10,000 clones were then sequenced at the Sanger Institute according to their protocols. Using the T7 primer, sequence was generated from the 5' end of each clone by the dideoxy chain termination method using an ABI 3700 sequence analyser (Applied Biosystems, Foster City, CA).
EST sequence analysis
Bioinformatic analysis commenced with 10,173 sequences. After eliminating poor quality sequence and repeats, 9,434 of these sequences remained after screening with phred , RepeatMasker , Crossmatch  and XNUN . Certain unwanted sequences were then identified after using the Blast algorithm [ and ] and screening the results for specific keywords. These included 'ribosomal', 'mitochondrial', 'Newcastle', 'Mareks', 'Eimeria', 'Salmonella' and 'E. coli'. 8,154 sequences passed these criteria. These sequences were then clustered against the existing UMIST and EMBL chicken EST sequences using TIGR's clustering tool, tgicl . This resulted in 3,845 clusters which contained one or more sequence from our libraries and 1,959 singletons. The following clones were chosen for inclusion on the array: 3,770 cluster representatives, 1,067 singletons and 157 reference immune genes: 93 clones from the UMIST collection, 41 from our immune libraries, 21 clones from the Delaware set  and 2 clones courtesy of R. Zoorob (CNRS, France) (Table 2).
Construction of the array
The immune array was constructed from 4994 chicken EST clones plus 196 control elements (landing lights (positional controls), GAPDH, gamma actin, salmon sperm DNA, calf thymus DNA, chicken and bovine genomic DNA and a variety of spotting buffers). Plasmid DNA was prepared using MagAttract 96 Miniprep chemistry on a Biorobot 8000 platform (Qiagen Ltd., Crawley, UK), and the cDNA inserts were amplified using CGATTAAGTTGGGTAACGC (fwd) and CAATTTCACACAGGAAACAG (rev) in 50 ul reactions using 1 ul of DNA as a template. Amplified DNA was purified by Multiscreen 384 well PCR purification plates (Millipore, Watford, UK) on a Multiprobe II liquid handling platform (Perkin Elmer, Beaconsfield, UK) and the reactions confirmed by agarose gel electrophoresis and quantified by Picogreen assay (Molecular Probes, Invitrogen, Paisley, UK) on a Flouroskan Ascent flourescent plate reader (Thermo Life Science, Basingstoke, UK). DNA was resuspended to 150 ng/ul in spot buffer (150 mM Sodium phosphate, 0.01% SDS) before being spotted in duplicate on to amino-silane coated slides (CMT-GAPSII, Corning, Schiphol-Rijk, The Netherlands) using a Biorobotics MicroGrid II spotter (Genomic Solutions, Huntingdon, UK). Slides were then treated using succinic anhydride and 1-methyl-2-pyrrolidinone (Sigma, Poole, UK) to block unbound amino groups, followed by a wash in 95°C MilliQ water before hybridisation.
RNA preparation and labelling
Total RNA was isolated from lung tissue using a Trizol extraction according to the manufacturer's protocol (Invitrogen, Paisley, UK) and subsequently purified using the RNeasy Midi RNA Purification kit (Qiagen Ltd., Crawley, UK). RNA concentration was determined spectrophotometrically and RNA quality was determined using an Agilent 2100 Bioanalyser (Agilent Technologies, Waldbronn, Germany). Cy3 or Cy5 was incorporated into each sample using the Fairplay labelling kit (Stratagene, La Jolla, CA) and the labelled cDNA cleaned-up after passage through DyeEx columns (Qiagen Ltd., Crawley, UK). Labelling efficiency was determined by running 0.5 μl of each sample on a 1% agarose gel and measuring the intensity of fluorescence on a GeneTac LS IV scanner (Genomic Solutions, Huntingdon, UK).
Microarray hybridizations were carried out overnight using a GeneTAC automated hybridization system  (Genomic Solutions, Huntingdon, UK). Hybridizations (125 μl) were carried out in Genomic Solutions hybridization solution (Cat. no. RP#0025) in a stepped hybridization: 55°C for 3 hr, 50°C for 3 hr and then 45°C for 12 hr. Slides were then washed in Genomic Solutions wash buffers (Cat. nos. CS#0038, CS#0039 and CS#0040). Upon removal from the hybridization stations, slides were washed for 1 min in Post-Wash buffer (CS#0040) and a further minute in isopropanol, followed by centrifugation at 1000 rpm for 6 min. Dried slides were scanned in a Scanarray 5000 scanner (GSI Lumonics, Rugby, UK) fitted with Cy3 and Cy5 filters.
To indicate the suitability of the new array to discriminate the differences in the experimental treatments, hybridizations comparing samples with controls and controls with controls were performed. Control (vehicle treated) animals were compared with immunologically challenged animals (activated slides) and control animals were also compared with other control individuals (replicate slides). The same animal was also compared with itself (self/self). Each comparison was completed in duplicate and with a dye flip. Dye-swaps are carried out in order to deal with any residual dye-bias remaining after labelling. However, this is generally not a problem, due to the indirect labelling method employed. Data was extracted from the slide using Bluefuse software (BlueGnome, Cambridge, UK). Features with poor confidence information (confidence <0.30, flagged D and E) were eliminated from the analysis. M v A plots [where M = log2 (Cy5/Cy3) and A = 1/2*(log2(Cy5) + log2(Cy3)] of the data for each slide (data not shown) were suitably linear to require only a simple global normalisation of the data. Data from slides of similar treatments was pooled and a boxplot produced for each comparison (Genstat v8.1, VSN International Ltd., Hemel Hempstead, Herts, UK).
Databases and sequence sources
Ensembl and Genscan predicted genes/peptide sequences for the chicken genome assembly (March 2004) were downloaded from the Ensembl database using Ensmart or the UCSC table browser . Chicken EST sequences were downloaded from the TIGR Gallus gallus gene index (GGGI) [release 10.0] [ and ]. Chicken full-length cDNA sequences were downloaded from the UMIST www site (Sept 2004). Ensembl predicted peptide sequences for the human genome assembly (May 2004) were downloaded from the Ensembl database using Ensmart or the UCSC table browser.
Mapping array probes to chicken ESTs, cDNAs, genes and genome
Unique ESTs used to create the immune array were mapped to chicken cDNAs, ESTs, genes or the chicken genome assembly using NCBI Blastn (version 2.2.11). Identity was defined with > 95% sequence identity over 100-bp and then taking the top-scoring match to each EST to provide a unique sequence assignment. All repeats and low-complexity sequences were masked using RepeatMasker (version 3.1.0).
Definition of Gene Ontology terms and Gene Descriptions for array probes
Gene Ontology (GO) annotations  were all based on database hits in sequence similarity searches using Blastn. GO annotations were automatically transferred from these database records to the array probe entries. GO annotations were available for GGGI and UMIST EST/cDNA sequences. For chicken Ensembl or Genscan gene predictions, GO annotations were based on orthologous human peptide sequences. Orthologues were defined based on two cycles of Blastp between human and chicken proteins. An E_value cut off of less than 10-4, with the subject and query databases swapped between runs. By comparing E_values mutually best proteins pairs were selected as orthologues. When E_values were equal, bits score and sequence coverage were used as tiebreakers to select the top hit. For each array probe associated GO terms and a unique gene description was transferred from the orthologous database record. Finally a Perl script was used to create a non-redundant set of probe to GO records.
Frequency of GO and GO-Slim terms
GO terms (version 3.2.16) were downloaded from the Gene Ontology www site. More general GO terms were assigned using GoaSlim_map (June 2005) available from the GOA www site at EBI. The GO-Slim terms allowed us to estimate e.g. the frequency of array probes associated with the biological process Metabolism (GO:0008152).
Perl scripts (version 5.8.5) and SQL were used throughout to manipulate and filter data sets.
The authors would like to thank Incyte Genomics (|Palo Alto, CA) for construction of the normalized cDNA libraries and The Wellcome Trust Sanger Institute (Hinxton, UK) for sequencing 10,000 cDNA clones from the libraries. Thanks also to Frazer Murray of ARK-Genomics (Roslin) for invaluable technical assistance and to Theo Jansen (Intervet International B.V., Boxmeer, The Netherlands) for the preparation of the vaccine formulations. This project was funded by Intervet International B.V, Boxmeer, The Netherlands, the Biotechnology and Biological Science Research Council (BBSRC) and partly by a BSIK VIRGO consortium grant, the Netherlands (grant nr. 03012).
- Schmid M, Nanda I, Guttenbach M, Steinlein C, Hoehn M, Schartl M, Haaf T, Weigend S, Fries R, Buerstedde JM, Wimmers K, Burt DW, Smith J, A'Hara S, Law A, Griffin DK, Bumstead N, Kaufman J, Thomson PA, Burke T, Groenen MA, Crooijmans RP, Vignal A, Fillon V, Morisson M, Pitel F, Tixier-Boichard M, Ladjali-Mohammedi K, Hillel J, Maki-Tanila A, Cheng HH, Delany ME, Burnside J, Mizuno S: First report on chicken genes and chromosomes 2000. Cytogenet Cell Genet. 2000, 90: 169-218. 10.1159/000056772.PubMedView ArticleGoogle Scholar
- Aerts J, Crooijmans R, Cornelissen S, Hemmatian K, Veenendaal T, Jaadar A, van der Poel J, Fillon V, Vignal A, Groenen M: Integration of chicken genomic resources to enable whole-genome sequencing. Cytogenet Genome Res. 2003, 1024: 297-303. 10.1159/000075766.View ArticleGoogle Scholar
- Ren C, Lee MK, Yan B, Ding K, Cox B, Romanov MN, Price JA, Dodgson JB, Zhang HB: A BAC-based physical map of the chicken genome. Genome Res. 2003, 13: 2754-2758. 10.1101/gr.1499303.PubMedPubMed CentralView ArticleGoogle Scholar
- Morisson M, Lemiere A, Bosc S, Galan M, Plisson-Petit F, Pinton P, Delcros C, Feve K, Pitel F, Fillon V, Yerle M, Vignal A: ChickRH6: a chicken whole-genome radiation hybrid panel. Genet Sel Evol. 2002, 34: 521-533. 10.1051/gse:2002021.PubMedPubMed CentralView ArticleGoogle Scholar
- Boardman PE, Sanz-Ezquerro J, Overton IM, Burt DW, Bosch E, Fong WT, Tickle C, Brown WR, Wilson SA, Hubbard SJ: A comprehensive collection of chicken cDNAs. Curr Biol. 2002, 12: 1965-1969. 10.1016/S0960-9822(02)01296-4.PubMedView ArticleGoogle Scholar
- International Chicken Genome Sequencing Consortium (ICGSC): Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004, 432: 695-716. 10.1038/nature03154.View ArticleGoogle Scholar
- van Hemert S, Ebbelaar BH, Smits MA, Rebel JM: Generation of EST and microarray resources for functional genomic studies on chicken intestinal health. Anim Biotechnol. 2003, 13: 133-143. 10.1081/ABIO-120026483.View ArticleGoogle Scholar
- Bliss TW, Dohms JE, Emara MG, Keeler CL: Gene expression profiling of avian macrophage activation. Vet Immunol Immunopath. 2005, 105: 289-299. 10.1016/j.vetimm.2005.02.013.View ArticleGoogle Scholar
- Neiman PE, Ruddell A, Jasoni C, Loring G, Thomas SJ, Brandvold KA, Lee Rm, Burnside J, Delrow J: Analysis of gene expression during myc oncogene-induced lymphomagenesis in the bursa of Fabricius. Proc Natl Acad Sci (USA). 2001, 98: 6378-6383. 10.1073/pnas.111144898.View ArticleGoogle Scholar
- Afrakhte M, Schultheiss TM: Construction and analysis of a subtracted library and microarray of cDNAs expressed specifically in chicken heart progenitor cells. Dev Dynam. 2004, 230: 290-298. 10.1002/dvdy.20059.View ArticleGoogle Scholar
- ARKGenomics. [http://www.ark-genomics.org]
- Burnside J, Neiman P, Tang J, Bascom R, Aronszajn M, Talbot R, Burt DW, Delrow J: Development of a cDNA array for chicken gene expression analysis. BMC Genomics. 2005, 6: 13-10.1186/1471-2164-6-13.PubMedPubMed CentralView ArticleGoogle Scholar
- Smith J, Speed D, Law AS, Glass EJ, Burt DW: In silico identification of chicken immune-related genes. Immunogenetics. 2004, 56: 122-133. 10.1007/s00251-004-0669-y.PubMedView ArticleGoogle Scholar
- Gene ontology www site. [http://www.geneontology.org/]
- Blast at NCBI. [http://www.ncbi.nlm.nih.gov/BLAST/]
- Iseli C, Jongeneel CV, Bucher P: ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc Int Conf Intell Syst Mol Biol. 1999, 138-148.Google Scholar
- ESTscan. [http://www.ch.embnet.org/software/ESTScan.html]
- Pfam. [http://www.sanger.ac.uk/Software/Pfam/]
- BBSRC chicken EST database. [http://chick.umist.ac.uk/]
- Ensembl genome databases. [http://www.ensembl.org/]
- Genscan. [http://genes.mit.edu/GENSCAN.html]
- Ensemble 2005 chicken genebuild. [ftp://ftp.ensembl.org/pub/chicken-32.1h/data/fasta/dna/]
- GO slims. [http://www.geneontology.org/GO.slims.shtml]
- Brazma A, Parkinson H, Sarkans U, Shojatalab M, Vilo J, Abeygunawardena N, Holloway E, Kapushesky M, Kemmeren P, Lara GG, Oezcimen A, Rocca-Serra P, Sansone S: ArrayExpress – a public repository for microarray gene expression data at the EBI. Nucleic Acids Res. 2003, 31: 68-71. 10.1093/nar/gkg091.PubMedPubMed CentralView ArticleGoogle Scholar
- ArrayExpress. [http://www.ebi.ac.uk/arrayexpress]
- Gene ontology annotation at EBI. [http://www.ebi.ac.uk/GOA/]
- Soares MB, Bonaldo MF, Jelene P, Su L, Lawton L, Efstratiadis A: Construction and characterization of a normalized cDNA library. Proc Natl Acad Sci U S A. 1994, 91: 9228-9232.PubMedPubMed CentralView ArticleGoogle Scholar
- Bonaldo MF, Lennon G, Soares MB: Normalization and subtraction: two approaches to facilitate gene discovery. Genome Res. 1996, 6: 791-PubMedView ArticleGoogle Scholar
- Ewing B, Hillier L, Wendl MC, Green P: Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 1998, 8: 175-185.PubMedView ArticleGoogle Scholar
- RepeatMasker. [http://www.repeatmasker.org/]
- Crossmatch. [http://www.genome.washington.edu/UWGC/analysistools/Swat.cfm]
- Claverie JM, States D: Information enhancement methods for large scale sequence analysis. Computers Chem. 1993, 17: 191-201. 10.1016/0097-8485(93)85010-A.View ArticleGoogle Scholar
- Blast at NCBI. [http://www.ncbi.nlm.nih.gov/BLAST/]
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410. 10.1006/jmbi.1990.9999.PubMedView ArticleGoogle Scholar
- Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J: TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003, 19: 651-652. 10.1093/bioinformatics/btg034.PubMedView ArticleGoogle Scholar
- University of Delaware EST collection. [http://www.chickest.udel.edu/]
- GenomicSolutions. [http://www.genomicsolutions.com/showPage.php?title=GeneMachines%20HybStation]
- UCSC genome bioinformatics site. [http://genome.ucsc.edu/]
- TIGR gene indices. [http://www.tigr.org/tdb/tgi/]
- Quackenbush J, Cho J, Lee D, Liang F, Holt I, Karamycheva S, Parvizi B, Pertea G, Sultana R, White J: The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species. Nucleic Acids Res. 2001, 29: 159-64. 10.1093/nar/29.1.159.PubMedPubMed CentralView ArticleGoogle Scholar
- Gene Ontology Consortium: Gene Ontology: tool for the unification of biology. Nat Genet. 2000, 25: 25-29. 10.1038/75556.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.