Microbes, metagenomes and marine mammals: enabling the next generation of scientist to enter the genomic era
© Edwards et al.; licensee BioMed Central Ltd. 2013
Received: 7 January 2013
Accepted: 28 August 2013
Published: 4 September 2013
The revolution in DNA sequencing technology continues unabated, and is affecting all aspects of the biological and medical sciences. The training and recruitment of the next generation of researchers who are able to use and exploit the new technology is severely lacking and potentially negatively influencing research and development efforts to advance genome biology. Here we present a cross-disciplinary course that provides undergraduate students with practical experience in running a next generation sequencing instrument through to the analysis and annotation of the generated DNA sequences.
Many labs across world are installing next generation sequencing technology and we show that the undergraduate students produce quality sequence data and were excited to participate in cutting edge research. The students conducted the work flow from DNA extraction, library preparation, running the sequencing instrument, to the extraction and analysis of the data. They sequenced microbes, metagenomes, and a marine mammal, the Californian sea lion, Zalophus californianus. The students met sequencing quality controls, had no detectable contamination in the targeted DNA sequences, provided publication quality data, and became part of an international collaboration to investigate carcinomas in carnivores.
Students learned important skills for their future education and career opportunities, and a perceived increase in students’ ability to conduct independent scientific research was measured. DNA sequencing is rapidly expanding in the life sciences. Teaching undergraduates to use the latest technology to sequence genomic DNA ensures they are ready to meet the challenges of the genomic era and allows them to participate in annotating the tree of life.
KeywordsUndergraduate education DNA sequencing Sea lion Metagenome
The sequencing of the human genome in 2001 marked the beginning of the genomic era [1, 2] and since then sequencing technology has undergone major improvements and cost reductions [3, 4]. The “next generation of sequencers” enables the sequencing of an ever increasing range of genomes quickly, cheaply and with a high degree of accuracy. Bold sequencing projects, such as the 1,000 bacteria genomes, and the 10,000 vertebrate genomes are revolutionizing life science research and medicine. In medicine, the community is preparing for personal, whole human genomes to become a part of routine care, while a trend to sequence gene panels in human increase until this happens. Even the effects of the human microbial community on human health have been described by DNA sequencing [5–7]. In the environmental sciences, microbes have been identified that are associated with different ecological processes, and the functional profile of microbial communities can be compared across environments [8, 9]. In the pharmaceutical industry, sequencing is used in all aspects research and development. Graduates competent in next generation sequencing technologies are needed to support each of these research endeavors, as highlighted in the National Research Council discussion of metagenomics, Clinical Pathologists call to action, and Nature’s discussion on the requisites in genome-jobs [10–12].
While the potential application for genomics is extensive, accelerating our scientific discoveries and simultaneously revolutionizing human lives, the training of the next generation of researchers is lagging . Genomic courses at undergraduate level have been taught at a small number of institutions, however the opportunity for students to gain hands on experience of preparing samples and operating the sequencers is rare. A key aspect in a young scientists’ development is to learn good experimental design practices, which is best achieved by providing experiences across the entire project work flow. In many courses, DNA sequences are obtained from projects available on the web  or third party resources, and the students annotate new genes, but do not do any of the sequencing process. Other courses enable the students to extract the DNA, which is sent to a genome center for technicians to sequence [15, 16], and the students annotate the new genomes. While annotation has been shown to engage students in analytical thinking, and can allow significant numbers of students to participate in the scientific process [14, 17, 18] there could be pedagogical and practical value in providing students with opportunities to participate in the whole process, including the sequencing per se. Here we test a new way to engage students, having them work directly with next-generation instrumentation to conduct the DNA sequencing process from the beginning, then annotating the novel genome they sequenced. We invite the scientific community to consider what might be accomplished by the distributed community of undergraduate scientists using this approach.
The most effective way to teach science is to participate in the scientific process . Molecular biology has proven adaptable to educational settings. Cloning projects have allowed students to become technically proficient and learn other important skills of science, such as critical thinking, troubleshooting and adapting protocols to become independent researchers . The development of the “phage hunter” course, where student isolate new phages, obtain sequence data, explore the genomic data, and get to name their phage has been highly successful in training students in scientific discovery and providing new data to science [19, 21]. We have built on this excitement of discovery and developed a course that allows undergraduate students to extract, sequence, and analyze novel genomes to become part of sequencing and annotating the tree of life.
The first series of courses in ecological genomics was taught in 2010 at San Diego State University. In the Ecological Metagenomics course, 21 students sequenced novel DNA from microbes, metagenomes and marine mammals. The students were provided with interdisciplinary training in genomics, experience in research, and generated data that is being used by an international consortium to investigate the genomic signature of cancers in the California sea lions. As a template for others to generate next generation DNA sequencing courses, here we describe the ecological metagenomics course, results of student affective surveys, learning outcomes, data quality, and initial findings of the first marine mammal genome sequenced and annotated by undergraduate students.
Ecological metagenomics courses
A practical course in DNA sequencing and annotating novel genomes from start to finish with a next-generation sequencer was offered to upper division undergraduates and graduate students as a lecture and laboratory course and was open to students across biology and computer sciences. The syllabus is provided in Additional file 1: Table S1. The goals of the course were to: 1) introduce and use a next generation sequencer and analyse the data, 2) engage the students in research projects sequencing novel genomes, and 3) understand the importance of genomics to areas of biology and ecology. The students were novices in the genomics field as measured by an introductory quiz of the students’ knowledge (Additional file 1: Table S2). None of the students knew when the human genome was sequenced, how much it cost, or how long it took to complete. The students had not been introduced to genomics in earlier classes and had not considered genomics as a research or career area.
The course focused on sequencing the California sea lion (Zalophus californianus) and the microbial communities from the kelp forest where the sea lion hunts. The different organisms piqued each student’s interest, and the range of genome sizes provided different characteristics that were useful in practical conduct of the course. For example, the large genomes provided a control sample that could be sequenced multiple times at the same titration level throughout the course, facilitating the students’ introduction to the sequencer, and providing the instructor a tool to evaluate each student’s progress. The microbial samples allowed the students to culture organisms and extract DNA for sequencing. Because each microbial genome library was different, the students were required to calculate the titration level for each library. By calculating the titration levels (i.e. the amount of DNA per capture bead that is required for a successful sequencing run), the students understood the effect of varying DNA quantity on the sequencing quality.
Several metrics were used to determine the success of the ecological metagenomics course, including quality of sequences data, course evaluation, student self-confidence, and student learning. The quality of the data produced by the students was assessed using the quality control targets set by the manufacture, sequencing number and average length, and contamination levels. Course evaluation was administered at the end to assess whether the students had perceived that the course had met its education goals. Self-confidence and learning surveys were administered at the beginning and end of the course. Student learning was measured by changes in responses to 10 open-ended questions (Additional file 1: Table S2). Changes in students’ self-confidence in their ability to sequence DNA and to conduct scientific research were measured using scales adapted for this particular course [22,23]. Both scales were found to be reliable (Cronbach’s α = 0.96, 0.92) and changes were measured using a matched-pairs t-test.
The California sea lion genome case study
The California sea lion, Zalophus californianus, is a coastal sea lion that ranges from the west coast of southern Alaska to the Baja peninsula in Mexico . The California sea lion population is growing and while iconic for the California students, they are often in conflict with humans because they exploit prized fisheries such as swordfish and salmon . The California sea lion is most closely related to the Galapagos sea lion and the extinct Japanese sea lion . Sequencing mammal genomes provides information on evolution and identifies genes that are responsible for specific traits, in this case the return of a land mammal to a semi-aquatic habitat. Many mammalian diseases have a genetic component and identifying linages specific genomic changes may shed light on defects in related organisms. Understanding structural and functional features that influence genome size and evolution may be important in ecological and population studies designed to address issues relating to coastal conservation. The California sea lion is the first marine mammal genome, the first from the suborder Pinnipedia, and the fourth carnivore to be sequenced. The assembled DNA totals 1,951,532,210 bp with 13,352,265 bp in contigs > 10 kb, and 972,007 bp in contigs > 15 kb. The N50 sizes are 2,127 bp for all contigs, 11,249 for the 10 kb contigs, and 16,472 for the 15 kb contigs, suggesting high quality sequencing. The sea lion data is available from http://www.sealiongenome.org upon request and will be released after publication from NCBI.
Human contamination is a potential problem with novice users. Human contamination is difficult to discern in eukaryotic DNA, because there is an over whelming bias of human DNA sequences in the databases. Therefore, the amount of contamination in the metagenomics samples was calculated using BLAST. A metagenome is a random sample taken from a microbial community and contains short sequences that are generated from different microbial taxa and different genes . The metagenomes are microbial and therefore should contain very few sequences similar to human DNA (some sequence show similarity to human genes because of evolutionary history and the bias towards human sequences in the databases and some metagenomes may contain eukaryotic sequences). As shown in Additional file 1: Table S5, almost no human DNA matches were found suggesting the students were not sequencing themselves. Another test of the quality of the sequences generated by the students would be to see whether the proportion of sequences in the metagenomes that show similarity to microbes or functional genes were similar to those described in the metagenomes from the literature. The microbial communities sequenced by the students had between 37–76% of the sequences that showed similarity to various Bacteria and Archaea and 24 – 53% of sequences similar to known functional genes, similar to that of an externally sequenced metagenome (Additional file 1: Table S5). The proportions sequences similar to known organisms in the student sequenced metagenomes is similar to those describes for other marine samples in the literature [9, 32, 33], further suggesting that the student were generating usable sequence data.
The students’ final project was a formal report where they described the characteristic of the genomes, specific metabolic pathways or suggested how the features of genome contribute to the activity of the organism. The students investigated viral, bacterial, archaeal and eukaryotic genomes using 660 billion bp of sequence data and some of their project titles, amount of sequence examined and brief findings of the students is shown in Additional file 1: Table S6. Several of the ecological student reports are in the final stages of manuscript preparation for submission to peer reviewed journals and sequences generated by the class have contributed to two publications [34, 35] and two new genome descriptions [36, 37].
Comments provided by the students describing their thoughts about the ecological metagenomics course
This next generation sequencing experience has been educational beyond any class I have taken. It would be a mistake if this class were terminated. It would be a mistake if Dr Dinsdale was not given props for organizing this class. She needs to teach it again – it was fun!
I thought this class was very interesting and needs better advertising. I got into the course by accident but the quality of the course deserves more student interest. Being an ecology guy, I would have liked to have more background on how these microbial communities can affect the larger environment.
This was a very exciting course that introduced what I think is the next big thing in science. Being able to sequence essentially on demand is going to enhance a lot of research. It was a lot of stuff to take in.
This was one of my favourite courses I’ve ever taken at SDSU. I feel like I’ve learned so much and this class has spawned my interest in genomics. I am really excited about the new technology this class offers and I would highly recommend this course to others. I am really glad I had the opportunity to take this course.
As a student, I feel that doing the labs I was given enough independence to feel I was doing the work on my own and with my lab mates. This is a very important part of a lab course and I believe it should be preserved.
I liked the course and learned a lot. I feel confident about the Next Generation Sequencing, but would suggest more time reading and understanding the flow grams and analysing the data.
This course has been very useful to me with every aspect of sequencing touched and explained.
Overall, it was a good course especially to me, who didn’t have any lab-experience. It taught me the lab side of the sequencing, the process and the chemistry involved.
I loved the class. I think designating 10/15 min at the end of the lecture to talk about what’s going on in lab for the week would have been useful, that way we can come to lab feeling more prepared.
This was a great course to take at SDSU and I am grateful for the opportunity to be one of the small numbers of students to take this course. All labs were hands on and very educational.
Excellent course! This course has opened doors for me in the industry! I have got 2 calls and had an interview by saying that I’ve taken the course.
In a time when education and research are suffering budgetary constraints, introducing a sequencing based course into undergraduate training was high risk, but has returned high rewards. Publishable quality data has been generated and the students were provided with state of the art training. New technology engages students , and the genomics course merges the new technologies of metagenomics and next generation sequencing. While the sequencing technology is changing rapidly, by conducting the process on one instrument the students will be able to understand the new developments and the gains that the students made in terms of thinking like a scientists will last them a lifetime. The course inspired students to follow a genomic career path and several are employed in related industries or continued their education in the genomic arena. Career pathways that are not only highly relevant in today’s society, but ones they had not considered prior to taking the course. The students have gained knowledge and skills that are not offered in traditional lecture- and laboratory-based course which follow a cook book approach. Instead, these students are engaged in real research, and generating data that is useful to researchers across the world. Data will be released through SEED, MG_RAST and NCBI upon publication.
The course is cross-disciplinary, bringing together biologists and computer science students. In genomics, bioinformatics and analysis of the resultant data has now become the bottleneck of most sequencing projects. Part of the problem arises from a lack of training in both biology and computer science. This course has the two groups of students working side by side, the computer scientists learned biology and the biologists learn some of the computational constraints and both groups of students learned research techniques. Enabling collaboration of these students at an early stage will help the progress of bioinformatics in the future.
The lab is costly, time and instructor intensive, but the reward are large as it provides students with research experience in a technology of the future and acts as a recruitment tool for the life sciences. There are several problems inherent in teaching students DNA sequencing, 1) the potential of human contamination, 2) contamination of the environmental DNA with linkers or other cross-contamination of samples, 3) damage to the equipment with inexperienced researchers, and 4) establishing metrics to enable the assessment of good laboratory practices. Using basic laboratory sterile techniques successfully limited contamination issues. Setting the course up on a rotational basis by having students working in different rooms and leaving after each step stopped cross-contamination with linkers. The students were closely guided when operating the sequencer and recognized the opportunity, thus respected the equipment and no damage to the sequencer occurred during the course. By dividing the protocols up into lab timed blocks, the whole sequencing process takes longer than would be recommended by the manufacturer, but the time lag did not lead to noticeable reduction in yield. The time lag made it difficult to respond to any sequencing issues, such as over or under enrichment. Keeping track of each part of the process being conducted by each group of student was initially difficult and therefore a new online database that could be accessed by students and instructors was developed and would be available to other researchers on request. As with any course, pedagogical goals were reinforced by repeatedly covering material and using assessments to reinforce learning outcomes.
DNA sequencing is one of the fastest growing fields in the life sciences; however students have problems relating to the concepts because of the complexities, amounts of data, cross-disciplinary and microscopic nature of the process . By providing students with the opportunity to use a sequencer and sequence novel organisms, some of the mystery of sequencing was removed and the students were motivated to explore the complex data. Students attended capstone courses, became part of many research projects, including an international consortium, and were provided training to enter the genomic era. Many students have continued their scientific careers in either the academic or industry side of the business, suggesting the power of DNA sequencing to recruit much needed talent to the life sciences and extend the capacity and use of DNA sequencing. The best summary of the course comes from the students “This course is a ‘must have’ in the resume of any molecular biologists, graduate or undergraduate. The technology we were able to use and the research projects we have been part of constitute an unbelievable asset that without any doubt will be very useful in our professional futures”.
To conduct a sequencing run on a 454 FLX titanium sequencer takes a single person approximately 3 days, but the procedure needs to be divided into 2 hour: 40 minute modules and be conducted by 20+ students. Therefore, to organize the course to maximize equipment and learning objectives, some modules of the class were taught to all students at once and other parts of the course were taught to groups of students on a rotational basis (Figure 1). Written consent was obtained from the students to display their photographs. The rotation allowed the students to be in small groups and obtain practice in all areas of the process. The sequencer could be run on a weekly basic, ensuring sequences were available for students to analyse. The 454 protocol was broken up into seven modules (Figure 1), 1) library preparation, 2) emPCR, 3) breaking the emulsion, 4) loading the beads into the picotitre plate, 5) running the sequencer, 6) enrichment and 7) analysing the data. Module 6 - enrichment, is out of order, because is too lengthy to put into the class schedule more than once, therefore, for most of the semester a teaching assistant does this step in the rotation. Three lab sections were devoted to analysis of the data and the students had a further two weeks to finish their reports (Additional file 1: Table S1). Some extra classes that could be included to round out student education are collecting microbes, extracting DNA and quantification of the DNA. The extra classes provide students practice at several of the techniques prior to working with the 454 sequencing chemicals. For the whole class modules, the students worked in pairs and in the rotation modules, the students worked groups of four and often each module were subdivided such that the students worked on half of the protocol and then brought their products together at the end to complete the section. For example, in the emPCR module, a pair of students would organize the DNA capture and the second pair would prepare the emulsion oil and at the end, the DNA would be combined into the oil and both groups would pipette the oil into the PCR plates. Each module was conducted several times during the semester to enable each student to conduct each part of the process. The 454 was always run with the picotitre plate divided into four lanes, thereby giving the students more practice at the various steps. Each step had important targets that the students had to meet as part of their grades. For example, in the library preparation the amount of DNA in the library and the length of the fragments were measured using a bioanalyzer (Agilent 2100) and these need to meet the manufactures requirements (> 7.3 x 108 molecules of DNA with peak of the DNA sample migrating to between 500 and 1,250 bp). A 2 hours lecture section was held in conjunction with the practical course and provided theoretical background for understanding the sequencing technology and analysing the data. The lectures were divided into four sections that described; 1) next generation sequencing, 2) metagenomics, 3) eukaryotic genomes and 4) bacterial and archaeal genomes. The lectures relied on journal articles that were presented by both the professor and students (Additional file 1: Table S1). The presentation format increased student participation and provided examples of how to analyse the sequence data, which students would need to use in their final report.
Sequencing the California sea lion
DNA from a male sea lion was provided by Y. Schramm and G. Heckel. DNA was cleaned using high template PCR cleaning kit (Roche), and 70 μg of DNA was obtained. The students sequenced the sea lion genome in the courses held in Spring 2010–2012. Sequences with homology to the mitochondria were identified by comparison to a local version of the mitochondrial sequence database (http://megasun.bch.umontreal.ca/ogmp/projects/other/mtcomp.html) and separated prior to assembly. The mitochondrial sequences and the remaining (chromosomal sequences) were assembled independently using Newbler version 2.6 (454/Roche Life Sciences, Branford, CT). Sequences related to the mitochondrial genome were identified by BLASTN at the NCBI website and similar sequences were downloaded and aligned using ClustalX . A distance matrix was computed from the alignment using phylip  and visualized using FigTree (http://tree.bio.ed.ac.uk/software/figtree/). Mitochondrial genome alignments were also compared using Mauve . Interspersed repeats and low complexity sequences were identified using RepeatMasker v. 2.3.8 . This program also provided GC skew information. These programs were run by the students.
Preparation and analysis of microbial genomes
Marine microbes were obtained by the students by plating 100 μl of seawater on three different growth media: TCBS, MacConkey, and Marine Broth with 15 g/l agar added. All cultures were incubated at room temperature overnight and individual colonies were re-streaked until a single colony was obtained. Bacterial DNA was extracted from 1 ml of a liquid overnight marine broth cultures inoculated with a single colony. The overnight culture was pelleted and re-suspended in 600 μl of nuclei lysis buffer and incubated at 80°C for 5 min. The samples were cooled to room temperature and 200 μl of protein precipitate solution was added. The mixture was vortexed at high speed for 20 s and incubated on ice for 5 min. After centrifugation (13,000-16,000 x g, 10 min), the supernatant containing the DNA was transferred into 600 μl of room temperature isopropanol. The tubes were gently inverted until thread-like strands of DNA were visible. The DNA was pelleted (13,000-16,000 x g, 5 min) and washed with 70% room temperature ethanol, centrifuged again, and was air-dried overnight then rehydrated in 100 μl of ultrapure water. Microbial genomes were sequenced of a quarter of plate of 454 flex and assembled using Newbler version 2.6 (454/Roche Life Sciences, Branford, CT). The assembled genomes were up loaded to the RAST and annotated using subsystem technology . The students examined each genome and obtained the required data for their reports.
Preparation and analysis of the metagenomes
Metagenomes were prepared by concentrating approximately 60 l of seawater using a tangential flow filter (tff). A demonstration of the tff was provided to the class, because the concentration process allows the students to see the microbes. Once concentrated the microbes were obtained by filtering through a 0.2 um sterivex. The DNA was extracted using phenol chloroform extractions [9, 32, 33]. Because of the long lag time in the metagenomic DNA preparation, this part was conducted by the class teaching assistant. The metagenomes were analyzed without assembly using MG-RAST [45, 46]. Sequence similarity was set at an e value of 10-5, percent identity of 60% [9, 32, 33]. The students examined and compared metagenomes within the MG-RAST platform for their reports.
Evaluation of student learning outcome and ability to conduct STEMS research
These surveys were adapted to meet the needs of our specific course and the revised versions were all found to be reliable (Cronbach’s α = 0.96, 0.92, respectively). These measures were administered and at the beginning and end of the courses. The study was presented to the 21 students that took the 2011 ecological metagenomics course and 19 participated in the survey. Paired samples test was used to identify significant in the pre- and post- surveys.
We acknowledge Roche 454 Lifesciences for providing the backing to conduct the course. The course and EAD was supported by a NSF for Transforming Undergraduate Education in Science: 1044453 from the Division of Undergraduate Training grant. RAE is supported by NSF grants DBI: 0850356 from the Division of Biological Infrastructure and DEB: 1046413 from the Division of Environmental Biology. We acknowledge Robert Olson for help with the Argonne National Laboratory’s super computer to assemble the California sea lion genome. We thank the students from San Diego State University for taking the course, sequencing the California sea lion, answering the surveys and showing enthusiasm for knowledge. We acknowledge Yolanda Schramm and Gisela Heckel from Universidad Autónoma de Baja California for providing the California sea lion DNA. The research was conducted under the San Diego State University Institutional Review Board application # 604061. Photographs were taken by Lorena Nava Ruggero.
- Collins FS, Green ED, Guttmacher AE, Guyer MS: A vision for the future of genomics research. Nature. 2003, 422: 835-847. 10.1038/nature01626.View ArticlePubMedGoogle Scholar
- Schuster SC: Next-generation sequencing transforms today’s biology. Nat Methods. 2008, 5: 16-18.View ArticlePubMedGoogle Scholar
- Metzker ML: Applications of next-generation sequencing technologies - the next generation. Nat Rev Genet. 2010, 11: 31-46. 10.1038/nrg2626.View ArticlePubMedGoogle Scholar
- von Bubnoff A: Next-generation sequencing: the race is on. Cell. 2008, 132: 721-723. 10.1016/j.cell.2008.02.028.View ArticlePubMedGoogle Scholar
- Goya R, Sun MG, Morin RD, Leung G, Ha G, Wiegand KC, Senz J, Crisan A, Marra MA, Hirst M: SNVMix: predicting single nucleotide variants from next-generation sequencing of tumors. Bioinformatics. 2010, 26: 730-736. 10.1093/bioinformatics/btq040.PubMed CentralView ArticlePubMedGoogle Scholar
- Walter MJ, Graubert TA, Dipersio JF, Mardis ER, Wilson RK, Ley TJ: Next-generation sequencing of cancer genomes: back to the future. Pers Med. 2009, 6: 653-10.2217/pme.09.52.View ArticleGoogle Scholar
- Alkan C, Kidd JM, Marques-Bonet T, Aksay G, Antonacci F, Hormozdiari F, Kitzman JO, Baker C, Malig M, Mutlu O: Personalized copy number and segmental duplication maps using next-generation sequencing. Nat Genet. 2009, 41: 1061-1067. 10.1038/ng.437.PubMed CentralView ArticlePubMedGoogle Scholar
- Uchiyama T, Miyazaki K: Functional metagenomics for enzyme discovery: challenges to efficient screening. Curr Opin Biotech. 2009, 20: 616-622. 10.1016/j.copbio.2009.09.010.View ArticlePubMedGoogle Scholar
- Dinsdale EA, Edwards RA, Hall D, Angly F, Breitbart M, Brulc JM, Furlan M, Desnues C, Haynes M, Li L: Functional metagenomic profiling of nine biomes. Nature. 2008, 452: 629-632. 10.1038/nature06810.View ArticlePubMedGoogle Scholar
- Chi KR: The sequencer: rapid technological developments have spurred big changes in the requisite genome-sequencing jobs. Nature. 2010, 456: 256-257.View ArticleGoogle Scholar
- Haspel RL, Arnaout R, Briere L, Kantarci S, Marchand K, Tonellato P, Connolly J, Boguski MS, Saffitz JE: A call to action: training pathology residents in genomics and personalized medicine. Am J Clin Pathol. 2010, 133: 832-834. 10.1309/AJCPN6Q1QKCLYKXM.View ArticlePubMedGoogle Scholar
- Council NR: The new science of metagenomics: revealing the secrets of our microbial planet. 2007, DC: National Academies PressGoogle Scholar
- Fambrough D: Review of Bio2010: transforming undergraduate education for foture research biologists, by the national research council. Cell Biol Educ. 2003, 2: 92-93. 10.1187/cbe.03-03-0015.PubMed CentralView ArticlePubMedGoogle Scholar
- Lopatto D, Alvarez C, Barnard D, Chandrasekaran C, Chung HM, Du C, Eckdahl T, Goodman AL, Hauser C, Jones CJ: Undergraduate research. Genomics education partnership. Science. 2008, 322: 684-685. 10.1126/science.1165351.PubMed CentralView ArticlePubMedGoogle Scholar
- Temple L, Cresawn SG, Monroe JD: Genomics and bioinformatics in undergraduate curricula: contexts for hybrid laboratory/lecture courses for entering and advanced science students. Biochem Molecul Biol Educ. 2010, 38: 23-28. 10.1002/bmb.20359.View ArticleGoogle Scholar
- Lau JM, Robinson DL: Effectiveness of a cloning and sequencing exercise on student learning with subsequent publication in the national center for biotechnology information, GenBank. CBE Life Sci Educ. 2009, 8: 326-337. 10.1187/cbe.09-05-0036.PubMed CentralView ArticlePubMedGoogle Scholar
- Kerfeld CA, Simons RW: The undergraduate genomics research initiative. PLoS Biol. 2007, 5: e141-10.1371/journal.pbio.0050141.PubMed CentralView ArticlePubMedGoogle Scholar
- Shaffer CD, Alvarez C, Bailey C, Barnard D, Bhalla S, Chandrasekaran C, Chandrasekaran V, Chung HM, Dorer DR, Du C: The genomics education partnership: successful integration of research into laboratory classes at a diverse group of undergraduate institutions. CBE Life Sci Educ. 2010, 9: 55-69. 10.1187/09-11-0087.PubMed CentralView ArticlePubMedGoogle Scholar
- Hanauer DI, Jacobs-Sera D, Pedulla ML, Cresawn SG, Hendrix RW, Hatfull GF: Teaching scientific inquiry. Science. 2006, 314: 1880-1881. 10.1126/science.1136796.View ArticlePubMedGoogle Scholar
- Dymond JS, Scheifele LZ, Richardson S, Lee P, Chandrasegaran S, Bader JS, Boeke JD: Teaching synthetic biology, bioinformatics and engineering to undergraduates: the interdisciplinary build-a-genome course. Genet. 2009, 181: 13-21.View ArticleGoogle Scholar
- Hatfull GF, Pedulla ML, Jacobs-Sera D, Cichon PM, Foley A, Ford ME, Gonda RM, Houtz JM, Hryckowian AJ, Kelchner VA: Exploring the mycobacteriophage metaproteome: Phage genomics as an educational platform. PLoS Genet. 2006, 2: 835-847.View ArticleGoogle Scholar
- Denofrio LA, Russell B, Lopatto D, Lu Y: Linking student interests to science curricula. Science. 2007, 318: 1872-1873. 10.1126/science.1150788.View ArticlePubMedGoogle Scholar
- Howard DR, Miskowski JA, Grunwald SK, Abler ML: Assessment of a bioinformatics across life science curricula initiative. Biochem Molecul Biol Educ. 2007, 35: 16-23. 10.1002/bmb.13.View ArticleGoogle Scholar
- Rice DW, Lawrence SL: Marine mammals of the world: systematics and distribution. Special publications of the society for marine mammals. The Society for Marine Mammalogy. 1998, Lawrence KS, USA: Alan Press, 1-891276-03-04, vol. 4Google Scholar
- Department of Commerce, NOAA, National Marine Fisheries Service: Impacts of california sea lions and pacific harbor seals on salmonids and west coast ecosystems. Report by dept. Commerce. 1999, U.S: Department of CommerceGoogle Scholar
- Wolf JB, Tautz D, Trillmich F: Galapagos and Californian sea lions are separate species: genetic analysis of the genus zalophus and its implications for conservation management. Frontiers Zool. 2007, 4: 20-10.1186/1742-9994-4-20.View ArticleGoogle Scholar
- Kirkness EF, Bafna V, Halpern AL, Levy S, Remington K, Rusch DB, Delcher AL, Pop M, Wang W, Fraser CM: The dog genome: survey sequencing and comparative analysis. Science. 2003, 301: 1898-1903. 10.1126/science.1086432.View ArticlePubMedGoogle Scholar
- Li RQ, Fan W, Tian G, Zhu HM, He L, Cai J, Huang QF, Cai QL, Li B, Bai YQ: The sequence and de novo assembly of the giant panda genome. Nature. 2010, 463: 311-317. 10.1038/nature08696.PubMed CentralView ArticlePubMedGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.View ArticlePubMedGoogle Scholar
- Batzer MA, Deininger PL: Alu repeats and human genomic diversity. Nat Rev Genet. 2002, 3: 370-379. 10.1038/nrg798.View ArticlePubMedGoogle Scholar
- Roy-Engel AM, Carroll ML, El-Sawy M, Salem AH, Garber RK, Nguyen SV, Deininger PL, Batzer MA: Non-traditional Alu evolution and primate genomic diversity. J Mol Biol. 2002, 316: 1033-1040. 10.1006/jmbi.2001.5380.View ArticlePubMedGoogle Scholar
- Dinsdale EA, Edwards RA, Bailey BA, Tuba I, Akhter S, McNair K, Schmieder R, Apkarian N, Creek M, Guan E: Multivariate analysis of functional metagenomes. Frontiers Genet Stat Analysis. 2013, 4: 41-10.3389/fgene.2013.00041.Google Scholar
- Dinsdale EA, Pantos O, Smriga S, Edwards RA, Angly F, Wegley L, Hatay M, Hall D, Brown E, Haynes M: Microbial ecology of four coral atolls in the Northern Line Islands. Plos One. 2008, 3: e1584-10.1371/journal.pone.0001584.PubMed CentralView ArticlePubMedGoogle Scholar
- Hanna LF, Matthews TD, Dinsdale EA, Hasty D, Edwards RA: Characterization of the ELPhiS Prophage from Salmonella enterica serovar Enteritidis Strain LK5. Appl Environ Microb. 2012, 78: 1785-1793. 10.1128/AEM.07241-11.View ArticleGoogle Scholar
- Bruce T, Meirelles P, Garcia G, Paranhos R, Rezende C, de Moura R, Francini Filho R, Tereza Vasconcelos A, Amado Filho G, Hatay M: Abrolhos reef Bank health evaluated by means of water quality, microbial diversity, benthic cover, and fish biomass data. PLoS One. 2012, 7: e36687-10.1371/journal.pone.0036687.PubMed CentralView ArticlePubMedGoogle Scholar
- Hagen M, McNair K, Carolina A, Adegbemle F, Agarwal N, Aguinaldo K, Andrews J, Arciniega J, Arnoult M, Avila V: Complete genome sequences of Vibrio alginolyticus spp. 2013, Stand Genomic Sci: Miyuksis, in reviewGoogle Scholar
- Carolino A, McNair K, Adegbemle F, Agarwal N, Aguinaldo K, Andrews J, Arciniega J, Arnoult M, Avila V, Badeanlou L: Draft genome sequence of Vibrio sp strain 624788. Stand Genomic Sci. 2013, in reviewGoogle Scholar
- Jurkowski A, Reid AH, Labov JB: Metagenomics: a call for bringing a new science into the classroom (while it’s still new). CBE Life Sci Educ. 2007, 6: 260-265. 10.1187/cbe.07-09-0075.PubMed CentralView ArticlePubMedGoogle Scholar
- Tibell LA, Rundgren CJ: Educational challenges of molecular life science: characteristics and implications for education and research. CBE Life Sci Educ. 2010, 9: 25-33. 10.1187/cbe.08-09-0055.PubMed CentralView ArticlePubMedGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ: Clustal-W - Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.PubMed CentralView ArticlePubMedGoogle Scholar
- Felsenstein J: PHYLIP -- Phylogeny Inference Package (Version 3.2). Cladistics. 1989, 5: 164-166.Google Scholar
- Darling AC, Mau B, Blattner FR, Perna NT: Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004, 14: 1394-1403. 10.1101/gr.2289704.PubMed CentralView ArticlePubMedGoogle Scholar
- Tempel S: Using and understanding RepeatMasker. Methods Mol Biol. 2012, 859: 29-51. 10.1007/978-1-61779-603-6_2.View ArticlePubMedGoogle Scholar
- Aziz RK, Devoid S, Disz T, Edwards RA, Henry CS, Olsen GJ, Olson R, Overbeek R, Parrello B, Pusch GD: SEED servers: high-performance access to the SEED genomes, annotations, and metabolic models. Plos One. 2012, 7 (10): 10.1371/journal.pone.0048053.
- Meyer F, Paarmann D, D’Souza M, Olson R, Glass EM, Kubal M, Paczian T, Rodriguez A, Stevens R, Wilke A: The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics. 2008, 9: 386-10.1186/1471-2105-9-386.PubMed CentralView ArticlePubMedGoogle Scholar
- Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M: The RAST server: rapid annotations using subsystems technology. BMC Genomics. 2008, 9: 75-10.1186/1471-2164-9-75.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.