- Open Access
Development of a database system for mapping insertional mutations onto the mouse genome with large-scale experimental data
BMC Genomics volume 10, Article number: S7 (2009)
Insertional mutagenesis is an effective method for functional genomic studies in various organisms. It can rapidly generate easily tractable mutations. A large-scale insertional mutagenesis with the piggyBac (PB) transposon is currently performed in mice at the Institute of Developmental Biology and Molecular Medicine (IDM), Fudan University in Shanghai, China. This project is carried out via collaborations among multiple groups overseeing interconnected experimental steps and generates a large volume of experimental data continuously. Therefore, the project calls for an efficient database system for recording, management, statistical analysis, and information exchange.
This paper presents a database application called MP-PBmice (insertional mutation mapping system of PB Mutagenesis Information Center), which is developed to serve the on-going large-scale PB insertional mutagenesis project. A lightweight enterprise-level development framework Struts-Spring-Hibernate is used here to ensure constructive and flexible support to the application. The MP-PBmice database system has three major features: strict access-control, efficient workflow control, and good expandability. It supports the collaboration among different groups that enter data and exchange information on daily basis, and is capable of providing real time progress reports for the whole project. MP-PBmice can be easily adapted for other large-scale insertional mutation mapping projects and the source code of this software is freely available at http://www.idmshanghai.cn/PBmice.
MP-PBmice is a web-based application for large-scale insertional mutation mapping onto the mouse genome, implemented with the widely used framework Struts-Spring-Hibernate. This system is already in use by the on-going genome-wide PB insertional mutation mapping project at IDM, Fudan University.
Insertional mutations have been playing an important role in biological studies ever since the early development of genetic engineering in the late 1960s (reviewed extensively in ). It was first used by bacterial genetics . Later on, its usage was expanded to other organisms by modification of the bacterial system or the discovery of other appropriate natural insertional agents. It has already been successfully applied to the study of gene functions of a number of model organisms such as yeast [2–5], fly [6–10], worm [11–13], plant , fish [15–18], and mouse [19–28], and also organisms of economic and/or medical importance such as rice , silkworm , and mosquito . Insertional mutagenesis has proven itself to be one of the most efficient means for large-scale functional characterization of a number of genomes, including yeast , fly [9, 33], fish , plant , and mammal [21, 22, 28, 35–37]. When an appropriate insertion vector is used, a large number of mutations can be produced at low cost and with a high speed, and expression patterns of the inserted endogenous genes are readily revealed. For example, new insertional mutations could be produced from mice carrying transposable elements and are capable of transposition in the germ cells. Progeny harboring individual transposition events can be subsequently mapped and maintained in the absence of the transposase.
More and more large-scale insertional mutagenesis projects are underway to decode genomes of interest. These projects are producing large quantities of experimental data. Simple spreadsheets on personal computers would not meet the demand generated by the current volume of experimental data. Large-scale experimental projects are not handled by isolated individuals/groups, but are instead managed with the same flow production commonly used by car manufacturers. In this method, the whole process is divided into multiple interconnected steps with each step under the charge of a specific research group. This kind of management generates two urgent needs: 1) recording data and exchanging information on a daily basis among the collaborating groups, and 2) extracting statistical information in a timely manner for researchers to grasp the current progress of the project. In other words, an efficient way to record and process data is required. A well-designed and implemented management database is the right choice to meet these needs.
Recently, two databases have been published to facilitate lab management in large-scale mutagenesis projects: the ENU-mutagenesis system for mice  and the PACLIMS system used for a rice fungus genome insertional mutagenesis project . Both have been used in their own projects respectively. However, the ENU mutagenesis system does not involve an insertional mutation mapping procedure needed in our project, while the PACLIMS system is only applicable for handling large-scale insertional mutagenesis project carried out in 24- or 96-well plates for cell culture system and microorganisms. In addition, neither satisfies the need of a large-scale project carried out with the flow-control production system.
Currently, a genome-wide piggyBac (PB) mutagenesis mapping project managed with the flow control system is underway in the Institute of Developmental Biology and Molecular Medicine (IDM) at Fudan University in Shanghai, China. PB transposon, originally found in the cabbage looper moth Trichoplusia ni, is a DNA transposon shown to be an important genetic manipulation tool in multiple organisms including mice [22, 40–43]. We have established the PBmice system (PB Mutagenesis Information Center) for archiving, retrieving, and analyzing the resulting data from this project , but the published version did not handle either the raw data or experimental flow. This paper presents the MP-PBmice system (insertional mutations mapping system of PBmice), which is developed to meet the demands mentioned above for the efficient management of large-scale projects performed in the flow control fashion: efficient data recording, processing, and exchange. The MP-PBmice system can also be easily adapted to other insertional mutation mapping projects using the flow production mode with large-scale experimental data.
Inverse PCR (iPCR)  was chosen in the large-scale PB insertional mutagenesis project. The mapping process of this project runs in the efficient flow-production mode (Figure 1). The whole process is divided into five steps with each step overseen by a designated research group.
Mouse tissue and iPCR data
The first step is sampling. In the PB project, mice tissues served as the samples from which the mice genomic DNA was generated. Mice are taken care of by the animal facility group. This group collects the tips of the tails of mice potentially carrying new PB insertions (Inserts) in the genome. Then the iPCR group performs iPCR experiments using these tail tips to obtain amplified fragments carrying the mouse genomic fragment flanking the PB insertion site. These amplified fragments are sequenced subsequently. In this step, the following data are gathered: sample information, experiment status information (fail or successful), DNA electrophoresis gel photos, and the information of the DNA fragment to be sequenced.
DNA sequence data
The sequencing of the PB flanking genomic DNA is performed by the sequencing group. For each DNA fragment sequenced, three files are generated to describe the contents and quality of the sequence. These results are passed onto the bioinformatics group to determine the location of the Inserts on the chromosomes.
Insertional mutation localization data
The Insert-flanking genomic sequences are obtained from the reliable sequences and submitted to Ensembl and/or UCSC genome browser and/or NCBI for mapping those insertions onto the mouse genome. One example is shown in Figure 1. The locations of the Inserts on the genome and the genomic features of those locations are obtained. For each Insert, a 4 kb sequence with the insertion site in the middle is sent to the Genotyping (GT) group for experimental GT verification.
The GT group designs GT primers using the above 4 kb sequence as template with the MacVector Software (MacVector Inc.). Suitable primers predicted by the software are submitted to UCSC genome browser to determine its uniqueness in the whole mouse genome. To ensure that there is no accidental sample switch, the animal facility group will collect a piece of the ear for GT verification. The ear tissue comes from the same mouse whose tail tissue was sent for iPCR in the beginning. If the GT verification of one insertion generates a positive result, the mapping process comes to an end for this specific Insert. If the experiment fails, new primers will be designed for a new round of GT verification. Primer information, experiment status information (fail or successful), and DNA electrophoresis gel photos are the three types of data gathered in this step.
Design and implementation of the MP-PBmice system
The previously published PBmice collect and disseminate the large quantities of the genetic and phenotypic information from this on-going large-scale PB insertional mutagenesis project . It provides a user interface to query and display data related of the published PB insertions in the mouse genome and their characterizations. However, in this published version, neither the raw data nor the experimental flow is touched. In the process of the production of large numbers of insertional mutants and the characterization of their phenotype, large amounts of raw data are produced and recorded daily. There is an urgent need of a mapping system to serve the mutant mapping procedure to ensure detailed and correct data recording and information exchange on a daily basis. We designed and developed the current MP-PBmice system to meet these needs.
Experimental data gathered from every step of the project, including future updates, need to be stored in the database. We employ the relational data schema to model these data. Four kinds of tables (one example is shown in Figure 2) are designed in the MP-PBmice to ensure correct daily data recording and exchange.
1. Access-control tables. The function of accessing control in our system is realized through access-control tables including a user table, a role table, a group table, a permission table, and a user-role relation table. The user, role, and group table are used to store the information of every registered user, every defined role, and every designated group respectively. In the permission table we keep the permission status of each data table, such as which table can be read and which table can be updated by a specific role of a specific group. Each user can be allocated multiple roles to carry out corresponding tasks. The user-role relation table archives this relationship between each user and the role(s) allocated to the user.
2. Experimental stage tables. Data created from one step of the whole procedure are often stored in a single table. Data from an earlier stage can be easily accessed in a later stage through foreign keys. For example, if several versions of experimental data of one specific insertion in the same stage are created, foreign keys are able to distinguish the newest version from the old version(s).
3. Constant tables. Certain categories of data are not unique to either each mouse or each Insert generated. For example, the coat color of a mouse falls into a few types: black, white, and others. The experimenters only need to fill in the data via a simple pull-down menu to make the right choice. Constant tables are applied to store these values instead of fixing these options into codes. This designing strategy allows for easy potential updates by updating the data in corresponding constant tables rather than the need to rebuild the whole system.
4. Status table. In the whole experimental flow, mice and Inserts are the key threads. All steps of the experiments are carried out around them and the current status of an Insert is critical to the workflow. The Status table is designed to record the current experimental stage and the status of each Insert in this stage. This strategy smoothens the experimental flow and creates a more efficient query of the stages and statuses.
The Model-View-Controller pattern
The three-layer architectural pattern Model-View-Controller (MVC) http://heim.ifi.uio.no/~trygver/1979/mvc-2/1979-12-MVC.pdf is the choice for the system design in MP-PBmice. The MVC pattern is widely used in software engineering and often used by applications that deal with multiple views of the same data. The basic idea of this mainstream design pattern is to isolate the business logic from user interface considerations. This results in an application that modifies the visual appearance or the underlying business rules in such a way that they will not affect each other (see Figure 3).
In MVC, the Model layer stores the data or information on which the application operates, provides application-specific methods to manage the data, and operates all of the business process of an application. It receives and responds to the requests from the Controller layer and the View layer.
The View layer is a visual representation of the information contained in the Model layer through listening for and reflecting the state updates in the Model layer. A view may change when the information in the Model layer changes, so there can be more than one view for a model.
The Controller layer helps the communication between the Model layer and the View layer by listening to the user inputs and translating these inputs into the data changes in the Model layer. It also instructs the View layer to make changes in the user interface when the Model layer updates.
The Struts-Spring-Hibernate framework
We designed and developed the MP-PBmice system with the lightweight enterprise-level development framework Struts-Spring-Hibernate to implement the MVC pattern [46, 47]. This development framework is widely used in commerce and would meet the demands from the large volume of experimental data.
In our implementation, the Model layer is also called the Data Access Object (DAO) layer, which is formed by the database models generated by Middlegen http://boss.bekk.no/boss/middlegen/. These database models are composed of Java objects describing the models, operations, and model relationships of the database. DAO performs queries on the database models to obtain the information using Hibernate Query Language supported by Hibernate (v3.0.5) . This framework controls mapping of every schema (table) in the database of MP-PBmice to the underlying relational database MySQL (v5.0.22). Every operation on an object will be automatically persisted into the database. As a transparent back-end layer, it is easy to execute data queries and feature extractions.
Based on the above architecture, the application was implemented to retrieve, store and operate Mice, Inserts, data-input, and user-management related information as a set of database models defined by Java objects.
Results and discussion
The MP-PBmice has been developed and is already used for the mapping of the new insertional mutants produced in the large-scale PB project.
This MP-PBmice system allows users to log in their different roles to carry out corresponding tasks. Here we give one example about how this system works. The complete user menu is available with the source code.
The main entry page is the login page (Figure 4a). A pull-down menu provides the choice of the roles for the specific user. After the Experimenter logs into the system, he/she can fill in his/her own experimental data on the form-filling page (Figure 4b). Once the Experimenter has submitted the data, the Group_leader will allocate a different user as the Checker for this Experimenter to confirm the data (Figure 5). IDM internal users are able to perform quick searches to obtain the detailed information of the experimental data related to his/her work easily (Figure 6a). The search result will be listed and the detailed information will be retrieved through the "viewDetail" linkage (Figure 6b). In the detailed information page (Figure 6c), there is a comment box for remarks and opinions and an email message will be sent to the corresponding PI to report any mistake for correction.
Strict access control
The MP-PBmice is destined to be a web-based application to meet the needs of concurrent operation of the database by multiple users. An entire strict access control system is introduced into this system to ensure data accuracy. This access control system is divided into three units: 1) user registration, 2) user information management, and 3) user permission management. Each user has only one entry in the user information unit, while each user can be allocated multiple roles in the user permission unit after registration (Figure 7). Different roles in different groups are allocated different permission to perform designated tasks. In each group, the Experimenters perform the allocated experiments, fill in their own data, and submit the data to the database. For each step of the experiment, the Checkers validate the data and are allowed to make modifications only once. If anybody finds an error, she or he may report the mistake through the mistake tracking system. Only the person with the PI role is able to allocate the wrong experiment record to a specific experimenter to correct the data and/or repeat the experiment. The Director role decides to publish the experimental records to the world. All the internet-published data can be accessed without registration. Each operation and the time the operation was performed are recorded into the database. Every error record is also kept in the database as an old version, rather than being replaced. All of these records can be queried by specific roles to show the rate at which error occurred. The strict division of the work and step-by-step checking may reduce the number of mistakes in experimental data submitted, ensure timely data exchange, and improve the efficiency of the whole process.
This control will be developed further to meet more complicated requirements of each role and will generate new roles needed throughout the progress of the project.
MP-PBmice is developed to have a strict control of the experimental procedure. In practice, each step of the experiment carried out would result in various situations, and the mapping process won't be smooth for every new Insert from the start to the end. Figuring out the complicated branches of the process is the most important part in making the whole project methodically organized. The detailed experimental process and situations that may happen along the way are recorded, discussed, and checked by Directors and PIs of the project. The purpose of this discrete consideration is to work out an approach that avoids the loss of important data or resources as a result of inattentive operation, as well as minimizing the cost of resources and time. A workflow is designed to deal with all kinds of situations that would happen in the experimental process (Figure 8). In each step, an Insert with a specific status defined by the previous experimental result is automatically passed to the corresponding experimental pathway, which notifies the downstream user of his/her coming workloads. Automatic workflow control streamlines the experiments and benefits the process of large quantities of batch analysis. This workflow control also allows a timely report generated by the system for users to visualize the progress of each sample once it is in the procedure. This offers project leaders to analyze the quality and quantity of the experiments so as to improve the protocol in time. With the development of experimental research process, we will improve the system to better support researchers to finish tasks intelligently and efficiently. For example, the system will be able to give suggestions or assessments in certain key functions to researchers through a data analysis system.
The MP-PBmice system is developed to record all kinds of experimental data generated in a large-scale project of mapping insertional mutations onto the mouse genome. Initially used for mapping onto the mouse genome, it can easily be used for mapping onto any genome, because the only difference will be in the steps of sampling and blasting. For any other genome, the sample will be the corresponding tissue of the organism of interest, and the genome to be blasted is the genome of interest. Either step would generate the same data types as the experimental procedure discussed in this paper. Only the title of sample, location of the Insert (chromosome No. and position) of the table columns in the database need to be changed for other projects. If the thermal asymmetric interlaced PCR (Tail-PCR)  or the ligation-mediated PCR  is used for mapping the mutants instead of iPCR, only the titles of the table used to record iPCR results need to be changed to the one in use. In both cases mentioned above, the changes are minimal. The MP-PBmice system is easily modified to archive the information generated from other labs for mapping insertional mutations onto a particular genome of interest.
The large-scale PB insertional mutagenesis project asks for efficient management of the experimental process and data accumulated. The flow production control is more efficient than the mere collaboration among isolated groups, with each group covering the whole process of new mutant production. A database targeting the lab management specifically for large-scale insertional mutagenesis project with flow production control is the key to meeting the demand. In collaboration among three research groups from Fudan University, we constructed the MP-PBmice system to allow researchers to submit data and pass on tasks daily and electronically. The MP-PBmice system was designed and developed with the widely used MPV pattern and lightweight enterprise-level development framework Struts-Spring-Hibernate to ensure the applicability. This system incorporates a strict access-control system and a tight workflow control to ensure the data accuracy and timely information exchange, which are key to the efficiency and success of the project. This system has been in use for five months, and is easily expandable for other large-scale insertional mutation mapping projects.
Availability and requirements
The software MP-PBmice is freely available at http://www.idmshanghai.cn/PBmice. For other information, please contact the corresponding author.
Other papers from the meeting have been published as part of BMC Bioinformatics Volume 10 Supplement 15, 2009: Eighth International Conference on Bioinformatics (InCoB2009): Bioinformatics, available online at http://www.biomedcentral.com/1471-2105/10?issue=S15.
Kleckner N, Roth J, Botstein D: Genetic engineering in vivo using translocatable drug-resistance elements. New methods in bacterial genetics. J Mol Biol. 1977, 116 (1): 125-159. 10.1016/0022-2836(77)90123-1.
Seifert HS, Chen EY, So M, Heffron F: Shuttle mutagenesis: a method of transposon mutagenesis for Saccharomyces cerevisiae. Proc Natl Acad Sci USA. 1986, 83 (3): 735-739. 10.1073/pnas.83.3.735.
Snyder M, Elledge S, Davis RW: Rapid mapping of antigenic coding regions and constructing insertion mutations in yeast genes by mini-Tn10 "transplason" mutagenesis. Proc Natl Acad Sci USA. 1986, 83 (3): 730-734. 10.1073/pnas.83.3.730.
Garfinkel DJ, Mastrangelo MF, Sanders NJ, Shafer BK, Strathern JN: Transposon tagging using Ty elements in yeast. Genetics. 1988, 120 (1): 95-108.
Roeder GS, Fink GR: DNA rearrangements associated with a transposable element in yeast. Cell. 1980, 21 (1): 239-249. 10.1016/0092-8674(80)90131-2.
Spradling AC, Rubin GM: Drosophila genome organization: conserved and dynamic aspects. Annu Rev Genet. 1981, 15: 219-264. 10.1146/annurev.ge.15.120181.001251.
Rubin GM, Spradling AC: Genetic transformation of Drosophila with transposable element vectors. Science. 1982, 218 (4570): 348-353. 10.1126/science.6289436.
Spradling AC, Rubin GM: Transposition of cloned P elements into Drosophila germ line chromosomes. Science. 1982, 218 (4570): 341-347. 10.1126/science.6289435.
Cooley L, Kelley R, Spradling A: Insertional mutagenesis of the Drosophila genome with single P elements. Science. 1988, 239 (4844): 1121-1128. 10.1126/science.2830671.
Bingham PM, Levis R, Rubin GM: Cloning of DNA sequences from the white locus of D. melanogaster by a novel and general method. Cell. 1981, 25 (3): 693-704. 10.1016/0092-8674(81)90176-8.
Plasterk RH: Reverse genetics: from gene sequence to mutant worm. Methods Cell Biol. 1995, 48: 59-80. full_text.
Bessereau JL, Wright A, Williams DC, Schuske K, Davis MW, Jorgensen EM: Mobilization of a Drosophila transposon in the Caenorhabditis elegans germ line. Nature. 2001, 413 (6851): 70-74. 10.1038/35092567.
Gally C, Eimer S, Richmond JE, Bessereau JL: A transmembrane protein required for acetylcholine receptor clustering in Caenorhabditis elegans. Nature. 2004, 431 (7008): 578-582. 10.1038/nature02893.
Koncz C, Nemeth K, Redei GP, Schell J: T-DNA insertional mutagenesis in Arabidopsis. Plant Mol Biol. 1992, 20 (5): 963-976. 10.1007/BF00027166.
Amsterdam A, Becker TS: Transgenes as screening tools to probe and manipulate the zebrafish genome. Dev Dyn. 2005, 234 (2): 255-268. 10.1002/dvdy.20541.
Kawakami K: Transposon tools and methods in zebrafish. Dev Dyn. 2005, 234 (2): 244-254. 10.1002/dvdy.20516.
Amsterdam A, Burgess S, Golling G, Chen W, Sun Z, Townsend K, Farrington S, Haldi M, Hopkins N: A large-scale insertional mutagenesis screen in zebrafish. Genes Dev. 1999, 13 (20): 2713-2724. 10.1101/gad.13.20.2713.
Amsterdam A, Yoon C, Allende M, Becker T, Kawakami K, Burgess S, Gaiano N, Hopkins N: Retrovirus-mediated insertional mutagenesis in zebrafish and identification of a molecular marker for embryonic germ cells. Cold Spring Harb Symp Quant Biol. 1997, 62: 437-450.
Jaenisch R, Harbers K, Schnieke A, Lohler J, Chumakov I, Jahner D, Grotkopp D, Hoffmann E: Germline integration of moloney murine leukemia virus at the Mov13 locus leads to recessive lethal mutation and early embryonic death. Cell. 1983, 32 (1): 209-216. 10.1016/0092-8674(83)90511-1.
Schnieke A, Harbers K, Jaenisch R: Embryonic lethal mutation in mice induced by retrovirus insertion into the alpha 1(I) collagen gene. Nature. 1983, 304 (5924): 315-320. 10.1038/304315a0.
Ivics Z, Kaufman CD, Zayed H, Miskey C, Walisko O, Izsvak Z: The Sleeping Beauty transposable element: evolution, regulation and genetic applications. Curr Issues Mol Biol. 2004, 6 (1): 43-55.
Ding S, Wu X, Li G, Han M, Zhuang Y, Xu T: Efficient transposition of the piggyBac (PB) transposon in mammalian cells and mice. Cell. 2005, 122 (3): 473-483. 10.1016/j.cell.2005.07.013.
Ostertag EM, DeBerardinis RJ, Goodier JL, Zhang Y, Yang N, Gerton GL, Kazazian HH: A mouse model of human L1 retrotransposition. Nat Genet. 2002, 32 (4): 655-660. 10.1038/ng1022.
Dupuy AJ, Fritz S, Largaespada DA: Transposition and gene disruption in the male germline of the mouse. Genesis. 2001, 30 (2): 82-88. 10.1002/gene.1037.
Fischer SE, Wienholds E, Plasterk RH: Regulated transposition of a fish transposon in the mouse germ line. Proc Natl Acad Sci USA. 2001, 98 (12): 6759-6764. 10.1073/pnas.121569298.
Luo G, Ivics Z, Izsvak Z, Bradley A: Chromosomal transposition of a Tc1/mariner-like element in mouse embryonic stem cells. Proc Natl Acad Sci USA. 1998, 95 (18): 10769-10773. 10.1073/pnas.95.18.10769.
Zagoraiou L, Drabek D, Alexaki S, Guy JA, Klinakis AG, Langeveld A, Skavdis G, Mamalaki C, Grosveld F, Savakis C: In vivo transposition of Minos, a Drosophila mobile element, in mammalian tissues. Proc Natl Acad Sci USA. 2001, 98 (20): 11474-11478. 10.1073/pnas.201392398.
Wu SC, Meir YJ, Coates CJ, Handler AM, Pelczar P, Moisyadi S, Kaminski JM: piggyBac is a flexible and highly active transposon as compared to sleeping beauty, Tol2, and Mos1 in mammalian cells. Proc Natl Acad Sci USA. 2006, 103 (41): 15008-15013. 10.1073/pnas.0606979103.
Hirochika H: Contribution of the Tos17 retrotransposon to rice functional genomics. Curr Opin Plant Biol. 2001, 4 (2): 118-122. 10.1016/S1369-5266(00)00146-1.
Uchino K, Sezutsu H, Imamura M, Kobayashi I, Tatematsu K, Iizuka T, Yonemura N, Mita K, Tamura T: Construction of a piggyBac-based enhancer trap system for the analysis of gene function in silkworm Bombyx mori. Insect Biochem Mol Biol. 2008, 38 (12): 1165-1173. 10.1016/j.ibmb.2008.09.009.
Adelman ZN, Jasinskiene N, James AA: Development and applications of transgenesis in the yellow fever mosquito, Aedes aegypti. Mol Biochem Parasitol. 2002, 121 (1): 1-10. 10.1016/S0166-6851(02)00028-2.
Burns N, Grimwade B, Ross-Macdonald PB, Choi EY, Finberg K, Roeder GS, Snyder M: Large-scale analysis of gene expression, protein localization, and gene disruption in Saccharomyces cerevisiae. Genes Dev. 1994, 8 (9): 1087-1105. 10.1101/gad.8.9.1087.
Zhang P, Spradling AC: Insertional mutagenesis of Drosophila heterochromatin with single P elements. Proc Natl Acad Sci USA. 1994, 91 (9): 3539-3543. 10.1073/pnas.91.9.3539.
Parinov S, Sundaresan V: Functional genomics in Arabidopsis: large-scale insertional mutagenesis complements the genome sequencing project. Curr Opin Biotechnol. 2000, 11 (2): 157-161. 10.1016/S0958-1669(00)00075-6.
Schnutgen F, De-Zolt S, Van Sloun P, Hollatz M, Floss T, Hansen J, Altschmied J, Seisenberger C, Ghyselinck NB, Ruiz P, et al: Genomewide production of multipurpose alleles for the functional analysis of the mouse genome. Proc Natl Acad Sci USA. 2005, 102 (20): 7221-7226. 10.1073/pnas.0502273102.
Stanford WL, Cohn JB, Cordes SP: Gene-trap mutagenesis: past, present and beyond. Nat Rev Genet. 2001, 2 (10): 756-768. 10.1038/35093548.
Nguyen D, Xu T: The expanding role of mouse genetics for understanding human biology and disease. Dis Model Mech. 2008, 1 (1): 56-66. 10.1242/dmm.000232.
Masuya H, Nakai Y, Motegi H, Niinaya N, Kida Y, Kaneko Y, Aritake H, Suzuki N, Ishii J, Koorikawa K, et al: Development and implementation of a database system to manage a large-scale mouse ENU-mutagenesis program. Mamm Genome. 2004, 15 (5): 404-411. 10.1007/s00335-004-2265-8.
Donofrio N, Rajagopalon R, Brown D, Diener S, Windham D, Nolin S, Floyd A, Mitchell T, Galadima N, Tucker S, et al: 'PACLIMS': a component LIM system for high-throughput functional genomic analysis. BMC Bioinformatics. 2005, 6: 94-10.1186/1471-2105-6-94.
Fraser MJ, Cary L, Boonvisudhi K, Wang HG: Assay for movement of Lepidopteran transposon IFP2 in insect cells using a baculovirus genome as a target DNA. Virology. 1995, 211 (2): 397-407. 10.1006/viro.1995.1422.
Fraser MJ, Ciszczon T, Elick T, Bauser C: Precise excision of TTAA-specific lepidopteran transposons piggyBac (IFP2) and tagalong (TFP3) from the baculovirus genome in cell lines from two species of Lepidoptera. Insect Mol Biol. 1996, 5 (2): 141-151. 10.1111/j.1365-2583.1996.tb00048.x.
Cary LC, Goebel M, Corsaro BG, Wang HG, Rosen E, Fraser MJ: Transposon mutagenesis of baculoviruses: analysis of Trichoplusia ni transposon IFP2 insertions within the FP-locus of nuclear polyhedrosis viruses. Virology. 1989, 172 (1): 156-169. 10.1016/0042-6822(89)90117-7.
Wu S, Ying G, Wu Q, Capecchi MR: Toward simpler and faster genome-wide mutagenesis in mice. Nat Genet. 2007, 39 (7): 922-930. 10.1038/ng2060.
Sun LV, Jin K, Liu Y, Yang W, Xie X, Ye L, Wang L, Zhu L, Ding S, Su Y: PBmice: an integrated database system of piggyBac (PB) insertional mutations and their characterizations in mice. Nucleic Acids Res. 2008, D729-734. 36 Database
Ochman H, Gerber AS, Hartl DL: Genetic applications of an inverse polymerase chain reaction. Genetics. 1988, 120 (3): 621-623.
Bauer C, King G: Java Persistence with Hibernate. 2006, Greenwich, CT: Manning Publications
Walls C, Breidenbach R: Spring in Action. 2007, Greenwich, CT.: Manning Publications, 2
Cavaness C: Programming Jakarta Struts. 2004, Sebastopol, CA.: O'Reilly Media Inc, 2
Liu YG, Whittier RF: Thermal asymmetric interlaced PCR: automatable amplification and sequencing of insert end fragments from P1 and YAC clones for chromosome walking. Genomics. 1995, 25 (3): 674-681. 10.1016/0888-7543(95)80010-J.
Johnson DH: Molecular cloning of DNA from specific chromosomal regions by microdissection and sequence-independent amplification of DNA. Genomics. 1990, 6 (2): 243-251. 10.1016/0888-7543(90)90563-A.
This research was supported by grants from Chinese Key Projects for Basic Research (973) (Grant No. 2006CB806700, and 2002CB512801), Hi-tech Research and Development Project (863) (Grant No. 2007AA022102), Program for New Century Excellent Talents in University, Shanghai Leading Academic Discipline Project under Grant Nos. B114 and B111, and AMD University Cooperation Project. We thank Dr. Yun Zheng and the software development group of IDM for technical assistance and constructive reading of the manuscript, research groups involved in the PB project for system testing, Drs. Tian Xu, Min Han, Yuan Zhuang, and Xiaohui Wu for help in needs-analysis and management. We thank Mr. Ian Knisely from University of Toronto for reading the revised version of the manuscript.
This article has been published as part of BMC Genomics Volume 10 Supplement 3, 2009: Eighth International Conference on Bioinformatics (InCoB2009): Computational Biology. The full contents of the supplement are available online at http://www.biomedcentral.com/1471-2164/10?issue=S3.
The authors declare that they have no competing interests.
WY, JK, and XX designed the architecture of the system. XX, DL, JY and LW designed and implemented the database and all functionalities of the whole system. WY organized the implementation and edited the user guide. YZ, NG, and LS directed the project. YZ conceived the structure of this paper. LS wrote the paper in collaboration with all of the other authors. All authors read and approved the final manuscript.
Wenwei Yang, Ke Jin, Xing Xie contributed equally to this work.
About this article
Cite this article
Yang, W., Jin, K., Xie, X. et al. Development of a database system for mapping insertional mutations onto the mouse genome with large-scale experimental data. BMC Genomics 10, S7 (2009). https://doi.org/10.1186/1471-2164-10-S3-S7
- Mouse Genome
- Insertional Mutation
- Insertional Mutagenesis
- UCSC Genome Browser
- View Layer