- Research article
- Open Access
Novel motifs distinguish multiple homologues of Polycomb in vertebrates: expansion and diversification of the epigenetic toolkit
© Senthilkumar and Mishra; licensee BioMed Central Ltd. 2009
- Received: 26 August 2009
- Accepted: 20 November 2009
- Published: 20 November 2009
Polycomb group (PcG) proteins maintain expression pattern of genes set early during development. Although originally isolated as regulators of homeotic genes, PcG members play a key role in epigenetic mechanism that maintains the expression state of a large number of genes. Polycomb (PC) is conserved during evolution and while invertebrates have one PC gene, vertebrates have five or more homologues. It remains unclear if different vertebrate PC homologues have distinct or overlapping functions. We have identified and compared the sequence of PC homologues in various organisms to analyze similarities and differences that shaped the evolutionary history of this key regulatory protein.
All PC homologues have an N-terminal chromodomain and a C-terminal Polycomb Repressor box. We searched the protein and genome sequence database of various organisms for these signatures and identified ~100 PC homologues. Comparative analysis of these sequences led to the identification of a novel insect specific motif and several novel and signature motifs in the vertebrate homologue: two in CBX2 (Cx2.1 and Cx2.2), four in CBX4 (Cx4.1, Cx4.2, Cx4.3 and Cx4.4), three in CBX6 (Cx6.1, Cx6.2 and Cx6.3) and one in CBX8 (Cx8.1). Additionally, adjacent to the chromodomain, all the vertebrate homologues have a DNA binding motif - AT-Hook in case of CBX2, which was known earlier, and 'AT-Hook Like' motif, from this study, in other PC homologues.
Our analysis shows that PC is an ancient gene dating back to pre bilaterian origin that has not only been conserved but has also expanded during the evolution of complexity. Unique motifs acquired by each homologue have been maintained for more than 500 millions years indicating their functional relevance in boosting the epigenetic 'tool kit'. We report the presence of a DNA interaction motif adjacent to chromodomain in all vertebrate PC homologues and suggest a three-way 'PC-histoneH3-DNA' interaction that can restrict nucleosome dynamics. The signature motifs of PC homologues and insect specific motif identified in this study pave the way to understand the molecular basis of epigenetic mechanisms.
- Homeotic Gene
- Vertebrate Homologue
- trxG Protein
- Histone Tail Modification
Cell type specific expression pattern of genes is set during development. This complex process is accompanied by differential packaging of the genome in a cell and tissue specific manner that involves post-translational modifications, like methylation and acetylation of histones, and subsequent interaction of other regulatory proteins. The differential organization of chromatin, once set early during development, is maintained by Polycomb group (PcG) and trithorax group (trxG) proteins. Maintenance of chromatin structure, and thereby the expression state, is referred to as epigenetic cellular memory that provides continuity of specific pattern of expression states in daughter cells when a differentiated cell divides and also throughout the life span of organisms. PcG proteins maintain the repressed state, while trxG proteins maintain genes in the active state. Both PcG and trxG proteins function as multi-protein complexes. In Drosophila, PcG proteins form two major complexes. PC, Ph, Psc and dRing form Polycomb Repressive Complex1 (PRC1) , while PRC2 consists of Esc, E(z), Su(z)12 and P55 [2, 3]. It has been observed that PCL also interact with a subset of PRC2, making a highly active and distinct complex [4, 5]. A third complex consists of a DNA binding protein Pleiohomeotic (PHO) that interacts with PC and directs its binding to the specific sites of recruitment . Similarly, trxG proteins form specific complexes . The DNA sequences that function as sites for the recruitment of the PcG/trxG proteins are called the cellular memory elements or Polycomb response elements (PREs) . Often common elements function to recruit both PcG and trxG proteins. It is generally believed that a balance between the two opposing functions on PREs maintains the precise level of expression state of a particular genomic region. Expression state of the locus is interpreted and maintained by distinct set of PcG and trxG complexes that bind to PREs and establish a chromatin state marked by specific histone modifications [9–14].
PC, the core member of PRC1, was first identified in Drosophila. The PC mutation causes a dominant phenotype of extra sex comb and is an essential gene. Many members of this group show similar phenotype due to the defect in the expression pattern of homeotic genes . In Drosophila, the initial expression of homeotic genes is determined by segmentation genes  and subsequently this expression pattern is maintained by PcG and trxG proteins . Changes in the expression pattern leads to homeotic transformation and/or lethality . Insects have one PC gene, where as mammals have five homologues. Vertebrate homologues of PC contain Chr omatin o rganizer mo difier domain, chromodomain, and are referred to as c hromob ox, CBX, proteins. These include CBX2, CBX4, CBX6, CBX7 and CBX8. The other CBX proteins CBX1, CBX3 and CBX5 are the homologues of h eterochromatin p rotein (HP1). Here we refer PC proteins of vertebrates as CBX proteins.
Several lines of evidence suggest that homeotic genes are not the only targets of PcG genes [19, 20]. More recently, genome wide ChIP on Chip analysis of PcG proteins and its associated histone methylation marks in fly, human and mouse cells have identified large number of targets of these proteins [21–23]. PcG members are essential for maintenance and normal proliferation of cells and have been implicated in the maintenance of stem cells . Genome wide mapping of H3K27Me3 in various prostate cancer tissues shows the PcG mediated repression of several genes which are down regulated in cancer . The abnormal expression of PcG genes cause misregulation on its target loci and subsequently to abnormal proliferation of cells and cancer [24, 26]. CBX7 and CBX8 are involved in maintaining the repressive state of INK4A-ARF locus which is involved in the regulation of cellular proliferation and senescence [27, 28]. CBX7 knockdown increases the ARF and INK4A expression which causes impairment in cell growth . CBX4 is the repressor of C-MYC and mutation in its C-terminal region leads to enhanced expression of this proto oncogene and cellular transformation . Genome wide mapping of CBX8 target shows that this PcG protein is predominantly associated with genes that are involved in developmental and differentiation processes . CBX2 and CBX7 have also been implicated in maintenance of the inactive X-chromosome in mouse [32, 33].
The N-terminal end of PC has chromodomain, which binds to the histone methylation marks created by PRC2 on PREs . Chromodomain is a three beta strands and a helix containing domain present in proteins that are involved in chromatin organization, viz., HP1, SU(var)3-9, Swi6, CHD1, MSL-3, MOF, etc. Chromodomain is involved in targeting the protein to specific regions of chromatin. The chromodomain of Drosophila PC exhibits preferential binding to tri-methylated histone H3 at lysine 27 (H3K27Me3)  whereas chromodomain of HP1 recognizes H3K9Me3 mark. Mutation in the chromodomain of PC results in the disintegration of PRC1 and subsequently loss of its silencing activity . In Drosophila, a chimeric protein generated by replacing chromodomain of HP1 with PC chromodomain localizes HP1 to euchromatic PC binding sites indicating that chromodomain is essential for recognizing specific histone methyl marks [37, 38]. This suggests that subtle differences in the sequence/structure of chromodomains may confer differential affinity to different histone methylation patterns, for example, H3K9Me3 and H3K27Me3. Unlike the fly PC that recognizes only H3K27Me3, mammalian PC homologues show differential binding to methylated histone. CBX2 and CBX7 bind to both H3K9Me3 and H3K27Me3 whereas CBX4 shows strong affinity for H3K9Me3 .
Significance in PcG system is apparent from the observation that these genes are not only conserved from plants to animals, but also highly evolved animals have more homologues of PcG genes. For example, while insects have only one copy of PC, vertebrates have at least five homologues. The importance of having more PC homologues in an organism remains elusive. Identification of uniquely conserved regions in each CBX protein will help us understand the function of homologues. In this study, we carried out extensive mining and analysis of PC homologues to understand their evolution and sequence-structure-function relationship in the context of motif organization.
Search for PC homologues and comparative sequence analysis
Chromodomain in PC homologues
The crystal structure of fly PC chromodomain indicates that W(36) and Y(40) residues are essential for its interaction with H3K27Me3 . W is involved in H3K27Me3 recognition and Y is involved in stabilization of the interaction. W is conserved in all the PC homologues whereas Y is not conserved as some homologues have His residue at this position (Figures 4 & 5). The Drosophila PC chromodomain also forms hydrophobic pocket, involving V(11), Y(12) and A(13), that is essential for its interaction with H3. CBX6 has V or I at residue 11 whereas the other homologues have highly conserved V at this position. Similarly, A(13) is replaced by other residues only in CBX2. Y(12) is conserved across insects whereas vertebrate homologues have F at this position. D(51) and R(53), also involved in PC and H3 interaction, are conserved across the species. PC chromodomain functions as dimer [35, 39]. Of the two key residues shown to be essential for PC dimerization, L(50) is conserved while R(52) is not. These important differences between insect PC chromodomain and among that of vertebrates, point to different preferences for histone modifications by the vertebrate PC homologues. The residues essential for differential affinity of PC homologues to different methylation marks, however, remain elusive. Interestingly we find several residues that are uniquely conserved in paralogue specific manner (Figure 4, highlighted in gray). Further studies will be needed to understand functional significance of these residues.
Polycomb repressor (PcR) domain
AT-Hook and AT-Hook Like motifs in PC homologues
CBX2 has a conserved AT-hook motif adjacent to N-terminal chromodomain. This motif is present in many DNA binding proteins and interacts with the minor groove of AT rich region and enhances the interaction of the protein with chromatin and/or DNA . In CBX4, 6, 7 & 8 we found a highly conserved motif with close similarity to AT-Hook and named it as 'AT-Hook Like' (ATHL) motif. Like AT-Hook, this motif is also not specific to Cbx proteins and found in several other nuclear proteins that are likely to interact with DNA (see Additional file 6). The core region of AT-Hook motif has highly conserved P, G and two basic amino acids, K and R. Similarly the ATHL motif is also rich in basic amino acids and has P and G in the core region (Figure 3, see Additional file 5). This indicates a possible alternative to the AT-Hook in other CBX protein for their chromatin binding. Interestingly, except in Xenopus, CBX8 has two ATHL motifs. Although the functional importance of this motif in PC homologues is not clear yet, it is tempting to suggest that it may be involved in a cooperative three way interaction: "H3K27Me3_chromodomain+AT-Hook/ATHL_DNA" on the nucleosome (see below).
Novel motifs in PC homologues
The conserved motifs of PC homologues
Twisted antiparallel β sheet (3 β strands and a helix)
Binds with H3K27me3
P olyc omb R epressor
Interacts with dRING
CBX4, CBX6, CBX7 and CBX8
Present in several nuclear proteins (DNA binding?)
Poly histidine site
CBX4 of mammals
CBX8 of mammals
Present in several proteins
CtBP interaction motif
CBX4 (Except fishes)
Serine rich region
I nsect Pc box
Present in insect Polycomb
CBX2 specific motifs
In addition to AT-Hook motif, CBX2 has three conserved motifs (Figure 2). A Serine rich region (SRR) is present next to the AT-Hook motif. SRR is found in proteins with wide range of function and thought to be the site for phosphorylation and associated functions . The length of SRR has reduced during evolution from fish to human. Fishes have 24 Serine residues whereas human have a stretch of 16 serine residues (see Additional file 5). An exception is the CBX2 of Tetraodon, where one of the two homologues identified does not have SSR and the size of the motif in other homologue is smaller than that in human. Two more unique motifs present in CBX2 are named as Cx2.1 and Cx2.2. In the 74 aa long Cx2.1 motif, the highly conserved core is rich in basic residues and Proline. Cx2.2 is present in C terminal region; closer to PcR box and has highly conserved Serine residues and acidic amino acids D and E.
CBX4 specific motifs
The CBX4 protein functions as a SUMO E3 ligase and is involved in SUMOylation of CtBP, SIP1, HIPK2, CTCF and Dnmt3a [47–50]. The CBX4 is highly conserved across vertebrates. It has four conserved motifs (Cx4.1, Cx4.2, Cx4.3 and Cx4.4) (Figure 2, see Additional file 5). Cx4.1, a 88aa conserved motif, has a duplicated and highly conserved shorter stretch of Y [QE]LNSKKHH [QHP]YQP separated by a stretch of poorly conserved 19aa residues (Figure 3). A SUMO binding site is present in motif Cx4.2 and the region flanking this site is rich in Lysine with a highly conserved (KN)3 repeat within it. The motif Cx4.3, which has highly conserved hydrophilic residues (Q, S and T), is absent in Danio rerio but present in Fugu. CBX4 has a conserved CtBP binding site and a SUMOylation site as well. These are absent in other PC homologues  which indicates the functional specificity of CBX4.
CBX6 specific motifs
CBX6 has three conserved regions between chromodomain and PcR box (Figure 2, see Additional file 5). Cx6.1 is a 88 aa highly conserved region unique to CBX6 and is rich in S, T, N, P and basic amino acids. One end of the motif is rich in Serine, followed by basic amino acid rich region, Proline rich region and basic amino acid rich region, indicating possible multiple functional domains present within this motif. Cx6.2 is rich in basic amino acids, of the 36 residues 10 are basic amino acids. Cx6.3 is a small conserved motif present closer to PcR box and is rich in acidic amino acids and Proline.
CBX8 specific motifs
CBX8 has one conserved motif unique to it, Cx8.1 following its first AT-Hook like motif (Figure 2, see Additional file 5). CBX8 of mammals also has a RED/RD repeats, present in several nuclear proteins although the function of this acid-base dipeptide repeat is not known (see Additional file 5). Unlike SRR repeat in CBX2, which shows reduction in the repeat length as we move from fish to human, the length of the RED repeat has expanded from mouse to human - mouse has 4 while human has 16 repeat units. In all the species studied, the repeat ends with RG residues. The less conserved region of CBX8 (conserved from Xenopus to mammals but not in fishes), 202-333 amino acids, was shown to interact with the nuclear protein AF9 in mouse . Although the function of AF9 protein is not well understood, its human homologue, ENL, is involved in transcriptional activation  and interacts with CBX8 . The AF9 locus is prone to gene rearrangement involving MLL and the MLL-AF9 fusion protein is associated with acute leukaemia in mice . Further studies will be needed to understand the precise role of CBX8 in these events.
Insect specific conserved region
In sequence comparison analysis along with the vertebrate homologues of PC, insect PC always clusters in one group indicating its highly conserved nature compared to the vertebrate counter parts. In this analysis we identified a novel insect specific highly conserved motif, c onserved i nset specific P oly c omb (CIPC) box (Figure 2). This unique motif is mainly composed of basic, hydroxyl and acidic residues (see Additional file 5). Vertebrate homologues of PC do not have this extremely conserved insect specific motif. It appears as though CIPC was replaced by different conserved motifs that are specific to each homologue in order to acquire functional uniqueness.
Evolution of Polycomb homologues
The members of PRC2 are conserved across the species including plants . Earlier studies, although suggested that members of PRC1 are absent in plants , recent findings indicate that plant do have distant homologues of these genes. Arabidopsis protein LHP1/TFL2 that has chromodomain and a chromoshadow domain, like HP1 in animals, does recognize H3K27Me3 modification which is a major repressive mark in plants [57–59]. LHP1/TFL2 is, therefore, considered as the functional homologue of PC in plants [58, 60]. More recently, Arabidopsis homologues of ring finger proteins have been identified that also interact with LHP1 [61, 62]. In our analysis we found PC homologues in cnidarians Nematostella, Hydra and Podocoryne. This shows that PC is pre bilaterian origin. The PC homologues of sea anemone show high sequence homology with the chromodomain and the PcR box of other PC homologues. PC of Podocoryne carnea interacts with mouse RING1B  which is a PRC1 member. This shows that PC appeared in the common ancestor of bilaterians and cnidarians, 570-700 millions years ago.
In nematode Caenorhabditis elegans, Cec-1 is predicted as a homologue of PC . Functional analysis has shown that expression profile of cec-1 is similar to that of Drosophila Pc . It has N-terminal chromodomain but the predicted PcR box of CEC-1 shows a poor sequence homology . In Drosophila it has been shown that both N-terminal chromodomain and C-terminal PcR box is essential for PC function . PcR box of vertebrates shows much higher sequence homology with PcR box of cnidarians compared to that of C. elegans. This raises the possibility that CEC-1 may be at best a highly diverged PC homologue. Function of PcR box includes the assembly of PRC1. In C. elegans, where all homologues of PRC2 members are present, only a few homologues of PRC1 members have been identified [61, 66]. It remains possible that CEC-1 may not be a true PC homologue or it may be a remote homologue of PC that interacts with different class of proteins for its repressive function.
Invertebrates have one PC where as mammals have 5 homologues. The teleosts like Zebrafish, Fugu and Tetraodon have up to 7 PC homologues. This expansion of PC homologue in vertebrate shows remarkable parallel with the expansion of Hox gene complex. In teleosts PC genes are duplicated in species-specific manner. CBX8 is duplicated in Zebrafish and Tetraodon. CBX2, CBX4, CBX7 are duplicated in Tetraodon, Fugu, Zebrafish, respectively (see Additional file 1). Interestingly, teleosts have more copies of Hox clusters than mammals. Invertebrates have one Hox cluster where as tetrapods have four Hox clusters. In teleosts lineage, Hox cluster duplicated further and ray finned fishes have 7-8 Hox clusters. During vertebrate evolution, genome of the common ancestor of vertebrates underwent two rounds of duplication events that led to increase in the copy number of several genes  and many of the extra copies acquired novel functions. It is believed to have paved the way for increase in the system complexity. In teleosts lineage, genome underwent an additional round of species-specific duplication event that led to duplication of several genes.
PC is the key protein in the epigenetic control of gene expression from early development to adult stages. Precise expression of Hox genes along the anterior-posterior body axis and its maintenance in subsequent cell divisions is dependent on PcG and trxG of genes. PC homologues begin to appear in cnidarians. This early origin of PC and its expansion in vertebrates shows a remarkable parallel with the pre bilaterian-cnidarian origin of Hox complexes and their expansion in vertebrates . While most vertebrates have five PC homologues, we found more PC homologues in fishes. This may be due to the fish specific genome duplications. Interestingly, fishes have more Hox clusters too [69, 70]. Genome duplication event that occurred in vertebrate lineage  caused gene duplications that lead to increase in complexity-associated acquisition of novel functions in these additional copies. We show that different PC homologues carry distinct motifs and, therefore, are likely to have functional distinctions. This also suggests that common ancestor of vertebrate - the link between invertebrate and vertebrates - acquired novel features in various duplicated copies of the PC gene. These novelties have been conserved for more than 500 million years indicating their functional relevance in vertebrate diversity. Expansion of PC gene itself and that of several other members of PcG/trxG genes vis-à-vis expansion of genome size, in particular, the increasing proportion of non-coding part of genome indicates enforcement of epigenetic regulatory toolkit and its important role in vertebrate evolution. It is known that vertebrates that have four or more Hox clusters also have similarly higher number of PC homologues [69, 70]. This parallel expansion may indicate that the expanded group of the master control Hox genes also need a matching degree of expansion of the epigenetic tools including PC. This may have been one of the selection pressures behind the amplification and conservation of PC genes in vertebrates.
The vertebrate homologues of Polycomb gene, CBX2, 4, 6, 7 and 8 acquired distinct combination of motifs that separates them from one another and have been remarkably well conserved. We also found that each vertebrate PC homologue has a putative DNA binding domain AT-Hook or ATHL, a newly identified variant. Since these motifs are present adjacent to the chromodomain that recognizes specific histone tail modification, it raises the possibility that PC contact with the target site may involve bimodal recognition and binding. In addition to chromodomain binding to the histone tail, AT-Hook/ATHL can bind to the adjacent site in the minor groove of DNA, sandwiching the DNA between PC and histone surface (see Additional file 8). It is interesting to note that this interaction will 'lock' the nucleosome from shuffling or remodelling and may explain why PRC1 makes nucleosome more stable and the entire region repressive for transcription . While several DNA binding proteins are known as the members of the PcG in flies, our finding that all CBX counterparts in vertebrates contain their own DNA binding motif, indicates that at least at some loci in vertebrate genomes PRC1 by itself can read the epigenetic histone tail mark and associate with it.
Why vertebrates have more PC homologues? In this study we report for the first time that each homologue has its own signature. We think that the expansion, which allowed addition of novel features to different PC homologues, created greater possibilities in the variety of factors interacting with this protein. This may also allow distinct kind of PRC1 complexes depending on the CBX homologue associated with it. Very recently CBX7 and CBX8 have been shown to exist in distinct complexes , it remains to be seen, however, if other homologues follow the same trend. In mouse embryonic stem cells, PC homologues follow distinct sub nuclear localization patterns and show differential dynamics during ES cell differentiation [72, 73]. It is possible that novel motifs identified in our study may contribute to these distinct properties of the PC homologues by themselves or with the help of different sets of interactors. It also remains possible that some CBX proteins may function as individuals and not as components of a complex. Detailed biochemical analysis will be needed to explore these possibilities.
We also identified a novel insect specific conserved motif, CIPC, absent in vertebrate homologues. While the function of this novel domain remains to be studied, it seems likely that expansion of Polycomb gene and acquisition of novel motifs have come at the expense of the CIPC. Expansion of epigenetic mechanisms has been one of the significant events in the evolution of complexity. Since PcG and trxG are the key players in regulating the differential expression of genes, having more homologues of these groups of proteins may have provided regulatory capabilities needed for the evolution of complex organisms. Diversification of PC function (recognition of histone tail modification by its chromodomain and recruitment of repressor factor by PcR box) coincides with the trade off between insect specific CIPC and vertebrate homologue specific motifs including the DNA binding motif AT-Hook/ATHL. We suggest that CIPC may be restricting PC to selected loci or processes and that its absence allows the new protein architecture to be recruited for diverse functions depending on the newly acquired motifs. It will be important to dissect out the function of these new motifs to understand the epigenetic mechanisms of developmental gene regulation controlled by PC homologues and various disease conditions where these proteins are involved.
Using genome wide scanning approach we report an efficient strategy to mine homologues in the genome databases. We find that PC is of pre-bilaterian origin and it is evolving from the common ancestor of bilaterians and cnidarians. We find that multiple PC homologues present in vertebrates, CBX2, 4, 6, 7 & 8, have acquired CBX protein specific conserved motifs, including signature motifs for each homologue. This indicates that these homologues emerged early during the vertebrate evolution and have been conserved under positive selection pressure. The chromodomain and PcR domains of vertebrate PC proteins also show orthologue specific features indicating functional uniqueness of these domains as well within the globally conserved constraints. We also see two major features in the PC that sets apart the vertebrate and insect proteins - presence of CIPC motif in the insect protein and that of DNA binding AT-Hook or ATHL (identified in this study) motif in all vertebrate homologues. We also suggest a model in which chromdomain-H3K27Me3 interaction on the one hand and AT-Hook/ATHL - DNA interaction on the other, locks nucleosomes in repressed state.
Several lines of studies indicate that complexity of the higher vertebrates evolved by increase in the non-coding part of the genome that is likely to have functional elements where a set of trans-acting factors interact. One of such set of factors is the PcG system. Our study shows a quick expansion and acquisition of novelties in vertebrate PC homologues. We suggest that increase in the number of homologues of these proteins resulted into the expansion of epigenetic mechanisms in the common ancestor of vertebrates that led to the evolution of complexity and diversity in these organisms.
Mining of the PC homologues
One PC is present in fly and other insects. The Drosophila PC protein was used as a seed for mining its homologues. The overall schema for mining PC homologues is shown in Figure 1. PC homologues were searched in NCBI Nr Genpept (release156, 3,570,920 sequences) database using PSI BLAST . It is based on iterative profile based search and constructs position specific weight matrices on the basis of the alignment generated by the blast search. This matrix was used to score the next iteration. The availability of protein sequence information is lesser than the possible coding regions in genome. So it is essential to search in genome to find the homologues, which are not reported in protein databases. For this purpose the genome sequence of model organisms (see Additional file 9) were also searched by using tBLASTN . The genomic region showing hits with any length irrespective of their score and identity were considered as probable candidates for our analysis. The BLAST hits were parsed and the regions showing hits were extended 5 kb upstream and 5 kb downstream. After extending their length if any hit over laps with another hit, they were combined together and extended 5 kb further from the overlapping regions. This approach will be useful to extract larger size gene sequence and homologues with less sequence homology. The extended blast hits were subjected for gene prediction. Genscan  was used to predict genes in the genomic regions showing hits with PC of Drosophila. The sequences showing similarity >98% over the length of the sequences in same organism were considered as redundant set. The predicted genes of same organism were aligned using Blastclust  and clustered with above cut off and redundant data were removed.
The sequence homology among PC homologues is less. Conventional methods to find sequence homologues based on sequence homology are not efficient enough to pick up conserved regions in sequences having less sequence homology in larger dataset. The conserved motifs among our training dataset were predicted and used for picking true homologues. The motifs were predicted by using the program MEME . It uses expectation maximization algorithm to identify motifs with best width, in a group of protein or DNA sequences. The predicted motifs were assigned to Genpept hits and predicted proteins of our genome wide search approach. The sequences were aligned based on motif conservation by MAST (Motif Alignment Search Tool) . The conserved regions in homologues were extracted from predicted motifs and HMM profile was created by using HMMER . The profiles were searched in Swissprot protein sequence database to identify the existence of these conserved regions in other proteins.
Multiple sequence alignment and phylogenetic analysis
The multiple sequence alignment was done by using ClustalW  and T-Coffee . The alignments were visualized by using ClustalX . The consensus pattern was drawn in logo format using WebLogo . The phylogenetic analysis was performed by maximum parsimony method using MEGA . The sequences were bootstrapped 1000 times, to check the chance for occurrence of each internal branch of the tree. This method is helpful to know the reliability of tree.
Secondary structure analysis
The secondary structure prediction was done by using PSIPRED . It uses a neural network method to predicts the secondary structure based on PSI BLAST result of the given query sequence.
RS thanks IIIT financial assistance in the early stages and CSIR for a senior research fellowship. Research work in RKM laboratory is supported by CSIR, CEFIRA, DBT and DST. We acknowledge RKM lab members particularly Ram Parikshan Kumar for discussions and suggestions and Krishnaveni Mishra for critical reading of the manuscript.
- Shao Z, Raible F, Mollaaghababa R, Guyon JR, Wu CT, Bender W, Kingston RE: Stabilization of chromatin structure by PRC1, a Polycomb complex. Cell. 1999, 98 (1): 37-10.1016/S0092-8674(00)80604-2.View ArticlePubMedGoogle Scholar
- Kuzmichev A, Nishioka K, Erdjument-Bromage H, Tempst P, Reinberg D: Histone methyltransferase activity associated with a human multiprotein complex containing the Enhancer of Zeste protein. Genes Dev. 2002, 16 (22): 2893-10.1101/gad.1035902.PubMed CentralView ArticlePubMedGoogle Scholar
- Tie F, Furuyama T, Prasad-Sinha J, Jane E, Harte PJ: The Drosophila Polycomb Group proteins ESC and E(Z) are present in a complex containing the histone-binding protein p55 and the histone deacetylase RPD3. Development. 2001, 128 (2): 275-PubMedGoogle Scholar
- Tie F, Prasad-Sinha J, Birve A, Rasmuson-Lestander A, Harte PJ: A 1-megadalton ESC/E(Z) complex from Drosophila that contains polycomblike and RPD3. Mol Cell Biol. 2003, 23 (9): 3352-10.1128/MCB.23.9.3352-3362.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Nekrasov M, Klymenko T, Fraterman S, Papp B, Oktaba K, Kocher T, Cohen A, Stunnenberg HG, Wilm M, Muller J: Pcl-PRC2 is needed to generate high levels of H3-K27 trimethylation at Polycomb target genes. EMBO J. 2007, %19;26 (18): 4078-10.1038/sj.emboj.7601837.View ArticleGoogle Scholar
- Mohd-Sarip A, Venturini F, Chalkley GE, Verrijzer CP: Pleiohomeotic can link polycomb to DNA and mediate transcriptional repression. Mol Cell Biol. 2002, 22 (21): 7473-10.1128/MCB.22.21.7473-7483.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Kal AJ, Mahmoudi T, Zak NB, Verrijzer CP: The Drosophila brahma complex is an essential coactivator for the trithorax group protein zeste. Genes Dev. 2000, 14 (9): 1058-PubMed CentralPubMedGoogle Scholar
- Mihaly J, Mishra RK, Karch F: A conserved sequence motif in Polycomb-response elements. Mol Cell. 1998, 1 (7): 1065-10.1016/S1097-2765(00)80107-0.View ArticlePubMedGoogle Scholar
- Levine SS, King IF, Kingston RE: Division of labor in polycomb group repression. Trends BiochemSci. 2004, 29 (9): 478-10.1016/j.tibs.2004.07.007.View ArticleGoogle Scholar
- Schwartz YB, Pirrotta V: Polycomb silencing mechanisms and the management of genomic programmes. Nat Rev Genet. 2007, 8 (1): 9-10.1038/nrg1981.View ArticlePubMedGoogle Scholar
- Simon JA, Tamkun JW: Programming off and on states in chromatin: mechanisms of Polycomb and trithorax group complexes. CurrOpinGenet Dev. 2002, 12 (2): 210-Google Scholar
- Muller J, Kassis JA: Polycomb response elements and targeting of Polycomb group proteins in Drosophila. CurrOpinGenet Dev. 2006, 16 (5): 476-Google Scholar
- Cao R, Zhang Y: The functions of E(Z)/EZH2-mediated methylation of lysine 27 in histone H3. CurrOpinGenet Dev. 2004, 14 (2): 155-Google Scholar
- Lund AH, Van LM: Polycomb complexes and silencing mechanisms. CurrOpinCell Biol. 2004, 16 (3): 239-Google Scholar
- Lewis EB: A gene complex controlling segmentation in Drosophila. Nature. 1978, 276 (5688): 565-10.1038/276565a0.View ArticlePubMedGoogle Scholar
- Ingham PW, Martinez AA: Boundaries and fields in early embryos. Cell. 1992, 68 (2): 221-10.1016/0092-8674(92)90467-Q.View ArticlePubMedGoogle Scholar
- Orlando V, Jane EP, Chinwalla V, Harte PJ, Paro R: Binding of trithorax and Polycomb proteins to the bithorax complex: dynamic changes during early Drosophila embryogenesis. EMBO J. 1998, 17 (17): 5141-10.1093/emboj/17.17.5141.PubMed CentralView ArticlePubMedGoogle Scholar
- Kennison JA: Introduction to Trx-G and Pc-G genes. Methods Enzymol. 2004, 377: 61-70. full_text.View ArticlePubMedGoogle Scholar
- Zink B, Paro R: In vivo binding pattern of a trans-regulator of homoeotic genes in Drosophila melanogaster. Nature. 1989, 337 (6206): 468-10.1038/337468a0.View ArticlePubMedGoogle Scholar
- Ringrose L, Rehmsmeier M, Dura JM, Paro R: Genome-wide prediction of Polycomb/Trithorax response elements in Drosophila melanogaster. DevCell. 2003, 5 (5): 759-Google Scholar
- Tolhuis B, de WE, Muijrers I, Teunissen H, Talhout W, van SB, Van LM: Genome-wide profiling of PRC1 and PRC2 Polycomb chromatin binding in Drosophila melanogaster. Nat Genet. 2006, 38 (6): 694-10.1038/ng1792.View ArticlePubMedGoogle Scholar
- Schwartz YB, Kahn TG, Nix DA, Li XY, Bourgon R, Biggin M, Pirrotta V: Genome-wide analysis of Polycomb targets in Drosophila melanogaster. Nat Genet. 2006, 38 (6): 700-10.1038/ng1817.View ArticlePubMedGoogle Scholar
- Negre N, Hennetin J, Sun LV, Lavrov S, Bellis M, White KP, Cavalli G: Chromosomal distribution of PcG proteins during Drosophila development. PLoS Biol. 2006, 4 (6): e170-10.1371/journal.pbio.0040170.PubMed CentralView ArticlePubMedGoogle Scholar
- Sparmann A, Van LM: Polycomb silencers control cell fate, development and cancer. Nat Rev Cancer. 2006, 6 (11): 846-10.1038/nrc1991.View ArticlePubMedGoogle Scholar
- Yu J, Yu J, Rhodes DR, Tomlins SA, Cao X, Chen G, Mehra R, Wang X, Ghosh D, Shah RB, et al: A polycomb repression signature in metastatic prostate cancer predicts cancer outcome. Cancer Res. 2007, 67 (22): 10657-10.1158/0008-5472.CAN-07-2498.View ArticlePubMedGoogle Scholar
- Gil J, Bernard D, Peters G: Role of polycomb group proteins in stem cell self-renewal and cancer. DNA Cell Biol. 2005, 24 (2): 117-10.1089/dna.2005.24.117.View ArticlePubMedGoogle Scholar
- Dietrich N, Bracken AP, Trinh E, Schjerling CK, Koseki H, Rappsilber J, Helin K, Hansen KH: Bypass of senescence by the polycomb group protein CBX8 through direct binding to the INK4A-ARF locus. EMBO J. 2007, 26 (6): 1637-10.1038/sj.emboj.7601632.PubMed CentralView ArticlePubMedGoogle Scholar
- Bernard D, Martinez-Leal JF, Rizzo S, Martinez D, Hudson D, Visakorpi T, Peters G, Carnero A, Beach D, Gil J: CBX7 controls the growth of normal and tumor-derived prostate cells by repressing the Ink4a/Arf locus. Oncogene. 2005, 24 (36): 5543-10.1038/sj.onc.1208735.View ArticlePubMedGoogle Scholar
- Gil J, Bernard D, Martinez D, Beach D: Polycomb CBX7 has a unifying role in cellular lifespan. Nat Cell Biol. 2004, 6 (1): 67-10.1038/ncb1077.View ArticlePubMedGoogle Scholar
- Satijn DP, Olson DJ, van dV, Hamer KM, Lambrechts C, Masselink H, Gunster MJ, Sewalt RG, van DR, Otte AP: Interference with the expression of a novel human polycomb protein, hPc2, results in cellular transformation and apoptosis. Mol Cell Biol. 1997, 17 (10): 6076-PubMed CentralView ArticlePubMedGoogle Scholar
- Bracken AP, Dietrich N, Pasini D, Hansen KH, Helin K: Genome-wide mapping of Polycomb target genes unravels their roles in cell fate transitions. Genes Dev. 2006, 20 (9): 1123-10.1101/gad.381706.PubMed CentralView ArticlePubMedGoogle Scholar
- Plath K, Talbot D, Hamer KM, Otte AP, Yang TP, Jaenisch R, Panning B: Developmentally regulated alterations in Polycomb repressive complex 1 proteins on the inactive X chromosome. J Cell Biol. 2004, %20;167 (6): 1025-10.1083/jcb.200409026.View ArticleGoogle Scholar
- Bernstein E, Duncan EM, Masui O, Gil J, Heard E, Allis CD: Mouse polycomb proteins bind differentially to methylated histone H3 and RNA and are enriched in facultative heterochromatin. Mol Cell Biol. 2006, 26 (7): 2560-10.1128/MCB.26.7.2560-2569.2006.PubMed CentralView ArticlePubMedGoogle Scholar
- Hansen KH, Bracken AP, Pasini D, Dietrich N, Gehani SS, Monrad A, Rappsilber J, Lerdrup M, Helin K: A model for transmission of the H3K27me3 epigenetic mark. Nat Cell Biol. 2008, 10 (11): 1291-10.1038/ncb1787.View ArticlePubMedGoogle Scholar
- Fischle W, Wang Y, Jacobs SA, Kim Y, Allis CD, Khorasanizadeh S: Molecular basis for the discrimination of repressive methyl-lysine marks in histone H3 by Polycomb and HP1 chromodomains. Genes Dev. 2003, 17 (15): 1870-10.1101/gad.1110503.PubMed CentralView ArticlePubMedGoogle Scholar
- Messmer S, Franke A, Paro R: Analysis of the functional role of the Polycomb chromo domain in Drosophila melanogaster. Genes Dev. 1992, 6 (7): 1241-10.1101/gad.6.7.1241.View ArticlePubMedGoogle Scholar
- Platero JS, Hartnett T, Eissenberg JC: Functional analysis of the chromo domain of HP1. EMBO J. 1995, 14 (16): 3977-PubMed CentralPubMedGoogle Scholar
- Fanti L, Perrini B, Piacentini L, Berloco M, Marchetti E, Palumbo G, Pimpinelli S: The trithorax group and Pc group proteins are differentially involved in heterochromatin formation in Drosophila. Chromosoma. 2008, 117 (1): 25-10.1007/s00412-007-0123-7.View ArticlePubMedGoogle Scholar
- Min J, Zhang Y, Xu RM: Structural basis for specific binding of Polycomb chromodomain to histone H3 methylated at Lys 27. Genes Dev. 2003, 17 (15): 1823-10.1101/gad.269603.PubMed CentralView ArticlePubMedGoogle Scholar
- Breiling A, Bonte E, Ferrari S, Becker PB, Paro R: The Drosophila polycomb protein interacts with nucleosomal core particles In vitro via its repression domain. Mol Cell Biol. 1999, 19 (12): 8451-PubMed CentralView ArticlePubMedGoogle Scholar
- Muller J: Transcriptional silencing by the Polycomb protein in Drosophila embryos. EMBO J. 1995, 14 (6): 1209-PubMed CentralPubMedGoogle Scholar
- Bunker CA, Kingston RE: Transcriptional repression by Drosophila and mammalian Polycomb group proteins in transfected mammalian cells. Mol Cell Biol. 1994, 14 (3): 1721-PubMed CentralView ArticlePubMedGoogle Scholar
- Schoorlemmer J, Marcos-Gutierrez C, Were F, Martinez R, Garcia E, Satijn DP, Otte AP, Vidal M: Ring1A is a transcriptional repressor that interacts with the Polycomb-M33 protein and is expressed at rhombomere boundaries in the mouse hindbrain. EMBO J. 1997, 16 (19): 5930-10.1093/emboj/16.19.5930.PubMed CentralView ArticlePubMedGoogle Scholar
- Bardos JI, Saurin AJ, Tissot C, Duprez E, Freemont PS: HPC3 is a new human polycomb orthologue that interacts and associates with RING1 and Bmi1 and has transcriptional repression properties. J Biol Chem. 2000, 275 (37): 28785-10.1074/jbc.M001835200.View ArticlePubMedGoogle Scholar
- Aravind L, Landsman D: AT-hook motifs identified in a wide variety of DNA-binding proteins. Nucleic Acids Res. 1998, 26 (19): 4413-10.1093/nar/26.19.4413.PubMed CentralView ArticlePubMedGoogle Scholar
- O'Mahony DJ, Smith SD, Xie W, Rothblum LI: Analysis of the phosphorylation, DNA-binding and dimerization properties of the RNA polymerase I transcription factors UBF1 and UBF2. Nucleic Acids Res. 1992, 20 (6): 1301-1308. 10.1093/nar/20.6.1301.PubMed CentralView ArticlePubMedGoogle Scholar
- Long J, Zuo D, Park M: Pc2-mediated sumoylation of Smad-interacting protein 1 attenuates transcriptional repression of E-cadherin. J Biol Chem. 2005, 280 (42): 35477-10.1074/jbc.M504477200.View ArticlePubMedGoogle Scholar
- Li B, Zhou J, Liu P, Hu J, Jin H, Shimono Y, Takahashi M, Xu G: Polycomb protein Cbx4 promotes SUMO modification of de novo DNA methyltransferase Dnmt3a. BiochemJ. 2007, 405 (2): 369-10.1042/BJ20061873.View ArticleGoogle Scholar
- Roscic A, Moller A, Calzado MA, Renner F, Wimmer VC, Gresko E, Ludi KS, Schmitz ML: Phosphorylation-dependent control of Pc2 SUMO E3 ligase activity by its substrate protein HIPK2. Mol Cell. 2006, 24 (1): 77-10.1016/j.molcel.2006.08.004.View ArticlePubMedGoogle Scholar
- MacPherson MJ, Beatty LG, Zhou W, Du M, Sadowski PD: The CTCF insulator protein is posttranslationally modified by SUMO. Mol Cell Biol. 2009, 29 (3): 714-10.1128/MCB.00825-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Wotton D, Merrill JC: Pc2 and SUMOylation. BiochemSocTrans. 2007, 35 (Pt 6): 1401-Google Scholar
- Hemenway CS, de Erkenez AC, Gould GC: The polycomb protein MPc3 interacts with AF9, an MLL fusion partner in t(9;11)(p22;q23) acute leukemias. Oncogene. 2001, 20 (29): 3798-10.1038/sj.onc.1204478.View ArticlePubMedGoogle Scholar
- Mueller D, Bach C, Zeisig D, Garcia-Cuellar MP, Monroe S, Sreekumar A, Zhou R, Nesvizhskii A, Chinnaiyan A, Hess JL, et al: A role for the MLL fusion partner ENL in transcriptional elongation and chromatin modification. Blood. 2007, 110 (13): 4445-10.1182/blood-2007-05-090514.PubMed CentralView ArticlePubMedGoogle Scholar
- Garcia-Cuellar MP, Zilles O, Schreiner SA, Birke M, Winkler TH, Slany RK: The ENL moiety of the childhood leukemia-associated MLL-ENL oncoprotein recruits human Polycomb 3. Oncogene. 2001, 20 (4): 411-10.1038/sj.onc.1204108.View ArticlePubMedGoogle Scholar
- Grossniklaus U, Vielle-Calzada JP, Hoeppner MA, Gagliano WB: Maternal control of embryogenesis by MEDEA, a polycomb group gene in Arabidopsis. Science. 1998, 280 (5362): 446-10.1126/science.280.5362.446.View ArticlePubMedGoogle Scholar
- Pien S, Grossniklaus U: Polycomb group and trithorax group proteins in Arabidopsis. BiochimBiophysActa. 2007, 1769 (5-6): 375-Google Scholar
- Zhang X, Clarenz O, Cokus S, Bernatavichute YV, Pellegrini M, Goodrich J, Jacobsen SE: Whole-genome analysis of histone H3 lysine 27 trimethylation in Arabidopsis. PLoS Biol. 2007, 5 (5): e129-10.1371/journal.pbio.0050129.PubMed CentralView ArticlePubMedGoogle Scholar
- Turck F, Roudier F, Farrona S, Martin-Magniette ML, Guillaume E, Buisine N, Gagnot S, Martienssen RA, Coupland G, Colot V: Arabidopsis TFL2/LHP1 specifically associates with genes marked by trimethylation of histone H3 lysine 27. PLoS Genet. 2007, 3 (6): e86-10.1371/journal.pgen.0030086.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang X, Germann S, Blus BJ, Khorasanizadeh S, Gaudin V, Jacobsen SE: The Arabidopsis LHP1 protein colocalizes with histone H3 Lys27 trimethylation. Nat StructMol Biol. 2007, 14 (9): 869-10.1038/nsmb1283.View ArticleGoogle Scholar
- Hennig L, Derkacheva M: Diversity of Polycomb group complexes in plants: same rules, different players?. Trends Genet. 2009, 25 (9): 414-423. 10.1016/j.tig.2009.07.002.View ArticlePubMedGoogle Scholar
- Sanchez-Pulido L, Devos D, Sung ZR, Calonje M: RAWUL: a new ubiquitin-like domain in PRC1 ring finger proteins that unveils putative plant and worm PRC1 orthologs. BMC Genomics. 2008, 9: 308-10.1186/1471-2164-9-308.PubMed CentralView ArticlePubMedGoogle Scholar
- Xu L, Shen WH: Polycomb silencing of KNOX genes confines shoot stem cell niches in Arabidopsis. CurrBiol. 2008, 18 (24): 1966-View ArticleGoogle Scholar
- Lichtneckert R, Muller P, Schmid V, Reber-Muller S: Evolutionary conservation of the chromatin modulator Polycomb in the jellyfish Podocoryne carnea. Differentiation. 2002, 70 (8): 422-10.1046/j.1432-0436.2002.700804.x.View ArticlePubMedGoogle Scholar
- Whitcomb SJ, Basu A, Allis CD, Bernstein E: Polycomb Group proteins: an evolutionary perspective. Trends Genet. 2007, 23 (10): 494-10.1016/j.tig.2007.08.006.View ArticlePubMedGoogle Scholar
- Agostoni E, Albertson D, Wittmann C, Hill F, Tobler H, Muller F: cec-1, a soma-specific chromobox-containing gene in C. elegans. DevBiol. 1996, 178 (2): 316-Google Scholar
- Ross JM, Zarkower D: Polycomb group regulation of Hox gene expression in C. elegans. DevCell. 2003, 4 (6): 891-Google Scholar
- Meyer A, Van de PY: From 2R to 3R: evidence for a fish-specific genome duplication (FSGD). Bioessays. 2005, 27 (9): 937-10.1002/bies.20293.View ArticlePubMedGoogle Scholar
- Ryan JF, Mazza ME, Pang K, Matus DQ, Baxevanis AD, Martindale MQ, Finnerty JR: Pre-bilaterian origins of the Hox cluster and the Hox code: evidence from the sea anemone, Nematostella vectensis. PLoS ONE. 2007, 2 (1): e153-10.1371/journal.pone.0000153.PubMed CentralView ArticlePubMedGoogle Scholar
- Amores A, Force A, Yan YL, Joly L, Amemiya C, Fritz A, Ho RK, Langeland J, Prince V, Wang YL, et al: Zebrafish hox clusters and vertebrate genome evolution. Science. 1998, 282 (5394): 1711-10.1126/science.282.5394.1711.View ArticlePubMedGoogle Scholar
- Crow KD, Stadler PF, Lynch VJ, Amemiya C, Wagner GP: The "fish-specific" Hox cluster duplication is coincident with the origin of teleosts. Mol Biol Evol. 2006, 23 (1): 121-10.1093/molbev/msj020.View ArticlePubMedGoogle Scholar
- Maertens GN, El Messaoudi-Aubert S, Racek T, Stock JK, Nicholls J, Rodriguez-Niedenfuhr M, Gil J, Peters G: Several distinct polycomb complexes regulate and co-localize on the INK4a tumor suppressor locus. PLoS One. 2009, 4 (7): e6380-10.1371/journal.pone.0006380.PubMed CentralView ArticlePubMedGoogle Scholar
- Ren X, Vincenz C, Kerppola TK: Changes in the distributions and dynamics of polycomb repressive complexes during embryonic stem cell differentiation. Mol Cell Biol. 2008, 28 (9): 2884-10.1128/MCB.00949-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Vincenz C, Kerppola TK: Different polycomb group CBX family proteins associate with distinct regions of chromatin using nonhomologous protein sequences. ProcNatlAcadSciUSA. 2008, 105 (43): 16572-View ArticleGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410.View ArticlePubMedGoogle Scholar
- Burge C, Karlin S: Prediction of complete gene structures in human genomic DNA. J Mol Biol. 1997, 268 (1): 78-10.1006/jmbi.1997.0951.View ArticlePubMedGoogle Scholar
- Bailey TL, Williams N, Misleh C, Li WW: MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006, W369-373. 10.1093/nar/gkl198. 34 Web ServerGoogle Scholar
- Bailey TL, Gribskov M: Combining evidence using p-values: application to sequence homology searches. Bioinformatics. 1998, 14 (1): 48-10.1093/bioinformatics/14.1.48.View ArticlePubMedGoogle Scholar
- Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14 (9): 755-10.1093/bioinformatics/14.9.755.View ArticlePubMedGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22 (22): 4673-10.1093/nar/22.22.4673.PubMed CentralView ArticlePubMedGoogle Scholar
- Notredame C, Higgins DG, Heringa J: T-Coffee: A novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000, 302 (1): 205-10.1006/jmbi.2000.4042.View ArticlePubMedGoogle Scholar
- Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25 (24): 4876-10.1093/nar/25.24.4876.PubMed CentralView ArticlePubMedGoogle Scholar
- Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14 (6): 1188-10.1101/gr.849004.PubMed CentralView ArticlePubMedGoogle Scholar
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24 (8): 1596-10.1093/molbev/msm092.View ArticlePubMedGoogle Scholar
- McGuffin LJ, Bryson K, Jones DT: The PSIPRED protein structure prediction server. Bioinformatics. 2000, 16 (4): 404-10.1093/bioinformatics/16.4.404.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.