Genome wide expression analysis of CBS domain containing proteins in Arabidopsis thaliana (L.) Heynh and Oryza sativa L. reveals their developmental and stress regulation
© Kushwaha et al; licensee BioMed Central Ltd. 2009
Received: 02 December 2008
Accepted: 28 April 2009
Published: 28 April 2009
In Arabidopsis thaliana (L.) Heynh and Oryza sativa L., a large number of genes encode proteins of unknown functions, whose characterization still remains one of the major challenges. With an aim to characterize these unknown proteins having defined features (PDFs) in plants, we have chosen to work on proteins having a cystathionine β-synthase (CBS) domain. CBS domain as such has no defined function(s) but plays a regulatory role for many enzymes and thus helps in maintaining the intracellular redox balance. Its function as sensor of cellular energy has also been widely suggested.
Our analysis has identified 34 CBS domain containing proteins (CDCPs) in Arabidopsis and 59 in Oryza. In most of these proteins, CBS domain coexists with other functional domain(s), which may indicate towards their probable functions. In order to investigate the role(s) of these CDCPs, we have carried out their detailed analysis in whole genomes of Arabidopsis and Oryza, including their classification, nomenclature, sequence analysis, domain analysis, chromosomal locations, phylogenetic relationships and their expression patterns using public databases (MPSS database and microarray data). We have found that the transcript levels of some members of this family are altered in response to various stresses such as salinity, drought, cold, high temperature, UV, wounding and genotoxic stress, in both root and shoot tissues. This data would be helpful in exploring the so far obscure functions of CBS domain and CBS domain-containing proteins in plant stress responses.
We have identified, classified and suggested the nomenclature of CDCPs in Arabidopsis and Oryza. A comprehensive analysis of expression patterns for CDCPs using the already existing transcriptome profiles and MPSS database reveals that a few CDCPs may have an important role in stress response/tolerance and development in plants, which needs to be validated further through functional genomics.
All eukaryotic genomes sequenced so far, contain a number of genes that encode for proteins whose functions are still unknown. These proteins have been documented to be induced under specific set of conditions and participate in protein-protein interactions and/or sometimes are also associated with mutant phenotypes . These proteins with unknown functions are either called p roteins with o bscure f eatures (POFs) when they contain no previously defined domains/motifs, or p roteins with d efined f eatures (PDFs) when they contain at least one previously defined domain/motif [1, 2]. A protein domain is an evolutionarily conserved unit of protein sequence that can evolve, function and exist independently of the rest of the protein chain. In general, each domain is assumed to perform a specific function. An identical domain may appear in evolutionarily and functionally unrelated proteins, and therefore it is challenging to relate the presence of a domain with overall functionality of the protein. One of the possible approaches to address this important issue is to use the microarray data as a tool to predict the function of proteins having unknown functions, as suggested . Recently, a number of these kinds of proteins have been characterized in Arabidopsis and Oryza using transcriptome studies as well as functional genomics tools, by raising transgenic plants. It has been reported that some of these proteins of unknown function(s) can indeed improve tolerance of transgenic plants to oxidative stress .
To understand the probable mechanism of abiotic stress tolerance in Oryza sativa, we have made an attempt to characterize several unknown members of the stress responsive machinery . A group of these proteins of unknown functions were found to have cystathionine-β-synthase (CBS) domain and were differentially regulated in the contrasting genotypes of rice indicating towards their probable role in salinity tolerance. Thus, we assume that these proteins may be participating in known pathways and networks and/or be involved in basic or specialized processes and also might comprise new and undiscovered pathways.
In CBS protein, C-terminal CBS domain exerts an autoinhibitory effect on the CBS activity, while binding of SAM (S-adenosylmethionine) with CBS domain induces a conformational change which relieves the autoinhibitory effect. Mutation in CBS domains abolish or strongly reduce activation by SAM and cause homocystinuria. CBS domain of γ-subunit of AMPK acts as sensor of cellular energy status and mutations cause a glycogen storage disease, which is clinically expressed as a familial hypertrophic cardiomyopathy (Wolff-Parkinson-White syndrome) [9–12]. Scott et al.  reported that CBS domain of IMPDH binds to ATP in a positive cooperative way and activates IMPDH. ATP binding and activation was abolished by a point mutation which corresponds to the mutation causing retinitis pigmentosa . Earlier research shows that the CBS module in ATP-binding cassette transporter OpuA constitutes the ionic strength sensor whose activity is modulated by the C-terminal anionic tail . In CLCs, function of CBS domain remains unresolved and controversial. However, it has been shown that CBS domains in human CLCs are required for its function and/or expression because mutations in the CBS domain of CLCs cause diseases due to CLC dysfunction [16–21]. However, in plants information available on CLCs is very limited. The first CLC gene (CLC-Nt1) was identified from tobacco , thereafter CLC genes were identified and characterized from Arabidopsis [23–25] and rice , but function of CBS domain in these CLC channel genes has not been resolved.
The availability of complete genome sequence along with the microarray expression and Massively Parallel Signature Sequencing (MPSS) data makes Arabidopsis an ideal plant for study of newly identified protein family . In the present work, we have performed genome wide analysis in the two highly finished plant genomes i.e. Arabidopsis thaliana and Oryza sativa where we have identified and classified the CDCPs based on their conserved features. To further establish their possible involvement in development and abiotic stresses, we have analyzed the expression of genes encoding CDCPs using MPSS database and already existing Arabidopsis microarray database http://www.arabidopsis.org.
Results and Discussion
Identification and classification of CBS domain containing proteins
In the present study, we provide information on the CDCPs in Arabidopsis thaliana and Oryza sativa. For this, the CDCPs were searched against the protein sequence database for Arabidopsis and Oryza, using the HMM profile of CBS domain obtained from Pfam database as described earlier . We propose a systematic classification of all the CDCPs based on their structural features, as there are no pre-existing reports on classification of CDCPs, till date.
CBS domain containing proteins in Oryza sativa and Arabidopsis thaliana. In O. sativa, the genes encoding CBS domain proteins were named according to their classification prefixed by 'Os' while in A. thailana genes were prefixed by 'At'. The alternative spliced forms were post fixed with the alphabets like 'a', 'b' and so on.
Oryza sativa sub japonica one CBS domain containing proteins
13226547 – 13234580
1223593 – 1220510
35086529 – 35087614
35086529 – 35087614
35086529 – 35087614
35086529 – 35087614
35086529 – 35087614
30158802 – 30157593
30158802 – 30157394
30158802 – 30157394
30158802 – 30157394
30158802 – 30157593
30203267 – 30199873
2416552 – 2414325
25775244 – 25777361
23148281 – 23147344
23148281 – 23147344
36063946 – 36070376
3204142 – 3206178
25695370 – 25697808
25640848 – 25642222
34488888 – 34487614
38344981 – 38348540
29546209 – 29548214
21150763 – 21145109
21150763 – 21145109
27815829 – 27811366
32601607 – 32606015
12344418 – 12350373
12349765 – 12355720
14487588 – 14485027
24506315 – 24512370
29898156 – 29893927
29898156 – 29894593
21892144 – 21886597
21892144 – 21886597
21984229 – 21972448
22028109 – 22016078
3179627 – 3182495
19149758 – 19156919
19149758 – 19155394
26601005 – 26605975
1454844 – 1461230
15789767 – 15799283
32304776 – 32309729
32304776 – 32308874
Arabidopsis thaliana one CBS domain containing proteins
Oryza sativa sub japonica two CBS domain containing proteins
23148281 – 23147344
40567292 – 40565692
18564463 – 18559754
19671270 – 19664088
19671270 – 19664088
19668390 – 19664088
19671270 – 19665094
19686702 – 19679520
40484031 – 40480347
40712332 – 40715773
3419765 – 3414996
42704156 – 42708054
3535282 – 3530718
Arabidopsis thaliana two CBS domain containing proteins
Analysis of CBS domain containing proteins
Whole genome analysis of CDCPs in Arabidopsis reveals the presence of 24 genes which code for 25 proteins containing only single CBS domain, while in Oryza 29 genes coding for 46 proteins containing only single CBS domain were found. In case of two CBS domain proteins, 9 genes code for 9 proteins in Arabidopsis, while in Oryza 8 genes code for 13 proteins [Table 1]. These proteins were further classified on the basis of additional domain(s) present within the sequence. The different sub-classes of these proteins are as follows:
A fraction of predicted CBS domain-containing proteins harbor only single CBS domain (PF00571). CDCPs have been reported to have the regulatory functions http://www.sanger.ac.uk/Users/agb/CBS/CBS.html, however the biological significance of these domains remains to be elucidated. These CDCPs can act as binding domains for adenosine derivatives and may regulate the activity of attached enzymatic or other domains . In some cases, these proteins may act as sensors of cellular energy status as they are activated by AMP and inhibited by ATP . Recently, one of the Arabidopsis CBSX proteins (CDCP2) has been purified and crystallized . In case of Arabidopsis, 6 genes were found to encode for 6 CBSX proteins, while in Oryza, a total of 22 CBSX proteins were categorized in this subgroup, which were encoded by 12 genes. In Oryza, all the CBSX genes code for only one protein, except for OsCBSX3, OsCBSX4 and OsCBSX7, which code for 5, 6 and 2 proteins, respectively through the alternate splicing mechanism.
In case of Arabidopsis, this subgroup contains two proteins which are encoded by two genes, while in case of Oryza, two proteins are encoded by a single gene. These proteins contain a domain of unknown function (DUF21) (PF01595) at the N-terminus adjacent to a CBS domain and a CorC_HlyC domain (PF03471) at the C-terminus. DUF21 domain has no known function and is usually present at the N-terminus of the proteins adjacent to a CBS domain. The CorC_HlyC is a transport associated domain and is found at the C-terminus of the proteins. CorC_HlyC domain is also found in magnesium and cobalt efflux protein CorC and some of the Na+/H+ antiporters. The function of this domain is unknown but it might be involved in modulating the transport of ion substrates http://pfam.sanger.ac.uk.
Proteins classified in this subgroup contain a DUF21 domain at the N-terminus, along with a CBS domain. In case of Oryza, this subgroup contains four proteins, which are encoded by three genes, as OsCBSDUF1 gene codes for two proteins whereas, in case of Arabidopsis, this subgroup contains seven proteins, which are encoded by seven genes.
These proteins belong to chloride channel protein (CLC) family which sustains a wide variety of cellular functions, including membrane excitability, synaptic communication, transepithelial transport, cell volume recognition, cell proliferation, and acidification of endosomes and lysosomes. Earlier chimeric and deletion approaches had suggested that CBS domain may influence gating of CLCs. Past studies have suggested that mutation in the CBS domain affects protein-protein interaction within CLC protein subunits as well as between two subunits of the dimer and that they influence the voltage dependence of gating through the common gate . Role of CBS domain in the correct targeting (possibly related to correct folding) of the CLC is in accordance with previous studies with yeast CLC protein where mutation in the CBS domain abolished its localization to the late golgi that is seen upon its overexpression . Earlier experiments have shown that certain mutations in CBS domain affect chloride channel gating but physiologically relevant regulatory role of CBS domains in CLC channels is yet to be established . In an earlier report, 7 CLC genes were identified each from Oryza and Arabidopsis . However, in the present study we have identified total 10 genes encoding 14 CLC proteins in Oryza. Whereas in case of Arabidopsis, our results are in accordance with the previous report as we found 8 CLC proteins encoded by 7 genes.
Proteins containing sugar isomerase (SIS) domain along with a CBS domain have been classified in this subgroup. We have identified one CBSSIS gene encoding only one protein in cases of both, Arabidopsis and Oryza. The SIS domain is widespread and found in all species including, prokaryotic, archaebacterial and eukaryotic proteins. In general SIS domain is found in proteins that have a common role in phosphosugar isomerization as SIS domain functions by binding to the phosphosugars. The SIS domains are also found in a family of bacterial transcriptional regulators  as well as in a family of Escherichia coli iron transporters .
In both Arabidopsis and Oryza, this subgroup is composed of a single gene encoding for a single protein, containing a pentatricopeptide repeat (PPR) motif and a CBS domain. The PPR motif, first described by Small et al.  is a degenerate 35 amino acid sequence closely related to the 34 amino acid tetratricopeptide repeats (TPR) motif. PPR repeat motifs are structural motifs encoded by a large number of genes in plants and other organisms, although the PPR gene family is greatly expanded in plants. It was hypothesized that this could be due to novel functions served by PPR proteins in plants that are not required in other organisms, or that PPR proteins replace functions performed by other genes in other organisms. Also, restoration of male fertility is a plant specific function encoded by PPR genes . A genome-wide analysis of Arabidopsis PPR family proteins has identified 441 members and their further analysis revealed that PPR proteins play constitutive, often essential roles in mitochondria and chloroplast, probably via binding to organeller transcripts . Many plant PPR genes that have been functionally annotated so far, are involved in either male fertility restoration through modification or silencing of cytotoxic mitochondrial transcripts, or post transcriptional modulation of plastid gene expression or plant embryogenesis and other plant developmental processes . Interestingly, among large number of PPR proteins found in plants, only one PPR protein, each in Oryza and Arabidopsis, contains a CBS domain in the same reading frame. Occurrence of CBS domain along with PPR repeat in a protein suggests that this protein might be involved in the various cellular processes by sensing the cell energy status . Structurally characterizing the full length protein (PPR + CBS domain) may shed light on this regulatory mechanism in plants.
Proteins containing inosine-5'-monophosphate dehydrogenase (IMPDH) domain (PF00478) along with CBS domain has been classified in this subgroup. In Oryza, only one CBSIMPDH gene has been identified, which code for two CBSIMPDH proteins while in Arabidopsis, no member of this subgroup has been identified. Interestingly, in Oryza CBSIMPDH proteins, CBS domain lies within the IMPDH domain. IMPDH domain has been identified in the sequence of IMPDH enzyme, which is a key enzyme in the de novo guanosine nucleotide biosynthesis. Scott et al.  have shown that the CBS domain of IMPDH binds to ATP in vitro and that the tetrameric IMPDH binds ATP in a positive, cooperative way. They have also observed that IMPDH was activated by ATP, which was never reported earlier. This observation strongly supported the verity that ATP binding to the CBS domain allosterically activates IMPDH and consequently XMP synthesis. If so, this mechanism would couple the GTP/dGTP biosynthesis to the cellular energy status i.e., high ATP levels .
In this subgroup, proteins containing only two CBS domains have been classified. In case of Oryza, 8 CBSCBS proteins have been identified, which are encoded by 5 different genes. The OsCBSX7 gene, which has been classified in CBSX subgroup, also codes for a protein OsCBSCBS1 containing two CBS domains. The OsCBSCBS4 gene encodes for four proteins through an alternative splicing mechanism whereas, in case of Arabidopsis, AtCBSCBS subgroup contains four genes, which encode four proteins.
These proteins contain Phox/Bemp1 (PB1) domain along with the CBS domain. In case of Oryza, this subgroup possess 4 genes, which encode for 4 proteins, while in Arabidopsis 5 CBSCBSPB genes encode for 5 proteins. The PB1 domains are present in many eukaryotic cytoplasmic signaling proteins. They are dimerization/oligomerization domains present in adaptor and scaffold proteins and kinases that serve to organize platforms that ensure specificity and fidelity during cellular signaling. Recently, a number of studies have provided valuable information on the structural details that govern binding between the different PB1 modules and explain how they direct the formation of different macromolecular signaling complexes . Proteins containing the PB1 domain are conserved in animals, fungi, amoebae and plants, which participate in various biological processes . The function of PB1 domain containing proteins in plants has not been reported so far. Presence of PB1 domain, along with a pair of CBS domains, in a single protein suggests that these proteins might be involved in cellular signaling processes through interaction with other proteins and/or ligands (ATP, ADP or SAM). Characterization of these proteins at physiological, molecular and structural level might shed some light on their functionality.
Chromosomal distribution of CBS domain containing proteins
The distribution of the CDCP genes on the 5 chromosome of Arabidopsis and 9 chromosome of Oryza, at which they were found to be located, is not uniform. Their chromosomal distribution pattern reveals that some CDCP genes are found in clusters on certain chromosomes at various chromosomal regions. Occurrence of cluster of genes belonging to a family at certain chromosomes and chromosomal regions is common. Jain et al.  have reported that among the 19 auxin responsive (SAURs) genes present on chromosome IX, 17 are clustered together at a single locus in tandem. Similarly, genes encoding basic leucine zipper transcription factors (OsbZIP) have also been reported to be present in clusters at certain chromosomes and chromosomal regions . In Oryza, 4 genes encoding two CBS domain containing proteins were found in close vicinity at chromosome I. Percent identity amongst all the four genes clustered was found to be in the range of 30 to 60%.
The sequence information and analysis of the Arabidopsis [57–60] and Oryza  whole genomes have revealed numerous large-scale segmental duplications. Several studies conclude that at least two rounds of duplications might have probably occurred in the Arabidopsis genome, with many losses and rearrangements, leaving a mosaic of "segmental duplications" or "duplication blocks" [61–65]. Analysis of CDCP genes has revealed that some of these genes have been duplicated during the process of evolution in both, Arabidopsis and Oryza. Figure 3a shows the duplicated CDCP genes in Arabidopsis and Figure 3b shows the duplicated CDCP genes in Oryza. In Arabidopsis, AtCBSX4 at chromosome I seem to be duplicated segmentally, and a domain duplication event might have occurred, which have resulted in the appearance of AtCBSCBS4 gene on the same chromosome. AtCBSDUFCH2 gene at chromosome I was observed to be duplicated to chromosome III as AtCBSDUFCH1. These duplicated genes show 85% identity at the nucleotide level and 79% identity at the protein level. AtCBSCLC8, present on the chromosome III seems to be duplicated segmentally to chromosome V as AtCBSCLC6 and at the same time the gene observed to have undergone inversion. AtCBSCBSPB2 present at chromosome II seems to be segmentally duplicated at chromosome III as AtCBSCBSPB3. These genes share 86% identity at the nucleotide level and 67% identity at the protein level. Interestingly in Oryza, it was observed that during the course of duplication the structural domains accumulate variations in order to adapt to new function. For example OsCBSDUF2 gene present on chromosome III got duplicated on chromosome I as OsCBSCBS5. In other instances the CDCP genes were duplicated but retained their structural domains, like OsCBSCBSPB2 present on chromosome XI got duplicated as OsCBSCBSPB4 on chromosome XII. OsCBSCLC9 on chromosome II was found to be duplicated as OsCBSCLC8 on chromosome VIII. OsCBSCBS2 located on chromosome I was observed to be duplicated as OsCBSCBS3 on chromosome IV.
Sequence Analysis of CBS domain containing proteins
Sequence analysis of all the CDCPs shows an overall homology amongst their own respective groups (see Additional files: 1 and 2). Alignment of CBS domain sequences shows that the conserved domain also possesses some variations within themselves. Single CBS domain proteins of Oryza have been observed to have a percent identity of 55% to 60% within themselves, and 30% with that of Arabidopsis except for the AtCBSX1 which showed identity of more than 80% with OsCBSX1 and OsCBSX2. Similar identity pattern was also observed for the OsCBSCLC proteins, which showed identity ranging from 30% to 80% within their own subgroups and also with AtCBSCLC proteins. The OsCBSDUF proteins were observed to have identity ranging from 50% to 60% with the AtCBSDUF proteins. The OsCBSSIS1 protein shares 80% identity with AtCBSSIS1 and OsCBSPPR1 protein shares 60% identity with AtCBSPPR1 protein. OsCBSCBS proteins share 40% to 75% identity among themselves. The two CBS domain proteins also showed 50% to 70% identity with the other two CBS domain members. Apart from having variations, the CBS domain also accumulates large insertions. The large insertion or deletion might have helped the CBS domain proteins to evolve in order to perform specific functions. Results from the sequence alignment suggested that the sequences might have evolved according to the other functional domains present in the sequences which might have led to their specialized functions.
Phylogenetic analysis of CBS domain containing proteins
MPSS analysis of genes encoding CBS domain containing proteins
Analysis of CDCP genes in Arabidopsis and Oryza showed that AtCBSX4 has no expression value in any of the tissues under any condition according to the MPSS database while in Oryza OsCBX12 has no expression values in the MPSS database. In Arabidopsis, genes encoding for proteins with CorC_HlyC functional domain, which has a major role in magnesium and cobalt efflux, along with CBS domain seems to be expressed more in siliqua, leaves and inflorescence. In Oryza, OsCBSDUFCH1 gene was observed to show more expression in leaves and other plant tissues, except roots. The gene AtCBSDUFCH1 is expressed more in inflorescence of ap1-10, ap3-6 mutants than in the normal condition of inflorescence. AtCBSDUFCH2, which is duplicated from AtCBSDUFCH1 on chromosome III, is expressed more in leaves and seed. In case of Arabidopsis, CBSCLC genes showed more expression in callus, roots and leaves overall in the MPSS analysis, while in case of Oryza, OsCBSCLC7 seems to be more expressed in root tissues, while other members were found to be expressed more in aerial part of the plant. It was observed that in TIGR ver 6, the data for OsCBSCLC10 was not available in the MPSS database. The CBSSIS1 gene seems to be expressed more in callus, normal inflorescence and leaves, while CBSPPR1 gene showed more expression in leaves, seed, siliqua and roots in Arabidopsis. In Oryza CBSSIS1, CBSPPR1 and CBSIMPDH1 genes were observed to maintain a constant level of expression in all the plant tissues.
Among the groups of genes having two CBS domains, MPSS analysis showed higher expression for all the CBSCBS genes in all the plant parts in Oryza, whereas in case of Arabidopsis, AtCBSCBS4 does not show expression in any of the plant part considered for the MPSS analysis. AtCBSCBS2 showed enhanced levels of expression in normal inflorescence, leaves and callus. While, AtCBSCBS3 showed expression in other inflorescence such as ap1-10, ap3-6 and agamous mutant conditions but does not show any expression in the normal inflorescence.
In Arabidopsis, AtCBSCBSPB1, AtCBSCBSPB2 and AtCBSCBSPB3 showed expression in MPSS analysis. Whereas in Oryza, OsCBSCBSPB3 was observed to show expression in the meristematic tissues and root while rest of the genes of this subclass were found to have been expressed in leaves also. In Arabidopsis, AtCBSCBSPB1 showed more expression in callus, root and normal inflorescence, while AtCBSCBSPB2 showed expression only in normal inflorescence and siliqua. This data indicates that CDCP genes exhibit a strict tissue specific expression which is also developmentally controlled.
Expression profiles of genes encoding CBS domain containing proteins under various stresses
Thomashow  has shown that model plant Arabidopsis may express as many as hundreds of cold shock proteins under cold stress condition. CDCP genes showed altered expression at 24 hr cold stress in root with respect to the control, while in shoot some of the CDCP genes showed early upregulation within 30 min and 1 hr of cold stress. Some of the CDCP genes also showed up regulation at 12 hr to 24 hr cold stress conditions. AtCBSSIS1 gene was found to be upregulated in shoots under cold and wound stresses, while in roots, upregulation was observed only under osmotic and salt stresses. In roots, AtCBSDUF3 and AtCBSCLC7 showed upregulation in cold stress condition at 12 hour and 24 hour. Under cold stress condition AtCBSCBSPB4 and AtCBSCBSPB5 showed upregulation within 24 hrs in roots but in shoots they showed biphasic upregulation i.e. their expression was upregulated within 30 min of stress, followed by downregulation of expression and it was again upregulated within 24 hr of the stress exposure. High (intense) light stress causes the formation of oxygen radicals in chloroplasts and has the potential to damage them. However, plants are able to respond to this stress and protect chloroplasts by various means, including transcriptional regulation in the nucleus. Although the corresponding signaling pathway is unknown, there have been attempts to study its regulation . When exposed to UV light, CDCP genes showed upregulation at 24 hr exposure in roots (also expressed at early timepoints), while in shoot, UV light exposure from 1 to 3 hrs is found to be sufficient to induce the expression of CDCP genes. Most of the CDCP genes maintain a constant level of expression during all the time period of the UV exposure suggesting that these genes might play crucial role(s) in the light sensing mechanism. The complex responses of plants to wounding have been extensively studied in recent years, and numerous wound-responsive genes have been identified in Arabidopsis . Wounding stress in shoot showed comparable expression at all the time points while in roots CDCP genes showed expression at 1 hr and 24 hr of exposure to stress. When exposed to the genotoxic stress, the cell cycle is halted to gain necessary time for repairing DNA and genes required for repair and protection of other cellular components endangered by the genotoxic stress are activated. Some organisms do respond to the abiotic stresses by way of apoptosis i.e. eliminating the damaged cell . On exposure to bleomycin (genotoxic), CDCP genes showed differential expression in root and shoot at all the time points except for the 6 hr in shoot, where none of the gene showed enhanced expression. This data indicates that the CDCPs might also play role in protecting the cells from the genotoxic stress. Under drought conditions, plants adapt themselves to maintain the cellular homeostasis. It has been observed that sensitive plants suffer rapid irreversible cell damage, essentially due to degradation of their membranes . Membranes are main targets of degradative processes induced by drought and it has been shown that, under water stress, a decrease in membrane lipid content is correlated to the inhibition of lipid biosynthesis [72, 73] and a stimulation of lipolytic and peroxidative activities [74–77]. When observed under drought conditions, all CDCP genes showed comparable expression at different time points in root, while in shoot approximately all the CDCP genes were found to be upregulated at 24 hr drought stress, which indicates that plant takes some time to adapt to the drought conditions, suggesting that CDCPs might also help plants to adapt to drought stress. Salt and osmotic stress result in transient increase in a cytoplasmic free calcium concentration, and disruption of this calcium gradient affects downstream gene expression . Earlier, some attempts have been made to identify the osmotic stress related genes in Arabidopsis . Under the condition of osmotic stress, expression of some of the CDCP genes was found to be altered with respect to the control in 30 min and 12 hr stress in roots, while in shoots these genes showed higher expression at 1 hr and 24 hr of osmotic stress. Under salt stress, CDCP genes were found to be up regulated at 1, 6 and 12 hr time points in roots, while in shoots these genes were expressed more in 1 hr and 24 hr of salt stress. The analysis of expression of these genes gives an idea that CBS domain containing proteins might play an important role in drought, salt and osmotic stress response/tolerance. Oxidative stress, arising from an imbalance in the generation and removal of reactive oxygen species (ROS) such as hydrogen peroxide (H2O2), is a challenge faced by all aerobic organisms [80, 81]. Although ROS were originally considered to be detrimental to cells, it is now widely recognized that redox regulation involving ROS is a key factor modulating cellular activities . Oxidative stress seems to induce the expression of CDCP genes at 3 hr of stress in roots while in shoot almost all the CDCP genes, which were differentially expressed showed upregulation under oxidative stress.
The cystathionine β-synthase (CBS) domain containing proteins (CDCPs) comprise of a large superfamily of evolutionarily conserved proteins which are present in all kingdoms of life. In plants, CDCPs were never reported and hence their occurrence and possible function is still a mystery. The present study has identified CDCPs in Arabidopsis thaliana and Oryza sativa at the whole genome level. We also propose their classification based on the presence of CBS domain and various other functional domain(s) present in them. The chromosomal position of genes encoding CDCPs gives an insight into their distribution in the whole genome of Arabidopsis and Oryza. In addition, MPSS analysis reveals the differential expression of these genes in various tissues and parts under stress conditions in these plant species. Moreover, microarray expression data gives an overview that the CDCPs might play an important role in stress response/tolerance in Arabidopsis under various stress conditions. Thus, it can be concluded that CDCP genes exhibit strict stress and developmentally regulated expression patterns in Arabidopsis and Oryza. Certainly, there is a need to functionally validate the role of these CDCPs which exhibit "induced expression patterns" under multiple stresses. Another major finding of this work is the observed expansion of CDCP gene family in Oryza as compared to Arabidopsis. This expansion has been noticed in Oryza primarily because of presence of relatively more cases of alternative splicing as well as gene duplications, possibly indicating towards the significant involvement of CDCPs in development and stress responses. This study will be helpful in commenting on the structural and functional aspects of these unexplored proteins with respect to their roles under various abiotic stresses. Tools of functional genomics based on transgenic approach can further help in testing the "candidature" of these proteins with defined features towards improving stress tolerance in crop plants. The leads provided here would also pave the way for elucidating the precise role of individual CDCP proteins in plants.
Search and analysis of CBS domain containing proteins
The CDCP protein sequences were retrieved from whole genome sequences of Arabidopsis thaliana (TIGR version 5 and TAIR 8) and Oryza sativa (TIGR version 6). The sequences obtained from TIGR were also cross checked with TAIR8 sequences for any new instances of CDCP proteins in Arabidopsis. The domain structure of the CBS was used to identify and classify the CDCP proteins using the TIGR A. thaliana genome sequence version 5.0 and O. sativa version 6.0. Profiles unique to the CBS domain were used to screen all predicted proteins using the HMMER software (version 2.3.2; http://hmmer.wustl.edu/. These unique profiles are for Pfam HMM of CBS domain (accession no. PF00571) . We have used these profiles as default parameters in the hmmsearch program of the HMMER package . All the significant hits having positive scores were selected for classification and were then examined individually for accessory domains that are usually present. This was accomplished by searching the sequences against the Pfam database (version 21.0) to map the known domains, such as Voltage_CLC, CorC_HlyC, SIS, PB1, DUF21 and IMPDH1 (see Fig. 2 for a schematic representation). This step, besides mapping accessory domains to the CBS provides the basis of classifying these proteins. The percentage values of the sequence similarity and identity within the groups were obtained using BLAST. The respective CDCP protein sequences were also cross checked from the TAIR8 database for presence of any additional alternative splice forms in Arabidopsis.
For convenience, we have assigned name to protein sequence according to the domains observed in the respective protein sequence, where At is for A rabidopsis t haliana and Os is for O ryza s ativa. The chromosomal positions of the CDCP genes were obtained from TIGR version 6 for O. sativa and TIGR version 5, TAIR version 8 for A. thaliana, and plotted using Dia version 0.96.1, an open source gtk+ based diagram creation program.
Sequence and phylogenetic analysis
Multiple alignment analyses were performed using MUSCLE (version 3.6) program . The unrooted parsimonious tree was plotted using protpars and drawtree program of phylip package (version 3.66)  using default parameters. Similarity and identity values among the protein sequences were analyzed using standalone BLAST (version 2.2.15) . The figures for final alignment were prepared using Jalview multiple sequence alignment editor .
Expression analysis using MPSS database
Expression evidence from MPSS (Massively Parallel Signature Sequencing) tags was determined from the Arabidopsis MPSS project http://mpss.udel.edu/at/ mapped to Arabidopsis and Oryza gene models. The signature was considered to be significant if it uniquely identifies an individual gene and shows perfect match (100% identity over 100% length of the tag). The normalized abundance (tags per million, tpm) of these signatures for a given gene in a given library represents a quantitative estimate of expression of that gene.
The description of MPSS libraries in A. thaliana is: CAF – Callus, hardened tissue that forms to protect the exposed areas of cuttings; INF – Inflorescence, part of the plant that consists of flower bearing stalks; LEF – Leaves – 21 day, untreated, classic MPSS; ROF – Root – 21 day, untreated, classic MPSS; SIFSilique (Seedpod) – 24 to 48 hr post-fertilization, classic MPSS; AP1 – ap1-10 inflorescence (part of the plant that consists of flower bearing stalks) – mixed stage, immature buds; AP3 – ap3-6 inflorescence (part of the plant that consists of flower bearing stalks) – mixed stage, immature buds; AGM – agamous inflorescence (part of the plant that consists of flower bearing stalks) – mixed stage, immature buds; INS – Inflorescence – mixed stage, immature buds; ROS – Root – 21 day, untreated; SAP – sup/ap1 inflorescence – mixed stage, immature buds; S04 – Leaves, 4 hr after salicylic acid treatment; S52 – Leaves, 52 hr after salicylic acid treatment; LES – Leaves – 21 day, untreated; GSE – Germinating seedlings; CAS – Callus (hardened tissue that forms to protect the exposed areas of cuttings) – actively growing, signature MPSS; SIS – Silique (Seedpod) – 24 to 48 hr post-fertilization, signature MPSS.
The description of MPSS libraries in O. sativa is: NYR -14 days – Young Roots, NRA – 60 days – Mature Roots – Replicate A, NRB – 60 days – Mature Roots – Replicate B, NGD – 10 days – Germinating seedlings grown in dark, NST – 60 days – Stem, NYL – 14 days – Young leaves, NLA – 60 days – Mature Leaves – Replicate A, NLB – 60 days – Mature Leaves – Replicate B, NLC – 60 days – Mature Leaves – Replicate C, NLD – 60 days – Mature Leaves – Replicate D, NME – 60 days – Crown vegetative meristematic tissue, NPO – Mature Pollen, NOS – Ovary and mature stigma, NIP – 90 days – Immature panicle, NGS – 3 days – Germinating seed, NCA – 35 days – Callus, NSR – 14 days – Young roots stressed in 250 mM NaCl for 24 h, NSL – 14 days – Young leaves stressed in 250 mM NaCl for 24 h, NDR – 14 days – Young roots stressed in drought for 5 days, NDL – 14 days – Young leaves stressed in drought for 5 days, NCR– 14 days – Young roots stressed in 4 C cold for 24 h, NCL – 14 days – Young leaves stressed in 4 C cold for 24 h, 9RO – Roots, I9RR – Roots – Replicate, 9LA – Leaves, 9LB – Leaves – Replicate, 9LC – Leaves, 9LD – Leaves – Replicate, 9ME – Meristematic Tissue, FRO – F1 Hybrid 60 days Mature Root, FRR – F1 Hybrid 60 days Mature Root-Repl, FLA – F1 Hybrid 60 days Mature Leaf Replicate A, FLB – F1 Hybrid 60 days Mature Leaf Replicate B, FLC – F1 Hybrid 60 days Mature Leaf Replicate C, FLD – F1 Hybrid 60 days Mature Leaf Replicate D, FME – F1 Hybrid 60 days Meristematic tissue, PSC – rice developing seeds, 6 days old cypress high milling (99–1710), PSI – rice developing seeds,6 days old, Ilpumbyeo – High Taste, PSL – rice developing seeds, 6 days old, LaGrue-Low Milling, PSN – rice developing seed, 6 days old, Nipponbare-Grain quality control, PSY – rice developing seeds, 6 days old, Expression values obtained from MPSS database for respective CDCP genes were used for making the heatmap using gplots package of open source R software.
Expression analysis using microarrays
The microarray data of the selected gene expression for various abiotic stress conditions such as cold, UV, wound, genotoxic stress, drought, osmotic, salt and oxidative stress were retrieved from the Arabidopsis Information Resource . The datasets obtained were corresponding to root and shoot tissues at different time sets of stress namely 30 min, 1 hr, 3 hr, 6 hr, 12 hr and 24 hr. Fold increase in transcript abundance under stress conditions were calculated with respect to their controls. The transcript abundance with respect to the control was calculated using PERL scripts. The hierarchical clustering analysis and the heatmaps were made using gplots package of R software.
Authors would like to thank Jawaharlal Nehru University (JNU), International Center for Genetic Engineering and Biotechnology (ICGEB), Department of Science and Technology and Department of Biotechnology, Govt. of India for financial support. HRK would like to thank Council for Scientific and Industrial Research (CSIR), New Delhi, India for Research Fellowship.
- Gollery M, Harper J, Cushman J, Mittler T, Girke T, Zhu JK, Bailey-Serres J, Mittler R: What makes species unique? The contributions of proteins with obscure feature. Genome Biol. 2006, 7: 757-10.1186/gb-2006-7-7-r57.View Article
- Gollery M, Harper J, Cushman J, Mittler T, Mittler R: POFs: what we don't know can hurt us. Trends Plant Sci. 2007, 12: 492-496. 10.1016/j.tplants.2007.08.018.View ArticlePubMed
- Horan K, Jang C, Bailey-Serres J, Mittler R, Shelton C, Harper JF, Zhu JK, Cushman JC, Gollery M, Girke T: Annotating genes of known and unknown functions by large-scale co-expression analysis. Plant Physiol. 2008, 147: 41-57. 10.1104/pp.108.117366.PubMed CentralView ArticlePubMed
- Luhua S, Ciftci-Yilmaz S, Harper J, Cushman J, Mittler R: Enhanced tolerance to oxidative stress in transgenic Arabidopsis plants expressing proteins of unknown function. Plant Physiol. 2008, 148: 280-92. 10.1104/pp.108.124875.PubMed CentralView ArticlePubMed
- Kumari S, Sabharwal VP, Kushwaha HR, Sopory SK, Singla-Pareek SL, Pareek A: Transcriptome map for seedling stage specific salinity stress response indicates a specific set of genes as candidate for salinetolerance in Oryza sativa L. Func Integr Genomics. 2008, 9: 109-123. 10.1007/s10142-008-0088-5.View Article
- Ignoul S, Eggermont J: CBS domains: structure, function, and pathology in human proteins. Am J Physiol Cell Physiol. 2005, 289: C1369-1378. 10.1152/ajpcell.00282.2005.View ArticlePubMed
- Bateman A: The structure of a domain common to archaebacteria and homocystinuria disease proteins. Trends Biochem Sci. 1997, 22: 12-13. 10.1016/S0968-0004(96)30046-7.View ArticlePubMed
- Aitken SM, Kirsch JF: The enzymology of cystathionine biosynthesis: strategies for the control of substrate and reaction specificity. Arch Biochem Biophys. 2005, 433: 166-175. 10.1016/j.abb.2004.08.024.View ArticlePubMed
- Arad M, Benson DW, Perez-Atayde AR, McKenna WJ, Sparks EA, Kanter RJ, McGarry K, Seidman JG, Seidman CE: Constitutively active AMP kinase mutations cause glycogen storage disease mimicking hypertrophic cardiomyopathy. J Clin Invest. 2002, 109: 357-362.PubMed CentralView ArticlePubMed
- Blair E, Redwood C, Ashrafian H, Oliveira M, Broxholme J, Kerr B, Salmon A, Ostman-Smith I, Watkins H: Mutations in the γ2 subunit of AMP-activated protein kinase cause familial hypertrophic cardiomyopathy: evidence for the central role of energy compromise in disease pathogenesis. Hum Mol Genet. 2001, 10: 1215-1220. 10.1093/hmg/10.11.1215.View ArticlePubMed
- Gollob MH, Green MS, Tang AS, Gollob T, Karibe A, Ali Hassan AS, Ahmad F, Lozado R, Shah G, Fananapazir L, et al: Identification of a gene responsible for familial Wolff-Parkinson-White syndrome. N Engl J Med. 2001, 344: 1823-1831. 10.1056/NEJM200106143442403.View ArticlePubMed
- Gollob MH, Seger JJ, Gollob TN, Tapscott T, Gonzales O, Bachinski L, Roberts R: Novel PRKAG2 mutation responsible for the genetic syndrome of ventricular preexcitation and conduction system disease with childhood onset and absence of cardiac hypertrophy. Circulation. 2001, 104: 3030-3033. 10.1161/hc5001.102111.View ArticlePubMed
- Scott JW, Hawley SA, Green KA, Anis M, Stewart G, Scullion GA, Norman DG, Hardie DG: CBS domains form energy-sensing modules whose binding of adenosine ligands is disrupted by disease mutations. J Clin Invest. 2004, 113: 274-284.PubMed CentralView ArticlePubMed
- Bowne SJ, Sullivan LS, Blanton SH, Cepko CL, Blackshaw S, Birch DG, Hughbanks-Wheaton D, Heckenlively JR, Daiger SP: Mutations in the inosine monophosphate dehydrogenase 1 gene (IMPDH1) cause the RP10 form of autosomal dominant retinitis pigmentosa. Hum Mol Genet. 2002, 11: 559-568. 10.1093/hmg/11.5.559.PubMed CentralView ArticlePubMed
- Mahmood NABN, Biemans-Oldehinkel E, Patzlaff JS, Schuurman-Wolters GK, Poolman B: Ion specificity and ionic strength dependence of the osmoregulatory ABC transporter OpuA. J Biol Chem. 2006, 281: 29830-29839. 10.1074/jbc.M604907200.View ArticlePubMed
- Cleiren E, Benichou O, Van Hul E, Gram J, Bollerslev J, Singer FR, Beaverson K, Aledo A, Whyte MP, Yoneyama T, deVernejoul MC, Van Hul W: Albers-Schonberg disease (autosomal dominant osteopetrosis, type II) results from mutations in the ClCN7 chloride channel gene. Hum Mol Genet. 2001, 10: 2861-2867. 10.1093/hmg/10.25.2861.View ArticlePubMed
- Haug K, Warnstedt M, Alekov AK, Sander T, Ramirez A, Poser B, Maljevic S, Hebeisen S, Kubisch C, Rebstock J, et al: Mutations in CLCN2 encoding a voltage-gated chloride channel are associated with idiopathic generalized epilepsies. Nat Genet. 2003, 33: 527-532. 10.1038/ng1121.View ArticlePubMed
- Konrad M, Vollmer M, Lemmink HH, Heuvel van den LP, Jeck N, Vargas-Poussou R, Lakings A, Ruf R, Deschenes G, Antignac C, et al: Mutations in the chloride channel gene CLCNKB as a cause of classic Bartter syndrome. J Am Soc Nephrol. 2000, 11: 1449-1459.PubMed
- Kornak U, Kasper D, Bosl MR, Kaiser E, Schweizer M, Schulz A, Friedrich W, Delling G, Jentsch TJ: Loss of the ClC-7 chloride channel leads to osteopetrosis in mice and man. Cell. 2001, 104: 205-215. 10.1016/S0092-8674(01)00206-9.View ArticlePubMed
- Lloyd SE, Gunther W, Pearce SH, Thomson A, Bianchi ML, Bosio M, Craig IW, Fisher SE, Scheinman SJ, Wrong O, Jentsch TJ, Thakker RV: Characterisation of renal chloride channel, CLCN5, mutations in hypercalciuric nephrolithiasis (kidney stones) disorders. Hum Mol Genet. 1997, 6: 1233-1239. 10.1093/hmg/6.8.1233.View ArticlePubMed
- Pusch M: Myotonia caused by mutations in the muscle chloride channel gene CLCN1. Hum Mutat. 2002, 19: 423-434. 10.1002/humu.10063.View ArticlePubMed
- Lurin C, Güclü J, Cheniclet C, Carde JP, Barbier-Brygoo H, Maurel C: CLC-Nt1, a putative chloride channel protein of tobacco, co-localizes with mitochondrial membrane markers. Biochem J. 2000, 348: 291-295. 10.1042/0264-6021:3480291.PubMed CentralView ArticlePubMed
- Hechenberger M, Schwappach B, Fischer WN, Frommer WB, Jentsch TJ, Steinmeyer K: A family of putative chloride channels from Arabidopsis and functional complementation of a yeast strain with aCLC gene disruption. J Biol Chem. 1996, 271: 33632-33638. 10.1074/jbc.271.52.33632.View ArticlePubMed
- Geelen D, Lurin C, Bouchez D, Frachisse JM, Lelievre F, Courtial B, Barbier-Brygoo H, Maurel C: Disruption of putative anion channel gene AtCLC-a in Arabidopsis suggests a role in the regulation of nitrate content. Plant J. 2000, 21: 259-267. 10.1046/j.1365-313x.2000.00680.x.View ArticlePubMed
- Marmagne A, Vinauger-Douard M, Monachello D, Longevialle AF, Charon C, Allot M, Rappaport F, Wollman FA, Barbier-Brygoo H, Ephritikhine G: Two members of the Arabidopsis CLC (chloride channel) family, AtCLCe and AtCLCf, are associated with thylakoid and Golgi membranes, respectively. J Exp Bot. 2007, 58: 3385-3393. 10.1093/jxb/erm187.View ArticlePubMed
- Nakamura A, Fukuda A, Sakai S, Tanaka Y: Molecular cloning, functional expression and subcellular localization of two putative vacuolar voltage-gated chloride channels in Rice (Oryza sativa L.). Plant Cell Physiol. 2006, 47: 32-42. 10.1093/pcp/pci220.View ArticlePubMed
- Arabidopsis Genome Initiative: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408: 796-815. 10.1038/35048692.View Article
- Pareek A, Singh A, Kumar M, Kushwaha HR, Lynn AM, Singla-Pareek SL: Whole genome analysis of Oryza sativa L. reveals similar architecture of two-component-signaling-machinery with Arabidopsis. Plant Physiol. 2006, 142: 380-397. 10.1104/pp.106.086371.PubMed CentralView ArticlePubMed
- Gibson MM, Bagga DA, Miller CG, Maguire ME: Magnesium transport in Salmonella typhimurium : the influence of new mutations conferring Co2+ resistance on the CorA Mg2+ transport system. Mol Microbiol. 1991, 5: 2753-2762. 10.1111/j.1365-2958.1991.tb01984.x.View ArticlePubMed
- ter Huurne AA, Muir S, van Houten M, Zeijst van der BA, Gaastra W, Kusters JG: Characterization of three putative Serpulina hyodysenteriae hemolysins. Micro Patho. 1994, 16: 269-282. 10.1006/mpat.1994.1028.View Article
- Mindell JA, Maduke M: ClC chloride channels. Genome Biol. 2001, 2: 1-3003. 10.1186/gb-2001-2-2-reviews3003.View Article
- Zhang J, George AL, Griggs RC, Fouad GT, Roberts J, Kwiecinski H, Connolly AM, Ptacek LJ: Mutations in the human skeletal muscle chloride channel gene (CLCN1) associated with dominant and recessive myotonia congenita. Neurology. 1996, 47: 993-998.View ArticlePubMed
- Schmidt-Rose T, Jentsch TJ: Reconstitution of functional voltage-gated chloride channels from complementary fragments of CLC-1. J Biol Chem. 1997, 272: 20515-20521. 10.1074/jbc.272.33.20515.View ArticlePubMed
- Bateman A: The SIS domain: a phosphosugar-binding domain. Trends Biochem Sci. 1999, 24: 94-95. 10.1016/S0968-0004(99)01357-2.View ArticlePubMed
- Teplyakov A, Obmolova G, Badet-Denisot MA, Badet B, Polikarpov I: Involvement of the C terminus in intramolecular nitrogen channeling in glucosamine 6-phosphate synthase: evidence from a 1.6 Å crystal structure of the isomerase domain. Structure. 1998, 6: 1047-1055. 10.1016/S0969-2126(98)00105-1.View ArticlePubMed
- Small ID, Peeters I: The PPR motif – a TPR-related motif prevalent in plant organellar proteins. Trends Biochem Sci. 2000, 25: 45-47. 10.1016/S0968-0004(99)01520-0.View Article
- Barkan A, Walker M, Nolasco M, Johnson D: A nuclear mutation in maize blocks the processing and translation of several chloroplast mRNAs and provides evidence for the differential translation of alternative mRNA forms. EMBO J. 1994, 13: 3170-3181.PubMed CentralPubMed
- Manthey GM, McEwen JE: The product of the nuclear genePET309 is required for translation of mature mRNA and stability or production of intron-containing RNAs derived from the mitochondrial COX1 locus of Saccharomyces cerevisiae. EMBO J. 1995, 14: 4031-4043.PubMed CentralPubMed
- English JM, Vanderbilt CA: Isolation of MEK5 and differential expression of alternatively spliced forms. J Biol Chem. 1995, 270: 28897-28902. 10.1074/jbc.270.48.28897.View ArticlePubMed
- Zhou G, Bao ZQ, Dixon JE: Components of a new human protein kinase signal transduction pathway. J Biol Chem. 1995, 270: 12665-12669.View ArticlePubMed
- Ponting CP: Novel domains in NADPH oxidase subunits, sorting nexins, and PtdIns 3-kinases: binding partners of SH3 domains?. Protein Sci. 1996, 5: 2353-2357. 10.1002/pro.5560051122.PubMed CentralView ArticlePubMed
- Campbell MA, Haas BJ, Hamilton JP, Mount SM, Buell CR: Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genomics. 2006, 7: 327-10.1186/1471-2164-7-327.PubMed CentralView ArticlePubMed
- Sintchak MD, Fleming MA, Futer O, Raybuck SA, Chambers SP, Caron PR, et al: Structure and mechanism of inosine monophosphate dehydrogenase in complex with the immunosuppressent mycophenolic acid. Cell. 1996, 85: 921-930. 10.1016/S0092-8674(00)81275-1.View ArticlePubMed
- Hardie DG, Hawley SA: AMP-activated protein kinase: the energy change hypothesis revisited. Bioessay. 2001, 23: 1112-1119. 10.1002/bies.10009.View Article
- Jeong BC, Yoo KS, Jung KW, Shin JS, Song HK: Purification, crystallization and preliminary X-ray diffraction analysis of a cystathionine b-synthase domain-containing protein, CDCP2, from Arabidopsis thaliana. Acta Crystallographica. 2008, F64: 825-827.
- Estévez R, Pusch M, Ferrer-Costa C, Orozco M, Jentsch TJ: Functional and structural characteristic conservation of BCS domains from CLC chloride channels. J Physiol. 2004, 557: 363-378. 10.1113/jphysiol.2003.058453.PubMed CentralView ArticlePubMed
- Schwappach B, Stobrawa S, Hechenberger M, Steinmeyer K, Jentsch TJ: Golgi localization and functionally important domain in the NH2 and COOH terminus of the yeast CLC putative chloride channels Gef1p. J Biol Chem. 1998, 273: 15110-15118. 10.1074/jbc.273.24.15110.View ArticlePubMed
- Dhedhiou CJ, Golldack D: Salt-dependent regulation of chloride channel transcripts in rice. Plant Science. 2006, 170: 793-800. 10.1016/j.plantsci.2005.11.014.View Article
- Ouyang Z, Isaacson R: Identification of a novel regulatorfor the Escherichia coli fit iron transport system. Open Microbio J. 2008, 2: 94-99. 10.2174/1874285800802010094.View Article
- Wise RP, Pring DR: Nuclear-mediated mitochondrial gene regulation and male fertility in higher plants: Light at the end of the tunnel?. Proc Natl Acad Sci. 2002, 99: 10240-10242. 10.1073/pnas.172388899.PubMed CentralView ArticlePubMed
- Lurin C, Andrés C, Aubourg S, Bellaoui M, Bitton F, Bruyère C, Caboche M, Debast C, Gualberto J, Hoffmann B, Lecharny A, Le Ret M, Martin-Magniette ML, Mireau H, Peeters N, Renou JP, Szurek B, Taconnat L, Small I: Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis. Plant Cell. 2004, 16: 2089-2103. 10.1105/tpc.104.022236.PubMed CentralView ArticlePubMed
- Saha D, Prasad AM, Srinivasan R: Pentatricopeptide repeat proteins and their emerging roles in plants. Plant Physiol Biochem. 2007, 45: 521-534. 10.1016/j.plaphy.2007.03.026.View ArticlePubMed
- Moscat J, Diaz-Meco MT, Albert A, Campuzano S: Cell signaling and function organized by PB1 domain interactions. Mol Cell. 2006, 23: 631-640. 10.1016/j.molcel.2006.08.002.View ArticlePubMed
- Sumimoto H, Kamakura S, Ito T: Structure and function of the PB1 domain, a protein interaction module conserved in animals, fungi, amoebas, and plants. Sci STKE. 2007, 401: re6-10.1126/stke.4012007re6.
- Jain M, Tyagi AK, Khurana JP: Genome-wide analysis, evolutionary expansion, and expression of early auxin-responsive SAUR gene family in rice (Oryza sativa). Genomics. 2006, 88: 360-371. 10.1016/j.ygeno.2006.04.008.View ArticlePubMed
- Nijhawan A, Jain M, Tyagi AK, Khurana JP: Genomic survey and gene expression analysis of the basic leucine zipper transcription factor family in rice. Plant Physiol. 2008, 146: 333-350. 10.1104/pp.107.112821.PubMed CentralView ArticlePubMed
- Blanc G, Barakat A, Guyot R, Cooke R, Delseny M: Extensive duplication and reshuffling in the Arabidopsis genome. Plant Cell. 2000, 12: 1093-1101. 10.1105/tpc.12.7.1093.PubMed CentralView ArticlePubMed
- Simillion C, Vandepoele K, Van Montagu MC, Zabeau M, Peer Van de Y: The hidden duplication past of Arabidopsis thaliana. Proc Natl Acad Sci. 2002, 99: 13627-13632. 10.1073/pnas.212522399.PubMed CentralView ArticlePubMed
- Vision TJ, Brown DG, Tanksley SD: The origins of genomic duplications in Arabidopsis. Science. 2000, 290: 2114-2117. 10.1126/science.290.5499.2114.View ArticlePubMed
- Blanc G, Hokamp K, Wolfe KH: A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Res. 2003, 13: 137-144. 10.1101/gr.751803.PubMed CentralView ArticlePubMed
- Wang X, Shi X, Hao B, Ge S, Luo J: Duplication and DNA segmental loss in the rice genome: implications for diploidization. New Phytol. 2005, 165: 937-946. 10.1111/j.1469-8137.2004.01293.x.View ArticlePubMed
- Ermolaeva MD, Wu MM, Eisen JA, Salzberg SL: The age of the Arabidopsis thaliana genome duplication. Plant Mol Biol. 2003, 51: 859-866. 10.1023/A:1023001130337.View ArticlePubMed
- Bowers JE, Chapman BA, Rong J, Paterson AH: Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 2003, 422: 433-438. 10.1038/nature01521.View ArticlePubMed
- Zhang L, Vision TJ, Gaut BS: Patterns of nucleotidesubstitution among simultaneously duplicated gene pairs in Arabidopsis thaliana. Mol Biol Evol. 2002, 19: 1464-1473.View ArticlePubMed
- Ziolkowski PA, Blanc G, Sadowski J: Structural divergence of chromosomal segments that arose from successive duplication events in the Arabidopsis genome. Nucleic Acids Res. 2003, 31: 1339-1350. 10.1093/nar/gkg201.PubMed CentralView ArticlePubMed
- Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, Johnson D, Luo S, McCurdy S, Foy M, Ewan M, et al: Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nat Biotechnol. 2000, 18: 630-634. 10.1038/76469.View ArticlePubMed
- Thomashow MF: Arabidopsis thaliana as a model for studying mechanisms of plant cold tolerance. Arabidopsis. Edited by: Meyerowitz E, Somerville C. 1994, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 807-834.
- Kimura M, Yoshizumi T, Manabe K, Yamamoto YY, Matsui M: Arabidopsis transcriptional regulation by light stress via hydrogen peroxide-dependent and -independent pathways. Genes Cells. 2001, 6: 607-617. 10.1046/j.1365-2443.2001.00446.x.View ArticlePubMed
- Reymond P, Weber H, Damond M, Farmer EE: Differential gene expression in response to mechanical wounding and insect feeding in Arabidopsis. Plant Cell. 2000, 12: 707-719. 10.1105/tpc.12.5.707.PubMed CentralView ArticlePubMed
- Heribert Hirt, Kazuo Shinozaki: Plant Responses to Abiotic Stress. Topics in Current Genetics. 2004, Springer, 4:
- Vieira da Silva J, Naylor AW, Kramer J: Some ultrastructural and enzymatic effects of water stress in cotton (Gossypium L.) leaves. Proc Natl Acad Sci. 1974, 71: 3243-3247. 10.1073/pnas.71.8.3243.View Article
- Pham-Thi AT, Borrel-Flood C, Vieira da Silva J, Justin AM, Mazliak P: Effects of drought on [1-14C]-oleic and [1-14C]-linoleic acid desaturation in cotton leaves. Physiol Plant. 1987, 69: 147-150. 10.1111/j.1399-3054.1987.tb01958.x.View Article
- Monteiro de Paula F, Pham Thi AT, Vieira dailva J, Justin AM, Demandre C, Mazliak P: Effects of water stress on the molecular species composition of polar lipids from Vigna unguiculata L. leaves. Plant Sci. 1990, 66: 185-193. 10.1016/0168-9452(90)90203-Z.View Article
- Ferrari-Iliou R, d'Arcy-Lameta A, Pham-Thi AT, Zuily-Fodil Y, Mazliak P: Effect of drought on photodynamic peroxidation of leaf total lipophilic extracts. Phytochem. 1994, 37: 1237-1243. 10.1016/S0031-9422(00)90391-1.View Article
- Sahsah Y, Campos P, Gareil M, Zuily-Fodil Y, Pham-Thi AT: Enzymatic degradation of polar lipids in Vigna unguiculata leaves and influence of drought stress. Physiol Plant. 1998, 104: 577-586. 10.1034/j.1399-3054.1998.1040409.x.View Article
- El Maarouf H, Zuily-Fodil Y, Gareil M, d'Arcy-Lameta A, Pham-Thi AT: Enzymatic activity and gene expression under water stress of phospholipase D in two cultivars of Vigna unguiculata (L.). Plant Mol Biol. 1999, 39: 1257-1265. 10.1023/A:1006165919928.View ArticlePubMed
- Matos AR, d'Arcy-Lameta A, França M, Petres S, Edelman L, Kader J, Zuily-Fodil Y, Pham-Thi AT: A novel patatin-like gene stimulated by drought stress encodes a galactolipid acyl hydrolase. FEBS Lett. 2001, 491: 188-192. 10.1016/S0014-5793(01)02194-9.View ArticlePubMed
- Knight H, Trewavas AJ, Knight MR: Calcium signalling in Arabidopsis thaliana responding to drought and salinity. Plant J. 1997, 12: 1067-1078. 10.1046/j.1365-313X.1997.12051067.x.View ArticlePubMed
- Xiong L, Lee H, Ishitani M, Zhu JK: Regulation of Osmotic Stress-responsive Gene Expression by the LOS6/ABA1 Locus in Arabidopsis. J Biol Chem. 2002, 277: 8588-8596. 10.1074/jbc.M109275200.View ArticlePubMed
- Finkel T, Holbrook NJ: Oxidants, oxidative stress and the biology of ageing. Nature. 2000, 408: 239-247. 10.1038/35041687.View ArticlePubMed
- Allen RG, Tresini M: Oxidative stress and gene regulation. Free Radic Biol Med. 2000, 28: 463-499. 10.1016/S0891-5849(99)00242-7.View ArticlePubMed
- Dat JF, Lopez-Delgado H, Foyer CH, Scott IM: Parallel changes in H2O2 and catalase during thermotolerance induced by salicylic acid or heat acclimation in mustard seedlings. Plant Physiol. 1998, 116: 1351-1357. 10.1104/pp.116.4.1351.PubMed CentralView ArticlePubMed
- Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, Durbin R, Eddy SR, Sonnhammer EL, Bateman A: Pfam: clans, web tools and services. Nucl Acids Res. 2006, 34: 247-251. 10.1093/nar/gkj149.View Article
- HMMER. [http://hmmer.janelia.org]
- Edgar , Robert C: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.PubMed CentralView ArticlePubMed
- Felsenstein J: PHYLIP (Phylogeny Inference Package). version 3.6. Distributed by the author. 2005, Department of Genome Sciences, University of Washington, Seattle
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.View ArticlePubMed
- Clamp M, Cuff J, Searle SM, Barton GJ: The Jalview Java Alignment Editor. Bioinformatics. 2004, 20: 426-427. 10.1093/bioinformatics/btg430.View ArticlePubMed
- Kilian J, Whitehead D, Horak J, Wanke D, Weinl S, Batistic O, D'Angelo Bornberg-Bauer E, Kudla J, Harter K: The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses. Plant J. 2007, 50: 347-363. 10.1111/j.1365-313X.2007.03052.x.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.