A FERM-adjacent (FA) region defines a subset of the 4.1 superfamily and is a potential regulator of FERM domain function
© Baines; licensee BioMed Central Ltd. 2006
Received: 13 July 2005
Accepted: 20 April 2006
Published: 20 April 2006
Proteins containing FERM domains comprise a diverse group of eukaryotic proteins that bind membrane proteins and lipids. In doing so, they organise membrane microstructure, and coordinate the disposition of signalling and cell adhesion complexes. In protein 4.1R, phosphorylation adjacent to the FERM domain regulates its activity, and membrane mechanical properties.
A novel sequence domain has been detected in a subset of proteins that contain FERM domains. This subset includes the true 4.1 proteins, some tyrosine phosphatases, rho-GEF proteins and type II transmembrane proteins, as well as some uncharacterised FERM proteins. This FERM-adjacent region is always closely proximate to the C-terminal of the FERM domain. This sequence is likely to be folded with elements of α and β structure. The FERM-adjacent region of 4.1R contains serine residues phosphorylated by PKC and PKA; these appear conserved in about half of all other FERM-adjacent regions. Phylogenetic analyses indicate that all proteins containing a FERM-adjacent region arose from a single ancestor after FERM domains had started to proliferate in genomes of animals, plants and mycetozoa.
The FERM-adjacent region defines a subset of the FERM proteins in animals. The conservation of motifs in this region that are potential substrates for kinases together with the known regulatory phosphorylation of 4.1R in this region raises the possibility that the FERM-adjacent region is a regulatory adaptation in this subset of the FERM proteins.
FERM domains define the band 4.1 superfamily . The domain takes its name from the 4.1 (f our point one) and ERM (e zrin, r adixin, m oesin) proteins where it was first discovered , but many metazoan cytoplasmic proteins that associate with membranes contain FERM domains: such proteins include merlin, talin, KRIT1, the uncoventional myosins VIIA, X and XV, certain non-receptor protein tyrosine kinases (e.g. the FAK and JAK kinases) and phosphatases (e.g. PTP-E1 and PTP-H1). A few examples of FERM domains are also found in mycetozoa and plants. This family is of great interest from several points of view. Several members of the family are tumor suppressors (4.1R, 4.1B, merlin) . More generally, this family carries functions that reflect many of the distinctive features of eukaryotic – and most especially animal – life, including tissue-specific signalling through organisation of membrane domains, mechano-protection of membranes from the stresses of animal movement and participation in the formation of complex tissues through cell-cell and cell-matrix junctions .
FERM domains have three-lobed 'cloverleaf' structures; each lobe represents a compactly folded structure. Lobe A (the most N-terminal) has a fold resembling ubiquitin; lobe B (the central lobe) resembles acyl-CoA binding proteins; and lobe C (the most C-terminal) has a fold related to pleckstrin homology domain/phosphotyrosine-binding domain (PTB) [4–10]. The close packing of these domains suggests they do not function independently, but rather form a co-ordinated structure.
FERM domains bind a variety of protein and lipid ligands. For example, in 4.1R, lobe A binds the anion exchanger AE1 (band 3), lobe B binds the PDZ and guanylate kinase protein p55 and lobe C binds glycophorin C . The motif YKRS in Lobe C is required for phosphatidylserine (PS) binding to 4.1R; this motif is required for correct intracellular targeting of 4.1R . In the ERM proteins, the head group of PIP2 binds a basic cleft between lobes A and C; this binding displaces the ERM tail from the FERM domain  thus unmasking the binding site for cell adhesion molecules (such as ICAM1-3 and L1) on lobe C .
The first observation of what is now known as the FERM domain came when Leto et al.  subjected 4.1R to limited chymotryptic proteolysis. The FERM domain was released as a 30 kDa fragment. Another protease-resistant fragment released in this experiment was 16 kDa. This region lies between the FERM domain and another important functional domain, the spectrin-actin binding domain. The 16 kDa fragment contains residues phosphorylated by PKA and PKC [14, 15]. Importantly, PKC phosphorylation of a serine residue in this region modulates membrane mechanical properties by controlling the activities of both the FERM and the spectrin-actin-binding domain .
Mammals have four "true" 4.1 proteins: 4.1R, 4.1N, 4.1G and 4.1B. Sequence alignment revels considerable identity between the N-terminal half of their 16 kDa regions, although the C-terminal halves are much less conserved . Here, I investigate the nature of the conserved part of the 16 kDa region. I report that sequences strongly similar to the conserved part of the 16 kDa region are present in a subset of the 4.1 superfamily. This region seems to form a discrete FERM-adjacent region, with the potential to regulate the activities of its neighbouring FERM domain.
Results and discussion
Identification of the FERM-adjacent region
The FERM domain in human protein 4.1R [Swiss-Prot:41_HUMAN] is now defined as residues 285–488 by X-ray crystallography [PDB:1GG3] . The 16 kDa fragment lies directly adjacent to the FERM domain: residues 494–614.
Sequence alignment of the four mammalian "true" 4.1 proteins, 4.1R, 4.1B, 4.1G and 4.1N, reveals that a high level of sequence identity extends beyond the end of the FERM domain, and into the 16 kDa region . To explore this further, I compared the 16 kDa region of human 4.1R with the sequences of other mammalian 4.1R, 4.1G, 4.1N and 4.1B proteins available in the Uniprot Knowledgebase (Swis-Prot/Trembl) . This revealed a region of strong conservation over approximately 60 amino acid residues at the N-terminus of the 16 kDa region. 60 amino acids is a large enough region to fold, and is similar in size to known folding structures such as the SH3 domain.
Strikingly, all the sequences recognised by the HMM are immediately adjacent to a FERM domain. However, they are clearly distinct from FERM domains in that the HMM does not recognise the sequence of any known FERM domain. The HMM detects no proteins whose structures are represented in the Protein Databank. For simplicity, I shall refer to sequences recognised by the HMM as FERM-adjacent (FA) regions.
All the sequences detected are in animal proteins with one exception. [Trembl:Q8GUI3] is an Arabidopsis thaliana hypothetical protein which contains a FERM domain, and a weak hit (E = 0.00042) is detected adjacent to this. No other plant or mycetozoan protein is detected.
Importantly, many well-characterised members of the band 4.1 superfamily are not detected by the HMM: in particular the ERM proteins, merlin, talin, unconventional myosins and non-receptor protein tyrosine kinases were not found.
A more extended alignment of several FERM proteins that contain the FA region in comparison with representative proteins that do not contain the FA region illustrates the limits of the FA region. Additional file 2 shows the sequences of several human proteins aligned to the sequence of 4.1R Lobe B of the FERM domain through the whole 16 kDa sequence. Note that the FA region is detected as a discrete sequence region corresponding to the N-terminal half of the 16 kDa fragment in class A-E proteins, and that no sequence similarity is detected in other proteins.
Is the FA region folded?
FA regions seem likely to be compactly folded since the 4.1R 16 kDa fragment is resistant to chymotrypsin , despite containing several possible substrate residues (see sequence 41_HUMAN in Fig. 1).
To predict secondary structures, the alignment of all FAs was submitted to JPRED; 4.1R sequence was also submitted to PSI-Pred and Disopred. An α-helix and β-strand and a certain amount of disordered structure are predicted, and the most conserved part of the sequence in the FA alignment is predicted to be buried (annotated on Fig. 1). But the structure cannot be predicted with certainty, because the HMM recognises no sequences of proteins of known structure in the Protein Data Bank. Furthermore, the fold-recognition programmes PSI-Pred and 3D-PSSM do not detect a known fold.
A single evolutionary event links FERM and FA proteins
The presence of a single FA region in a subset of all FERM domain proteins raises the question of their evolutionary origin. Do these proteins derive from a single ancestral FERM protein that acquired an FA region at some point after FERM domains started to multiply in animal, plant and mycetozoan genomes? If the answer to this question is "yes", then the FA-containing subset of the superfamily should have FERM domains more closely related to each other than to non-FA-containing proteins.
Interestingly, the phosphatase PTN3 [Swiss-Prot:PTN3_HUMAN] does not contain an FA region, yet appears in the FA group. Direct alignment of this sequence with that of the related phosphatase PTN4 [Swiss-Prot:PTN4_HUMAN] leaves a gap where the FA region would be (see Additional file 2); furthermore, probing all possible translations of the genomic sequence of PTN3 with the FA region HMM reveals no FA sequence that might be expressed in splice variants. Maximum parsimony and neighbour joining analyses (not shown) support the positioning of PTN3 in the FA cluster. It seems most likely that PTN3 has lost the FA region present in the common ancestor of PTN3 and PTN4.
PKC and PKA phosphorylation sites
It has long been established that protein kinases A and C phosphorylate the 16 kDa portion of erythrocyte 4.1R [14, 15]. Protein kinase C regulates the membrane binding activity of the 4.1R FERM domain [15, 19–21]. Recently Manno et al.  have shown that PKC phosphorylates ser-312 in erythrocyte 4.1R (isoform 3 [IPI:IPI00218698]; the equivalent sequence number in [Swiss-Prot:41_human] is ser-521). The site in the 16 kDa portion phosphorylated by PKA is ser-331. Both these residues are part of the FA region.
Ser-312 is not in a known PKC consensus, so it is difficult to predict if equivalent residues in other proteins will be PKC substrates. However, Fig 1 indicates that about half of the FA region proteins have ser at equivalent positions in the alignment.
Ser-331 is in the protein kinase A substrate motif, [KR] [KR]X [ST], and this is conserved in all mammalian "true" 4.1 proteins. It is also conserved in some of the group that lacks known functional domains in their C-termini (e.g. E41L5) and in some of the group of transmembrane proteins (see Fig. 1).
Protein 4.1G [Swiss-Prot:E41L2_human] is also phosphorylated in vivo in the FA region at ser-550 . The kinase that catalyses this phosphorylation has not been identified. Representatives of each of groups (a)-(e) contain ser or thr at this point in the alignment, indicating that this too is a candidate site for phosphorylation in other FA proteins.
The FA sequence region described here is in all cases immediately adjacent to FERM domains. Based on the chymotrypsin resistance of the 16 kDa 4.1R fragment, and secondary structure predictions, it seems likely that FA regions are folded. The FA region of 4.1R is phosphorylated in vivo by PKA and PKC. The latter phosphorylation is especially important in the red cell, since phosphorylation of ser-312 controls the activity of the adjacent FERM and spectrin-actin binding domains which in turn controls the mechanical properties of the red cell membrane . Since representatives of all the five groups of FA proteins contain ser/thr at sites equivalent to those phosphorylated in the 4.1R and 4.1G FA regions, the possibility arises of a general role of the FA region in regulating the activities of their neighbouring FERM domain.
Sequences were retrieved from the UniProt Knowledgebase via the European Bioinformatics Institute (EBI) . BLAST analyses were done with either the BLAST2 server at EBI  or the BLAST server at the National Center for Biotechnology Information . The HMMER package [26, 27] was used for hidden Markov model analysis; it was run either locally or via a server made available by the Medical Research Council's Rosalind Franklin Centre for Genomics Research . Alignments of sequences to HMMs were done using HMALIGN; alignments were displayed using Chroma . Secondary structure predictions were made using JPRED [30, 31]. PSI-Pred [30, 32] and Disopred [33, 34]. Fold recognition was done using the PSI-PRED and 3D-PSSM [35, 36] servers. For phylogenetic analyses, the Phylip package  was used.
This research was supported by the Biotechnology and Biological Sciences Research Council (grant 96/18062) and by a Research Fellowship from the Leverhulme Trust. The author gratefully acknowledges the help and support of Dr Mohandas Narla and the New York Blood Center during his Fellowship period.
- Chishti AH, Kim AC, Marfatia SM, Lutchman M, Hanspal M, Jindal H, Liu SC, Low PS, Rouleau GA, Mohandas N, Chasis JA, Conboy JG, Gascard P, Takakuwa Y, Huang SC, Benz EJ, Bretscher A, Fehon RG, Gusella AF, Ramesh V, Solomon F, Marchesi VT, Tsukita S, Arpin M, Louvard D, Tonks NK, Anderson JM, Fanning AS, Bryant PJ, Woods DF, Hoover KB: The FERM domain: a unique module involved in the linkage of cytoplasmic proteins to the membrane. Trends Biochem Sci. 1998, 23 (8): 281-282. 10.1016/S0968-0004(98)01237-7.PubMedView ArticleGoogle Scholar
- Sun CX, Robb VA, Gutmann DH: Protein 4.1 tumor suppressors: getting a FERM grip on growth regulation. J Cell Sci. 2002, 115 (Pt 21): 3991-4000. 10.1242/jcs.00094.PubMedView ArticleGoogle Scholar
- Bennett V, Baines AJ: Spectrin and ankyrin-based pathways: metazoan inventions for integrating cells into tissues. Physiol Rev. 2001, 81 (3): 1353-1392.PubMedGoogle Scholar
- Hamada K, Shimizu T, Matsui T, Tsukita S, Hakoshima T: Structural basis of the membrane-targeting and unmasking mechanisms of the radixin FERM domain. Embo J. 2000, 19 (17): 4449-4462. 10.1093/emboj/19.17.4449.PubMedPubMed CentralView ArticleGoogle Scholar
- Pearson MA, Reczek D, Bretscher A, Karplus PA: Structure of the ERM protein moesin reveals the FERM domain fold masked by an extended actin binding tail domain. Cell. 2000, 101 (3): 259-270. 10.1016/S0092-8674(00)80836-3.PubMedView ArticleGoogle Scholar
- Han BG, Nunomura W, Takakuwa Y, Mohandas N, Jap BK: Protein 4.1R core domain structure and insights into regulation of cytoskeletal organization. Nat Struct Biol. 2000, 7 (10): 871-875. 10.1038/82819.PubMedView ArticleGoogle Scholar
- Kang BS, Cooper DR, Devedjiev Y, Derewenda U, Derewenda ZS: The structure of the FERM domain of merlin, the neurofibromatosis type 2 gene product. Acta Crystallogr D Biol Crystallogr. 2002, 58 (Pt 3): 381-391. 10.1107/S0907444901021175.PubMedView ArticleGoogle Scholar
- Shimizu T, Seto A, Maita N, Hamada K, Tsukita S, Hakoshima T: Structural basis for neurofibromatosis type 2. Crystal structure of the merlin FERM domain. J Biol Chem. 2002, 277 (12): 10332-10336. 10.1074/jbc.M109979200.PubMedView ArticleGoogle Scholar
- Garcia-Alvarez B, de Pereda JM, Calderwood DA, Ulmer TS, Critchley D, Campbell ID, Ginsberg MH, Liddington RC: Structural determinants of integrin recognition by talin. Mol Cell. 2003, 11 (1): 49-58. 10.1016/S1097-2765(02)00823-7.PubMedView ArticleGoogle Scholar
- Smith WJ, Nassar N, Bretscher A, Cerione RA, Karplus PA: Structure of the active N-terminal domain of Ezrin. Conformational and mobility changes identify keystone interactions. J Biol Chem. 2003, 278 (7): 4949-4956. 10.1074/jbc.M210601200.PubMedView ArticleGoogle Scholar
- An XL, Takakuwa Y, Manno S, Han BG, Gascard P, Mohandas N: Structural and functional characterization of protein 4.1R- phosphatidylserine interaction: potential role in 4.1R sorting within cells. J Biol Chem. 2001, 276 (38): 35778-35785. 10.1074/jbc.M101364200.PubMedView ArticleGoogle Scholar
- Hamada K, Shimizu T, Yonemura S, Tsukita S, Hakoshima T: Structural basis of adhesion-molecule recognition by ERM proteins revealed by the crystal structure of the radixin-ICAM-2 complex. Embo J. 2003, 22 (3): 502-514. 10.1093/emboj/cdg039.PubMedPubMed CentralView ArticleGoogle Scholar
- Leto TL, Marchesi VT: A Structural Model Of Human-Erythrocyte Protein-4.1. J Biol Chem. 1984, 259 (7): 4603-4608.PubMedGoogle Scholar
- Horne WC, Prinz WC, Tang EK: Identification of two cAMP-dependent phosphorylation sites on erythrocyte protein 4.1. Biochim Biophys Acta. 1990, 1055 (1): 87-92. 10.1016/0167-4889(90)90095-U.PubMedView ArticleGoogle Scholar
- Chao TS, Tao M: Modulation of protein 4.1 binding to inside-out membrane vesicles by phosphorylation. Biochemistry. 1991, 30 (43): 10529-10535. 10.1021/bi00107a023.PubMedView ArticleGoogle Scholar
- Manno S, Takakuwa Y, Mohandas N: Modulation of erythrocyte membrane mechanical function by protein 4.1 phosphorylation. J Biol Chem. 2005, 280 (9): 7581-7587. 10.1074/jbc.M410650200.PubMedView ArticleGoogle Scholar
- Parra M, Gascard P, Walensky LD, Gimm JA, Blackshaw S, Chan N, Takakuwa Y, Berger T, Lee G, Chasis JA, Snyder SH, Mohandas N, Conboy JG: Molecular and functional characterization of protein 4.1B, a novel member of the protein 4.1 family with high level, focal expression in brain. J Biol Chem. 2000, 275 (5): 3247-3255. 10.1074/jbc.275.5.3247.PubMedView ArticleGoogle Scholar
- Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA, O'Donovan C, Redaschi N, Yeh LS: The Universal Protein Resource (UniProt). Nucleic Acids Res. 2005, D154-159. 33 Database
- Danilov YN, Fennell R, Ling E, Cohen CM: Selective modulation of band 4.1 binding to erythrocyte membranes by protein kinase C. J Biol Chem. 1990, 265 (5): 2556-2562.PubMedGoogle Scholar
- Pinder JC, Gardner B, Gratzer WB: Interaction of protein 4.1 with the red cell membrane: effects of phosphorylation by protein kinase C. Biochem Biophys Res Commun. 1995, 210 (2): 478-482. 10.1006/bbrc.1995.1685.PubMedView ArticleGoogle Scholar
- Chang SH, Low PS: Regulation of the glycophorin C-protein 4.1 membrane-to-skeleton bridge and evaluation of its contribution to erythrocyte membrane stability. J Biol Chem. 2001Google Scholar
- Beausoleil SA, Jedrychowski M, Schwartz D, Elias JE, Villen J, Li J, Cohn MA, Cantley LC, Gygi SP: Large-scale characterization of HeLa cell nuclear phosphoproteins. Proc Natl Acad Sci U S A. 2004, 101 (33): 12130-12135. 10.1073/pnas.0404720101.PubMedPubMed CentralView ArticleGoogle Scholar
- Sequence Retrieval System at EBI. [http://srs.ebi.ac.uk]
- EBIWU-Blast2. [http://www.ebi.ac.uk/blast2/_]
- NCBI Blast. [http://www.ncbi.nlm.nih.gov/BLAST/]
- Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14 (9): 755-763. 10.1093/bioinformatics/14.9.755.PubMedView ArticleGoogle Scholar
- HMMER: profile HMMs for protein sequence analysis. [http://hmmer.wustl.edu/]
- Rosalind Franklin Centre for Genomics Research. [http://www.hgmp.mrc.ac.uk]
- Goodstadt L, Ponting CP: CHROMA: consensus-based colouring of multiple alignments for publication. Bioinformatics. 2001, 17 (9): 845-846. 10.1093/bioinformatics/17.9.845.PubMedView ArticleGoogle Scholar
- McGuffin LJ, Bryson K, Jones DT: The PSIPRED protein structure prediction server. Bioinformatics. 2000, 16 (4): 404-405. 10.1093/bioinformatics/16.4.404.PubMedView ArticleGoogle Scholar
- JPRED server. [http://www.compbio.dundee.ac.uk/~www-jpred/]
- PSI-Predserver. [http://www.compbio.dundee.ac.uk/~www-jpred/]
- Ward JJ, McGuffin LJ, Bryson K, Buxton BF, Jones DT: The DISOPRED server for the prediction of protein disorder. Bioinformatics. 2004, 20 (13): 2138-2139. 10.1093/bioinformatics/bth195.PubMedView ArticleGoogle Scholar
- Disopred server. [http://bioinf.cs.ucl.ac.uk/disopred/]
- Kelley LA, MacCallum RM, Sternberg MJ: Enhanced genome annotation using structural profiles in the program 3D-PSSM. J Mol Biol. 2000, 299 (2): 499-520. 10.1006/jmbi.2000.3741.PubMedView ArticleGoogle Scholar
- 3D-PSSM. [http://www.sbg.bio.ic.ac.uk/~3dpssm/]
- Felsenstein J: PHYLIP (Phylogeny Inference Package) version 3.5c. Distributed by the author. 1993, Department of Genetics, University of Washington, SeattleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.