Skip to main content

Histology and transcriptomic analyses of barnacles with different base materials and habitats shed lights on the duplication and chemical diversification of barnacle cement proteins

Abstract

Background

Barnacles are sessile crustaceans that attach to underwater surfaces using barnacle cement proteins. Barnacles have a calcareous or chitinous membranous base, and their substratum varies from biotic (e.g. corals/sponges) to abiotic surfaces. In this study, we tested the hypothesis that the cement protein (CP) composition and chemical properties of different species vary according to the attachment substrate and/or the basal structure. We examined the histological structure of cement glands and explored the variations in cement protein homologs of 12 barnacle species with different attachment habitats and base materials.

Results

Cement gland cells in the rocky shore barnacles Tetraclita japonica formosana and Amphibalanus amphitrite are eosinophilic, while others are basophilic. Transcriptome analyses recovered CP homologs from all species except the scleractinian coral barnacle Galkinia sp. A phylogenomic analysis based on sequences of CP homologs did not reflect a clear phylogenetic pattern in attachment substrates. In some species, certain CPs have a remarkable number of paralogous sequences, suggesting that major duplication events occurred in CP genes. The examined CPs across taxa show consistent bias toward particular sets of amino acid. However, the predicted isoelectric point (pI) and hydropathy are highly divergent. In some species, conserved regions are highly repetitive.

Conclusions

Instead of developing specific cement proteins for different attachment substrata, barnacles attached to different substrata rely on a highly duplicated cementation genetic toolkit to generate paralogous CP sequences with diverse chemical and biochemical properties. This general CP cocktail might be the key genetic feature enabling barnacles to adapt to a wide variety of substrata.

Peer Review reports

Background

How marine sessile invertebrates permanently attach to surfaces is a fascinating issue that has attracted attention from both evolutionary and applied scientists for more than a century. Cementation of sessile organisms typically involves three major components: (1) the surface of the sessile organism attaching to the substratum, (2) bioadhesive substances and (3) the substratum to which organisms are attaching. Bioadhesive substances are the key components responsible for gluing an organism’s attachment surface to a substratum. Bioadhesion is therefore highly dependent on the strength of the interactions between the material that makes up of the attachment surfaces and the bioadhesives, and between the bioadhesive substance and the substratum materials. Barnacles, mussels, bryozoans and ascidians are common sessile organisms capable of attaching to a wide range of substrata. However, it remains unclear whether or to what extent the evolution of bioadhesive substances in these sessile organisms are influenced by the proximal bottom structure and/or the substratum materials.

Barnacles are sessile crustaceans that are infamous for being irreversibly affixed to an impressive range of biotic and abiotic surfaces, ranging from whales, sea turtles, jellyfish, crustaceans, mussels, corals, sponges, rocky shores and a variety of man-made objects, including ships [1,2,3] (Fig. 1). They are also considered key marine biofoulers because they significantly impact both the population dynamics of their biological hosts [4, 5] and the costs of antifouling measures. The underlying structural and molecular mechanisms of barnacle attachment is poorly studied, particularly from an evolutionary and functionally comparative standpoint.

Fig. 1
figure 1

Barnacle cement gland and variations in barnacles’ habitat and attachment substratum. a. Plastic toy of the barnacle Megabalanus rosa, showing the cross section of the barnacle. Cement glands are located among the ovarian tissues at the base of the barnacle. b. Histological cross section of the acorn barnacle Chelonibia testudinaria showing the plaque of cement stained red at the base and the location of cement glands among the ovarian glands (boxes correspond to magnified views in c followed by d). A plaque of cement stained red at the base. e. Chthamalus malayensis is a membranous based barnacle that lives on intertidal rocks. f. Wanella milleporae is a membranous based barnacle that lives exclusively on fire corals. g. Megabalanus volano is a barnacle that lives on intertidal rocks with calcareous bases. h. Chelonibia testudinaria are epibiotic on sea turtles (photo by Ceri Lewis). i. Lepas species often attach on floating woods or other marine animals

Initial barnacle attachment occurs during cypris larvae settlement. After a serious of complex substratum exploration behavior, barnacle cyprids would determine the most suitable substratum and commit to permanent attachment by secreting larval bioadhesives from a pair of cement glands [2, 6, 7]. Once the cyprid is firmly attached, the larva immediately commits to metamorphosis and transforms into the juvenile stage [8]. The juvenile and adult remain attached to their substratum by continuously secreting adult cement proteins [9,10,11,12]. Some acorn barnacles (balanomorphan species) possess tubiferous calcareous bases that are physically cemented to the substratum [13]. This strategy is widely seen in species that adhere to ships and rocks and is believed to facilitate the strongest and firmest surface attachment [10, 12]. These calcareous bases remain intact on the substratum, even after the barnacles die. Other balanomorphan and stalked barnacles produce membranous and chitinous bases, which are only indirectly attached to the substratum via the cement [12]. The shell of these membranous-based species may detach from the substratum after they die [14]. Barnacles inhabiting biotic surfaces, such as corals and sponges, exhibit different attachment strategies. In coral-associated barnacles, their bases are embedded into and fused with the coral skeleton [15]. For sponge barnacles, their bases are cup-shaped and in contact with the living sponge tissue [16]. The underlying molecular mechanisms that have allowed barnacles to invade such a mesmerizing array of substrata have, however, remained elusive and poorly studied.

Adult barnacle adhesives are generated from cement glands, which are located close to the ovarian tissue and deliver cement to the basal region (Fig. 1a–d). A set of interconnecting principal and secondary ducts leads from the cement glands to the narrow interface between the barnacle base and the attachment substratum [17,18,19,20]. Barnacle adhesives are mainly composed of cement protein complex. A number of barnacle cement proteins identified from the fouling acorn barnacle species Megabalanus rosa and categorized based on their molecular weight; CP19k, CP20k, CP52k, CP68k and CP100k (CP denotes cement protein) are by far the five most studied cement proteins [21,22,23,24,25,26,27]. Their homologs have been identified in a range of rock-attaching barnacle species. However, functions of these CPs were only predicted based on protein primary structure, predicted pI (isoelectric point) and hydrophobicity of the homologs from M. rosa and A. amphitrite [7, 12]. In these two species, CP19k and CP43k (identical to CP68k) homologs were found to be hydrophobic and believed to be located at the barnacle cement and substrate boundary, thus making them responsible for the removal of the surface-bound water layer; CP20k was proposed to be responsible for interfacial adhesion [12], but its exact function is still controversial [7]; and CP52k and CP100k were considered bulk insoluble cement proteins that play important roles in internal cohesion [7].

As adult barnacles are structurally specialized to cement onto different substrata in different habitats and lifestyles [1,2,3, 28], whether and to what extent their adhesives display cross-species variation in their chemical composition and make-up are important questions [29]. In fact, the Raman spectrum of the cement of Lepas anatifera, which has a membranous base and attaches to a wide range of floating objects, is different from that of the acorn barnacle Balanus crenatus, which possesses a calcareous base and mainly attaches to rocky surfaces in the intertidal area [30]. Lin et al. [31] interestingly found signatures of interspecific variation in adhesive components of tetraclitid barnacles. These previous studies suggest that the compositions of cement proteins differ across species. Yet, it remains unclear if the gene expression level of key adult cement proteins varies across taxa and whether such signatures adaptively evolved to accommodate barnacle life on different substrata.

Although studies on the adhesive strategies of the stalked lepadid and pollicipedid barnacles have recently increased, they have traditionally focused on a few closely related biofouling model species, e.g., the balanid A. amphitrite, A. eburneus and M. rosa [29, 30, 32,33,34,35]. Understanding the diverse adhesive components of barnacles may shed further light on how barnacles achieve underwater attachment. In the present study, we used both histological and transcriptomic data to explore the cement glands and cement proteins on an unprecedented diversity of barnacles inhabiting both fire- and scleractinian corals, sponges, rocky shores, sea turtles, crabs and floating objects. We hypothesized that the composition and chemical properties of cement proteins from different species might vary according to the attachment substratum and/or the basal structure.

Results

Histological examination of cement glands

Cement glands (CGs) were examined from free living species (mostly attached to rocks), including the acorn barnacles Amphibalanus amphitrite, Tetraclita japonica formosana, Chthamalus malayensis; the stalked barnacle Capitulum mitella; epibiotic barnacles Chelonibia testudinaria (on sea turtles or crabs), Megabalanus ajax (on fire corals), Galkinia sp. (on corals), and Membranobalanus longirostrum and Pectinoacasta sp. (in sponges). Table 1 summarizes the coverage of the taxa, their lifestyle and habitat preferences.

Table.1 Characters of barnacle species examined in this study [14, 15, 36]

The CGs of the acorn barnacles A. amphitrite, T. j. formosana, C. testudinaria, C. malayensis and M. ajax are located in the basal mantle region and closely associated with the ovarian tissue. CG could be identified by the presence of unicellular cement gland cells (CGCs), which are structurally distinct from the ovarian follicle cells. The CGs of M. ajax are highly reduced and barely observable in histological sections. Cement gland structure could not be detected in the coral-associated barnacles Galkinia sp. and Wanella milleporae or the sponge-associated barnacles M. longirostrum and Pectinoacasta sp. through repeated sectioning of multiple specimens. The CG in these four coral- or sponge-associated species might be absent or highly reduced, or located in mantle tissue closer to the distal (upper) operculum plates.

On the other hand, the locations of CGs in the three examined stalked barnacle species (Capitulum mitella, L. anatifera, and C. hunteri) are variable. In the lepadid stalked barnacles C. hunteri and L. anatifera, the cement glands are located in the mantle and top region of peduncle; in the pollicipedid stalked barnacle Capitulum mitella, the cement glands are located in the ovary inside the peduncle.

The histochemical properties and the cellular diameter of CGCs are variable among the examined species. The CGCs of L. anatifera are up to 150 μm in diameter and have vacuoles in the basophilic cytoplasm (Fig. 2A); the CGCs of C. mitella occur in clumps and are linked by principal and secondary canals, ranging from 14 to 70 μm in diameter, and contain vacuoles among the basophilic cytoplasm and large single nucleolus in the nuclei (Fig. 2B and C); the CGCs of A. amphitrite have diameters up to 110 μm and eosinophilic cytoplasmic contents (Fig. 2D); the CGCs of C. testudinaria range from 34 to 70 μm in diameter, with single distinct nucleolus and basophilic cytoplasmic contents (Fig. 2E); the CGCs of C. malayensis reach up to 80 μm in diameter, with small vacuoles inside the basophilic cytoplasm (Fig. 2F); C. hunteri has large CGCs with diameter up to 135 μm, and large sized vacuoles inside the basophilic cytoplasm (Fig. 2 G); the diameter of CGCs of M. ajax range from 15 to 20 μm and are only found scarcely scattered in the basal mantle tissue (Fig. 2 H), and the cytoplasmic contents are basophilic; the CGCs of T. j. formosana are the largest, with a diameter ranging from 128 to 214 μm with multiple nucleoli in the eosinophilic cytoplasm (Fig. 2I). The diameter and the histochemical property of the CGCs of different species are summarized in Table 1.

Fig. 2
figure 2

Hematoxylin and eosin staining of histological sections of barnacle cement glands. a. Lepas anatifera, b and c. Capitulum mitella, d. Amphibalanus amphitrite, e. Chelonibia testudinaria, f. Chthamalus malayensis, g. Conchoderma auritum, h. Megabalanus ajax, i. Tetraclita japonica formosana. CHR – chromatin, VAC – vacuole, NU – nucleolus, PC – principal canal, CYT – cytoplasm. Scale bars = 50 μm

Phylogenomic analysis

For each examined barnacle species, a transcriptome assembly was generated using Illumina reads from the prosoma and base (see Additional file 1). The statistics of the assemblies are shown in Table 2. Based on the translated protein database generated from the transcriptomes of thirteen barnacle species, 45,913 orthogroups were identified, among which 6,348 were detected in all examined species and 278 were single copy orthologs (see Additional file 2 for detailed statistics). A maximum likelihood phylogenetic tree was generated based on the alignment of these single copy ortholog sequences (Fig. 3). Each node of the ML tree was supported by bootstrap values of 78–100 % (out of 100 replicates). With the parasitic rhizocephalan barnacle S. yatsui as the outgroup, two major sister clades were identified. The first clade included the two stalked barnacle lepadid species, L. anatifera and C. hunteri, and the second included the remaining species. Within the second clade, C. mitella (Pollicipedidae), which morphologically is considered a stalked barnacle, was the sister to the remaining acorn barnacles. Here, C. malayensis (Chthamalidae) was sister to two additional two sub-clades. The first sub-clade included C. testudinaria (Chelonibiidae) and T. formosana (Tetraclitidae), and the second included Galkinia sp. and W. milleporae (Balanidae), A. amphitrite and M. ajax (Balanidae), and Pectinoacasta sp. and M. longirostrum (Balanidae).

Table.2 Statistics on transcriptome assemblies of the 12 species examined in this study
Fig. 3
figure 3

Phylogeny, attachment substratum and base feature of the examined barnacle species. Maximum-likelihood phylogenetic tree constructed from 278 single-copy orthologs in the 14 examined Cirripedia species as identified by Orthofinder. The number on each node signifies the bootstrap support value out of 100 replications. The attachment substratum of each barnacle species is indicated in the figure. Species with membranous and calcareous bases are indicated in blue and red, respectively

We mapped the primary substratum choice (corals, fire corals, floating objects, rocks, sponges, sea turtles/crabs) and the base material of the examined barnacles as listed in Table 1 onto the phylogenetic tree (Fig. 3). Symbioses and base material arose independently in our tree and are thus not monophyletic.

Identification of barnacle adult cement proteins homologs

We first concluded the presence or absence of five CPs identified from Megabalanus rosa—CP19k, 20k, 43k, 52k, and 100k homologs—in our transcriptome dataset (Table 3). The transcript expression level of CP homologs and biochemical properties of the translated protein such as estimated molecular weight, protein isoelectric points (pI) and GRAVY hydropathy index are summarized in Additional file 3. Absence of CP homologs in our dataset indicated failure in detecting the mRNA expression signal by RNAseq or the lack of expression of the particular CP homologs, but does not necessarily imply that the CP gene was lost from the genome.

Table.3  A summary of the cement proteins detected in this study

No cement protein was identified from the transcriptome assembly of the coral barnacle Galkinia sp. CP100k homologs were detected in 10 of the sample species—all except the coral barnacle Galkinia sp. and the sponge barnacle Pectinoacasta sp.; CP52k homologs were detected in C. testudinaria, C. malayensis, C. mitella, C. hunteri, L. anatifera, M. longirostrum, T. j. Formosana, and W. milleporae, but were not detected in M. ajax, Galkinia sp. or Pectinoacasta sp. CP43k homologs were detected in C. testudinaria, C. malayensis, C. mitella, C. hunteri, Pectinoacasta sp., L. anatifera, M. ajax, and T. j. formosana, but not in M. longirostrum, Galkinia sp., or W. milleporae; CP20k homologs were detected in A. amphitrite, C. hunteri, C. mitella, and C. malayensis; CP19k homologs were detected in A. amphitrite, C. hunteri, C. mitella, and C. malayensis (Table 3).

Cement protein homologs

There were 33 CP19k, 17 CP20k, 28 CP43k, 57 CP52k and 39 CP100k homologs detected from the transcriptomes of the 12 examined barnacle species (Additional file 3). Alignment files of all detected homolog sequences of each CP were provided in Additional file 4. Among the CP homologs, we distinguished transcript isoforms and paralogues based on (1) the TRINITY transcript ID information, (2) protein sequence alignment data, and (3) the orthogroup assignment. For example, CP114k is a known paralogue of CP100k [37]. Although the two proteins were assigned to the same orthogroup (OG OG0001681, see Additional file 3), they only shared about 58 % and did not share any identical region in the alignment (refer to Additional file 3 in [37]).

After removing transcript isoforms based on the sequence similarity and transcript ID information (see Materials and Methods), there were 29 CP19k, 16 CP20k, 20 CP43k, 45 CP52k and 39 CP100k homologs. The CP homolog sequences were assigned to four orthogroups of CP19k, three of CP20k, four of CP43k, six of CP52k, and three orthogroups of CP100k by Orthofinder (Additional files 3 and 5). Certain species have remarkable numbers of CP paralog sequences. For example, 11 CP19k paralog sequences were detected in the C. malayensis transcriptome, and 22 and 14 CP52k were detected from M. longirostrum and C. testudinaria transcriptomes, respectively (Additional file 5).

mRNA expression pattern of cement protein homologs

The transcripts of CP homologs were generally highly expressed at the base compared to its prosoma counterpart in each species (Additional file 6). The mRNA expression CP homologs were often 10 times or higher in the base or peduncle compared to the prosoma, but a few exceptions were found. The mRNA expression level of different CP homologs in different species could vary by over 1000-fold (Fig. 4). In the majority of the examined acorn barnacles, the transcript expression level of at least one of the CP homologs exceed 10 TPM (Transcript Per Million read). In C. hunteri and L. anatifera, however, none of the CP homolog transcript expression levels exceeded 10 in the peduncle (Fig. 4), even though the expression levels were generally higher in the peduncle (Additional files 3 and 6). The mRNA expressions of CP homologs in calcareous based species were not significantly different from membranous-based species (Fig. 4).

Fig. 4
figure 4

mRNA expression level of CP homologs. The vertical dot plot presents the mRNA expression level of CP homologs in the base or the peduncle. The corresponding species of each CP homolog is indicated by different colors. Grouping of the attachment substratum refers to Table 1. The grey box plot and black box plot illustrate the 75th percentile, the median (50th percentile) and 25th percentile of the expression level (in TPM) of different CPs of calcified-based and membranous-based barnacles examined in this study, respectively

Amino acid composition of CP homologs

We only included full length CP homologs in the amino acid composition analysis, pI and hydropathy index predictions. The amino acid composition of CP homologs revealed similarities between CP19k and CP43k, CP52k and CP100k, and the unique property of CP20k homologs. Except for CP20k, all examined CP homologs were low (relative abundance below 1 %) in Cysteine (C), Histidine (H), Methionine (M), and Tryptophan (W). Different CP homologs showed clear bias (average relative abundance > 10 %) towards different amino acids (Fig. 5). CP19k (Fig. 5A) and CP43k (Fig. 5B) homologs shown a clear bias towards Glycine (G), Serine (S), Alanine (A), and Threonine (T); the biochemical profile of CP20k homologs was different from the other CPs, in which the homologs in general bias towards Cysteine (C), Histidine (H), and Lysine (K) (Fig. 5C); CP52k (Fig. 5D) and CP100k (Fig. 5E), on the other hand, were biased towards Leucine (L), Serine (S), and Alanine (A); CP100k homologs also showed clear bias towards Valine (V), Isoleucine (I), and Arginine (R) (Fig. 5E).

Fig. 5
figure 5

Amino acid (AA) composition of all full-length CP homolog sequences. a. CP19k homologs, b. CP43k homologs, c. CP20k homologs, d. CP52k homologs, e. CP100k homologs. The color scale shows the relative abundance of the amino acids in percentage with respect to the total number of amino acids of the CP homologs. The amino acid columns are three clusters based on the k-mean clustering method (k-mean cluster = 3) to visualize amino acid bias among CP homologs. Enriched amino acids are highlighted in enlarged red characters. K-mean clustering of CP homologs was also performed (row clustering) to observe similarity in amino acid composition among homologs. Abbreviations: Aamph – Amphibalanus amphitrite, Ctest – C. testudinaria, Cmite – C. mitella, Cmala – C. malayensis, Chunt – C. hunteri, Mrosa – Megabalanus rosa, Majax – Megabalanus ajax, Tform – Tetraclita japonica formosana

The CP homologs of certain species showed remarkable diversification in terms of the amino acid composition. For example, CP19k homologs of A. amphitrite were divided into four different clusters, among which one of the previously reported CP19k homolog, CP19k-6 (accession no. AQA26378, reported in [38]), was distinct from the other CP19k homologs (Fig. 5A); CP20k homologs of A. amphitrite were divided into two clusters, one of which was rich in Cysteine (C), Histidine (H), and Lysine (K), while the other was rich in Cysteine (C) only (the uppermost row cluster in Fig. 5C); CP52k homologs of M. longirostrum were divided into three clusters (Fig. 5D).

Chemical properties and motif structures of CP homologs

Correlations between the chemistry of CP with the attachment substrata, predicted protein pI, and protein hydropathy index of the recovered CP homologs (full length proteins only) were analyzed (Fig. 6). The internal conserved motif structures of paralog sequences were identified for all CPs (Fig. 7, Additional file 4: Files S1–5). The paralogous sequences within the top 10 MEME motif regions were aligned for each CP homolog (Additional file 4: Files S6–10) and the motif consensus sequences of all CPs were summarized in Additional file 7: Figure S6.

Fig. 6
figure 6

The predicted chemical properties of CP homologs. The predicted pI and GRAVY index of (A) CP19k homologs, (B) CP43k homologs, (C) CP52k homologs, (D) CP100k homologs, and (E) CP20k homologs. The corresponding species of each CP homolog is indicated by different colors. Grouping of attachment substratum refers to Table 1. Species with calcareous and membranous bases are indicated with circular and triangular nodes in the scatterplot

Fig. 7
figure 7

Schematic diagram for MEME motif structure of selected CP homologs. A. CP19k homolog, B. CP20k homologs, C. CP43k homologs, D. CP52k homologs, E. CP100k homologs. MEME was set to present the top 10 most significant motif regions among the homolog of each CP. Each motif region was numbered 1 to 10, where motif 1 has the lowest P-value in MEME. Protein repeats are highlighted and the motifs regions within these repeats are shown in each subfigure. Abbreviations: Aamph – Amphibalanus amphitrite, Ctest – C. testudinaria, Cmite – C. mitella, Cmala – C. malayensis, Chunt – C. hunteri, Lanat – Lepas anatifera, Mrosa – Megabalanus rosa, Majax – Megabalanus ajax, Tform – Tetraclita japonica formosana

CP19k homologs

Full-length CP19k homologs were successfully recovered from the transcriptome of C. malayensis, C. testudinaria, C. mitella, T. j. formosana, A. amphitrite, L. anatifera, M ajax and, from NCBI, M. rosa. The GRAVY index of full-length CP19k homologs from rock-attaching barnacles (C. malayensis and C. mitella), crab/sea turtle attaching C. testudinaria, and the fire coral inhabiting barnacle M. ajax ranged from a highly hydrophilic value − 0.67 to 0.06, and the predicted pI of all except M. rosa CP19k ranged from 8.51 to 12.4 (Fig. 6A, Additional file 3). The predicted pI of CP19k in the rock-attaching M. rosa was 5.43. The predicted pI of full-length CP19k homologs of membranous-based species—including C. malayensis, C. testudinaria, C. mitella, and T. j. formosana—ranged from 9.4 to 12.4, while those of calcareous-based species—including A. amphitrite. M. ajax, and M. rosa—ranged from 5.4 to 10.6. Internal repeating motif regions were observed within the paralogs (Additional file 7: Figure S1).

CP20k homologs

Full-length CP20k homologs were successfully recovered from the transcriptome of C. malayensis, C. mitella, C. hunteri, A. amphitrite and, from NCBI, M. rosa. The predicted pI of CP20k homologs ranged from 3.9 to 8.3 and the GRAVY index ranged from − 1 to -0.2, indicating that these proteins are hydrophilic and can be slightly acidic in certain species (Fig. 6E, Additional file 3). The predicted pI of CP20k homologs of membranous-based species (C. malayensis, C. mitella, and C. hunteri) ranged from 6.5 to 8.3, while those of the calcareous-based species (A. amphitrite and M. rosa) ranged from 3.9 to 7.8. Internal repeating motif regions were observed within the paralogs (Additional file 7: Figure S2).

CP43k homologs

Full-length CP43k homologs were successfully recovered from the transcriptome of C. malayensis, C. testudinaria, C. mitella, T. j. formosana, A. amphitrite, L. anatifera, and M. ajax. The GRAVY index of full-length CP43k homologs were from − 0.49 to -0.29 and the predicted pI ranged was 4.3 to 10.2, indicating that the homologs were in general hydrophilic. The CP43k homolog from L. anatifera was the most acidic (predicted pI: 4.3) (Fig. 6B, Additional file 3). The predicted pI of full-length CP43k homologs of all the membranous-based species except L. anatifera—i.e., C. malayensis, C. testudinaria, C. mitella, and T. j. formosana—exceeded 10, while those of the calcareous-based species—including A. amphitrite and M. ajax—ranged from 5.5 to 8. Internal repeating motif regions were observed within the paralogs (Additional file 7: Figure S3).

CP52k homologs

Full-length CP52k homologs were successfully recovered from the transcriptome of C. malayensis, C. testudinaria, C. mitella, C. hunteri, M. longirostrum, A. amphitrite and, from NCBI, M. rosa. The GRAVY index of full-length CP52k homologs ranged from − 0.55 to 0.55 and predicted pI ranged from 4.2 to 11.5 (Fig. 6C, Additional file 3). The predicted pI of CP52k from rock-attaching species (C. malayensis, C. mitella, A. amphitrite, and M. rosa) ranged from 10.5 to 11.5, exhibiting a potential bias towards basic pI. The predicted pI of CP52k homologs of the sponge-inhabiting barnacle M. longirostrum, on the other hand, ranged from 4.2 to 9.2. The GRAVY index of M. longirostrum CP52k homologs ranged from − 0.6 to 0.1, while in the rock-attaching barnacles, the CP52k homologs ranged from − 0.4 to 0.6, indicating a more diverse chemical property. Hence, the chemical properties of M. longirostrum CP52k homologs were distinct from those of other species. Internal repeating motif regions were observed within the paralogs (Additional file 7: Figure S4).

CP100k homologs

Full-length CP100k homologs were successfully recovered from the transcriptome of C. malayensis, C. testudinaria, C. mitella, T. j. formosana, M. ajax, A. amphitrite and, from NCBI, M. rosa. The majority of the homologs from different species were hydrophobic, except the homolog from fire coral-inhabiting M. ajax and the congeneric rock-attaching species M. rosa (Fig. 6D, Additional file 3). The GRAVY index of the CP100k homolog of these two Megabalanus species were slightly below zero, indicating that the CP100k homologs were somewhat amphiphilic. The predicted pI of all full-length CP100k homologs were basic, ranging from 9.4 to 10.75, but the homologs of membranous-based species (C. malayensis, C. testudinaria, C. mitella, and T. j. formosana) were more basic (pI: 10.5 to 10.75) than the calcareous-based species (M. ajax, A. amphitrite, and M. rosa, pI: 9.4 to 9.7). Unlike the other CPs, no internal repeat motif region was observed within these paralogs (Additional file 7: Figure S5).

Discussion

Our histological and molecular data supported our hypothesis that cement protein composition and their chemical properties in different species vary according to the attachment substratum and/or the barnacle basal attachment surface. Based on the histological results, we found that the diameter and the histochemistry of CGCs in the examined species were highly variable. While the diameter of CGC could vary in relation to the body size of the species, it is intriguing that not all CGCs showed the same histochemical properties. Cytoplasmic content typically contains a mixture of intracellular proteins with different chemical properties. However, given that CGCs should be dedicated to the production of CP, the histochemical properties of CGCs could serve as a preliminary indicator that reflect the average chemical properties of the barnacle adhesives, where basophilic and eosinophilic staining indicate the acidic and basic nature of the CGC cellular content. It is intriguing that the CGCs of the calcareous-based rock-attaching barnacles A. amphitrite and T. j. formosana were eosinophilic, while other species with observable CGCs were basophilic. Our histological section data in general support our hypothesis that the chemical properties of cement of different species vary according to the attachment substratum and/or the basal structure, although how the chemistry of cement is related to cementation on different surfaces remains unclear.

It is worth noting that the failure to observe CG in the histological sections of four coral- or sponge-associated barnacle species may not necessarily indicate the absence of CG in these species. CGs are considered a major feature of barnacle ecological success in space and time, and it is not clear what coral symbiotic barnacles gain by losing them. One explanation may revolve around the coral and sponge barnacles that inhabit the interior skeleton or tissues of their hosts—in this case, the barnacle is more or less physically fixed to the coral skeleton or within sponges. Compared to other examined species, the morphologies of these four species are different and their basal structures are not flat but modified into a cup-shape convex surface (see Fig. 1F). The CGs of these species might be distributed more distally in the mantle region. The presence of CG in the sponge barnacles M. longirostrum and Pectinoacasta sp. and the fire coral inhabiting W. milliporae was supported by the transcriptomic data, in which CP transcript expressions were detected. However, we do not rule out the possibility that CG is absent in Galkinia sp., as CG and CP were not detected in histological or transcriptomic analyses.

Based on the transcriptomic results, we observed a diversity of CP homologs. Based on sequence alignment, a remarkable number of conspecific CP sequences were found to be paralogues. The presence of CP paralogues were previously reported in A. amphitrite [38]. In this study, we observed more paralogous sequences of CP19k, 20k, 43k, and 52k compared to CP100k. Interestingly, the mRNA expression level of these paralogs exhibited remarkable variations, suggesting that their requirement in cementation might vary from each other. Species that attached to similar substrata also showed major variations in their cement protein expressions. For instance, only one CP43k homolog was detected in the transcriptome of a sponge inhabiting the barnacle Pectinoacasta sp., and its mRNA expression was below 1 TPM (Transcript Per Million read), while in another sponge barnacle—M. longirostrum—mRNA expressions of CP52k and CP100k homologs were detected, and the mRNA expression levels were 10 to 100 times higher. Such variation suggests that the composition of cement proteins can be highly variable even for species that attach to similar substrata.

In addition to variations in gene expression level, the predicted pI and hydrophobicity of CP19k, CP20k, CP43k, and CP52k homologs/paralogs exhibited remarkable divergence. CP100k, on the other hand, exhibited remarkable convergence in terms of amino acid composition, estimated pl, and hydrophobicity, suggesting that CP100k is a core component with indispensable function in barnacle adhesives. In A. amphitrite, hydrophobic CP19k and CP43k paralogs were suggested to serve as a water-repellent to remove non-polar substances on the surface of the substratum and subsequently mediate the formation of fibrous cement protein plaque [7, 38]; CP20k was suggested to play an important role in interfacial adhesion [12]. However, in this study, most of the examined full length CP homologs were recovered from rock-attaching species, and our results hint to a potential linkage between barnacle base material (calcareous/membranous) and the overall charge (predicted pI) of CP19k, CP20k, CP43k, and CP52k. On the other hand, we are aware that this result is, at this stage, insufficient to conclude such a correlation, owing to the limited number of examined species. Further studies with increased taxa coverage are necessary to elucidate the possible linkage between the overall charge of CP and base material.

In certain species, multiple sequences of CP19k, CP20k, CP43k and CP52k were assigned to more than one orthogroups. For example, 22 CP52k homologs from M. longirostrum were assigned to six different orthogroups, with two to six CP52k homologs assigned to each orthogroup. The results suggest that (1) there were significant duplications of the CP52k gene and (2) the resulting CP52k paralogous sequences have more than one ortholog within the examined species. Remm and colleagues referred to such a phenomenon as an “in-paralog” in which gene duplication occurred after a speciation event in one genome but not the orthologous counterpart in the other genome [39]. More interestingly, internal repeating motif regions were observed within the paralogues of CP19k, CP20k, CP43k, and CP52k (Fig. 7), suggesting that intragenic domain duplication occurred in addition to gene duplication. These repetitive motifs were conserved across the examined taxa but the number of motifs vary in different species, suggesting a common ancestral origin but random duplication of these motif regions in different species. Protein repeats could arise from exon duplications and/or transitions from existing genes within proteins that process regular secondary structures. The variations in numbers of repeats indicate the rapid loss and/or gain of repeats in the evolution of CP19k, CP20k, CP43k, and CP52k in different barnacle species. On the other hand, relatively fewer paralogs and a lack of internal repeat structures among CP100k sequences of the examined species suggested a completely different evolutionary constraint on this CP.

Protein repeats possess regular secondary structures and form multirepeat assemblies in three dimensions of diverse sizes and functions. They often have specific roles, such as protein-protein interaction, protein-biomolecule binding, and formation of fibrous structures. Examples of important protein repeats include Leucine-rich repeats in pattern recognition proteins, PPxxPxPPx repeats in Homeobox protein (summarized in [40]), GXY (where X and Y are frequently occupied by Pro and Hyp, respectively) or PXG repeats in collagen [41] and GA repeats in silk proteins [42]. In squid teeth, the mechanical properties of squid ring teeth proteins increase as a function of the number of tandem repeats [43]. For barnacle cement proteins, those repetitive regions may directly relate to substrate binding capacity because the presence of internal repetition increases the available binding surface area of the protein.

While the exact functions of those repetitive motifs in CPs remain unclear, the remarkable CP gene duplication and domain duplication of CP19k, 20k, 43k, and 52k have resulted in a pool of CPs with diverse chemical and biochemical properties. This is best illustrated by M. longirostrum CP52k paralogs, which are highly diverse in term of the overall protein charge and hydrophobicity. M. longirostrum is embedded and attached to sponge tissue, which is a composite of organic matrix and inorganic sponge spicules. Development of a highly diverse cement protein cocktail may enable the barnacle to accommodate environments with a variety of substrata. A remarkable repertoire of CP paralogs was also recovered from the rock-attaching species C. malayensis and A. amphitrite, and the sea turtle-inhabiting barnacle C. testudinaria, suggesting that duplications of CP19k, 20k, 43k, and 52k gene might be a common genetic feature in thoracican barnacles.

Conclusions

In conclusion, our results suggest that cement protein composition and its chemical properties might vary according to the base material of the barnacle species. Instead of developing highly specific cement proteins, barnacles seem to attach to different substrata by relying on a highly duplicated genetic toolkit that generates paralogous CP sequences with diverse chemical and biochemical properties. Such versatility may explain why many barnacle species are extremely cosmopolitan and diverse in their substratum choice.

Methods

Sample collection

A total of 12 barnacle species were collected from a wide variety of taxonomic groups covering diverse morphologies, host substrata, and basic materials (Fig. 1; Table 1). Specimens of the intertidal stalked barnacle Capitulum mitella, acorn barnacle Amphibalanus amphitrite, and membranous base high-shore barnacles Chthamalus malayensis and Tetraclita japonica formosana were collected from rocks in Shen-Ao-Kang, Taiwan (25°12’36.59 N, 121°81’97.66E). Specimens of coral reef barnacles, including Megabalanus ajax (on fire corals), Membranobalanus longirostrum (in sponges), Pectinoacasta sp. (in sponges), Wanella milleporae (on fire corals) and Galkinia sp. (on scleractinian corals) were collected from Green Island, Taiwan (22°40’45.03 N, 121°30’11.98E). Specimens of the oceanic epibiotic barnacles Chelonibia testudinaria, Conchoderma hunteri and Lepas anatifera were collected from crabs (C. testudinaria) at fish markets, ropes underneath buoys (C. hunteri) and floating pieces of wood (L. anatifera) in Taiwan.

Histological examination of cement glands

The somatic body (= prosoma) and ovarian tissue (which often mix with cement glands), located at the base region of barnacles, were fixed in 10 % formalin in seawater for one week, and rinsed in freshwater for 10 min before dehydration. The tissues were dehydrated in ascending concentrations of ethanol (75, 95, and 100 % for serial dehydration for 1 h for each concentration), then immersed in xylene (3 h) and embedded in paraffin. The specimens were then cut at a sagittal angle into 10-µm sections using a microtome (Leica). Hematoxylin (4 min) and eosin (2 min) (HE) staining were performed on distinct paraffin sections (see Kiernan 2008 for details on the histological preparation methods). The HE-stained sections were observed using a compound microscope, and cement glands were identified following the morphological description in [18].

RNA extraction, cDNA library preparation, and sequencing

For ten of the 12 species (all except A. amphitrite and T. j. formosana, for which transcriptome data had already been published), tissue samples (single individual of each species) of the prosoma and base/peduncle (where the cement gland is located) were carefully isolated without contaminating each other. The total RNA of the prosoma and base was extracted using TRIzol® Reagent (Invitrogen, Camarillo, CA) and High Pure RNA Isolation Kit (Roche Applied Science, Germany), respectively. RNA quality assessment was conducted by a Bioanalyzer 2100 with RNA 6000 labchip kit (Agilent Technologies, Santa Clara, CA, USA). cDNA libraries for both parts of all 10 species were prepared using Illumina TruSeq RNA Sample Prep Kits v2 and subsequently sequenced by HiSeq™ 2500 High-Throughput Mode v4 with paired-end 125 base-pair reads located at the High throughput Genomics Core, Biodiversity Research Center, Academia Sinica, Taiwan according to the manufacturer’s instructions (Illumina, San Diego, CA).

Sequence de novo assembly, annotation, and mRNA expression pattern

For cross-species comparison, RNA reads from both prosoma and base/peduncle samples in each species were combined into a single input to obtain a consolidated set of contigs. The sequence read quality was checked with FastQC v0.11.5 (Babraham Bioinformatics, Cambridge, UK) and filtered using the Trimmomatic v0.35 [44] to discard adaptor sequences and low-quality reads. The trimmed reads were then applied to do de novo assemblies with Trinity v2.2.0 [45]. Contigs with amino acid sequence similarity higher than 80 % were clustered and regarded as reference sequences, or transcripts. Transcripts with a sum of read counts from base/peduncle and prosoma of no more than 1 were discarded. Gene annotation of each transcript was performed by BLASTx via information from the NCBI Arthropoda database and non-redundant protein database with an E value of 1e-6 as the cutoff point. Gene expression level was quantified by mapping all short reads of paired-end data to the transcripts in Bowtie [46]. Then RSEM (RNA-Seq by Expectation Maximization) [47] was used to calculate FPKM (fragments per kilobase of transcripts per million mapped reads) values. The FPKM measure normalized the raw reads of each contig to avoid bias from transcript length and sequencing level. The differential expression level between base and prosoma was tested with DESeq2 [48] using negative binomial generalized linear models. Salmon [49] was used to calculate the TPM (Transcripts Per Million read) values of transcripts for subsequent comparative analyses.

Ortholog clustering

Prior to the in-silico protein translation step, transcripts with FPKM below 0.01 in both base and prosoma were removed. From the transcriptome assembly of each examined species, a translated protein database was generated from candidate coding regions within contigs using Transdecoder v3.0.0 [45]. BLASTp was performed on the translated protein databases against an assembly of translated protein sequences predicted from the genome data of the acorn barnacle Amphibalanus amphitrite [50] using Diamond v0.9.30.131 [51], with the default E value setting and -max_target_seqs set to 1. The BLASTp tubular output file was used as the reference file for the parameter --retain_blastp_hit in the TransDecoder.Predict step. Completely identical protein sequences in the translated protein database of each species were removed by cd-hit with an identity threshold of 1 (-c 1). Ortholog identification was performed using Orthofinder with default settings [52]. Single copy orthologs were aligned and a maximum-likelihood (ML) phylogenetic tree was constructed (with 100 bootstraps) as described in Lan et al. [53]. Transcriptome data of parasitic barnacle Sacculina yatsui (BioProject no. PRJDB8012) was incorporated as the outgroup to the examined thoracican barnacle species.

Barnacle cement protein identification and bioinformatics

Barnacle cement proteins were recovered from the NCBI webpage interface for protein sequences (https://www.ncbi.nlm.nih.gov/protein/) using the search command “(cement protein [Protein Name] OR (cement[All Fields] AND protein[All Fields])) AND “Thoracica“[ORGN]”. Barnacle cement protein sequences from Amphibalanus amphitrite and Megabalanus rosa were selected and downloaded for the subsequent analyses. The barnacle cement proteins were named based on Kamino et al. [23], who labeled the identified protein bands based on the estimated molecular weight in the SDS-PAGE. These M. rosa cement proteins included CP19k, CP20k, CP52k, CP68k (not deposited to NCBI) and CP100k, in which the number and “k” reflect the estimated molecular weight of the protein in kiloDa (kDa). This naming system was adopted for all subsequent studies relating to these reported barnacle cement proteins. So et al. [38] reported an extensive list of M. rosa cement protein homolog sequences. Their proteomic-transcriptomic approach identified three cement protein sequences from a 43 kDa protein band in their SDS-PAGE analysis. These 43 kDa proteins were reportedly homologous to a M. rosa 68 kDa protein (CP68k) [27]. Since M. rosa CP68k was not deposited into NCBI, we referred to the protein sequences homologous to A. amphitrite 43 kDa and M. rosa CP68k as “CP43k” homologs in the present study.

CP19k, CP20k, CP43k, CP52k, CP100k and CP114k (from A. amphitrite, reported by [38]) were used as the query in a tBLASTn search against the transcriptome (cDNA) databases, with an E value cutoff of 1e-5. Alignment of homologous barnacle cement protein sequences was performed using MUSCLE in MEGA7 [54]. The amino acid composition of each CP homolog sequence was computed using the function “readAAStringSet” in the R package “Biostrings” (Pages, [55]). Protein isoelectric points (pI) were predicted using the web-based interface of an IPC–isoelectric point calculator [56]. “Average pI” was taken as the predicted pI of the protein sequence. The GRAVY hydropathy index prediction was performed using GRAVY Calculator (http://www.gravy-calculator.de/index.php). Conserved motif identification was performed using MEME v5.2.0 [57], with the number of expected motifs set to 10 (-nmotifs 10), motif length range from 6 to 150 amino acid (-minw 6 -maxw 150), motif distribution set as “any number of repetition (anr)” (-mod anr) and the background model as zero order (-markov_order 0).

Availability of data and materials

The datasets generated and/or analysed during the current study are available in this published article and its supplementary information files. The datasets are also available from the GenBank repository, Biosample account numbers SAMN13956698-701, SAMN16953520-23, SAMN16963732-34, SAMN17491171-74, SAMN17525968-71, SAMN17573517-20, SAMN17573696-99, SAMN17573706-09, SAMN17598410-13, SAMN17598515-18, SAMN18042803-06. https://www.ncbi.nlm.nih.gov/genbank/.

References

  1. Chan BKK, Høeg JT. Diversity of lifestyles, sexual systems, and larval development patterns in sessile crustaceans. In: Thiel M, Watling L, editor. Lifestyles and Feeding Biology, The Natural History of the Crustacea, vol. 2. New York: Oxford University Press; 2015. pp. 14–34.

  2. Yu M-C, Dreyer N, Kolbasov GA, Høeg JT, Chan BKK. Sponge symbiosis is facilitated by adaptive evolution of larval sensory and attachment structures in barnacles. Proceedings of the Royal Society B. 2020; 287(1927):20200300.

  3. Dreyer N, Zardus JD, Høeg JT, Olesen J, Yu M-C, Chan BKK. How whale and dolphin barnacles attach to their hosts and the paradox of remarkably versatile attachment structures in cypris larvae. Org Divers Evol. 2020; 20(2):233–249.

    Article  Google Scholar 

  4. Fitridge I, Dempster T, Guenther J, De Nys R. The impact and control of biofouling in marine aquaculture: a review. Biofouling. 2012; 28(7):649–669.

    PubMed  Article  Google Scholar 

  5. Waiho K, Glenner H, Miroliubov A, Noever C, Hassan M, Ikhwanuddin M et al. Rhizocephalans and their potential impact on crustacean aquaculture. Aquaculture. 2020:735876.

  6. Aldred N, Alsaab A, Clare AS. Quantitative analysis of the complete larval settlement process confirms Crisp’s model of surface selectivity by barnacles. Proceedings of the Royal Society B: Biological Sciences. 2018; 285(1872):20171957.

  7. Liang C, Strickland J, Ye Z, Wu W, Hu B, Rittschof D. Biochemistry of barnacle adhesion: an updated review. Frontiers in Marine Science. 2019; 6:565.

    Article  Google Scholar 

  8. Høeg JT, Maruzzo D, Okano K, Glenner H, Chan BKK. Metamorphosis in balanomorphan, pedunculated, and parasitic barnacles: a video-based analysis. Integr Comp Biol. 2012; 52(3):337–347.

    PubMed  PubMed Central  Article  Google Scholar 

  9. Yule AB, Walker G. Settlement of Balanus balanoides: The effect of cyprid antennular secretion. J Mar Biol Assoc U K. 1985; 65(3):707–712.

    Article  Google Scholar 

  10. Aldred N, Clare AS. Mechanisms and principles underlying temporary adhesion, surface exploration and settlement site selection by barnacle cyprids: a short review. In: Gorb SN, editor. Functional surfaces in biology, vol. 2. Dordrecht: Springer Netherlands; 2009. pp. 43–65.

    Chapter  Google Scholar 

  11. Power AM, Klepal W, Zheden V, Jonker J, McEvilly P, von Byern J. Mechanisms of Adhesion in Adult Barnacles. In: Byern J, Grunwald I, editor. Biological Adhesive Systems. Vienna: Springer; 2010. pp. 153–168.

    Chapter  Google Scholar 

  12. Kamino K. Barnacle underwater attachment. In: Smith A, Callow J, editor. Biological Adhesives. Berlin, Heidelberg: Springer; 2006. pp. 145–166.

    Chapter  Google Scholar 

  13. Cheung P, Ruggieri G, Nigrelli R. A new method for obtaining barnacle cement in the liquid state for polymerization studies. Mar Biol. 1977; 43(2):157–163.

    Article  Google Scholar 

  14. Chan BKK, Prabowo RE, Lee K-S. Crustacean Fauna of Taiwan: Barnacles, Volume I - Cirripedia: Thoracica excluding the Pyrgomatidae and Acastinae, vol. 1. Keelung: National Taiwan Ocean University; 2009.

    Google Scholar 

  15. Chan BKK, Chen Y-Y, Achituv Y. Crustacean Fauna of Taiwan: Barnacles, Volume II - Cirripedia: Thoracica: Pyrgomatidae, vol. 2. Taipei, Taiwan: Biodiversity Research Center, Academia Sinica; 2013.

    Google Scholar 

  16. Yu M-C, Kolbasov GA, Høeg JT, Chan BKK. Crustacean-sponge symbiosis: collecting and maintaining sponge-inhabiting barnacles (Cirripedia: Thoracica: Acastinae) for studies on host specificity and larval biology. J Crustacean Biol. 2019; 39(4):522–532.

    Article  Google Scholar 

  17. Lacombe D. A comparative study of the cement glands in some balanid barnacles (Cirripedia, Balanidae). Biol Bull. 1970; 139(1):164–179.

    PubMed  Article  Google Scholar 

  18. Lacombe D, Liguori VR. Comparative histological studies of the cement apparatus of Lepas anatifera and Balanus tintinnabulum. Biol Bull. 1969; 137(1):170–180.

    Article  Google Scholar 

  19. Lobo-da-Cunha A, Alves Â, Oliveira E, Cunha I. The cement apparatus of the stalked barnacle Pollicipes pollicipes. Mar Biol. 2017; 164:11.

    Article  CAS  Google Scholar 

  20. Saroyan J, Lindner E, Dooley C. Repair and reattachment in the Balanidae as related to their cementing mechanism. Biol Bull. 1970; 139(2):333–350.

    CAS  PubMed  Article  Google Scholar 

  21. Otness JS, Medcalf DG. Chemical and physical characterization of barnacle cement. Comp Biochem Physiol B Comp Biochem. 1972; 43(2):443–449.

    CAS  Article  Google Scholar 

  22. Walker G. The biochemical composition of the cement of two barnacle species, Balanus hameri and Balanus crenatus. J Mar Biol Assoc U K. 1972; 52(2):429–435.

    CAS  Article  Google Scholar 

  23. Kamino K, Odo S, Maruyama T. Cement proteins of the acorn-barnacle, Megabalanus rosa. Biol Bull. 1996; 190(3):403–409.

    CAS  PubMed  Article  Google Scholar 

  24. Naldrett MJ, Kaplan DL. Characterization of barnacle (Balanus eburneus and B. cenatus) adhesive proteins. Mar Biol. 1997; 127(4):629–635.

    CAS  Article  Google Scholar 

  25. Kamino K, Inoue K, Maruyama T, Takamatsu N, Harayama S, Shizuri Y. Barnacle cement proteins: importance of disulfide bonds in their insolubility. J Biol Chem. 2000; 275(35):27360–27365.

    CAS  PubMed  Article  Google Scholar 

  26. Khandeparker L, Anil AC. Underwater adhesion: The barnacle way. Int J Adhesion Adhes. 2007; 27(2):165–172.

    CAS  Article  Google Scholar 

  27. So CR, Fears KP, Leary DH, Scancella JM, Wang Z, Liu JL et al. Sequence basis of barnacle cement nanostructure is defined by proteins with silk homology. Sci Rep. 2016; 6:36219

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  28. Liu JCW, Høeg JT, Chan BKK. How do coral barnacles start their life in their hosts? Biol Lett. 2016; 12(6):20160124.

    PubMed  PubMed Central  Article  Google Scholar 

  29. Jonker J-L, Morrison L, Lynch EP, Grunwald I, von Byern J, Power AM. The chemistry of stalked barnacle adhesive (Lepas anatifera). Interface focus. 2015; 5:20140062.

    PubMed  PubMed Central  Article  Google Scholar 

  30. Jonker J-L, Abram F, Pires E, Coelho AV, Grunwald I, Power AM. Adhesive proteins of stalked and acorn barnacles display homology with low sequence similarities. PLoS ONE. 2014; 9(10):e108902.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  31. Lin H-C, Wong YH, Tsang LM, Chu KH, Qian P-Y, Chan BKK. First study on gene expression of cement proteins and potential adhesion-related genes of a membranous-based barnacle as revealed from Next-Generation Sequencing technology. Biofouling. 2014; 30(2):169–181.

    CAS  PubMed  Article  Google Scholar 

  32. Domínguez-Pérez D, Almeida D, Wissing J, Machado AM, Jänsch L, Castro LF et al. The quantitative proteome of the cement and adhesive gland of the pedunculate barnacle, Pollicipes pollicipes. Int J Mol Sci. 2020; 21(7):2524.

    PubMed Central  Article  CAS  Google Scholar 

  33. Rocha M, Antas P, Castro LFC, Campos A, Vasconcelos V, Pereira F et al. Comparative analysis of the adhesive proteins of the adult stalked goose barnacle Pollicipes pollicipes (Cirripedia: Pedunculata). Mar Biotechnol. 2019; 21(1):38–51.

    CAS  Article  Google Scholar 

  34. Zheden V, Klepal W, von Byern J, Bogner FR, Thiel K, Kowalik T et al. Biochemical analyses of the cement float of the goose barnacle Dosima fascicularis–a preliminary study. Biofouling. 2014; 30(8):949–963.

    CAS  PubMed  Article  Google Scholar 

  35. Machado AM, Sarropoulou E, Castro LFC, Vasconcelos V, Cunha I. An important resource for understanding bio-adhesion mechanisms: Cement gland transcriptomes of two goose barnacles, Pollicipes pollicipes and Lepas anatifera (Cirripedia, Thoracica). Marine Genomics. 2019; 45:16–20.

    Article  Google Scholar 

  36. Chan BKK, Dreyer N, Gale AS, Glenner H, Ewers-Saucedo C, Pérez-Losada M, Kolbasov GA, Crandall JA, Høeg JT. The evolutionary diversity of barnacles, with an updated classification of fossil and living forms. Zool. J. Linn. Soc. 2021; zlaa160. https://doi.org/10.1093/zoolinnean/zlaa160.

  37. Wang Z, Leary DH, Liu J, Settlage RE, Fears KP, North SH et al. Molt-dependent transcriptomic analysis of cement proteins in the barnacle Amphibalanus amphitrite. BMC Genomics. 2015; 16:859.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  38. So CR, Scancella JM, Fears KP, Essock-Burns T, Haynes SE, Leary DH et al. Oxidase Activity of the Barnacle Adhesive Interface Involves Peroxide-Dependent Catechol Oxidase and Lysyl Oxidase Enzymes. ACS Appl Mater Interfaces. 2017; 9(13):11493–11505.

    CAS  PubMed  Article  Google Scholar 

  39. Remm M, Storm CE, Sonnhammer EL. Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J Mol Biol. 2001; 314(5):1041–1052.

    CAS  PubMed  Article  Google Scholar 

  40. Luo H, Nijveen H. Understanding and identifying amino acid repeats. Brief Bioinform. 2014; 15(4):582–591.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  41. Knebelmann B, Deschenes G, Gros F, Hors M, Grünfeld J, Zhou J et al. Substitution of arginine for glycine 325 in the collagen alpha 5 (IV) chain associated with X-linked Alport syndrome: characterization of the mutation by direct sequencing of PCR-amplified lymphoblast cDNA fragments. American journal of human genetics. 1992; 51(1):135–142.

    CAS  PubMed  PubMed Central  Google Scholar 

  42. Gatesy J, Hayashi C, Motriuk D, Woods J, Lewis R. Extreme diversity, conservation, and convergence of spider silk fibroin sequences. Science. 2001; 291(5513):2603–2605.

    CAS  PubMed  Article  Google Scholar 

  43. Jung H, Pena-Francesch A, Saadat A, Sebastian A, Kim DH, Hamilton RF et al. Molecular tandem repeat strategy for elucidating mechanical properties of high-strength proteins. Proc Natl Acad Sci USA. 2016; 113(23):6478–6483.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  44. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30(15):2114–2120.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  45. Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc. 2013; 8(8):1494–1512.

    CAS  PubMed  Article  Google Scholar 

  46. Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009; 10(3):R25.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  47. Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011; 12:323.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  48. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15(12):550.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  49. Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017; 14(4):417–419.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  50. Kim J-H, Kim H, Kim H, Chan BKK, Kang S, Kim W. Draft genome assembly of a fouling barnacle, Amphibalanus amphitrite (Darwin, 1854): the first reference genome for Thecostraca. Front Ecol Evol. 2019; 7:465.

    Article  Google Scholar 

  51. Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015; 12(1):59–60.

    CAS  Article  PubMed  Google Scholar 

  52. Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019; 20(1):238.

    PubMed  PubMed Central  Article  Google Scholar 

  53. Lan Y, Sun J, Tian R, Bartlett DH, Li R, Wong YH et al. Molecular adaptation in the world’s deepest-living animal: Insights from transcriptome sequencing of the hadal amphipod Hirondellea gigas. Mol Ecol. 2017; 26(14):3732–3743.

    CAS  PubMed  Article  Google Scholar 

  54. Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016; 33(7):1870–1874.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  55. Pagès HA, Gentleman P, DebRoy S. Biostrings: Efficient manipulation of biological strings. R package version 2.56.0; 2020.

  56. Kozlowski LP. IPC - Isoelectric Point Calculator. Biol Direct. 2016; 11(55):16.

    Google Scholar 

  57. Bailey TL, Williams N, Misleh C, Li WW. MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006; 34(suppl_2):W369-W373.

    PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank the High Throughput Sequencing Core hosted in the Biodiversity Research Center at Academia Sinica for performing the NGS experiments and Kimforest Enterprise Co., Ltd for assisting bioinformatics analyses. We thank Dr. Andrew H.-J. Wang and Dr. Kai-Fa Huang at the Institute of Biological Chemistry, Academia Sinica for their valuable comments; Pei-Chen Tsai and Wei-Pong Hsieh for their help collecting and preparing samples; and Jennie Chien-Wen Liu for assisting with RNA sample preparation. Thanks to Dr. Ceri Lewis (University of Exeter, UK) for providing photos of turtle barnacles (Fig. 1h). Thanks also to Noah Last of Third Draft Editing for his English language editing.

Funding

This work was supported by the Taiwan Protein Project (Grant No. MOST105-0210-01-12-01), awarded to HCL, and Shenzhen University Science Foundation Fund (Project No. 860-000002110380) and the Innovation Team Project of Universities in Guangdong Province (No. 2020KCXTD023) awarded to YHW. BKKC is supported by Academia Sinica Senior Investigator Award AS-IA-105-L03. All these funds provided financial support in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

HCL, YHW, BKKC conceived and designed the experiments; HCL and BKKC conducted the sample collection; HCL and CHS generated and analyzed the transcriptome data; YHW conducted the bioinformatic analyses; BKKC preformed the histological experiments and analyses; HCL and YHW led the writing with assistance from BKKC. All authors read the approved the final manuscript.

Corresponding author

Correspondence to Benny Kwok Kan Chan.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Lin, HC., Wong, Y.H., Sung, CH. et al. Histology and transcriptomic analyses of barnacles with different base materials and habitats shed lights on the duplication and chemical diversification of barnacle cement proteins. BMC Genomics 22, 783 (2021). https://doi.org/10.1186/s12864-021-08049-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12864-021-08049-4

Keywords

  • Barnacle
  • Cement protein
  • Cement gland
  • Transcriptome