Pyrosequencing analysis of the human microbiota of healthy Chinese undergraduates
© Ling et al.; licensee BioMed Central Ltd. 2013
Received: 23 January 2013
Accepted: 3 June 2013
Published: 10 June 2013
Skip to main content
© Ling et al.; licensee BioMed Central Ltd. 2013
Received: 23 January 2013
Accepted: 3 June 2013
Published: 10 June 2013
Elucidating the biogeography of bacterial communities on the human body is critical for establishing healthy baselines from which to detect differences associated with disease; however, little is known about the baseline bacterial profiles from various human habitats of healthy Chinese undergraduates.
Using parallel barcoded 454 pyrosequencing targeting on the 16S rRNA gene V3 region, the bacterial diversity of the nasopharynx, saliva, dominant hands, and feces were investigated from 10 healthy Chinese junior boarding undergraduates at Zhejiang University. The participants were 21–24 years of age with a body mass index (BMI) < 24 kg/m2. A total of 156,717 high-quality pyrosequencing reads were obtained for evaluating bacterial diversity, which represented 29,887 unique phylotypes. The overall taxonomic distribution of the 16S rRNA gene-based amplicons demonstrated that these 4 habitats of the human body harbored distinct microbiota and could be divided into different clusters according to anatomic site, while the established patterns of bacterial diversity followed the human body habitat (feces, hands, saliva, and nasopharynx). Although significant inter-individual variation was observed, the healthy microbiota still shared a large number of phylotypes in each habitat, but not among the four habitats, indicating that a core microbiome existed in each healthy habitat. The vast majority of sequences from these different habitats were classified into different taxonmies that became the predominant bacteria of the healthy microbiota.
We first established the framework of microbial communities from four healthy human habitats of the same participants with similar living environments for the Chinese undergraduates. Our data represent an important step for determining the diversity of Chinese healthy microbiota, and can be used for more large-scale studies that focus on the interactions between healthy and diseases states for young Chinese adults in the same age range.
An enormous number of microorganisms, the vast majority of which are bacterial species, are known to colonize and form complex communities, or microbiota, at various sites within the human body [1, 2]. Microbial cells that thrive on and within the human body are approximately 10 times more numerous than our own cells and contain, in aggregate, approximately 100 times more genes, leading to the suggestion that humans and our microbial symbionts be considered “supraorganisms” . The microbiota can be viewed as a forgotten “organ” exquisitely tuned to our physiology that performs functions that we have not had to evolve on our own . A growing body of evidence suggests that the composition and function of microbiota in different human body habitats, including nasal passages, oral cavities, skin, gastrointestinal tract, and urogenital tract, play vital roles in human development, physiology, immunity, and nutrition [1, 5–15]. With the advent of molecular techniques, especially the development of next-generation high-throughput sequencing techniques, more and more bacterial phylotypes, which were undiscovered by traditional microbiological techniques, were found in these habitats of the human body. Prior studies have shown that the taxonomic composition of the microbial community may affect the propensity to develop obesity, inflammatory bowel disease, type 1 diabetes, cardiovascular diseases, bacterial vaginosis, dental caries and so on [7, 16–22]. Studies of healthy individuals, including the Chinese population, have focused on particular body habitats, such as the gastrointestinal tract, nasopharynx, skin, and oral cavity, using high-throughput sequencing techniques [16, 20–28], and have revealed microbial communities with significant intra- and inter-individual variations [27, 29]. However, the five microbial habitats in the human body mentioned above are not isolated from one another; instead, each person comprises a complex, yet interconnected landscape consisting of many body habitats harboring distinctive microbiota . Recently, the Human Microbiome Project (HMP) has analyzed the largest cohort and set of distinct, clinically relevant body habitats from healthy adults of a Western population to characterize the ecology of human-associated microbial communities using 454 pyrosequencing and Illumina sequencing [31, 32]. However, little is known about the overall structure of microbiota from different body habitats in healthy Chinese individuals. A good definition of commensal microbiota in these habitats and an understanding of the relationship to health are essential in preventing and combating disease.
The purpose of the present study was to characterize the baseline bacterial communities from various human habitats, such as the nasopharynx, saliva, dominant hand and feces, of healthy male and female Chinese junior undergraduates boarding at Zhejiang University using parallel barcoded 454 pyrosequencing targeting the 16S rRNA gene V3 hypervariable region to gain a “whole-body” view [33, 34]. The bacterial profiles of these healthy undergraduates, with the same living environmental circumstances and nearly the same living habits, such as diet, rest, exercise, and hygiene in the university, could represent the real healthy microbiota prevailing in Chinese undergraduates, which might be useful for human microbiota-related diseases analyzing the same age range.
Determining the role of our microbiota in disease predisposition and pathogenesis will critically depend upon first defining it in healthy states. Previous studies have shown that there are fundamental differences in the composition and structure of bacterial communities in the same habitats within the subjects of different ethnic and racial backgrounds [35, 36]. However, most of the studies investigating microbial composition within a specific habitat using deep sequencing analysis have focused on sampling people from the West and some Asian countries [26, 27, 31, 37, 38], and little is known about the overall structure and composition of the healthy Chinese microbiota. This imbalance needs to be addressed to more fully understand the interactions between humans and their microbiota. Human microbial communities were strongly affected by external factors, such as lifestyle, dietary patterns, antibiotic usage, and host genotype [16, 39–44]. In China, undergraduate students, often boarding at the same university for 4 or 5 years, have adapted to the specific environmental circumstances of the university and formed almost the same lifestyle features, such as diet, rest, exercise, and hygiene after 2 years. Thus, Chinese undergraduate boarders might be an excellent model to explore the representative microbiota of the human body in the same age range. Unlike previous studies, the well-controlled similar living environmental circumstances for these participants could reduce the influence of various environmental factors on the healthy human microbiota and minimize the variations between participants [27, 31, 32]. The present study characterized the composition and structure of the healthy microbiota of the nasopharynx, saliva, dominant hands and feces from Chinese male and female boarding undergraduates, and established the baselines for human healthy microbiota, which might be useful in understanding the relationship between health and prevailing diseases in the Chinese population in the same age range. Because most of the undergraduates were not married, the vaginal microbiota from female participants was not included, as gynecologic examinations are not included in routine physical examinations for female participants in China. With nearly the same living environments and habits, as mentioned above, the diversity profiles of bacterial communities from these habitats could be considered as representative microbiota of healthy human bodies. Samples were only collected from a single time point, our data are in agreement with the conventional sampling habits in other studies [23, 26, 45, 46]. Recent studies have also shown that within-subject variation over time is consistently lower than between-subject variation, and that the uniqueness of the microbial community in each individual appears to be relatively stable over time (relative to the population as a whole), which suggests that a set of microbiomes in a given healthy habitat can be considered a core microbiome [2, 31].
Comparison of phylotype coverage and diversity estimation of the 16S rRNA gene libraries at 3% dissimilarity from the pyrosequencing analysis
Good’ s 2
95% C. I .
95% C. I .
It is necessary to define the salivary microbiota of the healthy human body before determining the role of oral bacteria in diseases. Nine phyla, including Actinobacteria, Bacteroidetes, Firmicutes, Fusobacteria, Proteobacteria, Spirochaetes, Tenericutes, and two candidate divisions (SR1, TM7), were found . The vast majority of sequences in saliva (99.2%) belonged to one of the following five major phyla: Actinobacteria (4.9%), Bacteroidetes (16.8%), Firmicutes (58.6%), Fusobacteria (3.0%) and Proteobacteria (15.9%) (Figure 7). In fact, the diversity of salivary microbiota was quite different from those in different ages and ethnic groups . When compared with the salivary microbiota from Chinese undergraduates, Bacteroidetes (28.8%), Fusobacteria (5.2%), and Proteobacteria (19.5%) were overrepresented in healthy Chinese children , while Firmicutes sequences were more abundant in adult. These disparities might be due to different hygiene habits, different dietary constitution, and different physiologic changes, which were clearly associated with age. When compared with healthy adult Americans reported by Keijser (2008), a higher relative abundance of Bacteroidetes (26.6%) and Proteobacteria (19.9%), and a lower relative abundance of Firmicutes (38.5%) were found in salivary microbiota of Americans, which might be ascribed to different lifestyles and different host genotypes . At the genus level, sequences from saliva represented 105 different genera (88 genera from females and 80 genera from males). The most frequently detected genera in saliva (OTU proportions > 1%) were Streptococcus (45.5%), Prevotella (10.1%), Neisseria (7.4%), Haemophilus (5.1%), Porphyromonas (4.7%), Gemella (3.9%), Rothia (3.0%), Granulicatella (2.2%), Fusobacterium (2.1%), Actinomyces (1.8%), Veillonella (1.7%) and Aggregatibacter (1.3%) (Figure 7 and Figure 8B). The overall taxonomic distribution of the 16S rRNA gene-based amplicons at the genus level mentioned above was in general agreement with previous findings , although the relative abundance of several genera was not consistent with the data herein, especially the relative abundance of Streptococcus. The higher relative abundance of Streptococcus, an acid-producing cariogenic bacterium, was more abundant in the Chinese population. The above-mentioned genera provide a framework for Chinese healthy salivary microbiota diversity and investigations of the environmental, dietary, and genetic factors that influence salivary microbiota in individuals. The continuous flow of saliva made it impossible to obtain a stable salivary microbiota, but only captured the temporal stability of salivary microbiota at a single time point.
In the dominant hands, 14 phyla, including Actinobacteria, Bacteroidetes, Chloroflexi, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Fusobacteria, Proteobacteria, Spirochaetes, Tenericutes, Verrucomicrobia, and three candidate divisions (OP10, SR1, and TM7) were found. Most of the sequences (97.6%) were divided into one of the following five major phyla: Actinobacteria (19.7%), Bacteroidetes (7.5%), Deinococcus-Thermus (9.8%), Firmicutes (23.7%) and Proteobacteria (36.9%) (Figure 7). Interestingly, the healthy skin microbiota from Chinese was the only site in which Proteobacteria predominated when compared with other human habitats. One predominant phylum, Deinococcus-Thermus, was detected in healthy skin microbiota with a relatively high abundance, which has been recognized solely in this habitat. Bacteria from the phylum Deinococcus-Thermus are known for their resistance to extreme stresses, including radiation, oxidation, desiccation, and high temperature. Perhaps due to the special environmental conditions of the hand (e.g., sebum production, salinity, and hydration), Deinococcus-Thermus was found with a higher relative abundance in the skin microbiota. However, the phylum of Deinococcus-Thermus could not be recognized as the predominant bacteria in skin through conventional 16S rRNA gene full-length cloning and sequencing technique , which might be ascribed to the methodologic constraints, such as lower resolution . At the genus level, sequences from skin represented 351 different genera (277 genera from females and 262 genera from males). The most frequently detected genera in skin (OTU proportions > 1%) were Enhydrobacter (19.8%), Deinococcus (8.1%), Staphylococcus (7.6%), Propionibacterium (7.6%), Corynebacterium (6.5%), Lactobacillus (6.1%), Acinetobacter (4.3%), Chryseobacterium (3.8%), Streptococcus (2.6%), Finegoldia (1.8%), Sphingomonas (1.2%), and Prevotella (1.1%) (Figure 7 and 8C). Compared with the study conducted by Fierer et al. (2008), the taxonomic composition of skin microbiota of healthy Chinese undergraduates was quite different from healthy Americans . In the present study, the genus, Enhydrobacter (Proteobacteria phylum), was the most predominant bacteria; however, the role of Enhydrobacter on skin microbiota is still unknown. Other genera, as mentioned above, have previously been found to be abundant in other molecular surveys of skin bacteria [24, 25, 53] and are considered to be common skin residents. A previous study showed that the male and female students harbored significantly different bacterial communities on the hand surfaces . However, similar results between the two genders could not be obtained in Chinese adults. Our current pyrosequencing analysis of bacterial 16S rRNA gene amplicons generated directly from superficial human palms raises the possibility of better understanding of the normal microbial ecology of the skin, and for studying the role of novel microbes or microbial communities.
With respect to the gastrointestinal tract, the composition and structure of the fecal microbiota has been recognized in a number of studies. In the current study, nine phyla, including Actinobacteria, Bacteroidetes, Deinococcus-Thermus, Firmicutes, Fusobacteria, Proteobacteria, Spirochaetes, Tenericutes, and Verrucomicrobia, were found. The two most abundant phyla, Bacteroidetes (10.5%) and Firmicutes (87.1%), constituted > 97% of the total number of sequences and should be considered as the predominant bacteria in this habitat (Figure 7). At the genus level, sequences from feces could be classified as 119 different genera (85 genera from females and 105 genera from males). The most dominant taxonomic groups in fecal microbiota (OTU proportions > 1%) were Faecalibacterium (14.8%), unclassified Lachnospiraceae (12.7%), Coprococcus (11.9%), Blautia (10.6%), Pseudobutyrivibrio (9.3%), Bacteroides (7.9%), Roseburia (6.6%), unclassified Clostridiales (2.5%), unclassified Ruminococcaceae (2.4%), Ruminococcus (2.2%), Subdoligranulum (2.1%), Sporacetigenium (2.1%), Oscillibacter (1.6%), Dorea (1.5%), Phascolarctobacterium (1.4%) and Prevotella (1.0%), which accounted for > 90% of the total sequences (Figures 7 and 8D). Except for Bacteroides and Prevotella (members of the order Bacteroidales), other predominant genera belonged to the order Clostridiales. This community was very similar to that reported for the human ileum and likely represented the normal gastrointestinal microbial population . As shown in previous studies, these resident bacteria constitute a healthy gastrointestinal microbiota and exert a conditioning effect on gastrointestinal homeostasis, delivering regulatory signals to the epithelium and instructing mucosal immune responses [5, 8].
There were some limitations to the current study. First, a relatively small number of healthy unmarried Chinese undergraduates participated in the study, which would affect the comprehensive understanding of healthy human microbiota. The microbiota from healthy male and female urogenital tracts of Chinese adults would effectively complete the framework of the human microbiome. Second, only samples from these habitats at a single time point were included; samples from more time points would verify the representative healthy human microbiota. Third, it still could not be ignored the wide variations observed among participants in present study. Living in the similar conditions, the wide OTU differences reported (104–777 in saliva; 219–1215 in feces; 95–1148 in dominant hand; 103–439 in nasopharynx) might be due to the different sequencing depth and different host genotype. A much greater sequencing depth was needed for establishing a database of Chinese healthy microbiota.
In summary, the framework of microbial communities from four healthy human habitats of the same participants has been established for Chinese undergraduates for the first time. The overall structure of healthy microbiota from these habitats was quite different from each other, which might be due to the specific microenvironments of these anatomic locations. As commensal or symbiotic bacteria, these healthy microbial communities co-exist with their specific habitats in the long-term evolutionary process. Hence, the data herein represent an important step for determining the diversity of the Chinese healthy microbiota, and can be used for additional large-scale studies focusing on the interactions between healthy and disease states for young Chinese adults in the same age range. In fact, the essential roles of microbiota in human health and disease, especially novel microbes, are gaining recognition. With an integrated “whole-body” view, many human complex diseases are associated with microbiota from more than one habitat, such as cardiovascular diseases . The role of microbiota in such human diseases will provide new insights regarding pathogenesis and treatment, and can be utilized as a novel therapeutic target for antibiotics, prebiotics, and probiotics, which indicates that the maintenance or restoration of normal microbiota plays a vital role in human health.
Ten healthy junior undergraduates 21–24 years of age boarding at the College of Medicine of Zhejiang University, including five male and five female students from two dormitories, were recruited in the present study. Having stayed in the university for > 2 years, these participants were adapted to the same living environmental circumstances, such as the same living spaces, dining hall, and drinking water, and formed nearly the same living habits, such as diet, rest, exercise, and hygiene (including brushing teeth and washing in the morning and evening). Before sampling, the students presented to our hospital in July 2010 for routine medical examinations to confirm each subject’s health status. The following exclusion criteria were followed: body mass index (BMI) > 24 kg/m2; hypertension; diabetes mellitus; dyslipidemia; allergic rhinitis; asthma; pharyngitis; dental caries; periodontal disease; various dermatoses; gastrointestinal diseases (diarrhea, constipation, ulcers, bleeding, and tumors); the use of antibiotics, probiotics, prebiotics, or synbiotics in the last 30 days; vegetarian; smoking; alcohol consumption; menstruation at the time of sampling; and known active bacterial, fungal, or viral infections. Informed written consent was obtained from all participants prior to enrollment with approval of the Ethics Committee of the First Affiliated Hospital, College of Medicine of Zhejiang University (Zhejiang province, China).
After enrollment, specimens (nasopharyngeal swabs, saliva, dominant palmar swabs, and feces) were collected in our hospital from every participant before washing hands, brushing teeth, or having their morning breakfast. Nasopharyngeal specimens were obtained with flexible, sterile, dry swabs, which could reach the posterior nasopharynx easily (approximately 2 inches) and were gently rotated five times around the inside of both nostrils while applying constant pressure. The palmar surfaces of the dominant hands (usually the right hand) were swabbed with cotton-tipped swabs moistened with a solution of 0.15 M NaCl and 0.1% Tween-20. The entire palmar surface was swabbed in two perpendicular directions to ensure that the maximum surface area of the palm was represented in the sample. A 1-ml spontaneous, unstimulated whole saliva specimen was collected into a sterile cryogenic vial (Corning, NY, USA) and fecal samples (300 mg/tube) were collected in sterile tubes. Saliva and fecal samples were put in aliquots before freezing. All samples for bacterial genomic DNA extraction were transferred immediately to the laboratory in an ice box, and stored at −80°C after preparation within 15 min.
The bacterial cells retrieved on nasopharyngeal swabs and palmar swabs were submerged in 1 ml of sterile normal saline (prepared with RNase-free H2O [pH 7.0]) and vigorously agitated to dislodge cells, while unused swabs served as negative controls, which were handled in the same way. The cells were pelleted by centrifugation (Thermo Electron Corporation, Boston, MA, USA) at full speed (≥ 10,000 g) for 10 min, washed by re-suspending the cells in sterile normal saline, and centrifuged at full speed for 5 min. Then, bacterial DNA was extracted from the swabs using a QIAamp® DNA Mini Kit (QIAGEN, Hilden, Germany) according to the manufacturer’s instructions, with the following minor modification: samples were agitated with 100 mg of zirconium beads (0.1 mm) in a Mini-beadbeater (FastPrep, Thermo Electron Corporation) for 2 min and then incubated at 56°C for 1 h in lysis solution containing proteinase K [20, 21]. Bacteria were pelleted from saliva by centrifugation at full speed for 10 min. Then, bacterial genomic DNA was extracted using a QIAamp® DNA Mini Kit with the aforementioned modifications. Stool bacterial genomic DNA was extracted using a QIAamp® DNA Stool Mini Kit (QIAGEN) according to the manufacturer’s instructions . Bacterial genomic DNA was eluted with elution buffer and stored at −20°C for further analysis.
PCR amplification of the 16S rRNA gene hypervariable V3-region was performed with universal bacterial primers, which correspond to positions 341 to 534 of the conserved Escherichia coli 16S rRNA gene sequence. Amplicon pyrosequencing was performed with standard 454/Roche GS-FLX Titanium protocols, where equal molar of PCR product from each sample was pooled. To pool and sort multiple samples in a single 454 GS-FLX run, we used a set of 8-bp barcodes designed according to Fierer et al. [20, 21, 24, 56–59]. The main criterion of the barcodes is that the adjoining nucleotides are different because the single nucleotide repeats are the main source of errors in pyrosequencing technology. The forward primer was a fusion of the 454 Life Science adaptor A, the barcode, and the 341F primer (5′-GCCTCCCTCGCGCCATCAG-NNNNNNNN-CCTACGGGAGGCAGCAG-3′). The reverse primer was a fusion of the 454 life science adaptor B, the same barcode with the forward primer, and 534R (5′-GCCTTGCCAGCCCGCTCA-NNNNNNNN-ATTACCGCGG CTGCTGG-3′). The PCR amplicon library was created for each individual DNA sample. The PCRs were carried out in triplicate 50 μL reactions with 0.6 μM each of the primer, 20 ng of template DNA, and 1× PCR reaction buffer, 2.5 U of pfu DNA polymerase (MBI, Fermentas, USA). The amplification program consisted of an initial denaturation step at 94°C for 4 min, followed by 25 cycles, where 1 cycle consisted of 94°C for 30 s (denaturation), 55°C for 30 s (annealing) and 72°C for 30 s (extension), and a final extension of 72°C for 10 min. During amplification, negative controls were also performed. Replicate PCR products of the same sample were assembled within a PCR tube. PCR was performed with a PCR thermocycler (Bio-Rad Laboratories, Hercules, California, USA). Prior to sequencing, the DNA concentration of each PCR product was extracted with the MiniElute® Gel Extraction Kit (QIAGEN) and quantified on a NanoDrop ND-1000 spectrophotometer (Thermo Electron Corporation). The products from different samples were mixed at equal ratios for pyrosequencing with the Genome Sequencer FLX (GS-FLX) system (Roche, Basel, Switzerland) at 454 Life Sciences. Emulsion PCR and pyrosequencing were performed according to the manufacturer’s recommendations . Because samples were pooled by equal mass, a variation in the number of sequences recovered from each sample likely reflected slight biases in PCR efficiency among the primer barcodes.
Raw pyrosequencing reads obtained from the sequencer were denoised using the “pre.cluster” command (http://www.mothur.org/wiki/Pre.cluster) in mothur platform (version 1.25.0; http://www.mothur.org)  to remove sequences that are likely due to pyrosequencing error . Using mothur implementation of ChimeraSlayer (http://www.mothur.org/wiki/ChimeraSlayer), chimera sequences arising from the PCR amplification were detected and excluded from the denoised sequences . All pyrosequencing reads were filtered using barcode and primer sequences using a combination of tools from mothur and custom Perl scripts. The resulting sequences were further screened and filtered for quality and length. Sequences were included in the subsequent analysis only if the sequences met all four of the following criteria: (1) the sequence carries the correct barcode and exact match to the primer in at least one end; (2) the sequence carries the correct primer sequence at the other end, even though the barcode is absent; (3) the sequence has a length > 160 nucleotides (excluding barcode and primer A sequences) ; and (4) the sequence has no ambiguous bases (Ns) and homopolymers longer than 8 nucleotides. A total of 156,717 high-quality pyrosequencing reads were produced according to barcode- and primer-sequence filtering, which were used for the following analysis.
The high-quality pyrosequencing reads were assigned to samples according to barcodes (details shown in Additional file 3: Table S1). Sequences were aligned in accordance with the SILVA alignment [61, 65] and clustered into operational taxonomic units (OTUs). The OTUs that reached at a 97% similarity level were used for alpha diversity (Shannon, Simpson, and Evenness), richness (ACE and Chao1), Good’s coverage , Venn diagram, and rarefaction curve analysis using the mothur program. Taxonomy-based analyses were performed by classifying each sequence using the Naïve Bayesian Classifier program of the Michigan State University Center for Microbial Ecology Ribosomal Database Project (RDP) database (http://rdp.cme.msu.edu/) [67, 68] with a 50% bootstrap score. Phylogenetic beta diversity measures such as unweighted UniFrac distance metrics analysis was performed using OTUs for each sample using the mothur program [47, 69], and principal coordinate analysis (PCoA) was conducted according to the distance matrix created by the mothur program and visualized using SigmaPlot.
Statistical analyses were performed using the SPSS Data Analysis Program (version 16.0; SPSS Inc., Chicago, IL, USA) with a 2-tailed t-test or One-Way ANOVA. All tests for significance were two-sided, and p values < 0.05 were considered statistically significant.
The high-quality sequence data from this study has been deposited in the GenBank Sequence Read Archive with accession number SRP016050.
This present work was funded by the grants of the National Basic Research Program of China (973 program) Grant 2013CB531404 and supported by Scientific Research Foundation of the State Key Laboratory for Diagnosis and Treatment of Infectious Diseases.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.