Omic horizon expression: a database of gene expression based on RNA sequencing data

Hu, Yuzhe; Xie, Dong; Li, Xixi; Han, Wenling; Chen, Yingyu; Qi, Huiying; Wang, Pingzhang

doi:10.1186/s12864-023-09781-9

Database
Open access
Published: 08 November 2023

Omic horizon expression: a database of gene expression based on RNA sequencing data

Yuzhe Hu^1,2,
Dong Xie³,
Xixi Li^1,2,
Wenling Han^1,2,
Yingyu Chen^1,2,
Huiying Qi⁴ &
…
Pingzhang Wang^1,2

BMC Genomics volume 24, Article number: 674 (2023) Cite this article

1335 Accesses
1 Citations
Metrics details

Abstract

Background

Gene expression profiles have important significance for gene expression characteristics and further functional studies. More attention has been given to the expression databases in humans and mice, but less attention has been given to rats, while rat models also play an irreplaceable role in biomedical experiments.

Results

To depict the rat gene expression profiles in mRNA expression levels, we analyzed over 2,700 RNA sequencing (RNA-Seq) samples from 48 tissues, 40 primary cell types and 25 cell lines; and then mapped them to the latest version of the rat genome reference, mRatBN7.2. Based on these datasets and reanalysis, we constructed a new database, the Omic Horizon Expression Database (http://immudb.bjmu.edu.cn/expression.html), which allows expressional profile query of over 25,000 rat genes based on non-redundant gene symbols. The database supports requests using gene symbols (or alias), Ensemble and Entrez gene IDs. Gene expression profiles can be queried in three categories: tissues, primary cells and cell lines. Application examples including expression profiling and comparison, as well as identification of novel rat genes, were illustrated to show the utility of the database.

Conclusions

As an omic resource, the Omic Horizon Expression Database provides horizons of gene expression profiles across various tissues and cells, which greatly facilitates the identification of rat genes as well as functional clues.

Peer Review reports

Background

Gene expression profiles play an important role in biological research. The depiction of the gene expression profile helps focus on the role and significance of genes in different diseases and promote the development and application of drugs. Multiple high-throughput sequencing (HTS) techniques have been used to obtain gene expression profiles, and the most commonly used methods are DNA microarray and RNA-Seq. Microarrays have been used as a transcriptomics platform since 1990s [1], while the high-throughput RNA-Seq method has emerged only in the past two decades [2]. Compared to microarrays, RNA-Seq has a higher dynamic range for the detection of low-abundance transcripts and does not necessarily rely on a reference genome, allowing for novel transcript and variant detection [3]. The emergence of HTS has accumulated a large amount of data and made it possible to depict gene expression maps more conveniently. Therefore, benefitting from public data, many databases have been constructed for public users to analyze gene expression profiles.

The exploration of gene expression profiles of experimental animals, such as mice and rats, is useful to clarify the conserved nature of genes in different species, which is an essential premise for the safe and effective application of biological experiments in humans. Mice have a wide variety of strains and mature genome modification techniques, making them the most commonly used laboratory animals. However, in some models, rats are better suited for experimental studies than mice. In some respects, such as cognition and behavior, rats are closer to humans than mice. Compared with mice, rats are larger in size and easier to study physiological characteristics, which are suitable for scientific research on behavior, cells, physiology, biochemistry, pharmacology and toxicology [4]. They are important model animals to increase our understanding of common human diseases and are widely used in physiological studies, such as hypertension, diabetes, breast cancer and neurological diseases.

There are many expression profile databases for humans and mice, such as The Human Protein Atlas (HPA, https://www.proteinatlas.org), Genotype-Tissue Expression (GTEx, https://gtexportal.org), RNA Seq Atlas [5], Mouse Gene Expression Database (MGD, https://www.informatics.jax), Mouse Phenome Database (MPD, https://phenome.jax.org), and Tabula Muris [6]. However, the database for querying rat gene expression has not yet been amply reported. The rat BodyMap database performs RNA-Seq from 11 organs of both sexes of different age rats [7]. The Rat Genome Database [8] (RGD, http://rgd.mcw.edu, 1999) was created by the American Academy of Medical Sciences, collecting data on rat genomics, genetics, physiology and more, but without expression profiles. Additionally, there are some comprehensive databases that can satisfy queries of multiple species, such as ArrayExpress [9] and Gene Expression Omnibus (GEO) [10]. However, both databases are mainly used to store omics data. Microarray datasets in GEO can be analyzed directly online by GEO2R [10]; however, no tools aim for HTS datasets in both databases. Therefore, users without informatics skills cannot systematically analyze and compare gene expression profiles, particularly HTS data.

The analysis of RNA-Seq data generally relies on a target genome for reads mapping and a gene transfer format (GTF) file for expressional quantification. The latest genome versions, such as the human genome reference T2T-CHM13 [11], mouse genome reference GRCm39 [12], and rat genome reference mRatBN7.2 [13], comprise more comprehensive gene information, which facilitates the discovery of some previously unnoticed genes. In addition, the gene-level quantification approach generally uses a GTF file containing gene models, with each model representing the structure of transcripts produced by a given gene. Nevertheless, the gene expression profiles for rat databases mentioned above are annotated using a more previous version of the rat genome reference and GTF file. This makes a lack of genes in the results when the previous or incomplete GTF files are used for gene quantification.

Since the latest rat genome reference mRatBN7.2 was published, there has not been a rat gene expression database satisfying the retrieval requirement. In addition, the current rat gene expression databases are focused on tissues and lack expression information on primary cells and cell lines. To address this need, we constructed the Omic Horizon Expression Database, which facilitates gene expression profiling and further functional clues at multiple levels.

Construction and content

Data collection

All HTS dataset information about rat samples was derived from GEO DataSets [10] searching (https://www.ncbi.nlm.nih.gov/geo/) according to processes similar to those described previously [14, 15]. The downloaded samples were due to March 2022. RNA-Seq data, with RNA being total RNA or polyA RNA, from Rattus norvegicus were retained. The tissue and cell sources were manually annotated based on sample characteristics and sorted in ascending order. Then, the top several samples of each type of tissue and cell were selected. The selected sample size accounts for approximately 10% of the total sample. Because the dataset information was associated with the Sequence Read Archive (SRA), the SRA run (SRR) IDs were extracted from the selected dataset information. Based on the accession numbers, RNA-Seq data were downloaded via SRAtools from the SRA database (https://www.ncbi.nlm.nih.gov/sra/) and used for subsequent analysis.

Data processing

The sra format files were converted to FASTQ format files by the fastq-dump command. The program fastp [16] (https://github.com/OpenGene/fastp) was used for quality control of raw reads according to the standard pipeline. Read alignment to the reference genome of rat mRatBN7.2 was performed by the STAR program [17]. The GTF file (v105) was downloaded from the ENSEMBLES website (http://ftp.ensembl.org/pub/release-105/gtf/rattus_norvegicus/). The rat reference sequences (https://ftp.ncbi.nlm.nih.gov/refseq/R_norvegicus/) was used to help gene symbol annotation. Raw counts of each gene were first calculated with featureCounts [18], and the transcripts per kilobase million (TPM) values (see below) were further calculated and used for the subsequent analysis. During the process, samples with fewer than 5,000 detected genes were removed.

TPM value calculation

To eliminate the effects of sequencing depth and gene length, we used the TPM value to measure the gene expression level. A TPM value is calculated by the following formula: TPM_i = (\(\frac{\text{N}\text{i}}{\text{L}\text{i}}\))*10⁶/\(\left(\sum _{i=1}^{n}\frac{\text{N}\text{i}}{\text{L}\text{i}}\right)\), where N_i indicates the count of reads mapping to gene i and L_i indicates the length of exons of gene i.

Construction of the database

The Omic Horizon Expression database was based on the MVT (Model/View/Template) design pattern and implemented based on the Python web framework-Django. The data were stored and managed by a MySQL relational database (version 8.0.17). Data box plotting via Highchart. Omic Horizon Expression runs on an Apache web server (version 2.4.54). It accesses the database using mysqlclient.

Molecular cloning of Lilrb1 and Lilrb3

Male Sprague-Dawley rats were obtained from Peking University Health Science Center, Department of Laboratory Animal Science. Rats were sacrificed by carbon dioxide. Tissues were isolated and temporarily stored in TRIzol reagent (Trans#ET111-01). Total RNA in tissues was extracted and reverse transcribed into cDNA. The molecules were cloned using nested polymerase chain reaction (nested-PCR), and the primers are shown in Table 1. The outer primers were labeled F1 and R1 for the forward and reverse primers, respectively, while the inner primers were labeled F2 and R2.

Table 1 The primers for molecular cloning

Full size table

Utility and discussion

Basic description of the database

In this study, we reanalyzed the sequencing data, mapped the reads to the reference genome mRatBN7.2, and then quantified gene expression using the v105 version of the GTF file. A new database for rat gene expression, the Omic Horizon Expression Database (or abbreviated as OmicHorizon@Expression), was constructed (Fig. 1), containing the majority of tissues and cells of publicly available HTS data for rats. To date, it contains 2,762 samples in terms of the sequencing run data (Additional file 1), which are categorized into 48 tissues, 40 primary cell types, and 25 cell lines. In addition, some tissues were further divided into several different subtissues according to the anatomical site, while some primary cells were described in detail based on different tissue sources. The summary of datasets collected in the database is shown in Table 2, and the description of cell lines is shown in Table 3. It is available to query the expression profiles of over 30,000 rat genes based on Ensemble gene IDs by the Omic Horizon Expression database, which supports requests using gene symbols (or alias), Ensemble and Entrez gene IDs.

Table 2 The RNA-Seq sample information stored in the Horizon Expression database

Full size table

Table 3 Description of rat cell lines

Full size table

Query of the database

The interface is simple and easy to understand (Fig. 2a,b). On the homepage, we can select a search type from the drop-down menu as tissue, primary cell and cell line. Then, an interesting tissue or cell type is selected before the new drop-down menu pops up for further selection of subtissue or cell sources. When ‘All’ is selected, all tissues or cells under the layer will be queried to show the result. Then, we entered the query gene symbol or gene ID and clicked on the button ‘GO’ to enter the result page for the gene expression profile. Please note that not all legends on the x-axis will be displayed when there are too many retrieved tissues or subtissues, for example, when “All” is selected in the first step. In this situation, the legend option helps users refine illustration when the interested legends are selected from the drop-down list box (Fig. 2c), moreover, the full retrieved results are also shown in a table, which is downloadable.

Understand the query result

The results page includes three sections (Fig. 2c). The first section is the basic information of the queried gene, such as gene description, gene alias, gene IDs, gene biotype and chromosome location. The second section indicates the gene expression profile shown in box plots. In each plot, the x-axis represents the queried tissues, primary cells or cell lines, while the y-axis represents the expressional TPM values. A box plot consists of five data nodes, which arrange a set of data from largest to smallest, as the upper edge (Q3 + 1.5*IQR), Q3 (75th percentile), median (50th percentile), Q1 (25th percentile) and lower edge (Q1-1.5*IQR), which are calculated for each profile. The values that overflow the range from the upper edge to the lower edge are identified as the outliers, and are shown as circles in the plot. The upper and lower edges are set to the maximum and minimum values, respectively, when there are no outliers. Each value represents a sample source in the box plots. The third section is a table form showing the gene expressional TPM values of the queried gene. The box plot and table files are downloadable.

Functions of the database

The Omic Horizon Expression Database provides gene expression profiles in multiple tissues and subtissues that are beneficial for gene functional clues. The “subtissue” catalog in this database provides a more detailed expression profile. For example, the “brain” tissues comprise subtissues from different brain regions, such as the amygdala, anterior pituitary, forebrain, superficial zone and thalamus; the “breast” tissues are divided into subtissues “basal fractions” and “luminal fractions” based on sample location. There are a total of 78 subtissues that are mainly derived from the brain, kidney and muscle tissues. This function offers multiple possibilities for the retrieval and presentation of gene expression.

Expression profiles in primary cells and cell lines help to arrange further functional studies, such as gene overexpression, knockdown or knockout. From the current database version, the primary cells are derived from a total of 33 tissues and subtissues. Some primary cells have multiple sources, such as macrophages from bone marrow, brain, lung and peritoneum. Therefore, differential expression can be analyzed among different tissue sources in the same cell types.

We used several application cases in the next three sections to show how to use the Omic Horizon Expression database to solve biological problems based on gene expression profiles.

Application case 1: conventional gene expression profiling

This is the general usage to show differential gene expression in various tissues, primary cells and cell lines. For example, cytokine-like protein 1 (CYTL1), also called protein C17, is a secretory protein originally identified in human CD34⁺ cells with a predicted cytokine and interleukin structure characteristics [19]. Sequence alignment showed that CYTL1 genes are conserved in humans, rats and mice [20]. Studies have shown that human CYTL1 is highly expressed in the aorta, placenta, and trachea [20] and that mouse Cytl1 is highly expressed in the cartilage, trachea, lung, and heart [21]. Using OmicHorizon@Expression, we found that rat Cytl1 is highly expressed in cartilage and aorta (Fig. 3a), indicating conservative expression among these species and further suggesting a conservative functional role. The queried expressional values of Cytl1 can be downloaded, which provides the chance to users to generate their expression profiles of interest (Fig. 3b). In primary cells, Cytl1 showed relatively high expression in rat germ cells, breast epithelial cells and aortic smooth muscle cells (Fig. 3c).

Application case 2: species-specific gene expression profiling

Considering the large number of gene expression databases in humans and mice, it is possible to determine whether there is species-specific expression based on the current rat database. For example, lysozyme G-like 1 (LYG1) is a classical secretory protein identified through immunogenomics and belongs to the lysozyme G family [22]. The Omic Horizon Expression database shows that rat Lyg1 is highly expressed in the gingiva (Fig. 4a). The gene expression profile database indicates that human LYG1 is highly expressed in the kidney (Fig. 4b) while mouse Lyg1 is highly expressed in the stomach (Fig. 4c). The change in the expression profiles suggests that LYG1 gene should evolve actively, which may result in functional discrimination in different species.

Application case 3: novel genes identification

The database uses the latest genome and the relevant GTF version, which are useful for identifying novel genes. Immune checkpoint therapy (ICT), which is designed to block inhibitory signals mediated by immune checkpoint molecules, such as PD-1 and CTLA-4, has revolutionized the field of cancer immunotherapy because of its clinical success in many cancers [23]. However, only a minority of patients benefit from current immune checkpoint inhibitors, highlighting the need to identify novel drug targets, including novel promising inhibitory receptors. Leukocyte immunoglobulin-like receptor subfamily B (LILRB) proteins (LILRBs 1–5) contain cytoplasmic immunoreceptor tyrosine-based inhibitory motifs (ITIMs) and transduce a negative signal in multiple cell types in the tumor microenvironment, providing novel opportunities for anti-cancer immunotherapy [24, 25]. Identification of the homologous genes of LILRBs in other species, such as mice and rats, contributes to their functional studies, target validation and drug development. However, the homologous genes of LILRBs in mice and rats have not been fully identified. In the NCBI reference gene and sequence database, there are known rat Lilrb2 and Lilrb4 mRNA sequences but only predicted rat Lilrb1 and Lilrb3 genes by automated computational annotation deposited in the database. In addition, rat Lilrb1 and Lilrb3 genes are not yet recorded by the UCSC Genome Browser (http://genome.ucsc.edu) and are not currently searchable. Lilrb1 is also lacking in the previous rat GTF versions. Therefore, this prompted us to use HTS data combined with PCR to verify the existence and exon structures of Lilrb1 and Lilrb3 in the rat genome (Fig. 5).

HTS data from rat tissues support the transcriptional evidence of the rat genes Lilrb1 and Lilrb3 (Fig. 5a). The Omic Horizon expression database revealed that Lilrb1 is highly expressed in bone marrow, spleen, blood, bone, and tendons (Fig. 5b). We successfully cloned Lilrb1 from the spleen (Fig. 5c). There was one transcript of Lilrb1 containing complete open reading frames (ORFs) (Fig. 5d & Additional file 2). However, the sequence similarity between the Lilrb1 cloned transcript and the predicted transcript in the NCBI GenBank database is 91% in nucleotides (Additional file 3) and 85% in amino acids. The sequence inconsistencies may be due to the differences in animal strains, as Brown Norway rat in the NCBI reference genome but Sprague-Dawley rat in our experiment were used, suggesting that there should be active evolution of Lilrb1.

Similarly, the Omic Horizon Expression database shows that Lilrb3 is highly expressed in the spleen, liver, thymus, bone and ear (Fig. 5e). We successfully cloned Lilrb3 from the spleen and liver and confirmed the existence of rat Lilrb3 (Fig. 5c). Lilrb3 has three transcript variants, including Lilrb3-v1, -v2 and -v3, with different exon combinations (Fig. 5d & Additional file 4). The similarities between these cloned transcripts and predicted transcripts of Lilrb3 in the NCBI GenBank database are all 100% at either the nucleic acid or protein level.

Interestingly, sequence alignments based on rat Lilrb1 and Lilrb3 mRNA sequences further confirm the loss of both genes in mice because of multiple pre-stop codons in the deduced coding regions in the homologous genomic region. The protein similarities between human and rat LILRB1 and LILRB3 are 32.5% and 40.6%, respectively (Additional file 5), with similar domains and motifs (Fig. 5f). Therefore, the successful identification of rat Lilrb1 and Lilrb3 will contribute to functional studies based on rat models.

Conclusions

In this study, we constructed a new gene expression database named the Omic Horizon Expression database. It supports gene expression profiling and comparison in 48 tissues, 40 primary cell types and 25 cell lines via the query of gene symbols (or alias), Ensemble and Entrez gene ID for a total of 30,560 rat genes. Through the database, we verified the inexistence of Lilrb1 and Lilrb3 in the mouse genome but their existence in the rat genome, which provided the basis for functional studies and drug development in rat models. In the current version, only rat data can be queried, however, multiple species will be considered in the future, especially for primary cells and cell line data, to which other existing databases often receive less attention. In addition, considering the new gene annotation in the more frequently updated GTF files rather than the reference genome version, as well as the accumulation of public data leading to other tissue and cell types that have not yet been covered in the current study, we will regularly update the database.

Data availability

Omic Horizon Expression Database is free at http://immudb.bjmu.edu.cn/expression.html.

Abbreviations

HTS:: High-throughput sequencing
GEO:: Gene Expression Omnibus
GTF:: Gene transfer format
SRA:: Sequence Read Archive
TPM:: Transcripts per kilobase million
PCR:: Polymerase chain reaction
CYTL1:: Cytokine-like protein 1
LYG1:: Lysozyme G-like 1
LILRBs:: Leukocyte immunoglobulin-like receptor subfamily B
ITIMs:: Immunoreceptor tyrosine-based inhibitory motifs
ORFs:: Open reading frames

References

Schena M, Shalon D, Davis RW, Brown PO. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science. 1995;270(5235):467–70.
Article CAS PubMed Google Scholar
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5(7):621–8.
Article CAS PubMed Google Scholar
van der Kloet FM, Buurmans J, Jonker MJ, Smilde AK, Westerhuis JA. Increased comparability between RNA-Seq and microarray data by utilization of gene sets. PLoS Comput Biol. 2020;16(9):e1008295.
Article PubMed PubMed Central Google Scholar
Jacob HJ. The rat: a model used in biomedical research. Methods Mol Biol. 2010;597:1–11.
Article PubMed Google Scholar
Krupp M, Marquardt JU, Sahin U, Galle PR, Castle J, Teufel A. RNA-Seq Atlas–a reference database for gene expression profiling in normal tissue by next-generation sequencing. Bioinformatics. 2012;28(8):1184–5.
Article CAS PubMed Google Scholar
Single-cell transcriptomics. Of 20 mouse organs creates a Tabula Muris. Nature. 2018;562(7727):367–72.
Article Google Scholar
Yu Y, Fuscoe JC, Zhao C, Guo C, Jia M, Qing T, Bannon DI, Lancashire L, Bao W, Du T, et al. A rat RNA-Seq transcriptomic BodyMap across 11 organs and 4 developmental stages. Nat Commun. 2014;5:3230.
Article PubMed Google Scholar
Smith JR, Hayman GT, Wang SJ, Laulederkind SJF, Hoffman MJ, Kaldunski ML, Tutaj M, Thota J, Nalabolu HS, Ellanki SLR, et al. The year of the rat: the rat genome database at 20: a multi-species knowledgebase and analysis platform. Nucleic Acids Res. 2020;48(D1):D731–d742.
CAS PubMed Google Scholar
Sarkans U, Füllgrabe A, Ali A, Athar A, Behrangi E, Diaz N, Fexova S, George N, Iqbal H, Kurri S, et al. From ArrayExpress to BioStudies. Nucleic Acids Res. 2021;49(D1):D1502–d1506.
Article CAS PubMed Google Scholar
Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, et al. NCBI GEO: archive for functional genomics data sets–update. Nucleic Acids Res. 2013;41(Database issue):D991–995.
CAS PubMed Google Scholar
Nurk S, Koren S, Rhie A, Rautiainen M, Bzikadze AV, Mikheenko A, Vollger MR, Altemose N, Uralsky L, Gershman A, et al. The complete sequence of a human genome. Science. 2022;376(6588):44–53.
Article CAS PubMed PubMed Central Google Scholar
Long H, Reeves R, Simon MM. Mouse genomic and cellular annotations. Mamm Genome. 2022;33(1):19–30.
Article PubMed PubMed Central Google Scholar
Howe K, Dwinell M, Shimoyama M, Corton C, Betteridge E, Dove A, Quail MA, Smith M, Saba L, Williams RW, et al. The genome sequence of the Norway rat, Rattus norvegicus Berkenhout 1769. Wellcome Open Res. 2021;6:118.
Article PubMed PubMed Central Google Scholar
Wang P, Yang Y, Han W, Ma D. ImmuSort, a database on gene plasticity and electronic sorting for immune cells. Sci Rep. 2015;5:10370.
Article CAS PubMed PubMed Central Google Scholar
Hu Y, Liu C, Han W, Wang P. A theoretical framework of immune cell phenotypic classification and discovery. Front Immunol. 2023;14:1128423.
Article CAS PubMed PubMed Central Google Scholar
Chen S, Zhou Y, Chen Y, Gu J. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinf (Oxford England). 2018;34(17):i884–90.
Google Scholar
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinf (Oxford England). 2013;29(1):15–21.
CAS Google Scholar
Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinf (Oxford England). 2014;30(7):923–30.
CAS Google Scholar
Liu X, Rapp N, Deans R, Cheng L. Molecular cloning and chromosomal mapping of a candidate cytokine gene selectively expressed in human CD34 + cells. Genomics. 2000;65(3):283–92.
Article CAS PubMed Google Scholar
Zhu S, Kuek V, Bennett S, Xu H, Rosen V, Xu J. Protein Cytl1: its role in chondrogenesis, cartilage homeostasis, and Disease. Cell Mol Life Sci. 2019;76(18):3515–23.
Article CAS PubMed PubMed Central Google Scholar
Zhu W, Yang X, Liu S, Wang M, Ye S, Luo H, Cui S. The involvement of cytokine-like 1 (Cytl1) in chondrogenesis and cartilage metabolism. Biochem Biophys Res Commun. 2020;529(3):608–14.
Article CAS PubMed Google Scholar
Liu H, Zhang Y, Liu Z, Wang P, Mo X, Fu W, Liu W, Cheng Y, Han W. LYG1 exerts antitumor function through promoting the activation, proliferation, and function of CD4(+) T cells. Oncoimmunology. 2017;6(4):e1292195.
Article PubMed PubMed Central Google Scholar
Sharma P, Goswami S, Raychaudhuri D, Siddiqui BA, Singh P, Nagarajan A, Liu J, Subudhi SK, Poon C, Gant KL, et al. Immune checkpoint therapy-current perspectives and future directions. Cell. 2023;186(8):1652–69.
Article CAS PubMed Google Scholar
Zhang CC. A perspective on LILRBs and LAIR1 as immune checkpoint targets for cancer treatment. Biochem Biophys Res Commun. 2022;633:64–7.
Article CAS PubMed Google Scholar
Deng M, Chen H, Liu X, Huang R, He Y, Yoo B, Xie J, John S, Zhang N, An Z, et al. Leukocyte immunoglobulin-like receptor subfamily B: therapeutic targets in cancer. Antib Ther. 2021;4(1):16–33.
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This work was supported by grants from the National Natural Science Foundation of China (No. 31972899 and No. 32270990).

Author information

Authors and Affiliations

Department of Immunology, NHC Key Laboratory of Medical Immunology (Peking University), School of Basic Medical Sciences, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing, 100191, China
Yuzhe Hu, Xixi Li, Wenling Han, Yingyu Chen & Pingzhang Wang
Peking University Center for Human Disease Genomics, No. 38 Xueyuan Road, Beijing, 100191, China
Yuzhe Hu, Xixi Li, Wenling Han, Yingyu Chen & Pingzhang Wang
School of Basic Medical Sciences, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing, 100191, China
Dong Xie
Department of Health informatics and Management, School of Health Humanities, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing, 100191, China
Huiying Qi

Authors

Yuzhe Hu
View author publications
You can also search for this author in PubMed Google Scholar
Dong Xie
View author publications
You can also search for this author in PubMed Google Scholar
Xixi Li
View author publications
You can also search for this author in PubMed Google Scholar
Wenling Han
View author publications
You can also search for this author in PubMed Google Scholar
Yingyu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Huiying Qi
View author publications
You can also search for this author in PubMed Google Scholar
Pingzhang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.W. designed this study, guided the data analysis, and revised the manuscript. H.Q. designed and guided the database construction. Y.H. collected and processed data, performed the experiment, and drafted, edited and revised the manuscript. D.X. deposited the data and constructed the database. X.L. performed parts of the data analysis and experiment. W.H. and Y.C. provided suggestions and revised the manuscript. All authors contributed to the article and approved the submitted version.

Corresponding authors

Correspondence to Huiying Qi or Pingzhang Wang.

Ethics declarations

Ethics approval and consent to participate

All animal procedures were approved by the Ethics Committee for Animal Use of the Peking University Health Science Center (Beijing, China) and performed in accordance with the ARRIVE guidelines (Animal Research: Reporting of In Vivo Experiments).

Consent for publication

Not applicable.

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Additional file 1

: Brief information of 2,762 samples based on the sequencing run data. Rat strain, age, sex and the associated PubMed IDs are indicated when the information is available on the NCBI web pages

Additional file 2

: Nucleotide and deduced amino acid sequence of rat Lilrb1. The ORF of rat Lilrb1 is underlined and the deduced amino acid sequence is shown below. The boxed letters represent the sequences of the restriction endonucleases HindIII and XhoI. The nucleotide sequence of Lilrb1 has been submitted to GenBank with the accession number OP709921

Additional file 3

: Nucleotide sequence alignment of cloned and predicted rat Lilrb1. The nucleotide sequence alignment of cloned and predicted (XM_003748711) rat Lilrb1 was generated using NCBI Blast (https://blast.ncbi.nlm.nih.gov/)

Additional file 4

: Nucleotides and deduced amino acid sequences of rat Lilrb3. The ORFs of Lilrb3-v1/v2/v3 are underlined, and the deduced amino acid sequences are shown below. The boxed letters represent the sequences of the restriction endonucleases HindIII and XhoI. The accession numbers of Lilrb3 submitted to GenBank are OP709922 (Lilrb3-v1), OP709923 (Lilrb3-v2) and OP709924 (Lilrb3-v3)

Additional file 5

: Protein sequence alignment of human and rat LILRB1 and LILRB3. The sequence alignment of human and rat LILRB1 and LILRB3 was generated using the Clustal Omega program (https://www.ebi.ac.uk/Tools/msa/clustalo/). The symbols below the sequence alignment are explained as follows: An * (asterisk) indicates positions that have a single, fully conserved residue. A : (colon) indicates conservation of strongly similar properties. A. (period) indicates the conservation of weakly similar properties. A - (dash) represents a gap in the alignment

Supplementary Material 6

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Hu, Y., Xie, D., Li, X. et al. Omic horizon expression: a database of gene expression based on RNA sequencing data. BMC Genomics 24, 674 (2023). https://doi.org/10.1186/s12864-023-09781-9

Download citation

Received: 06 April 2023
Accepted: 02 November 2023
Published: 08 November 2023
DOI: https://doi.org/10.1186/s12864-023-09781-9

Omic horizon expression: a database of gene expression based on RNA sequencing data

Abstract

Background

Results

Conclusions

Background

Construction and content

Data collection

Data processing

TPM value calculation

Construction of the database

Molecular cloning of Lilrb1 and Lilrb3

Utility and discussion

Basic description of the database

Query of the database

Understand the query result

Functions of the database

Application case 1: conventional gene expression profiling

Application case 2: species-specific gene expression profiling

Application case 3: novel genes identification

Conclusions

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Additional file 1

Additional file 2

Additional file 3

Additional file 4

Additional file 5

Supplementary Material 6

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us