Research Article
Transcriptome Sequencing and Bioinformatic Analysis of Eucalyptus cloeziana Terminal Buds
2 Forestry College of Guangxi University, Nanning, 530000, China
Author Correspondence author
Tree Genetics and Molecular Breeding, 2022, Vol. 12, No. 7 doi: 10.5376/tgmb.2022.12.0007
Received: 13 Jun., 2022 Accepted: 20 Jun., 2022 Published: 30 Jun., 2022
Lan J., Jiang W.X., Zhang L., Liang X.Y., and Bai T.D., 2022, Transcriptome sequencing and bioinformatic analysis of Eucalyptus cloeziana terminal buds, Tree Genetics and Molecular Breeding, 12(7): 1-13 (doi: 10.5376/tgmb.2022.12.0007)
In order to obtain transcriptome data and predict the key gene function of Eucalyptus cloeziana, an important timber species in China, the Illumina HiSeq X Ten sequencing technology was conducted to carry out transcriptome sequencing of E. cloeziana terminal buds. After Trinity assembly and splicing, Blast software was used to compare and annotate the high-quality Unigene with seven public databases including NR, Swiss-prot, KOG, Go, KEGG, etc. SSR sites search and analysis was performed by MISA software. A total of 26 587 high quality Unigenes were obtained from the terminal bud of E. cloeziana with an average length of 1279.69 bp. A total of 22 099 Unigenes were successfully annotated in at least one biological database. Of these, 11 507 Unigenes were successfully annotated with 25 functions in KOG database, and the most common function was general functional genes prediction. In the GO database, the 14 105 Unigenes annotated were matched to 50 functional gene groups in 3 categories of biological function, cell component and molecular function, respectively. Among them, the biological processes accounted for the largest proportion. Through KEGG pathways analysis, 7 117 Unigenes were successfully annotated and 127 metabolic pathways were detected, with the most abundant metabolism-related genes. Moreover, 1 021 Unigenes annotations were assigned with 65 families of transcription factor (TF) database, among which the bHLH and MYB had the largest proportion. A total of 3 274 Unigenes were annotated in PRG database and formed into 13 resistant gene categories, among which RLP and TNL had the largest number of matched genes. In addition, 12 366 SSR sites were detected by MISA software, with a distribution density of 1/2.75 kb and a variety of repeating primitive types. In this study, abundant transcriptome information from terminal bud was obtained by using high-throughput sequencing, which was beneficial for molecular assisted breeding of E. cloeziana.