华北农学报 ›› 2022, Vol. 37 ›› Issue (1): 42-49. doi: 10.7668/hbnxb.20192406

所属专题: 生物技术

• 作物遗传育种·种质资源·生物技术 • 上一篇    下一篇

扁茎黄芪转录组测序及生物信息学分析

武悦1,2, 陈阳2, 王星哲2, 单飞彪2, 张勇2, 孙鸿举1   

  1. 1.内蒙古大学 生命科学学院,内蒙古 呼和浩特 010020
    2.巴彦淖尔市农牧业科学研究院,内蒙古 临河 015000
  • 收稿日期:2021-10-04 出版日期:2022-02-28
  • 作者简介:
    作者简介:武 悦(1988—),女,内蒙古巴彦淖尔人,助理研究员,主要从事植物分子育种研究。武悦、陈阳为同等贡献作者。
  • 基金资助:
    内蒙古自治区应用技术研究与开发资金资助项目(5213322)

Transcriptome Sequencing and Bioinformatics Analysis of Astragalus complanatus

WU Yue1,2, CHEN Yang2, WANG Xingzhe2, SHAN Feibiao2, ZHANG Yong2, SUN Hongju1   

  1. 1. School of Life Sciences,Inner Mongolia University,Hohhot 010020,China
    2. Bayannur Academy of Agricultural and Animal Sciences,Linhe 015000,China
  • Received:2021-10-04 Published:2022-02-28

摘要:

为获得扁茎黄芪转录组信息及功能基因表达特征,以扁茎黄芪的幼苗叶片为材料,利用Illumina HiSeq平台进行转录组测序及生物信息学系统分析。结果显示,共获得扁茎黄芪Unigene 19 280条,总长23 472 470 bp,其中GC含量达42.74%,测序质量和组装效果较好。通过Blast分析后,分别有12 541,10 120,9 412,8 953,7 494,5 052条Unigene在Nr、Swiss-Prot、GO、KEGG、KOG、COG数据库获得注释。注释到的扁茎黄芪Unigene同源物种以豆科植物为主,尤其是与其同属蝶形花亚科的鹰嘴豆、蒺藜苜蓿和相思子匹配度较高,分别占31.49%,14.50%,11.65%;扁茎黄芪注释Unigene涉及3个GO类别,354条KEGG代谢通路和25个KOG功能分类,其中富集最多的类别分别为代谢过程、一般功能预测和嘌呤代谢,说明扁茎黄芪幼苗期的叶片细胞具有活跃的新陈代谢活动,基因表达丰富。另外,扁茎黄芪Unigene在KEGG数据库中的感染性疾病类别获得较多注释,表明其植株部位也可能具有药用价值。利用MISA软件挖掘到5 849个SSR位点,SSR出现频率30.34%。扁茎黄芪基因组内SSR位点丰富、类型多样化,单碱基至六碱基重复全部出现,其中单碱基丰度最高,占40.56%,以A/T、AG/CT和AAG/CTT为优势基元。

关键词: 扁茎黄芪, 转录组, 生物信息学, Unigene, 分子标记

Abstract:

In order to obtain transcriptome information and functional genes expression characteristics of Astragalus complanatus,the leaves of A.complanatus were used as material,the high-throughput transcriptome sequencing and bioinformatics analysis were performed using Illumina HiSeq platform.A total of 19 280 Unigenes were assembled with total sequence length of 23 472 470 bp,of which the GC content was 42.74%,indicating a high quality of sequencing and assembly.Blast analysis showed 12 541,10 120,9 412,8 953,7 494,5 052 Unigenes got annotation in Nr,Swiss-Prot,GO,KEGG,KOG and COG databases,respectively.The annotated Unigenes of A.complanatus were mainly homologuous to leguminous plants,in particular,highly matched to the subfamily Pteriformis plant Cicer arietinum,Medicago truncatula and Abrus precatorius with a respective percentage of 31.49%,14.50% and 11.65%.Annotated Unigenes were divided into 3 GO classifications,354 KEGG metabolic pathways and 25 KOG function categories.The most abundant categories in each database were metabolic process,general function prediction only and purine metabolism,respectively.These results showed that the leaf cells of Astragalus membranaceus had active metabolism and abundant gene expression in the seedling stage.Additionally,a lot of Unigenes were annotated to the infectious disease category in KEGG database,suggesting a potential medicinal value of the plant part of Astragalus membranaceus.A total of 5 849 simple sequence repeats(SSRs)were identified by MISA software,with an occurrence frequency of 30.34%.The SSRs in the genome of Astragalus membranaceus were abundant and diversified,containing single base to six base repeats,among which the mono-nucleotide SSRs had the largest number with a frequency of 40.56%.The main types of SSR motifs involved A/T,AG/CT and AAG/CTT.

Key words: Astragalus complanatus, Transcriptome, Bioinformatics, Unigene, Molecular marker

引用本文

武悦, 陈阳, 王星哲, 单飞彪, 张勇, 孙鸿举. 扁茎黄芪转录组测序及生物信息学分析[J]. 华北农学报, 2022, 37(1): 42-49. doi: 10.7668/hbnxb.20192406.

WU Yue, CHEN Yang, WANG Xingzhe, SHAN Feibiao, ZHANG Yong, SUN Hongju. Transcriptome Sequencing and Bioinformatics Analysis of Astragalus complanatus[J]. Acta Agriculturae Boreali-Sinica, 2022, 37(1): 42-49. doi: 10.7668/hbnxb.20192406.