王建新

教授 博士生导师 硕士生导师

所在单位:计算机学院

学历:研究生(博士)毕业

办公地点:中南大学校本部计算机楼303室

性别:男

联系方式:jxwang@mail.csu.edu.cn

学位:博士学位

在职信息:在职

毕业院校:中南大学

曾获荣誉:

教育部新世纪人才

湖南省杰出青年基金获得者

   
当前位置: 中文主页 >> 论文成果

De novo diploid genome assembly using long noisy reads

发布时间:2025-03-05

点击次数:

DOI码:10.1038/s41467-024-47349-7

发表刊物:NATURE COMMUNICATIONS

关键字:SEQUENCE,ALIGNMENT

摘要:The high sequencing error rate has impeded the application of long noisy reads for diploid genome assembly. Most existing assemblers failed to generate high-quality phased assemblies using long noisy reads. Here, we present PECAT, a Phased Error Correction and Assembly Tool, for reconstructing diploid genomes from long noisy reads. We design a haplotype-aware error correction method that can retain heterozygote alleles while correcting sequencing errors. We combine a corrected read SNP caller and a raw read SNP caller to further improve the identification of inconsistent overlaps in the string graph. We use a grouping method to assign reads to different haplotype groups. PECAT efficiently assembles diploid genomes using Nanopore R9, PacBio CLR or Nanopore R10 reads only. PECAT generates more contiguous haplotype-specific contigs compared to other assemblers. Especially, PECAT achieves nearly haplotype-resolved assembly on B. taurus (BisonxSimmental) using Nanopore R9 reads and phase block NG50 with 59.4/58.0 Mb for HG002 using Nanopore R10 reads. Most existing assemblers failed to generate high-quality phased assemblies using long noisy reads. Here, the authors present PECAT, a Phased Error Correction and Assembly Tool, for reconstructing diploid genomes from long noisy reads.

备注:Nie, F (Nie, Fan) [1] , [2] , [3] , [4] ; Ni, P (Ni, Peng) [1] , [2] , [5] ; Huang, N (Huang, Neng) [1] , [2] , [5] ; Zhang, J (Zhang, Jun) [1] , [2] , [5] ; Wang, ZY (Wang, Zhenyu) [6] ; Xiao, CL (Xiao, Chuanle) [7] ; Luo, F (Luo, Feng) [8] ; Wang, JX (Wang, Jianxin) [1] , [2] , [5]

论文类型:期刊论文

卷号:15

期号:1

是否译文:

发表时间:2024-04-05

收录刊物:SCI

上一条: HiTE: a fast and accurate dynamic boundary adjustment approach for full-length transposable element detection and annotation

下一条: DNA 5-methylcytosine detection and methylation phasing using PacBio circular consensus sequencing