Computer Engineering and Applications ›› 2010, Vol. 46 ›› Issue (11): 7-9.DOI: 10.3778/j.issn.1002-8331.2010.11.003

• 博士论坛 • Previous Articles     Next Articles

Design of Genetic Algorithm to enumerate multiple optimal solutions to haplotype assembly problem

XIE Min-zhu1,LIU Xin-qiu2,3   

  1. 1.College of Physics and Information Science,Hunan Normal University,Changsha 410081,China
    2.College of Mathematics and Computer Science,Hunan Normal University,Changsha 410081,China
    3.Department of Foundation Science,Hunan Engineering Vocational Technical College,Changsha 410151,China
  • Received:2009-12-31 Revised:2010-02-01 Online:2010-04-11 Published:2010-04-11
  • Contact: XIE Min-zhu

枚举单体型组装问题多个最优解的遗传算法设计

谢民主1,刘新求2,3   

  1. 1.湖南师范大学 物理信息与科学学院,长沙 410081
    2.湖南师范大学 数学与计算机科学学院,长沙 410081
    3.湖南工程职业技术学院 基础科学系,长沙 410151
  • 通讯作者: 谢民主

Abstract: The haplotype assembly problem aims to reconstruct a pair of haplotype of an individual from its DNA sequencing fragment data.There are some heuristic algorithms and parameterized algorithms for the various computational optimal models.However,these algorithms work out with only one optimal solution,i.e.a pair of haplotypes.However,the optimal solution to a biological problem is usually not unique,or the real solution may be suboptimal.The paper proposes a new genetic algorithm to enumerate multiple optimal solutions to the haplotype assembly problem.Experimental results show that this algorithm is more accurate in haplotype reconstruction and provides the chance for the biologists to choose one from these multiple solutions based on some biological knowledge.

Key words: Single-Nucleotide Polymorphisms(SNPs), haplotype, heuristic algorithm, bioinformatics

摘要: 单体型组装问题就是根据个体基因组测序获得的DNA序列数据重构出该个体的一对单体型。目前单体型组装问题的各种优化计算模型已有相关的启发式算法和参数化精确算法,但是这些算法只能得出一个最优解,即一对单体型。可是生物问题的最优解往往不是唯一的,或者真实解可能只是接近最优的。该文设计了一个新的能枚举出最优的多个解的遗传算法。实验结果表明该算法具有较高的单体型重建精度,并为生物学家根据领域知识在算法获得的多个解的基础进一步选择提供了可能。

关键词: 单核苷酸多态性, 单体型, 启发式算法, 生物信息学

CLC Number: