计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (22): 241-243.DOI: 10.3778/j.issn.1002-8331.2008.22.072

• 工程与应用 • 上一篇    下一篇

数据压缩在序列比对中的应用

杜 娟,呼广跃   

  1. 天津城市建设学院,天津 300384
  • 收稿日期:2007-05-09 修回日期:2007-08-03 出版日期:2008-07-11 发布日期:2008-07-11
  • 通讯作者: 杜 娟

Data compression’s application in alignment

DU Juan,HU Guang-yue   

  1. Tianjin Institute of Urban Construction,Tianjin 300384,China
  • Received:2007-05-09 Revised:2007-08-03 Online:2008-07-11 Published:2008-07-11
  • Contact: DU Juan

摘要: 同源或非同源长基因组序列的分析比较需要高效率的比对算法。开发出一个新的两两比对工具“超级压缩比对”(简称SCA),该系统是建立在Sequitur编码理论专为长基因组序列的两两比对设计。SCA是一个线性算法,并且能够处理序列重排。实验证明SCA算法能够准确快速的完成长基因组序列的比对。

关键词: 序列比对, Sequitur码, 锚定

Abstract: To compare and analyze large genomic DNA sequences of related organisms and of different species,researchers need efficient methods to align long sequences.A new tool Super Compression Alignment(SCA) is developed,a new system specially for rapid global alignment of genomic sequences.The new system is based on Sequitur coding theory.SCA is a linear algorithm and it can deal with rearrangements.SCA has been proved to align genomic sequences efficiently and accurately.

Key words: sequence alignment, Sequitur code, anchor