计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (25): 140-142.DOI: 10.3778/j.issn.1002-8331.2008.25.042

• 数据库、信号与信息处理 • 上一篇    下一篇

基于量子遗传算法的文本特征选择方法研究

邱 烨,刘培玉

  

  1. 山东师范大学 信息科学与工程学院,济南 250014
  • 收稿日期:2008-03-18 修回日期:2008-06-06 出版日期:2008-09-01 发布日期:2008-09-01
  • 通讯作者: 邱 烨

Research of text feature selection method based on quantum Genetic Algorithm

QIU Ye,LIU Pei-yu   

  1. School of Information Science and Engineering,Shandong Normal University,Jinan 250014,China
  • Received:2008-03-18 Revised:2008-06-06 Online:2008-09-01 Published:2008-09-01
  • Contact: QIU Ye

摘要: 特征选择方法是文本自动分类中的一项关键技术,提出了一种基于量子遗传算法的文本特征选择新方法,该方法用量子比特对文本向量进行编码,用量子旋转门和量子非门对染色体进行更新,同时,针对信息过滤的特点,对适应度函数进行了改进,充分考虑了特征权值、文本相似度和向量维数等。实验证明,该方法可以极大地降低文本的维数,提高分类的准确率。

关键词: 文本分类, 特征选择, 量子遗传算法

Abstract: Feature selection method is the critical technique of the automatic text categorization.The paper presents a new method of the text feature selection based on the quantum genetic algorithm.In the method,the text vector is coded by quantum bit,and the chromosome is updated by the quantum rotating gate and quantum not-gate.Meanwhile,according to the characteristics of the information filtering,we consider adequately on the feature weight,text similarity and vector dimension in order to improve the fitness function.The experiment has proved that the method can reduce the dimension of text vector and improve the precision of text classification.

Key words: text categorization, feature selection, quantum Genetic Algorithm