计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (8): 155-157.

• 网络、通信与安全 • 上一篇    下一篇

基于带权语言网络的网页关键词抽取

任克强,赵光甫,张国萍   

  1. 江西理工大学 信息工程学院,江西 赣州 341000
  • 收稿日期:2007-07-09 修回日期:2007-10-19 出版日期:2008-03-11 发布日期:2008-03-11
  • 通讯作者: 任克强

Extracting keywords from Web page based on weighted natural language network

REN Ke-qiang,ZHAO Guang-fu,ZHANG Guo-ping   

  1. Faculty of Information Engineering,Jiangxi University of Science and Technology,Ganzhou,Jiangxi 341000,China
  • Received:2007-07-09 Revised:2007-10-19 Online:2008-03-11 Published:2008-03-11
  • Contact: REN Ke-qiang

摘要: 论述了网页文档带权语言网络的建立过程,给出了结合介数指标与紧密度指标的词语综合中心度度量方法,实验表明采用该方法的关键词抽取结果能够很好地符合网页主题。

Abstract: Building weighted natural language network for the Web page is introduced in the paper,and the centrality of word combines the betweenness metric and the closeness metric.Experiments show that the words extracted have great contribution to the Web page subject.