Computer Engineering and Applications ›› 2007, Vol. 43 ›› Issue (21): 160-164.

• 数据库与信息处理 • Previous Articles     Next Articles

Extended PageRank algorithm based on Web link and content analysis

QIAN Gong-wei1,NI Lin1,MIAO Yuan2,CAO Rong1   

  1. 1.Department of Electronic Engineering and Information Science,University of Science and Technology of China,Heifei 230027,China
    2.School of Computer Science and Mathematics,Victoria University,Australia
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-07-21 Published:2007-07-21
  • Contact: QIAN Gong-wei

基于网页链接和内容分析的改进PageRank算法

钱功伟1,倪 林1,MIAO Yuan2,曹 荣1   

  1. 1.中国科学技术大学 电子工程与信息科学系,合肥 230027
    2.澳大利亚 维多利亚大学 计算机科学与数学系,澳大利亚
  • 通讯作者: 钱功伟

Abstract: An Extended PageRank(EPR) algorithm is presented,combining Web link and Web content analysis.The relevance and authority algorithm demands are met by analyzing the similarity of the contents of Web pages and the link structure respectively.The EPR algorithm provides large space to extend PageRank algorithm,and through experiment,better result set can be retrieved by adjusting appropriate parameters.

Key words: PageRank, Web page ranking, link analysis, similarity analysis

摘要: 结合网页链接分析和网页内容相关性分析提出一种改进的PageRank算法EPR(Extended PageRank),从分析网页内容相似性的角度解决相关性需求,从网页链接分析的角度解决权威性需求。算法为扩展PageRank提供了广阔的空间,并且实验证明,通过选择合适的参数EPR算法可以获得优于传统PageRank算法的排序结果。

关键词: PageRank, 网页排序, 链接分析, 相关性分析