计算机工程与应用 ›› 2007, Vol. 43 ›› Issue (36): 177-180.

• 数据库与信息处理 • 上一篇    下一篇

网页链接繁殖在搜索引擎资源发现中的应用

陈 爽1,钱 榕2,陈 福2,李 素3   

  1. 1.西北工业大学 计算机学院,西安 710072
    2.北京科技大学 信息工程学院,北京 100083
    3.北京工商大学 计算机学院,北京 100037
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-12-21 发布日期:2007-12-21
  • 通讯作者: 陈 爽

Application of hyperlinks reproduction in meta search resource discovery

CHEN Shuang1,QIAN Rong2,CHEN Fu2,LI Su3   

  1. 1.School of Computer Science,Northwestern Polytechnical University,Xi’an 710072,China
    2.School of Information Engineering,University of Science and Technology Beijing,Beijing 100083,China
    3.Computer College,Beijing Technology and Business University,Beijing 100037,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-12-21 Published:2007-12-21
  • Contact: CHEN Shuang

摘要: 为解决搜索引擎返回结果数量上的限制,扩展了元搜索技术,提出链接群落、链接繁殖的概念,并与生物群落进行了对比。链接繁殖的思想是首先将多个搜索引擎返回的结果作为起始信息源,利用预定义的繁殖规则,优化并整合搜索结果,对链接所指网页的链接进行分析,繁殖出更多的相关信息源。在分析不同的搜索引擎结果集时,系统根据不同搜索引擎直接与繁殖发现信息源的能力与质量,动态调整繁殖的链接的优先次序。经过实验验证,链接繁殖可以大大扩展通过搜索引擎发现主题信息源的数量。

关键词: 主题发现, 元搜索, 链接分析, 链接繁殖

Abstract: In order to solve the problem of quantity restriction of the search engine results,this paper presents a concept of link community by expanding Meta-Search Engine technique and compares with the biological community.The idea of link reproduction is taking many search engines results as the initial source information at first,then optimizing and integrating the search results by using predefined reproduction rules,and then reproducing more related information sources by analyzing the links of the WebPages.When analyzing result collections from different search engine,the system directly reproduces and discovers the ability and the quality of information source according to the different search engine,and dynamically adjusts the precedence of reproduced links.The experiments prove that the link reproduction may greatly increase the quantity of information sources when discovering subjects through the search engine.

Key words: topic discovery, meta search engine, link analysis, link reproduction