计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (31): 1-3.DOI: 10.3778/j.issn.1002-8331.2008.31.001

• 博士论坛 • 上一篇    下一篇

面向Blog的爬行算法

李卫疆1,赵铁军2   

  1. 1.昆明理工大学 云南省计算机应用重点实验室,昆明 650051
    2.哈尔滨工业大学 计算机科学与技术学院,哈尔滨 150001
  • 收稿日期:2008-05-19 修回日期:2008-06-30 出版日期:2008-11-01 发布日期:2008-11-01
  • 通讯作者: 李卫疆

New algorithm of Blog-oriented crawler

LI Wei-jiang1,ZHAO Tie-jun2   

  1. 1.Computer Application Key Lab. of Yunnan Province,Kunming University of Science and Technology,Kunming 650051,China
    2.School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China
  • Received:2008-05-19 Revised:2008-06-30 Online:2008-11-01 Published:2008-11-01
  • Contact: LI Wei-jiang

摘要: 由于通用搜索引擎的综合性,不具备面向专业的特点,所以在准确性和速度等方面存在不足。因此针对Blog这个全新领域,提出了一个面向Blog的网络爬行器算法,为Blog语料搜集以及相关Blog研究提供了方便。

关键词: 博客(Blog), 爬行器, 算法

Abstract: The general crawler provides a great many help to people for finding information in Web.However,it has some drawback in terms of precision and efficiency because of it’s generality and no specialty.Blog,as an emerging phenomenon of the Internet,has been concerned by more and more people.The authors propose a new algorithm of Blog-oriented Web crawler through considering “Blog” as a special “subject”.

Key words: Blog, crawler, algorithm