计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (8): 215-218.

• 工程与应用 • 上一篇    下一篇

基于XML的C代码抄袭检测算法

钟 美,张丽萍,刘东升   

  1. 内蒙古师范大学 计算机与信息工程学院,呼和浩特 010022
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-03-11 发布日期:2011-03-11

Plagiarism detection algorithm based on XML for C code

ZHONG Mei,ZHANG Liping,LIU Dongsheng   

  1. Computer and Information Engineering College,Inner Mongolia Normal University,Hohhot 010022,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-03-11 Published:2011-03-11

摘要: 主要研究一种基于XML文本的C代码抄袭检测算法。将C程序源代码转化为XML文本,提取XML文本各行标记的属性,计算XML文本行的相似值,得到程序对的相似度。并对抄袭检测算法实行分类测试:按十二种抄袭手段分类;按复杂语义和复杂数据结构分类。测试结果对比验证了抄袭检测算法的有效性。

关键词: XML文本, C代码, 相似度, 抄袭检测算法

Abstract: This paper mainly studies a plagiarism detection algorithm based on XML for C code.It generates a XML text from C source code,extracts the structure information of XML text lines,calculates the XML text lines similarity and obtains the similarity of two programs.Plagiarism detection algorithm is implemented to classification test,specific categories are described as follows:the means of 12 kinds’ plagiarism,complex semantics and complex data structures.Test results verify the effectiveness of plagiarism detection algorithm.

Key words: XML text, C code, similarity, plagiarism detection algorithm