计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (3): 137-139.

• 数据库、信号与信息处理 • 上一篇    下一篇

计量特征在语言风格比较及作家判定中的应用
——以韩寒《三重门》与郭敬明《梦里花落知多少》为例

陈芯莹,李雯雯,王 燕   

  1. 中国传媒大学 应用语言学系,北京 100024
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2012-01-21 发布日期:2012-01-21

Application of quantitative characteristics in comparison of language style and author judgment—Triple Gates of Han Han and Never Flowers in Never Dreams of Guo Jingming as examples

CHEN Xinying, LI Wenwen, WANG Yan   

  1. Department of Applied Linguistics, Communication University of China, Beijing 100024, China
  • Received:1900-01-01 Revised:1900-01-01 Online:2012-01-21 Published:2012-01-21

摘要: 提出了将语言计量研究成果应用于语言风格对比及作家判定中的方法。通过对两个75 000字的语料中12个语言结构特征分布的统计对比,发现了7个具有显著分布差异的语言结构特征。并以这7个语言结构特征作为文本表示特征对两个75 000字的未知作家文本做了相关性分析,并准确判定了未知作家文本的作者。以语言结果的计量特征表示文本的方法加强了语言风格对比及作家判定研究的可解释性,具有较高的理论和应用价值。以语料库和统计方法进行语言结构特征计量研究是汉语语言风格描写研究及作家判定研究的重要方法。

关键词: 语言风格, 语言结构特征, 三重门, 梦里花落知多少

Abstract: The paper proposes the method that applies the results of quantitative language research in comparison of language style and author judgment. The paper discovers 7 language structure characteristics possessing obvious distribution differences through the statistical comparison of 12 language structure characteristics distribution of two corpuses with 75 thousand words. The paper also analyzes two texts with 75 thousand words which are not denoted with authors by regarding the 7 language structure characteristics as text expression characteristics, and accurately judges the authors of the two texts. The method adopting quantitative characteristics of language to denote text can better explain the research of language style and author judgment. The quantitative research of language structure characteristics based on corpus and statistical method is an important method for the research of Chinese language style and author judgment.

Key words: language style, language structure, Triple Gates, Never Flowers in Never Dreams