计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (29): 125-127.DOI: 10.3778/j.issn.1002-8331.2009.29.037

• 数据库、信号与信息处理 • 上一篇    下一篇

法语料库文本语义接受度评价研究

杜家利1,于屏方2   

  1. 1.鲁东大学 外国语学院,山东 烟台 264025
    2.鲁东大学 汉语言文学院,山东 烟台 264025
  • 收稿日期:2009-05-18 修回日期:2009-06-25 出版日期:2009-10-11 发布日期:2009-10-11
  • 通讯作者: 杜家利

Research on evaluation of semantic accessibility scale based on French corpus

DU Jia-li1,YU Ping-fang2   

  1. 1.School of Foreign Languages,Ludong University,Yantai,Shandong 264025,China
    2.School of Chinese Language and Literature,Ludong University,Yantai,Shandong 264025,China
  • Received:2009-05-18 Revised:2009-06-25 Online:2009-10-11 Published:2009-10-11
  • Contact: DU Jia-li

摘要: 法文本语义接受度(SAS)研究属于基于语料库的SAS研究分支。以等距离系统随机抽样方法进行对比实验。抽取公式为A+BX≤C,A为起始页码,B为抽取间距,C为文本总页码,X为可取页数集。当以L’Assommoir(1953版)为语料,B∈(10;5;4;3;2;1)时,词句抽取率(SR)为9.55%,20.29%,25.13%,33.13%,50.36%和100%,SAS为0.089 7,0.084 1,0.085 4,0.084 7,0.084 8和0.085 4。依次攀升SR没有带来SAS显著变化。说明基于英日语料的SAS评价体系也适用于法文本,便于文学评论家对法文本进行量化评析。

关键词: 文学文本, 法语语料库, 语义接受度, 抽取率

Abstract: The study on Semantic Accessibility Scale(SAS) based on French corpus comes under the umbrella of corpus-involved SAS research.Systematic random sampling skill is used here to discuss the comparison of six different groups extracted from one text.The formula,A+BX≤C,is provided to equidistantly extract L’Assommoir(1953 ed.).“A”means the start page;“B”,the space of extraction;“C”,the total number of text;“X”,the set of available pages.If the condition is B∈(10;5;4;3;2;1),sampling ratios(SR)are 9.55%,20.29%,25.13%,33.13%,50.36%,100%,correspondingly bringing different values of SAS,namely,0.089 7,0.084 1,
0.085 4,0.084 7,0.084 8,0.085 4.The non-relevance between SR and SAS establishes the fact that the system of SAS evaluation based on English and Japanese corpora can safely extend to French corpus,which helps the critics to analyze French texts by computer.

Key words: literary texts, French corpus, semantic accessibility scale, sampling ratio

中图分类号: