计算机工程与应用 ›› 2022, Vol. 58 ›› Issue (18): 43-58.DOI: 10.3778/j.issn.1002-8331.2203-0453
李华昱,毕经纶,闫阳
出版日期:
2022-09-15
发布日期:
2022-09-15
LI Huayu, BI Jinglun, YAN Yang
Online:
2022-09-15
Published:
2022-09-15
摘要: 事件抽取是信息抽取领域最具有挑战性的任务之一,也是知识图谱构建中的关键技术。事件抽取在阅读理解、文本摘要、问答系统等领域得到了广泛的应用。限定域事件抽取指的是系统所抽取的事件类型是预定义的,因此针对某一特定领域,限定域事件抽取的研究更具有研究价值,而且中文事件抽取由于中文语言特性问题,面临着较大挑战。介绍了中文事件抽取中面对的挑战,对限定域中文事件抽取的主要方法进行归纳总结,重点介绍了基于深度学习的方法,并总结了少样本情况下的事件抽取方法,介绍了中文事件抽取常用的数据集,展望了中文事件抽取未来的发展趋势。
李华昱, 毕经纶, 闫阳. 限定域中文事件抽取研究综述[J]. 计算机工程与应用, 2022, 58(18): 43-58.
LI Huayu, BI Jinglun, YAN Yang. Survey of Chinese Event Extraction in Restricted Domain[J]. Computer Engineering and Applications, 2022, 58(18): 43-58.
[1] ZENG Y,YANG H,FENG Y,et al.A convolution bilstm neural network model for Chinese event extraction[J].Lecture Notes in Computer Science,2016,10102:275-287. [2] LI P,ZHOU G.Employing morphological structures and sememes for Chinese event extraction[C]//Proceedings of Coling,2012:1619-1634. [3] SHENG J,GUO S,YU B,et al.Casee:a joint learning framework with cascade decoding for overlapping event extraction[C]//Findings of the Association for Computational Linguistics,2021:164-174. [4] CHEN Y B,XU L H,LIU K,et al.Event extraction via dynamic multi-pooling convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing,2015:167-176. [5] NGUYEN T H,CHO K,GRISHMAN R.Joint event extraction via recurrent neural networks[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies,2016:300-309. [6] RILOFF E.Automatically constructing a dictionary for information extraction tasks[C]//Proceedings of the Eleventh National Conference on Artifical Intelligence,1993:811-816. [7] 姜吉发.自由文本的信息抽取模式获取的研究[D].北京:中国科学院研究生院(计算技术研究所),2004. JIANG J F.A research about the pattern acquisition for free text IE[D].Beijing:Graduate School of Chinese Academy of Sciences(Institute of Computing Technology),2004. [8] 梁晗,陈群秀,吴平博.基于事件框架的信息抽取系统[J].中文信息学报,2006,20(2):40-46. LIANG H,CHEN Q X,WU P B.Information extraction system based on event frame[J].Journal of Chinese Information Processing,2006,20(2):40-46. [9] CHIEU H L,NG H T.A maximum entropy approach to information extraction from semi-structured and free text[C]//Proceedings of the 18th National Conference on Artificial Intelligence,2002:786-791. [10] AHN D.The stages of event extraction[C]//Proceedings of the Workshop on Annotations and Reasoning about Time and Events,Sydney,2006:1-8. [11] 赵妍妍.中文事件抽取的相关技术研究[D].哈尔滨:哈尔滨工业大学,2007. ZHAO Y Y.Research on Chinese event extraction technology[D].Harbin:Harbin Institute of Technology,2007. [12] CHEN Z,JI H.Language specific issue and feature exploration in Chinese event extraction[C]//Proceedings of Human Language Technologies:the 2009 NAACL,volume:Short Papers.Boulder,Colorado,USA:ACL,2009:209-212. [13] 侯立斌,李培峰,朱巧明.基于CRFs和跨事件的事件识别研究[J].计算机工程,2012,38(24):191-195. HOU L B,LI P F,ZHU Q M.Study of event recognition based on crfs and cross-event[J].Computer Engineering,2012,38(24):191-195. [14] CHEN C,NG V.Joint modeling for Chinese event extraction with rich linguistic features[C]//Proceedings of Coling,2012,529-544. [15] 李培峰,周国栋,朱巧明.基于语义的中文事件触发词抽取联合模型[J].软件学报,2016,27(2):280-294. LI P F,ZHOU G D,ZHU Q M.Semantics-based joint model of Chinese event trigger extraction[J].Journal of Software,2016,27(2):280-294. [16] 贺瑞芳,段绍杨.基于多任务学习的中文事件抽取联合模型[J].软件学报,2019,30(4):1015-1030. HE R F,DUAN S Y.Joint Chinese event extraction based multi-task learning[J].Journal of Software,2019,30(4):1015-1030. [17] NGUYEN T H,GRISHMAN R.Event detection and domain adaptation with convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing,2015:365-371. [18] LIN H,LU Y,HAN X,et al.Nugget proposal networks for Chinese event detection[C]//Meeting of the Association for Computational Linguistics,2018:1565-1574. [19] SHA L,QIAN F,CHANG B,et al.Jointly extracting event triggers and arguments by dependency-bridge RNN and tensor-based argument interaction[C]//Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence,(AAAI-18),the 30th in Novative Applications of Artificial Intelligence(IAAI-18),and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence(EAAI-18),2018:5916-5923. [20] FENG X,QIN B,LIU T.A language-independent neural network for event detection[J].Science China(Information Sciences),2018,61(9):81-92. [21] DING N,LI Z R,LIU Z Y.Event detection with trigger-aware lattice neural network[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing,2019:347-356. [22] XI X Y,ZHANG T,YE W,et al.A hybrid character representation for Chinese event detection[C]//International Joint Conference on Neural Networks(IJCNN),2019:1-8. [23] NGUYEN T H,GRISHMAN R.Graph convolutional networks with argument-aware pooling for event detection[C]//Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence(AAAI-18),the 30th Innovative Applications of Artificial Intelligence(IAAI-18),and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence(EAAI-18),2018:5900-5907. [24] LIU X,LUO Z C,HUANG H Y.Jointly multiple events extraction via attention-based graph information aggregation[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing,2018:1247-1256. [25] YAN H R,JIN X L,MENG X B,et al.Event detection with multi-order graph convolution and aggregated attention[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing,2019:5766-5770. [26] LAI V D,NGUYEN T N,NGUYEN T H.Event detection:gate diversity and syntactic importance scores for graph convolution neural networks[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing,2020:5405-5411. [27] CUI S Y,YU B W,LIU T W,et al.Edge-enhanced graph convolution networks for event detection with syntactic relation[C]//Findings of the Association for Computational Linguistics(EMNLP 2020),2020:2329-2339. [28] LIU H Z,XU N,LIU A.Self-attention graph residual convolutional network for event detection with dependency relations[C]//Findings of the Association for Computational Linguistics(EMNLP 2021),2021:302-311. [29] CUI S Y,YU B W,CONG X,et al.Label enhanced event detection with heterogeneous graph attention networks[J].arXiv:2012.01878,2020. [30] WU X H,WANG T R,FAN Y P,et al.Chinese event extraction via graph attention network[J].ACM Transactions on Asian and Low-Resource Language Information Processing,2022:1-12. [31] LIU S,CHEN Y,LIU K,et al.Exploiting argument information to improve event detection via supervised attention mechanisms[C]//Proceedings of 55th Annual Meeting of the Association for Computational Linguistics,2017:1789-1798. [32] ZHANG J,ZHOU W,HONG Y,et al.Using entity relation to improve event detection via attention mechanism[C]//Proceedings of CCF International Conference on Natural Language Processing Chinese Computing,2018:171-183. [33] DING R X,LI Z J.Event extraction with deep contextualized word representation and multi-attention layer[C]//Proceedings of International Conference on Advance Data Mining Application,2018:189-201. [34] WU Y,ZHANG J.Chinese event extraction based on attention and semantic features:a bidirectional circular neural network[J].Future Internet,2018. [35] YANG S,FENG D,QIAO L,et al.Exploring pre-trained language models for event extraction and generation[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics,2019:5284-5294. [36] XU N,XIE H H,ZHAO D Y.A novel joint framework for multiple Chinese events extraction[M].Cham:Springer,2020:174-183. [37] DU X,CARDIE C.Event extraction by answering(almost) natural questions[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing(EMNLP),2020:671-683. [38] LIU J,CHEN Y,LIU K,et al.Event extraction as machine reading comprehension[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing(EMNLP),2020:1641-1651. [39] 陈敏,吴凡,王中卿,等.基于阅读理解框架的中文事件论元抽取[C]//第十九届中国计算语言学大会,2020:376-389. CHEN M,WU F,WANG Z Q,et al.Chinese event argument extraction using reading comprehension framework[C]//Proceedings of the 19th Chinese National Conference on Computational Linguistics,2020:376-389. [40] ZHOU Y,CHEN Y B,ZHAO J,et al.What the role is vs.What plays the role:semi-supervised event argument extraction via dual question answering[C]//Proceedings of the AAAI Conference on Artificial Intelligence,2021,35(16):14638-14646. [41] LI Q,PENG H,LI J X,et al.Reinforcement learning-based dialogue guided event extraction to exploit argument relations[J].IEEE/ACM Transactions on Audio Speech and Language Processing,2022,30:520-533. [42] LU Y J,LIN H Y,XU J,et al.TEXT2EVENT:controllable sequence-to-structure generationfor end-to-end event extraction[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing,2021:2795-2806. [43] YANG H,CHEN Y B,LIU K,et al.DCFEE:a document-level Chinese financial event extraction system based on automatically labeled training data[C]//Proceedings of ACL 2018,System Demonstrations,2018:50-55. [44] 仲伟峰,杨航,陈玉博,等.基于联合标注和全局推理的篇章级事件抽取[J].中文信息学报,2019,33(9):88-95. ZHONG W F,YANG H,CHEN Y B,et al.Document-level event extraction based on joint labeling and global reasoning[J].Journal of Chinese Information Processing,2019,33(9):88-95. [45] 张洪宽,宋晖,王舒怡,等.基于BERT的端到端中文篇章事件抽取[C]//第十九届中国计算语言学大会,2020:390-401. ZHANG H K,SONG H,WANG S Y,et al.A BERT-based end-to-end model for Chinese document-level event extraction[C]//Proceedings of the 19th Chinese National Conference on Computational Linguistics,2020:390-401. [46] DU X Y,CARDIE C.Document-level event role filler extraction using multi-granularity contextualized encoding[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics,2020:8010-8020. [47] ZHENG S,CAO W,XU W,et al.Doc2EDAG:an end-to-end document-level framework for Chinese financial event extraction[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(EMNLP-IJCNLP),2019:337-346. [48] YANG H,SUI D B.Document-level event extraction via parallel prediction networks[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing,2021:6298-6308. [49] XU R X,LIU T Y,LI L,et al.Document-level event extraction via heterogeneous graph-based interaction model with a tracker[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing,2021:3533-3546. [50] HUANG Y S,JIA W J.Exploring sentence community for document-level event extraction[C]//Findings of the Association for Computational Linguistics(EMNLP 2021),2021:340-351. [51] LIU S,CHEN Y,HE S,et al.Leveraging frameNet to improve automatic event detection[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.Stroudsburg,PA:ACL,2016:2134-2143. [52] CHEN Y,LIU S,ZHANG X,et al.Automatically labeled data generation for large scale event extraction[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics,Stroudsburg.PA:ACL,2017:409-419. [53] WANG X Z,HAN X,LIU Z Y,et al.Adversarial training for weakly supervised event detection[C]//Proceedings of NAACL-HLT 2019,2019:998-1008. [54] HUANG L F,JI H,CHO K,et al.Zero-shot transfer learning for event extraction[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers),2018:2160-2170. [55] DENG S M,ZHANG N Y,LI L Q,et al.OntoED:low-resource event detection with ontology embedding[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing,2021:2828-2839. [56] ZHANG H M,WANG H Y,ROTH D.Zero-shot label-aware event trigger and argument classification[C]//Findings of the Association for Computational Linguistics(ACL-IJCNLP 2021),2021:1331-1340. [57] DENG S M,ZHANG N Y,KANG J J,et al.Meta-learning with dynamic-memory-based prototypical network for few-shot event detection[C]//The Thirteenth ACM International Conference on Web Search and Data Mining(WSDM’20),2020. [58] LAI V D,DERNONCOURT F,NGUYEN T H.Learning prototype representations across few-shot tasks for event detection[C]//Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing,2021:5270-5277. [59] LAI V D,DERNONCOURT F,NGUYEN T H.Extensively matching for few-shot learning event detection[C]//Proceedings of the 1st Joint Workshop on Narrative Understanding,Storylines,and Events,2020:38-45. [60] CONG X,CUI S Y,YU B W,et al.Few-shot event detection with prototypical amortized conditional random field[C]//Findings of the Association for Computational Linguistics(ACL-IJCNLP 2021),2021:28-40. [61] NGUYEN T M,NGUYEN T H.One for all:neural joint modeling of entities and events[C]//The Thirty-Third AAAI Conference on Artificial Intelligence,2019:6851-6858. [62] LUAN Y,WADDEN D,HE L H,et al.A general framework for information extraction using dynamic span graphs[C]//Proceedings of NAACL-HLT 2019,2019:3036-3046. [63] WADDEN D,WENNBERG U,LUAN Y,et al.Entity,relation,and event extraction with contextualized span representations[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing,2019:5783-5788. [64] LIN Y,JI H,HUANG F,et al.A joint neural model for information extraction with global features[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics,2020:7999-8009. [65] CHEN Y B,YANG H,LIU K,et al.Collective event detection via a hierarchical and bias tagging networks with gated multi-level attention mechanisms[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing,2018:1267-1276. [66] ZHAO Y,JIN X L,WANG Y Z,et al.Document embedding enhanced event detection with hierarchical and supervised attention[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Short Papers),2018:414-419. [67] LOU D F,LIAO Z L,DENG S M,et al.MLBiNet:a cross-sentence collective event detection network[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing,2021:4829-4839. [68] VEYSEH A P B,NGUYEN M V,TRUNG N N,et al.Modeling document-level context for event detection via important context selection[C]//Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing,2021:5403-5413. [69] HIS A,YANG Y M.Leveraging multilingual training for limited resource event extraction[C]//Proceedings of COLING 2016,the 26th International Conference on Computational Linguistics:Technical Papers,2016:1201-1210. [70] LIU J,CHEN Y,LIU K,et al.Event detection via gated multilingual attention mechanism[C]//Thirty-Second AAAI Conference on Artificial Intelligence,2018:4865-4872. [71] LIU J,CHEN Y,LIU K,et al.Neural cross-lingual event detection with minimal parallel resources[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(EMNLP-IJCNLP),2019:738-748. [72] SUBBURATHINAM A,LU D,JI H,et al.Cross-lingual structure transfer for relation and event extraction[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(EMNLP-IJCNLP),2019:313-325. [73] AHMAD W U,PENG N,CHANG K.GATE:graph attention transformer encoder for cross-lingual relation and event extraction[C]//Thirty-Fifth AAAI Conference on Artificial Intelligence,2021:12462-12470. [74] NGUYEN M V,NGUYEN T N,MIN B,et al.Cross-lingual transfer learning for relation and event extraction via word category and class alignments[C]//Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing,2021:5414-5426. [75] LI C,SHENG Y,GE J,et al.Apply event extraction techniques to the judicial field[C]//Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers,2019:492-497. [76] LI Q,ZHANG Q,YAO J,et al.Event extraction for criminal legal text[C]//2020 IEEE International Conference on Knowledge Graph(ICKG),2020. [77] SHEN S R,QI G L,LI Z,et al.Hierarchical Chinese legal event extraction via pedal attention mechanism[C]//Proceedings of the 28th International Conference on Computational Linguistics,2020:100-113. [78] FENG Y,LI C Y,NG V.Legal Judgment prediction via event extraction with constraints[C]//Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers),2021:648-664. [79] WANG A,WANG J,LIN H,et al.A multiple distributed representation method based on neural network for biomedical event extraction[J].BMC Medical Informatics and Decision Making,2017:59-66. [80] YU X Y,RONG W G,LIU J S,et al.LSTM-based end-to-end framework for biomedical event extraction.[J].IEEE/ACM Transactions on Computational Biology and Bioinformatics,2019,17(6):2029-2039. [81] LI X,LI F,PAN L,et al.DuEE:a large-scale dataset for Chinese event extraction in real-world scenarios[C]//CCF International Conference on Natural Language Processing and Chinese Computing.Cham:Springer,2020:534-545. [82] YAO F,XIAO C J,WANG X Z,et al.LEVEN:a large-scale Chinese legal event detection dataset[C]//Findings of the Association for Computational Linguistics(ACL 2022),2022:183-201. [83] CHEN P,LIU K,CHEN Y B,et al.Probing into the root:a dataset for reason extraction of structural events from financial documents[C]//Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics,2021:2042-2048. [84] LI Q,PENG H,LI J,et al.A comprehensive survey on schema-based event extraction with deep learning[J].arXiv:2107.02126,2021. |
[1] | 隗昊,周爱,张益嘉,陈飞,屈雯,鲁明羽. 深度学习生物医学实体关系抽取研究综述[J]. 计算机工程与应用, 2021, 57(21): 14-23. |
[2] | 吴呈,王朝坤,王沐贤. 基于文本化简的实体属性抽取方法[J]. 计算机工程与应用, 2020, 56(21): 115-122. |
[3] | 赵晓永,王 磊. 电商网页中商品规格信息自动抽取方法研究[J]. 计算机工程与应用, 2017, 53(24): 168-171. |
[4] | 谷楠楠,冯 筠,孙 霞,赵 妍,张 蕾. 中文简历自动解析及推荐算法[J]. 计算机工程与应用, 2017, 53(18): 141-148. |
[5] | 冯钦林,杨志豪,林鸿飞. 疾病-病症和病症-治疗物质的关系抽取研究[J]. 计算机工程与应用, 2017, 53(10): 251-257. |
[6] | 孙红敏,姜楠楠,李 想. 基于文档集的生物信息挖掘模型研究[J]. 计算机工程与应用, 2016, 52(24): 102-106. |
[7] | 伊 政,徐武平,徐爱萍. 一种基于结构分析的网页主题区域发现方法[J]. 计算机工程与应用, 2015, 51(6): 227-230. |
[8] | 黄彦姣,吴 秦,梁久祯. 基于增强约束条件随机场的Web对象信息抽取[J]. 计算机工程与应用, 2015, 51(23): 143-148. |
[9] | 张菲菲1,李宗海2,周晓辉1,李晓戈1,2. 基于层次聚类的跨文本中文人名消歧研究[J]. 计算机工程与应用, 2014, 50(6): 106-111. |
[10] | 昌 磊1,陆 阳1,吴 雷1,2. PDF文档的跨终端发布技术[J]. 计算机工程与应用, 2014, 50(22): 158-162. |
[11] | 李 嘉,徐 前,王 梓,陈 钊. 基于语义的林产品贸易Web信息抽取算法[J]. 计算机工程与应用, 2014, 50(19): 199-204. |
[12] | 袁 璐,蒙祖强,许 珂. 依存分析和HMM相结合的信息抽取方法[J]. 计算机工程与应用, 2012, 48(9): 138-140. |
[13] | 潘 鹏1,2,诸云强2,朱 琦3,赵晓宏4. 隐马尔可夫模型在环保档案信息抽取中的应用[J]. 计算机工程与应用, 2012, 48(26): 243-248. |
[14] | 史存会1,罗文华2,林鸿飞1. 文本挖掘技术在互联网赌博案情分析中的应用[J]. 计算机工程与应用, 2011, 47(28): 113-116. |
[15] | 蔡华利1,2,刘 鲁1,刘志明1,杨跃翔2. 突发事件Web新闻中时间信息分析及抽取[J]. 计算机工程与应用, 2010, 46(34): 107-110. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||