基于反事实的相关背景知识获取方法

doi:10.3778/j.issn.1002-8331.2405-0434

摘要/Abstract

摘要： 在多任务学习中，学习器会将已学到的知识添加到背景知识库中，并利用这些背景知识来辅助其他任务的学习。然而，随着背景知识的不断积累，知识库可能会变得庞大，给学习系统带来负担。因此，有必要对不相关的背景知识进行遗忘。现有的遗忘策略往往未充分考虑背景知识与学习任务之间的关联性，而是为不同的归纳任务提供相同的背景知识。针对这一问题，提出了一种基于反事实思维的相关性识别方法，即反事实获取法。该方法通过相关性函数评估每个假设对学习任务的具体贡献，仅保留那些相关性函数值超过设定阈值的假设。此外，该方法应用于归纳逻辑编程领域，设计出一个名为Countergol的多任务归纳逻辑编程学习器。理论分析显示，Countergol能够有效地缩减假设空间及样本复杂度。实验结果表明，通过与其他遗忘方法的对比，Countergol在大量任务学习中的优越性得到了进一步验证。

关键词: 归纳逻辑程序设计, 反事实, 多任务学习

Abstract: In multi-task learning, a learner adds the learned programs into background knowledge (BK) and reuses them to learn other programs. Continually acquiring BK can lead to the problem of excessive BK, which overwhelms a learning system. Hence, it is necessary to forget irrelevant BK. However, existing forgetting approaches rarely consider the relevance between BK and learning tasks, commonly providing the same BK for different induction tasks. To address this issue, this paper proposes a relevance identification approach based on counterfactual thinking, termed counterfactual acquisition. This approach first measures each hypothesis’s contribution to the learning task using a relevance function. Then, it retains only those hypotheses whose relevance function values exceed a predefined threshold. Moreover, this approach is applied to inductive logic programming (ILP) through the introduction of a multi-task ILP learner named Countergol. Theoretical analysis demonstrates that Countergol can reduce the hypothesis space and sample complexity size. Experimental comparisons against other forgetting approaches show that Countergol outperforms similar methods.

Key words: inductive logic programming, counterfactual, multi-task learning

王学敏, 包旭光, 常亮, 郝远静. 基于反事实的相关背景知识获取方法[J]. 计算机工程与应用, 2024, 60(20): 168-179.

WANG Xuemin, BAO Xuguang, CHANG Liang, HAO Yuanjing. Towards Related Background Knowledge Acquisition via Counterfactual[J]. Computer Engineering and Applications, 2024, 60(20): 168-179.

参考文献

[1] CROPPER A, DUMANCIC S. Inductive logic programming at 30: a new introduction[J]. Journal of Artificial Intelligence Research, 2022, 74: 765-850.
[2] 戴望州, 周志华. 归纳逻辑程序设计综述[J]. 计算机研究与发展, 2019, 56(1): 138-154.
DAI W Z, ZHOU Z H. A survey on inductive logic programming[J]. Journal of Computer Research and Development, 2019, 56(1): 138-154.
[3] 郑磊, 贾东, 刘椿年. 归纳逻辑程序设计综述 [J]. 计算机工程与应用, 2003, 39(17): 43-46.
ZHENG L, JIA D, LIU C N. Introduction to inductive logic programming[J]. Computer Engineering and Applications, 2003, 39(17): 43-46.
[4] ZHANG Y, YANG Q. A survey on multi-task learning[J]. IEEE Transactions on Knowledge and Data Engineering, 2021, 34(12): 5586-5609.
[5] LIN D, DECHTER E, ELLIS K, et al. Bias reformulation for one-shot function induction[C]//Proceedings of the 21st European Conference on Artificial Intelligence, 2014: 525-530.
[6] CROPPER A. Forgetting to learn logic programs[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2020: 3676-3683.
[7] WIDMER G, KUBAT M. Learning in the presence of concept drift and hidden contexts[J]. Machine Learning, 1996, 23: 69-101.
[8] SABLON G, DE R L. Forgetting and compacting data in concept-learning[C]//Proceedings of the 14th International Joint Conference on Artificial Intelligence, 1995: 432-438.
[9] ELLIS K, MORALES L, SABLé-MEYER M, et al. Library learning for neurally-guided Bayesian program induction[C]//Proceedings of the 32nd International Conference on Neural Information Processing Systems, 2018: 7816-7826.
[10] MORUZZI C. Climbing the ladder: how agents reach counterfactual thinking[C]//Proceedings of the 14th International Conference on Agents and Artificial Intelligence, 2022: 555-560.
[11] ABID A, YUKSEKGONUL M, ZOU J. Meaningfully debugging model mistakes using conceptual counterfactual explanations[C]//Proceedings of the International Conference on Machine Learning, 2022: 66-88.
[12] HUANG W, ZHANG L, WU X. Achieving counterfactual fairness for causal bandit[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2022: 6952-6959.
[13] KIM H, SHIN S, JANG J H, et al. Counterfactual fairness with disentangled causal effect variational autoencoder[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2021: 8128-8136.
[14] QUINLAN J R. Learning logical definitions from relations[J]. Machine Learning, 1990, 5: 239-266.
[15] MUGGLETON S H, LIN D, PAHLAVI N, et al. Meta-interpretive learning: application to grammatical inference[J]. Machine Learning, 2014, 94: 25-49.
[16] BLOCKEEL H, RAEDT D L. Top-down induction of first-order logical decision trees[J]. Artificial Intelligence, 1998, 101(1/2): 285-297.
[17] 李艳娟, 郭茂祖. 关系tri-training: 利用无标记数据学习一阶规则[J]. 计算机科学与探索, 2012, 6(5): 430-442.
LI Y J, GUO M Z. Relational-tri-training: learning first-order rules exploiting unlabeled data[J]. Journal of Frontiers of Computer Science and Technology, 2012, 6(5): 430-442.
[18] LAW M, RUSSO A, BRODA K. Inductive learning of answer set programs[C]//Proceedings of the 14th European Conference on Logics in Artificial Intelligence, 2014: 311-325.
[19] SIEBERS M, SCHMID U. Was the year 2000 a leap year? step-wise narrowing theories with metagol[C]//Proceedings of the International Conference on Inductive Logic Programming, 2018: 141-156.
[20] CROPPER A, MOREL R, MUGGLETON S. Learning higher-order logic programs[J]. Machine Learning, 2020, 109: 1289-1322.
[21] DUMAN?IC S, BLOCKEEL H. Clustering-based relational unsupervised representation learning with an explicit distributed representation[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence, 2017: 1631-1637.
[22] MARTíNEZ-PLUMED F, FERRI C, HERNáNDEZ-ORALLO J, et al. Knowledge acquisition with forgetting: an incremental and developmental setting[J]. Adaptive Behavior, 2015, 23(5): 283-299.
[23] BALOG M, GAUNT A L, BROCKSCHMIDT M, et al. DeepCoder: learning to write programs[J]. arXiv:1611.01989, 2016.
[24] KUSNER M, LOFTUS J, RUSSELL C, et al. Counterfactual fairness[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems, 2017: 4069-4079.
[25] COLMERAUER A. An introduction to prolog Ⅲ[J]. Communications of the ACM, 1990, 33(7): 69-90.
[26] SCHAUB T, WOLTRAN S. Special issue on answer set programming[J]. KI-Künstliche Intelligenz, 2018, 32: 101-103.
[27] HU P, MOTIK B, HORROCKS I. Modular materialisation of datalog programs[J]. Artificial Intelligence, 2022, 308: 103726.
[28] RAEDT D L. Logical and relational learning[M]. Verlag Berlin Heidelberg: Springer, 2008.
[29] MILLER D. A survey of the proof-theoretic foundations of logic programming[J]. Theory and Practice of Logic Programming, 2022, 22(6): 859-904.