自注意力信用评估模型

doi:10.3778/j.issn.1002-8331.1810-0126

计算机工程与应用 ›› 2019, Vol. 55 ›› Issue (13): 36-41.DOI: 10.3778/j.issn.1002-8331.1810-0126

自注意力信用评估模型

刘欣阳，曲彦文，周琪云

江西师范大学计算机信息工程学院，南昌 330022

出版日期:2019-07-01 发布日期:2019-07-01

Self-Attention Credit Scoring Model

LIU Xinyang, QU Yanwen, ZHOU Qiyun

School of Computer Information and Engineering, Jiangxi Normal University, Nanchang 330022, China

Online:2019-07-01 Published:2019-07-01

摘要/Abstract

摘要： 在信用评估问题中，用户信息中既包含类别数据，也包含数值数据。传统的基于人工智能的信用评估模型通常对类别数据进行one-hot变换后，再与数值数据进行拼接作为判别器的输入。与之不同，借鉴了自然语言处理中的词嵌入技术来提取类别数据的词向量；将输入的词向量集合类比为“句子”，并基于自注意力机制从“句子”中提取出用户特征；最后采用多层感知机来预测用户违约的概率。新模型可以使用反向传播算法实现端到端的训练。在三个不同的数据集上将新模型和六种基准算法进行了比较，结果表明该模型能够比基准算法取得更好的性能。

关键词: 信用评估, 自注意力机制, 词嵌入, 特征提取, 深度神经网络

Abstract: In the credit scoring problem, the user information contains both category data and numerical data. Traditional artificial intelligence-based credit scoring algorithms usually transform the category data into one-hot vectors and joints them with numerical data, as the input of the discriminator. In contrast, this paper extracts vectors of category data based on the word embedding techniques which are popularly used in the natural language processing problem. After that, the set of the word vectors is analogized to a “sentence”, and the input feature is extracted from the “sentence” based on the self-attention mechanism. Finally, a Multi-Layer Perception（MLP） neural network is used to predict the probability of default. The new model is trained end-to-end by the back propagation method. Experimental results show the proposed new model achieves better performance than?six other baselines on three well-known benchmark datasets.

Key words: credit scoring, self-attention mechanism, word embedding, feature extraction, deep neural network

刘欣阳，曲彦文，周琪云. 自注意力信用评估模型[J]. 计算机工程与应用, 2019, 55(13): 36-41.

LIU Xinyang, QU Yanwen, ZHOU Qiyun. Self-Attention Credit Scoring Model[J]. Computer Engineering and Applications, 2019, 55(13): 36-41.

[1]	包志强，邢瑜，吕少卿，黄琼丹. 改进YOLO V2的6D目标姿态估计算法[J]. 计算机工程与应用, 2021, 57(9): 148-153.
[2]	王林，柴江云. 深度神经网络在多场景车辆属性识别中的研究[J]. 计算机工程与应用, 2021, 57(9): 162-167.
[3]	许昊，张凯，田英杰，种法广，王子超. 深度神经网络图像描述综述[J]. 计算机工程与应用, 2021, 57(9): 9-22.
[4]	许德刚，王露，李凡. 深度学习的典型目标检测算法研究综述[J]. 计算机工程与应用, 2021, 57(8): 10-25.
[5]	胡文涛，陈秀宏. 基于邻域图的低秩投影学习[J]. 计算机工程与应用, 2021, 57(7): 209-214.
[6]	祝钧桃，姚光乐，张葛祥，李军，杨强，王胜，叶绍泽. 深度神经网络的小样本学习综述[J]. 计算机工程与应用, 2021, 57(7): 22-33.
[7]	韦佶宏，郑荣锋，刘嘉勇. 基于混合神经网络的恶意TLS流量识别研究[J]. 计算机工程与应用, 2021, 57(7): 107-114.
[8]	张晓丽，张魁星，江梅，魏本征，丛金玉. 淋巴瘤图像分类技术研究综述[J]. 计算机工程与应用, 2021, 57(6): 1-9.
[9]	徐建国，刘泳慧，刘梦凡. 基于BILSTM-CRF的高校政策语义角色标注研究[J]. 计算机工程与应用, 2021, 57(6): 207-211.
[10]	熊健，覃仁超，何梦乙，刘建兰，唐风扬. 改进随机森林在Android恶意软件检测中的应用[J]. 计算机工程与应用, 2021, 57(3): 130-136.
[11]	徐志京，汪毅. 青光眼眼底图像的迁移学习分类方法[J]. 计算机工程与应用, 2021, 57(3): 144-149.
[12]	白祉旭，王衡军，郭可翔. 基于深度神经网络的对抗样本技术综述[J]. 计算机工程与应用, 2021, 57(23): 61-70.
[13]	李龙龙，何东健，王美丽. 基于改进型LBP算法的植物叶片图像识别研究[J]. 计算机工程与应用, 2021, 57(19): 228-234.
[14]	李杰，李苗，袁细国. 面向新一代测序数据的病原微生物检测算法[J]. 计算机工程与应用, 2021, 57(19): 282-289.
[15]	郭恒光，刘文彪，余仁波. 用于形状特征提取的spike函数[J]. 计算机工程与应用, 2021, 57(18): 220-226.

自注意力信用评估模型

Self-Attention Credit Scoring Model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics