Self-Attention Credit Scoring Model

doi:10.3778/j.issn.1002-8331.1810-0126

Abstract

Abstract: In the credit scoring problem, the user information contains both category data and numerical data. Traditional artificial intelligence-based credit scoring algorithms usually transform the category data into one-hot vectors and joints them with numerical data, as the input of the discriminator. In contrast, this paper extracts vectors of category data based on the word embedding techniques which are popularly used in the natural language processing problem. After that, the set of the word vectors is analogized to a “sentence”, and the input feature is extracted from the “sentence” based on the self-attention mechanism. Finally, a Multi-Layer Perception（MLP） neural network is used to predict the probability of default. The new model is trained end-to-end by the back propagation method. Experimental results show the proposed new model achieves better performance than?six other baselines on three well-known benchmark datasets.

Key words: credit scoring, self-attention mechanism, word embedding, feature extraction, deep neural network

摘要： 在信用评估问题中，用户信息中既包含类别数据，也包含数值数据。传统的基于人工智能的信用评估模型通常对类别数据进行one-hot变换后，再与数值数据进行拼接作为判别器的输入。与之不同，借鉴了自然语言处理中的词嵌入技术来提取类别数据的词向量；将输入的词向量集合类比为“句子”，并基于自注意力机制从“句子”中提取出用户特征；最后采用多层感知机来预测用户违约的概率。新模型可以使用反向传播算法实现端到端的训练。在三个不同的数据集上将新模型和六种基准算法进行了比较，结果表明该模型能够比基准算法取得更好的性能。

关键词: 信用评估, 自注意力机制, 词嵌入, 特征提取, 深度神经网络

LIU Xinyang, QU Yanwen, ZHOU Qiyun. Self-Attention Credit Scoring Model[J]. Computer Engineering and Applications, 2019, 55(13): 36-41.

刘欣阳，曲彦文，周琪云. 自注意力信用评估模型[J]. 计算机工程与应用, 2019, 55(13): 36-41.

[1]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[2]	WANG Lin, CHAI Jiangyun. Research on Deep Neural Network in Multi-scene Vehicle Attribute Recognition [J]. Computer Engineering and Applications, 2021, 57(9): 162-167.
[3]	XU Hao, ZHANG Kai, TIAN Yingjie, CHONG Faguang, WANG Zichao. Review of Deep Neural Network-Based Image Caption [J]. Computer Engineering and Applications, 2021, 57(9): 9-22.
[4]	XU Degang, WANG Lu, LI Fan. Review of Typical Object Detection Algorithms for Deep Learning [J]. Computer Engineering and Applications, 2021, 57(8): 10-25.
[5]	HU Wentao, CHEN Xiuhong. Low-Rank Projection Learning Based on Neighbor Graph [J]. Computer Engineering and Applications, 2021, 57(7): 209-214.
[6]	ZHU Juntao, YAO Guangle, ZHANG Gexiang, LI Jun, YANG Qiang, WANG Sheng, YE Shaoze. Survey of Few Shot Learning of Deep Neural Network [J]. Computer Engineering and Applications, 2021, 57(7): 22-33.
[7]	WEI Jihong, ZHENG Rongfeng, LIU Jiayong. Research on Malicious TLS Traffic Identification Based on Hybrid Neural Network [J]. Computer Engineering and Applications, 2021, 57(7): 107-114.
[8]	ZHANG Xiaoli, ZHANG Kuixing, JIANG Mei, WEI Benzheng, CONG Jinyu. Review of Image Classification Technology for Lymphoma [J]. Computer Engineering and Applications, 2021, 57(6): 1-9.
[9]	XIONG Jian, QIN Renchao, HE Mengyi, LIU Jianlan, TANG Fengyang. Application of Improved Random Forest Algorithm in Android Malware Detection [J]. Computer Engineering and Applications, 2021, 57(3): 130-136.
[10]	BAI Zhixu, WANG Hengjun, GUO Kexiang. Summary of Adversarial Examples Techniques Based on Deep Neural Networks [J]. Computer Engineering and Applications, 2021, 57(23): 61-70.
[11]	LI Longlong, HE Dongjian, WANG Meili. Study of Plant Leaf Image Recognition Based on Improved Local Binary Pattern Algorithm [J]. Computer Engineering and Applications, 2021, 57(19): 228-234.
[12]	LI Jie, LI Miao, YUAN Xiguo. Detection Algorithm?of Pathogenic Microbes from Next-Generation Sequencing Data [J]. Computer Engineering and Applications, 2021, 57(19): 282-289.
[13]	GUO Hengguang, LIU Wenbiao, YU Renbo. Shape Feature Extraction Using Spike Function [J]. Computer Engineering and Applications, 2021, 57(18): 220-226.
[14]	LI Zhenqiang, WANG Shucai, ZHAO Shida, BAI Yu. Cutting Methods of Sheep’s Trunk Based on Improved DeepLabv3+ and XGBoost [J]. Computer Engineering and Applications, 2021, 57(18): 263-269.
[15]	LIU Xingchen, JIA Juncheng, ZHANG Li, HU Qinhan. Feature Concentration Network for Image Super-Resolution [J]. Computer Engineering and Applications, 2021, 57(16): 213-219.

Self-Attention Credit Scoring Model

自注意力信用评估模型

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics