首页 | 本学科首页   官方微博 | 高级检索  
     

基于半监督广义可加Logistic回归的信用评分方法
引用本文:方匡南,陈子岚. 基于半监督广义可加Logistic回归的信用评分方法[J]. 系统工程理论与实践, 2020, 40(2): 392-402. DOI: 10.12011/1000-6788-2018-1711-11
作者姓名:方匡南  陈子岚
作者单位:1. 厦门大学 经济学院 统计系, 厦门 361000;2. 计量经济学教育部重点实验室(厦门大学), 厦门 361000;3. 厦门大学 学生工作部(处), 厦门 361000
基金项目:国家自然科学基金面上项目(71471152);中央高校基本科研业务费专项资金(20720171095,20720181003);国家统计局重大项目(2019LD02)
摘    要:传统的信用评分模型主要基于有监督学习(supervised learning)方法,但是,在实际的贷款问题中,有标记样本信息的获取往往成本较高、难度较大、周期较长,而无标记样本信息则大量存在.为了能在建模中充分利用无标记样本信息,本文提出了一种基于半监督广义可加(semi-supervised generalized additive,SSGA) Logistic回归的信用评分模型.该模型不但能处理线性不可分问题,也能同时利用有标记与无标记样本信息,并同步实现模型参数的估计和显著变量的选择.通过模拟实验表明,所提出的模型在外推预测和变量选择方面的表现均显著优于有监督模型.最后,将该模型应用于个人信用贷款违约风险的评估中.

关 键 词:半监督  广义可加logistic回归  信用评分  无标记样本
收稿时间:2018-09-03

Credit scoring based on semi-supervised generalized additive logistic regression
FANG Kuangnan,CHEN Zilan. Credit scoring based on semi-supervised generalized additive logistic regression[J]. Systems Engineering —Theory & Practice, 2020, 40(2): 392-402. DOI: 10.12011/1000-6788-2018-1711-11
Authors:FANG Kuangnan  CHEN Zilan
Affiliation:1. Department of Statistics, School of Economics, Xiamen University, Xiamen 361000, China;2. The MOE Key Laboratory of Econometrics(Xiamen University), Xiamen 361000, China;3. Student Affairs Office, Xiamen University, Xiamen 361000, China
Abstract:The traditional credit scoring model is mainly based on the supervised learning method. However, in the actual loan problem, the labeled sample information is often acquired at a higher cost and longer cycle, while the unlabeled sample information is abundant. In order to make full use of the information of unlabeled samples in modeling, this paper proposes a credit scoring model based on semi-supervised generalized additive (SSGA) logistic regression. The model can not only use both labeled and unlabeled sample information, but also realize the estimation of model parameters and the selection of significant variables. The simulation experiments show that the proposed model has significantly better performance in extrapolation prediction and variable selection than the supervised model. Finally, the model is applied to the assessment of personal credit loan default risk.
Keywords:semi-supervised learning  generalized additive logistic regression  credit scoring  unlabeled sample  
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《系统工程理论与实践》浏览原始摘要信息
点击此处可从《系统工程理论与实践》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号