首页 | 本学科首页   官方微博 | 高级检索  
     

Sugeno测度下的征信用户聚类研究
引用本文:韩璐,苏治,李爱华. Sugeno测度下的征信用户聚类研究[J]. 系统工程理论与实践, 2019, 39(11): 2750-2759. DOI: 10.12011/1000-6788-2018-0428-10
作者姓名:韩璐  苏治  李爱华
作者单位:1. 中央财经大学 管理科学与工程学院, 北京 100081;2. 中央财经大学 统计与数学学院, 北京 100081;3. 中央财经大学 金融学院, 北京 100081
基金项目:国家自然科学基金面上项目(71473279);国家哲学社会科学基金重大项目(15ZDC024);中央财经大学青年教师发展基金(QJJ1604);中央财经大学科研创新团队支持计划
摘    要:征信系统是典型的大数据客户系统,如何对征信用户进行合理归类,进而从类别特征中分析用户的特点,是对征信系统进行深入挖掘的焦点问题之一.在征信系统中存在着大量的频数字段,这些变量的距离不能使用欧氏距离来刻画,Sugeno积分提供了一种考虑集合序的测度框架.本研究通过使用Sugeno测度下的部分序积分,构造了两集合的Sugeno距离差,进而在此基础上,重构Kmeans动态聚类算法,通过截集阈值来控制类的合并,从而更准确地对征信用户进行归类,并初步讨论了部分类用户的画像特征.

关 键 词:Sugeno测度  Kmeans聚类  征信系统  用户画像  
收稿时间:2018-03-27

Clustering of credit users under Sugeno measure
HAN Lu,SU Zhi,LI Aihua. Clustering of credit users under Sugeno measure[J]. Systems Engineering —Theory & Practice, 2019, 39(11): 2750-2759. DOI: 10.12011/1000-6788-2018-0428-10
Authors:HAN Lu  SU Zhi  LI Aihua
Affiliation:1. School of Management Science and Engineering, Central University of Finance and Economics, Beijing 100081, China;2. School of Statistics and Mathematics, Central University of Finance and Economics, Beijing 100081, China;3. School of Finance, Central University of Finance and Economics, Beijing 100081, China
Abstract:How to classify users' clusters, then analyze the characteristics of users, and finally make user personas, is one of the key issues to further explore the credit information system. In academic research, clustering algorithms are often implemented for user classification. The clustering algorithm is based on the sample distance measure. In the credit system, there are lots of behavior variables, such as loan times, numbers of credit cards. The difference between these variables cannot be measured by numerical distance. This research is based on this purpose. By Sugeno measure we constructed Sugeno distance and experiment it with Kmeans dynamic clustering algorithm, we used 65536 records which came from the credit system during 2004 and 2009, explored the main clusters of these user, and finally drew personas of these users. With more and more ordinal variables in the credit system, this method will be more practical and useful.
Keywords:Sugeno measure  Kmeans cluster  credit system  user personas  
本文献已被 CNKI 等数据库收录!
点击此处可从《系统工程理论与实践》浏览原始摘要信息
点击此处可从《系统工程理论与实践》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号