首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于PSO的WFCM算法研究及其在医保欺诈行为发现中的应用
引用本文:李华,陈宁江.基于PSO的WFCM算法研究及其在医保欺诈行为发现中的应用[J].广西科学院学报,2017,33(1):32-39.
作者姓名:李华  陈宁江
作者单位:广西大学计算机与电子信息学院,广西南宁,530004
摘    要:【目的】在没有先验知识的前提下,采用基于粒子群优化算法(PSO)的加权模糊C-均值(WFCM)聚类算法,从30多万条记录的医疗保险数据中挖掘出疑似医疗保险欺诈的记录。【方法】首先,引用改进的欧式距离、相似性函数以及交叉熵函数并通过PSO算法极小化交叉熵函数,对属性权重进行分析;其次,选取Calinski-Harabasz(CH)有效性指标,展开聚类有效性的研究;然后,基于数据预处理的结果将数据运用于PSO算法,不断更新得到各属性的权重,并运用聚类有效性评价中的CH有效性指标来动态估计最佳聚类个数,提高FCM聚类的速度;最后,将属性权重和最佳聚类数应用于FCM聚类算法,根据隶属度矩阵聚类得到疑似医疗保险欺诈结果。【结果】基于上述研究方法,本研究根据最后的隶属度矩阵来进行聚类分析。【结论】将优化的权重应用于加权FCM聚类算法与聚类有效性评价,既提高了聚类算法的高效性,又避免了主观评价对分类的影响。

关 键 词:PSO  WFCM  CH有效性指标  医保欺诈
收稿时间:2016/11/26 0:00:00
修稿时间:2016/12/7 0:00:00

Study on WFCM Algorithm based on PSO and Its Application in Identifying Medicare Fraud
LI Hua and CHEN Ningjiang.Study on WFCM Algorithm based on PSO and Its Application in Identifying Medicare Fraud[J].Journal of Guangxi Academy of Sciences,2017,33(1):32-39.
Authors:LI Hua and CHEN Ningjiang
Institution:School of Computer, Electronics and Information in Guangxi University, Nanning, Guangxi, 530004, China and School of Computer, Electronics and Information in Guangxi University, Nanning, Guangxi, 530004, China
Abstract:Objective]This paper aims to find the records of suspected medicare fraud from over 30 million records by using the Weighted Fuzzy C-Means clustering algorithm based on particle swarm optimization (PSO) algorithm with the absence of prior knowledge.Methods]Firstly, the improved Euclidean Distance,similarity function and cross entropy function are introduced and the entropy function is minimized by PSO algorithm to analyze the attribute weight.Secondly, the validity index of CH (Calinski-Harabasz) is selected,and the study of validity of clustering is carried out.Thirdly,the data is applied to the PSO algorithm based on the results of data preprocessing, constantly updated to get the weight of each attribute,and the optimal numbers of clusters are estimated dynamically by validity index of CH,in order to increase the speed of FCM.Finally,the attribute weights and the optimal clustering numbers are applied to the FCM clustering algorithm,and the results of suspected medical insurance fraud are obtained according to the membership matrix.Results]Based on the above method,the final membership matrix is used for carrying out cluster analysis.Conclusion]This paper shows the running efficiency of clustering algorithms can be improved, and the influence of subjective evaluation for classification can be avoided by applying the weights to the WFCM clustering algorithm and clustering validity.
Keywords:PSO  WFCM  validity index of CH  medicare fraud
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《广西科学院学报》浏览原始摘要信息
点击此处可从《广西科学院学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号