首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于集成分类器的用户属性预测研究
引用本文:王斯盾,琚生根,周刚,刘玉娇.基于集成分类器的用户属性预测研究[J].四川大学学报(自然科学版),2017,54(6):1195-1201.
作者姓名:王斯盾  琚生根  周刚  刘玉娇
作者单位:后勤工程学院,四川大学计算机学院,四川大学计算机学院,四川大学计算机学院
基金项目:国家自然科学基金项目(61332066,81373239)
摘    要:用户属性在个性化服务中具有重要的作用,利用手机数据进行用户属性预测逐渐成为新方向.利用手机应用类别均使用时长和应用类别个数,提出了基本属性与辅助属性的概念.首先对所有未标注样本的辅助属性离散化,将辅助属性基于类别的海灵格距离作为基本属性的特征权重,将基本属性与权重的乘积作为特征训练集成分类器中的各个基分类器,并引入随机森林中的带外样本准确率作为基分类器的权重,得到最终的分类结果.实验结果表明,本文所给出的集成分类器框架能够提高用户属性预测的效果.

关 键 词:用户属性预测  智能手机  离散化  海灵格距离  特征权重
收稿时间:2017/5/23 0:00:00
修稿时间:2017/6/2 0:00:00

Research on Demographic Prediction Based on Ensemble Classifiers
WANG Si-Dun,JU Sheng-Gen,ZHOU Gang and LIU Yu-Jiao.Research on Demographic Prediction Based on Ensemble Classifiers[J].Journal of Sichuan University (Natural Science Edition),2017,54(6):1195-1201.
Authors:WANG Si-Dun  JU Sheng-Gen  ZHOU Gang and LIU Yu-Jiao
Institution:Logistical Engineering University,School of Computer Science, Sichuan Univ.,School of Computer Science, Sichuan Univ,School of Computer Science, Sichuan Univ
Abstract:User attributes play an important role in personalized service. The prediction of the user''s property based on mobile phone data has gradually become a new direction. In this paper, we use two independent attributes: average daily usage time and number of application categories. The basic attribute and the concept of the auxiliary attribute are proposed. In this paper, firstly, the auxiliary attributes of all unlabeled samples are discretized by non-supervised method. And then calculate the Hellinger Distance of auxiliary property categories, which is the characteristic weight of the basic attribute. Input the basic attributes and the characteristic weight to the base classifier of the ensemble classifier training model, introducing random forest with out of sample accuracy as the base classifier weights, finally we get the final classification results. The experimental results show that the ensemble classifiers framework can improve the effect of user attribute prediction.
Keywords:User attribute prediction  Smartphones  Discretization  Hellinger Distance  Feature weight
本文献已被 CNKI 等数据库收录!
点击此处可从《四川大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《四川大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号