首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于复杂网络重叠社团发现的微博话题检测
引用本文:尹兰,程飞,任亚峰,姬东鸿.基于复杂网络重叠社团发现的微博话题检测[J].四川大学学报(自然科学版),2016,53(6):1233-1240.
作者姓名:尹兰  程飞  任亚峰  姬东鸿
作者单位:武汉大学计算机学院; 贵州师范大学大数据与计算机科学学院,武汉大学计算机学院,武汉大学计算机学院,武汉大学计算机学院
摘    要:社交媒体话题检测一直是个热点问题,由于社交数据杂乱异构,且具有时效性,语义模糊性等特点,话题检测也是个难点问题.研究利用复杂网络对社交文本数据进行建模,并结合一种基于极大团凝聚层次聚类的重叠社团发现方法实现了社交话题的检测.文本数据建模中,通过自定义突发系数量化话题词,即把话题词看作具有时域分布偏好的关键词,并通过自定义相关系数连接话题词,构建话题网络.为使自定义系数更适用于动态数据环境,实验结合真实数据进行了适应性测试优化系数.文章把采用EAGLE重叠社团发现方法在公开数据集上评测,根据Q函数值显示结果明显优于当前一些重叠社团发现策略,研究对采样的60万条青少年社交数据进行了话题分析并可视化了分析结果.

关 键 词:复杂网络  重叠社团发现  话题检测  青少年
收稿时间:2015/11/12 0:00:00
修稿时间:2016/2/26 0:00:00

Topic Detection Based on Overlapping Community in Complex Network
YIN Lan,CHENG Fei,REN Ya-Feng and JI Dong-Hong.Topic Detection Based on Overlapping Community in Complex Network[J].Journal of Sichuan University (Natural Science Edition),2016,53(6):1233-1240.
Authors:YIN Lan  CHENG Fei  REN Ya-Feng and JI Dong-Hong
Institution:School of Computer Science, Wuhan University; School of Big Data and Computer Science, Guizhou Normal University,School of Computer Science, Wuhan University,School of Computer Science, Wuhan University and School of Computer Science, Wuhan University
Abstract:Topic detection in social media is a hot yet challenging issue in social computing given most data there are heterogeneous, time-evolving and linguistically ambiguous. In this paper, we explore the idea of achieving this goal through complex network modeling which has demonstrated excellent interpretability of the real world. Specifically, a complex network was constructed based on pre-processed topic words where two parameters, namely the emergency and correlation coefficients, were also introduced to allow us to filter social data through the network as well as determine their possible correlations. This approach was then applied to analyze 600,000 messages by teenager users in Weibo.com to identify overlapping communities with the help of the well-established algorithm EAGLE. It was demonstrated that, compared to other popular approaches such as CONGO and Peacock a much better Q-value results has been obtained by the method proposed here.
Keywords:Complex Network  Overlapping Community Discovery  Topic Detection  Teenagers  
本文献已被 CNKI 等数据库收录!
点击此处可从《四川大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《四川大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号