首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于MapReduce的大规模图挖掘并行计算模型
引用本文:饶君,张仁波,东昱晓,吴斌.基于MapReduce的大规模图挖掘并行计算模型[J].应用科技,2012(3):56-60.
作者姓名:饶君  张仁波  东昱晓  吴斌
作者单位:北京邮电大学计算机科学与技术学院,北京100876
基金项目:国家自然科学基金资助项目(60905025,90924029,61074128)
摘    要:在如何快速发现大规模网络的结构和特性问题中,网络规模及复杂度的快速增长给其分析研究带来了新的挑战.MapReduce及其开源实现Hadoop给大规模图的高效处理带来了希望.基于MapReduce框架的集群系统,提出了1种新的计算模型用于大规模图形的3-clique计算,来实现图挖掘.计算的基本步骤是:首先获取每个节点的第1跳信息,然后是第2跳信息,最后得到所有基于该节点的3-clique.该计算模型可以用来计算聚集系数,并且可以用于三大通话网络的挖掘.实验结果证明这种计算模型具有良好的可扩展性和性能.

关 键 词:图挖掘  社会网络分析  MapReduce  聚集系数  3-clique

A parallel computing model for large-graph mining with MapReduce
RAO Jun,ZHANG Renbo,DONG Yuxiao,WU Bin.A parallel computing model for large-graph mining with MapReduce[J].Applied Science and Technology,2012(3):56-60.
Authors:RAO Jun  ZHANG Renbo  DONG Yuxiao  WU Bin
Institution:School of Computer Science,Beijing University of Posts and Telecommunication,Beijing 100876,China
Abstract:Large-scale graphs exist everywhere.The continued exponential growth in both the size and complexity of the graphs is posing a new challenge for finding the structures and characters of a large-scale graph.An excellent promising clue for dealing with graphs with great sizes is the emerging MapReduce framework and its open-source implementation,Hadoop.The problem of 3-clique enumeration of a graph is an important operation that can help structure mining and a difficult mission for graphs with great sizes on the single computer.In this paper,we propose a parallel computing model for 3-clique enumeration based on cluster system with the help of MapReduce for large-scale graphs.The process of enumeration is firstly to extract one-leap information of the graph,then the two-leap information and finally,the key-based 3-clique enumeration.Also,we apply the computing model to the computation of clustering coefficient.The computing model is applied to three real-world large CALL graphs and the results of the experiments manifest the good scalability and efficiency of the model.
Keywords:graph mining  social network analysis  MapReduce  clustering coefficient  3-clique
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号