首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于标准编码的有向标记频繁子图挖掘算法
引用本文:马晓培.基于标准编码的有向标记频繁子图挖掘算法[J].科学技术与工程,2012,12(20):5060-5065.
作者姓名:马晓培
作者单位:1. 河海大学计算机及信息学院,常州,213022
2. 河海大学计算机及信息学院,常州213022;江苏省输配电装备技术重点实验室,常州213022
摘    要:针对大部分频繁子图挖掘算法,基于无向图而不适用于更具有实际意义的有向图的挖掘的现状,通过对无向图挖掘算法gSpan中编码结构的扩展,采用改进的规范形式,使编码适用于有向图领域。并使用针对有向图的DADI++存储结构来存储图集,简化了数据访问操作的代价。另外在挖掘中使用Hash表存储同构图的Hash地址和支持度,避免对图集的重复扫描和直接的同构测试。在实际数据集上运行的实验结果表明提出的Dspan算法是正确的,并比FFSM算法效率更高。

关 键 词:有向图挖掘  频繁子图  子图同构  标准编码  DADI++
收稿时间:4/12/2012 1:52:07 PM
修稿时间:4/12/2012 1:52:07 PM

Digraph Labeled Frequent Subgraph Mining Algorithm Based on Canonical Code
maxiaopei.Digraph Labeled Frequent Subgraph Mining Algorithm Based on Canonical Code[J].Science Technology and Engineering,2012,12(20):5060-5065.
Authors:maxiaopei
Institution:1(Institute of Computer and Information,Hohai University1,Changzhou 213022,P.R.China; Jiangsu Key Laboratory of Power Transmission and Distribution Equipment Technology2,Changzhou 213022,P.R.China)
Abstract:For most of the frequent subgraph mining algorithms are based on the undirected graphs and areunsuitable for the more meaningful study on directed graphs,through the expansion of the gSpan code structure,the improved canonical form made the code apply to the directed graphs.It simplified the cost of data access operations by using the DADI+ + storage structure.It stored the Hash address and supporting in the Hash table without scanning the graph repeatedly and avoided the direct isomorphism testing in the mining.The experimental result on real datasets show the new proposed algorithm Dspan is correct and better than FFSM in efficiency of mining.
Keywords:digraph mining the frequent subgraph subgraph isomorphism canonical code DADI++
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号