首页 | 本学科首页   官方微博 | 高级检索  
     检索      

WHPM-Apriori:网页超链接挖掘的Apriori改进算法
引用本文:姜玥,井福荣,谢青,李建阳,杨玉涵.WHPM-Apriori:网页超链接挖掘的Apriori改进算法[J].西南民族学院学报(自然科学版),2007,33(3):644-647.
作者姓名:姜玥  井福荣  谢青  李建阳  杨玉涵
作者单位:西南民族大学计算机科学与技术学院,江西理工大学信息工程学院,重庆邮电学院经济管理学院,西南民族大学计算机科学与技术学院,西南民族大学计算机科学与技术学院 成都610041 四川大学计算机学院,成都610064,赣州341000,重庆400065,成都610041,成都610041
摘    要:网页链接关系的设计影响到用户的访问效率,通过日志挖掘发现网页间的关联关系,使网站设计更趋合理,便于用户访问.为了提取页面间的关系,日志数据预处理后,利用Apriori算法发现频繁集,找到页面间的关联规则.网站结构主要由网页和网页间的超链接组成,针对网页超链接结构的特点:一条超链接只能建立在两个网页上.发现频繁集只需找出所有2-项集即可.提出网页超链接挖掘的Apriori改进算法(WPHM-Apriori).实验表明,该算法有效地降低Apriori的时间复杂度.

关 键 词:数据挖掘  关联规则  网站结构
文章编号:1003-2843(2007)03-0644-04
修稿时间:2006-11-13

WHPM-Apriori:web page hyperlink mining Apriori improving algorithm
JIANG Yue,JING Fu-rong,XIE Qing,LI Jian-yang,YANG Yu-han.WHPM-Apriori:web page hyperlink mining Apriori improving algorithm[J].Journal of Southwest Nationalities College(Natural Science Edition),2007,33(3):644-647.
Authors:JIANG Yue  JING Fu-rong  XIE Qing  LI Jian-yang  YANG Yu-han
Abstract:The design of the hyperlinks between web pages influences the accessing efficiency of the users. Finding the associating relation through mining logs helps design the website more reasonablely and helps users access conveniently. In order to draw out the relations between pages, Apriori algorithm is used to find frequent sets and the associating rules between pages, after logs are preprocessed. The structure of the website is mainly composed of pages and hyperlinks. The feature of the structure of the hyperlink is that one hyperlink only is between two pages. So it is enough to find 2-item sets in order to find frequent sets. Web page hyperlink mining Apriori algorithm(WPHM-Apriori) is put forward. The experiment shows that the improving algorithm efficiently decreases the time complexity of Apriori algorithm.
Keywords:data mining  association rule  structure of website
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号