首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一个简单的Web日志挖掘系统
引用本文:杨怡玲,管旭东,陆丽娜,尤晋元.一个简单的Web日志挖掘系统[J].上海交通大学学报,2000,34(7):932-935.
作者姓名:杨怡玲  管旭东  陆丽娜  尤晋元
作者单位:1. 上海交通大学,计算机科学与工程系,上海,200030
2. 西安交通大学,计算机科学与工程系,西安,710049
摘    要:在分析Web日志挖掘的困难及对策的基础上,给出了一个简单的Web日志挖掘系统(SWLMS)的体系结构,具体介绍了SWLMS中日志的预处理过程,包括数据净倾、用户识别、会话识别、路径补充的主要任务及其实现,并着重介绍了预处理之后的序列模式识别过程和算法,包括最大向前路径的识别和频繁遍历路径的发现,并给出了实验结果。

关 键 词:数据挖掘  Web日志挖掘  序列模式识别  SWLMS
修稿时间:1999-08-30

A Simple Web Log Mining System
YANG Yi-ling,GUAN Xu-dong,LU Li-na,YOU Jin-yuan.A Simple Web Log Mining System[J].Journal of Shanghai Jiaotong University,2000,34(7):932-935.
Authors:YANG Yi-ling  GUAN Xu-dong  LU Li-na  YOU Jin-yuan
Abstract:This paper mainly discussed Web log mining, the application of date mining to log data generated by Web servers, which could assist the webmaster to optimize site architecture and increase visiting efficiency. Based on the analysis of difficulties and the corresponding solutions of Web log mining, the architecture of SWLMS, our sample Web log mining system was addressed. The data preprocessing phase in SWMLS, including data cleaning, user recognition, session identification and path filling was discussed in detail. Then, the sequential pattern recognition phase and its algorithms were presented, including the recognition of maximum forward paths and frequent traversal paths, with some experimental results presented.
Keywords:data mining  Web log mining  sequential pattern recognition  maximum forward path
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号