首页 | 本学科首页   官方微博 | 高级检索  
     检索      

FICW: Frequent Itemset Based Text Clustering with Window Constraint
作者姓名:ZHOU  Chong  LU  Yansheng  ZOU  Lei  HU  Rong
作者单位:College of Computer Science and Technology, HuazhongUniversity of Science and Technology, Wuhan 430074,Hubei, China
基金项目:Supported by the Natural Science Foundation of Hubei Province( ABA048)
摘    要:Most of the existing text clustering algorithms overlook the fact that one document is a word sequence with semantic information. There is some important semantic information existed in the positions of words in the sequence. In this paper, a novel method named Frequent Itemset-based Clustering with Window (FICW) was proposed, which makes use of the semantic information for text clustering with a window constraint. The experimental results obtained from tests on three (hypertext) text sets show that FICW outperforms the method compared in both clustering accuracy and efficiency.

关 键 词:文本聚类  搜索引擎  信息检索  语义  FICW
文章编号:1007-1202(2006)05-1345-07
收稿时间:2006-03-14

FICW: Frequent itemset based text clustering with window constraint
ZHOU Chong LU Yansheng ZOU Lei HU Rong.FICW: Frequent itemset based text clustering with window constraint[J].Wuhan University Journal of Natural Sciences,2006,11(5):1345-1351.
Authors:Zhou Chong  Lu Yansheng  Zou Lei  Hu Rong
Institution:(1) College of Computer Science and Technology, Huazhong University of Science and Technology, 430074 Wuhan, Hubei, China
Abstract:Most of the existing text clustering algorithms overlook the fact that one document is a word sequence with semantic information. There is some important semantic information existed in the positions of words in the sequence. In this paper, a novel method named Frequent Itemset-based Clustering with Window (FICW) was proposed, which makes use of the semantic information for text clustering with a window constraint. The experimental results obtained from tests on three (hypertext) text sets show that FICW outperforms the method compared in both clustering accuracy and efficiency.
Keywords:text clustering  frequent itemsets  search engine
本文献已被 CNKI 维普 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号