首页 | 本学科首页   官方微博 | 高级检索  
     检索      

利用关系数据库系统对半结构化数据进行近似查询
引用本文:韩恺,岳丽华,龚育昌.利用关系数据库系统对半结构化数据进行近似查询[J].中国科学技术大学学报,2005,35(5):674-682.
作者姓名:韩恺  岳丽华  龚育昌
作者单位:中国科学技术大学计算机科学与技术系,安徽,合肥,230026
基金项目:中国科学院知识创新工程项目
摘    要:提出一种利用关系数据库系统在一般图结构的半结构化数据上进行近似查询的途径.根据嵌套结构和文本值的相似性来度量路径的相似性;根据路径的相似性得到查询目标节点与数据源节点的相似性.为返回数据源中与查询目标节点相似的节点,首先提取出数据源中长度在固定范围内的所有路径,然后利用关系数据库系统将其与查询路径进行相似性连接,并按相似度从大到小返回所有结果.为提高相似性连接的效率,引入q窗口概念,并利用若干路径相似的必要条件来减少计算相似性函数的次数.试验证明了其有效性.

关 键 词:半结构化数据  图结构  近似查询  相似性度量  相似性连接  关系数据库系统
文章编号:0253-2778(2005)05-0674-09
收稿时间:2003-10-28
修稿时间:2004-04-28

Approximate Querying of Semistructured Data Using a Relational Database System
HAN Kai,YUE Li-hua,GONG Yu-chang.Approximate Querying of Semistructured Data Using a Relational Database System[J].Journal of University of Science and Technology of China,2005,35(5):674-682.
Authors:HAN Kai  YUE Li-hua  GONG Yu-chang
Institution:Department of Computer Science and Technology, USTC, Hefei 230026, China
Abstract:An approach to approximate querying of general graph structured semistructured data is proposed based on a relational database system. A similarity measure for paths based on nesting structures and text values was brought out, from which the similarity between the target query node and a data source node was derived. To get the source nodes similar to the target query node, firstly the paths whose lengths were within an interval were extracted from the data source, then a similarity join process between them and the query paths was carried out using a relational database system. Finally the query result nodes were returned in a descend order of their similarity to the target query node. To make the similarity join process more efficient, the concept of q-windows was introduced and several necessary conditions were used to decrease the time of calculating costly similarity function. The experiments prove the effectiveness of the approach.
Keywords:semistructured data  graph structured  approximate querying  similarity measure  similarity join  RDBMS
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号