首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于PageRank与HITS的改进算法的网页排名优化
作者姓名:库珊  刘钊
作者单位:武汉科技大学计算机科学与技术学院;武汉科技大学智能信息处理与实时工业系统湖北省重点实验室
基金项目:国家自然科学基金资助项目(51874217).
摘    要:针对传统网页排序算法PageRank和HITS中存在的主题漂移、检索效率低等不足,本文提出了一种改进算法PHIA(PageRank and HITS Improved Algorithm)。该算法继承了HITS算法获取根集和基本集的方法,并且使用根集中所有网页的PageRank值作为Hub和Authority初始迭代值,最后根据马尔可夫链求随机矩阵的特征向量的方式来获取网页排名的静态分布。基于随机关键词的检索结果可知,相比于传统的PageRank和HITS算法,改进PHIA算法具有更快的收敛速度,并且在一定程度上提高了网页排序的准确度。

关 键 词:PageRank算法  HITS算法  链接结构  网页排序  算法改进
收稿时间:2018/11/2 0:00:00

An improved algorithm for page rank optimization based on PageRank and HITS algorithms
Authors:Ku Shan and Liu Zhao
Institution:1. College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China;2. Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, Wuhan University of Science and Technology, Wuhan 430065, China and 1. College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China;2. Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, Wuhan University of Science and Technology, Wuhan 430065, China
Abstract:Aiming at overcoming the disadvantages such as topic drift and low retrieval efficiency in the traditional webpage ranking algorithms PageRank and HITS, an improved algorithm named PHIA (PageRank and HITS Improved Algorithm) was proposed. Firstly, the algorithm inherits the way of HITS algorithm to obtain the root set and the basic set, then employs the PageRank value of all web pages in the root set as the initial iteration value of Hub and Authority, and finally, the page ranking status is obtained by searching the eigenvectors of random matrix based on the Markov chain. The calculation results based on random keyword retrieval show that compared with the traditional PageRank and HITS algorithms, the improved PHIA algorith not only has a faster convergence rate but also improves the accuracy of page ranking to some extent.
Keywords:PageRank algorithm  HITS algorithm  link structure  webpage ranking  algorithm improvement
本文献已被 CNKI 等数据库收录!
点击此处可从《》浏览原始摘要信息
点击此处可从《》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号