首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于多视角对称非负矩阵分解的跨模态信息检索方法
引用本文:柳利芳,马园园.基于多视角对称非负矩阵分解的跨模态信息检索方法[J].山东大学学报(理学版),2022,57(7):65-72.
作者姓名:柳利芳  马园园
作者单位:1.安阳师范学院继续教育学院, 河南 安阳 455000;2.安阳师范学院甲骨文信息处理教育部重点实验室, 河南 安阳 455000
基金项目:国家自然科学科基金资助项目(U1804153);教育部人文社科项目(20YJC740042)
摘    要:针对跨模态信息检索的策略和核心问题,从提升检索性能的角度,分析了多视角对称非负矩阵分解方法用于跨模态检索的优势,提出了一种新的基于对称非负矩阵分解的跨模态检索框架。首先在Wikipedia、Pascal公开数据集上习得一致的子空间表示;然后基于该子空间,设计了一种实时样本在子空间中的投影方法。与典型相关分析、语义匹配和偏最小二乘回归相比,在MAP和PR曲线这2个指标上,本文所提出的方法具有最优的性能表现,表明了该方法应用于跨模态信息检索任务中的潜力。

关 键 词:多视角聚类  对称非负矩阵分解  跨模态检索  子空间学习  

Cross-modal information retrieval method based on multi-view symmetric nonnegative matrix factorization
LIU Li-fang,MA Yuan-yuan.Cross-modal information retrieval method based on multi-view symmetric nonnegative matrix factorization[J].Journal of Shandong University,2022,57(7):65-72.
Authors:LIU Li-fang  MA Yuan-yuan
Abstract:This article summarizes the strategies and core issues in cross-modal information retrieval and analyses the advantages of multi-view symmetric nonnegative matrix factorization for cross-modal retrieval in terms of improving retrieval effect. A new cross-modal retrieval framework based on symmetric non-negative matrix factorization is proposed. Firstly, a consistent subspace representation is learned from the Wikipedia and Pascal datasets. Then, based on the subspace, a method of mapping real-time samples into subspaces is designed. Compared with the canonical correlation analysis, semantic matching and partial least squares regression, the proposed method has the best performance in terms of MAP and PR curves. The results demonstrate that the proposed algorithm has the potential ability in the task of cross-modal information retrieval.
Keywords:multi-view clustering  symmetric nonnegative matrix factorization  cross-modal retrieval  subspace learning  
点击此处可从《山东大学学报(理学版)》浏览原始摘要信息
点击此处可从《山东大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号