首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于批损失的跨模态检索
引用本文:刘爽,乔晗,徐清振.基于批损失的跨模态检索[J].华南师范大学学报(自然科学版),2021,53(6):115-121.
作者姓名:刘爽  乔晗  徐清振
作者单位:华南师范大学计算机学院,广州510631
基金项目:广东省科技攻关计划项目201903010103
摘    要:针对跨模态检索中成对或三元组样本的方法构造了高度冗余且信息量少的样本对问题,提出了基于批损失的跨模态检索方法(BLCMR):首先,引入批损失,考虑了嵌入样本的相似性,有效地保持了跨模态样本的不变性;然后,引入迭代方法来修正预测的类别标签,有效地区分了样本的语义类别信息. 在3个公开的数据集(Wikipedia、Pascal Sentence和NUS-WIDE-10k)上的实验结果表明:BLCMR方法能够拉近跨模态样本间的距离,有效地提升最终的跨模态检索精度.

关 键 词:跨模态检索  批损失  迭代方法
收稿时间:2021-05-02

The Cross-modal Retrieval Based on Batch Loss
Institution:School of Computer Science, South China Normal University, Guangzhou 510631, China
Abstract:Aiming at the problem that the method of couplet or triplet samples in cross-modal retrieval constructs redundant but uninformative sample pairs, a cross-modal retrieval method based on batch loss (BLCMR) is proposed. Firstly, the batch loss is introduced, and by taking into account the similarity of embedded samples, the invariance of cross-modal samples is effectively maintained. Secondly, an iterative method is introduced to modify the predicted category labels and effectively distinguish the semantic category information of the samples. Experimental results on three public datasets (Wikipedia, Pascal Sentence and NUS-WIDE-10k) show that the BLCMR method can effectively improve the accuracy of the final cross-modal retrieval.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《华南师范大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《华南师范大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号