基于超复数小波和图像空域的卷积网络融合注视点预测算法 Gaze prediction algorithm based on hypercomplex wavelet convolutional network期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于超复数小波和图像空域的卷积网络融合注视点预测算法

引用本文：	李策,朱子重,许大有,高伟哲,靳山岗.基于超复数小波和图像空域的卷积网络融合注视点预测算法[J].兰州理工大学学报,2021,47(5):76.

作者姓名：	李策朱子重许大有高伟哲靳山岗

作者单位：	兰州理工大学电气工程与信息工程学院,甘肃兰州 730050

基金项目：	国家自然科学基金(61866022),甘肃省基础研究创新群体项目(1506RJIA031),国防基础科研项目(JCKY2018427C002)

摘要：	针对已有注视点预测模型存在特征细节缺失、尺度单一和背景信息干扰严重导致的注视点预测精度偏低等问题,提出了一种基于超复数小波和图像空域的卷积网络融合注视点预测算法.首先,针对细节特征丢失问题,使用超复数小波变换在频域中提取图像的细节特征,与卷积网络提取的空域特征进行融合.然后,通过空洞空间金字塔池化模块,融合不同感受得到的特征图,有效解决了特征尺度单一的问题.最后,引入了残差卷积注意力模块,结合空间和通道的注意力机制,能够有效抑制背景信息的干扰,提高注视点预测精度.在SALICON数据集上,CC、sAUC和SIM评价指标下,该算法的性能达到0.884 7、0.769 3和0.778 0;在CAT2000数据集上,该算法在相应指标下的性能为0.735 5、0.870 1和0.664 5.主客观对比实验结果表明,该算法具有较好的注视点预测能力.
关键词：	注视点预测超复数小波变换空域特征卷积网络
收稿时间：	2020-01-10
Gaze prediction algorithm based on hypercomplex wavelet convolutional network

LI Ce,ZHU Zi-zhong,XU Da-you,GAO Wei-zhe,JIN Shan-gang.Gaze prediction algorithm based on hypercomplex wavelet convolutional network[J].Journal of Lanzhou University of Technology,2021,47(5):76.

Authors:	LI Ce ZHU Zi-zhong XU Da-you GAO Wei-zhe JIN Shan-gang

Institution:	College of Electrical and Information Engineering, Lanzhou Univ. of Tech., Lanzhou 730050, China

Abstract:	Gaze based prediction algorithms has a wide range of applications in object recognition, video compression, object tracking and so on. For existing gaze prediction models, the accuracy of gaze prediction is low due to the lack of feature details, single scale, and serious background information interference. This paper proposes a gaze prediction algorithm based on hypercomplex wavelet convolutional network. Firstly, aiming at the problem of loss of detailed features, the hypercomplex wavelet transform is used to extract the detailed features of the image in the frequency domain and fused with the spatial features extracted by the convolutional network. Then, through the atrous spatial pyramid pooling module, the feature maps obtained from different receptive fields are fused to effectively solve the problem of single feature scale. Finally, the proposed algorithm introduces a residual convolutional attention module, which combines spatial and channel attention mechanisms to effectively suppress the interference of background information and improve the accuracy of gaze prediction. On the SALICON datasets, CC, sAUC and SIM evaluation metrics, the performance of the proposed algorithm reaches 0.884 7, 0.769 3 and 0.778 0. On the CAT2000 datasets, the performance of the proposed algorithm is 0.735 5, 0.870 1, and 0.664 5. The experimental results show that the proposed algorithm has a good ability to predict fixation points.

Keywords:	gaze prediction hypercomplex wavelet transform spatial features convolutional neural network
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《兰州理工大学学报》浏览原始摘要信息
	点击此处可从《兰州理工大学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏