首页 | 本学科首页   官方微博 | 高级检索  
     检索      

注意力机制下双模态交互融合的目标跟踪网络
引用本文:姚云翔,陈莹.注意力机制下双模态交互融合的目标跟踪网络[J].系统工程与电子技术,2022,44(2):410-419.
作者姓名:姚云翔  陈莹
作者单位:江南大学物联网工程学院, 江苏 无锡 214122
基金项目:国家自然科学基金(61573168)资助课题。
摘    要:针对当前目标跟踪难以适应低光照、运动模糊、目标快速移动等挑战,提出了空间通道注意力下的红外与可见光双模态交互融合跟踪网络.首先,红外图像与可见光图像通过backbone三层卷积提取分层特征,并降维至统一分辨率,之后级联三层特征形成各模态特征.其次,多模态特征通过所设计的空间通道自注意力模块和跨模态交互注意力模块使得模态...

关 键 词:红外与可见光  目标跟踪  深度学习  注意力融合
收稿时间:2021-01-28

Target tracking network based on dual-modal interactive fusion under attention mechanism
YAO Yunxiang,CHEN Ying.Target tracking network based on dual-modal interactive fusion under attention mechanism[J].System Engineering and Electronics,2022,44(2):410-419.
Authors:YAO Yunxiang  CHEN Ying
Institution:College of Computer Internet of Things, Jiangnan University, Wuxi 214122, China
Abstract:Aiming at the challenges of current object tracking that is difficult to low illusion,motion blur,and fast motion,a dual-modal interacive fusion tracking network of infrared and visible under spatial channel attention is proposed.First,the infrared and RGB images are extracted through the backbone three-layer convalution to extract layered features which are normalized to the same resolution via dimension reduction.The three-layer features are cascaded to form each modal feature.Then the features are sent to the designed spatial channel self-attention module and the cross-module interactive attention module which lead network focus on global spatial features and high-response channels and therefore improve the complementarity of the dual-modal information.The interacted features of the dual-modal are cascaded for the fusion and finally sent to three fully connected layers to complete the target tracking.The experimental results of the largest RGB-Themeral(RGB-T)tracking data set RGBT234 show that the proposed network can effectively extract dual-modal interactive features and improve target tracking accuracy.Its Precision/Success Rateis improced by 5.3%and 4.2%,respectively,compared with the baseline network.
Keywords:RGB-Themeral(RGB-T)  object tracking  deep learning  attention fusion
本文献已被 维普 等数据库收录!
点击此处可从《系统工程与电子技术》浏览原始摘要信息
点击此处可从《系统工程与电子技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号