结合二阶注意力机制的多尺度人体姿态估计 Multi-scale human pose estimation combined with second-order attention mechanism期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

结合二阶注意力机制的多尺度人体姿态估计

引用本文：	张云绚,董绵绵,王鹏,李晓艳,吕志刚,邸若海,毋宁.结合二阶注意力机制的多尺度人体姿态估计[J].科学技术与工程,2022,22(32):14321-14327.

作者姓名：	张云绚董绵绵王鹏李晓艳吕志刚邸若海毋宁

作者单位：	西安工业大学电子信息工程学院

基金项目：	国家自然基金资助项目(62171360)、陕西省科技厅重点研发计划（2022GY-110）、西安工业大学校长基金面上培育项目（XGPY200217）、西安市智能兵器重点实验室（2019220514SYS020CG042）

摘要：	为解决人体姿态估计任务中存在的不同视角下人体实例尺度变化、遮挡问题导致的人体关键点定位不准确问题，提出融入二阶注意力机制的多尺度人体姿态估计网络模型（GOS-HRNet）。首先，在特征提取阶段为了获得高质量的特征图，通过在多分辨率网络结构中使用Octave卷积，保留更多的图像空间特征信息以提高关键点定位准确率；然后，为有效的利用图像上下文信息，融入二阶注意力模块Gsop使网络能更好的学习各分辨率表征的空间信息；最后，为了应对尺度变换对关键点定位的影响采用尺度增强训练方法，提高模型对尺度变化的鲁棒性。本文提出模型在MS COCO 2017数据集上进行实验，结果表明：提出的GOS-HRNet模型平均检测精度比HRNet模型提升了2.2%，能够更加准确的利用上下文信息、丰富空间特征信息以提高对关键点定位的准确性。
关键词：	多尺度高质量特征图姿态估计注意力机制
收稿时间：	2021/10/18 0:00:00
修稿时间：	2022/8/2 0:00:00
Multi-scale human pose estimation combined with second-order attention mechanism

Zhangyunxuan,Dongmianmian,Wangpeng,Lixiaoyan,Lvzhigang,Diruohai,Wuning.Multi-scale human pose estimation combined with second-order attention mechanism[J].Science Technology and Engineering,2022,22(32):14321-14327.

Authors:	Zhangyunxuan Dongmianmian Wangpeng Lixiaoyan Lvzhigang Diruohai Wuning

Abstract:	In order to solve the problem of inaccurate positioning of the key points of the human body caused by the change of the human body instance scale and the occlusion problem in the human body pose estimation task in different perspectives, a multi-scale human body pose estimation network model (GOS-HRNet) incorporating a second-order attention mechanism is proposed. First of all, in the feature extraction stage, for obtaining high-quality feature maps, Octave convolution is used in the multi-resolution network structure to retain more image spatial feature information to improve the accuracy of key point positioning. Then, the second-order attention module Gsop is incorporated so that the network can better learn the spatial information represented by each resolution, and effectively use the image context information. Finally, so as to cope with the impact of scale transformation on the positioning of key points, a multi-scale training method is adopted to improve the generalization of the model to scale changes. This paper proposes a model to conduct experiments on the MS COCO 2017 data set. The results show that the average detection accuracy of the proposed GOS-HRNet model is improved by 2.2% compared with the HRNet model. It can more accurately use context information and enrich spatial feature information to improve the key Accuracy of point positioning.

Keywords:

	点击此处可从《科学技术与工程》浏览原始摘要信息
	点击此处可从《科学技术与工程》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏