基于语义分割注意力与可见区域预测的行人检测方法 Pedestrian Detection Based on Semantic Segmentation Attention and Visible Region Prediction期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于语义分割注意力与可见区域预测的行人检测方法

引用本文：	王璐,王帅,张国峰,徐礼胜.基于语义分割注意力与可见区域预测的行人检测方法[J].东北大学学报(自然科学版),2021,42(9):1261-1267.

作者姓名：	王璐王帅张国峰徐礼胜

作者单位：	(1. 东北大学计算机科学与工程学院，辽宁沈阳110169; 2. 东北大学医学与生物信息工程学院，辽宁沈阳110169; 3. 沈阳东软智能医疗科技研究院有限公司，辽宁沈阳110167)

基金项目：	中央高校基本科研业务费专项资金资助项目(N181604006); 辽宁省自然科学基金资助项目(20170540312); 国家自然科学基金资助项目(61773110); 沈阳市科学技术计划基金资助项目(20-201-4-10).

摘要：	为改善图像中遮挡和小尺寸行人的检测精度，提出一种基于语义分割注意力和可见区域预测的行人检测方法.具体地，在SSD(single shot multi-box detector)目标检测网络的基础上，首先优化SSD的超参数设置，使其更适于行人检测;然后在主干网络中引入基于语义分割的注意力分支来增强行人检测特征的表达能力;最后提出一种检测预测模块，它不仅能同时预测行人整体和可见区域，还能利用可见区域预测分支所学的特征去引导整体检测特征的学习，提升检测效果.在Caltech行人检测数据集上进行了实验，所提方法的对数平均缺失率为5.5%，与已有方法相比具有一定的优势.
关键词：	行人检测卷积神经网络语义分割注意力行人可见区域预测多任务网络
修稿时间：	2021-01-04
Pedestrian Detection Based on Semantic Segmentation Attention and Visible Region Prediction

WANG Lu,WANG Shuai,ZHANG Guo-feng,XU Li-sheng.Pedestrian Detection Based on Semantic Segmentation Attention and Visible Region Prediction[J].Journal of Northeastern University(Natural Science),2021,42(9):1261-1267.

Authors:	WANG Lu WANG Shuai ZHANG Guo-feng XU Li-sheng

Institution:	1. School of Computer Science & Engineering， Northeastern University， Shenyang 110169， China; 2. School of Medicine and Biological Information Engineering， Northeastern University， Shenyang 110169， China; 3. Neusoft Research of Intelligent Healthcare Technology， Co.， Ltd.， Shenyang 110167， China.

Abstract:	To improve the detection performance on occluded and small pedestrians in images， a pedestrian detection method based on semantic segmentation attention and visible region prediction was proposed. Specifically， based on the single shot multi-box detector(SSD)object detection network， the hyperparameter setting of the SSD was firstly optimized to make it more suitable for pedestrian detection. Then the semantic segmentation attention branch was introduced into the network to enhance the pedestrian detection features learned by the network. Finally， a detection prediction module which can simultaneously detect the full bodies and visible regions of pedestrians was developed. This module has the advantage of leveraging the features learned from visible regions to guide the learning of the full-body detection features， hence improving the overall detection accuracy. The experiment carried out on the Caltech pedestrian detection benchmark shows that the log-average miss rate of the proposed method is 5.5%， which is competitive compared with existing pedestrian detection approaches.

Keywords:	pedestrian detection convolutional neural network semantic segmentation attention(SSA) pedestrian visible region detection multi-task network
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《东北大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《东北大学学报(自然科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏