首页 | 本学科首页   官方微博 | 高级检索  
     

基于残差注意力和金字塔上采样的图像语义分割
引用本文:高军礼,周华,宋海涛,郭靖,张慧. 基于残差注意力和金字塔上采样的图像语义分割[J]. 信阳师范学院学报(自然科学版), 2022, 0(1): 134-140
作者姓名:高军礼  周华  宋海涛  郭靖  张慧
作者单位:1.广东工业大学自动化学院;2.华南理工大学工商管理学院
摘    要:针对图像语义分割中,存在细节信息丢失、分割类别边缘模糊而粗糙的问题,在编码解码结构的基础上,结合残差模块和注意力机制,设计一种残差注意力模块.通过注意力机制加强特征图通道之间的联系,以提升语义分割的细腻度.为提高模型对多尺度物体的识别能力,结合金字塔模型,设计一种金字塔上采样模块.利用编码过程中产生的不同尺度的特征图,...

关 键 词:残差注意力  金字塔模型  上采样  编解码器  卷积神经网络  图像语义分割

Image Semantic Segmentation Based on Residual Attention and Pyramid Upsampling
Affiliation:,School of Automation, Guangdong University of Technology,School of Business Administration, South China University of Technology
Abstract:Aiming at the problems in image semantic segmentation such as detail information loss, fuzzy & rough edges of segmentation categories, a residual attention module is designed based on encoder-and-decoder combined with residual modules and attention mechanism. The attention mechanism strengthens the connectivity among feature-map channels to improve the fineness of semantic segmentation. For multi-scale object recognition, a joint pyramid up-sampling module is designed based on pyramid models. It uses different scale feature maps generated during encoding processes to extract semantic information and increases the recognition ability on model scenes. Finally, the proposed scheme is verified by experiments on the VOC2012 and Cityscape data sets. Comparing with FCN-8 s、SegNet、Deeplab-v2、PSPNet, the highest mean Intersection over Union(mIoU) and mean Pixel Accuracy(mPA) increased by 15.9% and 3.57% for VOC 2012, 17.8% and 13.3% for Cityscape data set, respectively. The image semantic segmentation effect has been significantly improved.
Keywords:residual attention  pyramid model  upsampling  encoder-decoder  convolution neural network  image semantic segmentation
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号