首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于上下文情景结构的图像语义分割
引用本文:陈乔松,陶亚,申发海,弓攀豪,孙开伟,王进,邓欣.基于上下文情景结构的图像语义分割[J].重庆邮电大学学报(自然科学版),2020,32(2):287-294.
作者姓名:陈乔松  陶亚  申发海  弓攀豪  孙开伟  王进  邓欣
作者单位:重庆邮电大学 计算机科学与技术学院, 计算智能重庆市重点实验室,重庆 400065,重庆邮电大学 计算机科学与技术学院, 计算智能重庆市重点实验室,重庆 400065,重庆邮电大学 计算机科学与技术学院, 计算智能重庆市重点实验室,重庆 400065,重庆邮电大学 计算机科学与技术学院, 计算智能重庆市重点实验室,重庆 400065,重庆邮电大学 计算机科学与技术学院, 计算智能重庆市重点实验室,重庆 400065,重庆邮电大学 计算机科学与技术学院, 计算智能重庆市重点实验室,重庆 400065,重庆邮电大学 计算机科学与技术学院, 计算智能重庆市重点实验室,重庆 400065
基金项目:国家自然科学基金(61806033),重庆市重点产业共性关键技术创新专项(cstc2017zdcy-zdyfX0012),国家社会科学基金西部项目(18XGL013)资助课题
摘    要:语义分割的目标是对图像进行像素级分类并分割为表示不同语义的区域,以便实现对更复杂场景的分析及理解。考虑到更大的卷积核对目标的定位及分割都有促进作用,故提出的网络里使用了一种基于上下文情景结构层(contextual structure)的卷积神经网络,既增大了感受野的范围,同时解决了增大传统卷积核带来大量参数的问题。自然图像中的目标往往具有不同的尺度和纵横比,提取丰富的细节信息和上下文信息对于像素级的预测至关重要。提出的网络同时实现了多尺度特征的提取,从规模较小区域到规模较大区域,再到完整目标,可以有效提取局部信息和全局信息,达到分割多尺度目标的效果。实验中使用PASCAL VOC 2012数据集对提出的方法进行评测,在综合考虑算法复杂度以及运行时间效率的基础上,提出算法取得了更好的实验结果。

关 键 词:语义分割  上下文情景结构  多尺度
收稿时间:2018/7/19 0:00:00
修稿时间:2019/12/22 0:00:00

Semantic segmentation of images based on contextual structure
CHEN Qiaosong,TAO Y,SHEN Fahai,GONG Panhao,SUN Kaiwei,WANG Jin and DENG Xin.Semantic segmentation of images based on contextual structure[J].Journal of Chongqing University of Posts and Telecommunications,2020,32(2):287-294.
Authors:CHEN Qiaosong  TAO Y  SHEN Fahai  GONG Panhao  SUN Kaiwei  WANG Jin and DENG Xin
Institution:Chongqing Key Laboratory of Computational Intelligence, School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China,Chongqing Key Laboratory of Computational Intelligence, School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China,Chongqing Key Laboratory of Computational Intelligence, School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China,Chongqing Key Laboratory of Computational Intelligence, School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China,Chongqing Key Laboratory of Computational Intelligence, School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China,Chongqing Key Laboratory of Computational Intelligence, School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China and Chongqing Key Laboratory of Computational Intelligence, School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
Abstract:The goal of semantic segmentation is to classify pixel-level images and divide pixels into regions which represent different semantics in order to implement analysis and understanding of complex scene. Considering that the large convolutional kernel has a promoting effect on positioning and segmentation of targets, the proposed network uses a contextual structural convolutional neural network which increases the receptive fields, and at the same time solves the problems that traditional convolution kernels bring a large number of parameters. Targets in natural images often have different scales and aspect ratios. It is very important to extract detailed and contextual information for pixel-level prediction.The proposed network also extracts multi-scale features, from the smaller area to larger area, and then to the complete target.The extracted information enables the network to achieve the multi-scale segmentation. In the experiment, PASCAL VOC 2012 dataset was used to evaluate the method and get better results on the basis of the structural complexity and time efficiency.
Keywords:semantic segmentation  contextual structure  multi-scale
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号