首页 | 本学科首页   官方微博 | 高级检索  
     

基于特征金字塔网络的自然场景图像文本检测
引用本文:林金朝,文盼,庞宇. 基于特征金字塔网络的自然场景图像文本检测[J]. 重庆邮电大学学报(自然科学版), 2022, 34(1): 155-163. DOI: 10.3979/j.issn.1673-825X.202001150022
作者姓名:林金朝  文盼  庞宇
作者单位:重庆邮电大学 通信与信息工程学院,重庆400065;光电信息感测与传输技术重庆市重点实验室,重庆400065,光电信息感测与传输技术重庆市重点实验室,重庆400065
基金项目:国家自然科学基金(61301124,61671091)
摘    要:针对深度学习中对任意形状文本检测准确率不高的问题,提出了一种结合特征金字塔网络(feature pyramid network,FPN)和内核尺度扩展算法的文本检测网络模型.特征金字塔网络能够提取卷积层中更加鲁棒的特征,融合后生成不同尺度的特征内核;内核尺度扩展算法将生成的最小特征内核逐渐扩展为包围完整文本实例的特征图...

关 键 词:深度学习  文本检测  特征金字塔  内核扩展
收稿时间:2020-01-15
修稿时间:2021-12-03

Text detection of natural scene images based on feature pyramid network
LIN Jinzhao,WEN Pan,PANG Yu. Text detection of natural scene images based on feature pyramid network[J]. Journal of Chongqing University of Posts and Telecommunications, 2022, 34(1): 155-163. DOI: 10.3979/j.issn.1673-825X.202001150022
Authors:LIN Jinzhao  WEN Pan  PANG Yu
Affiliation:School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China;Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing 400065, P. R. China
Abstract:Aiming at the problem of low accuracy in detecting arbitrary shape text in deep learning, a text detection method combining feature pyramid network (FPN) and kernel scale extension algorithm is proposed in this paper. The feature pyramid network can extract more robust features in the convolutional layer and generate different scale feature kernels after fusion. The kernel scale expansion algorithm gradually expands the generated minimum feature kernel into a feature map which surrounds the entire text instance. At the same time, for the text examples which are difficult to detect in natural scenes, the online hard example mining (OHEM) method was added during the training phase, and two training strategies were used to train in the form of transfer learning. Simulation results show that the algorithm model has good detection performance on different datasets.
Keywords:deep learning  text detection  feature pyramid  kernel expansion
本文献已被 万方数据 等数据库收录!
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号