首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于视觉关注模型与多尺度MSER的自然场景文本检测
引用本文:王大千,崔荣一,金璟璇.基于视觉关注模型与多尺度MSER的自然场景文本检测[J].应用科学学报,2020,38(3):496-506.
作者姓名:王大千  崔荣一  金璟璇
作者单位:延边大学 工学院, 吉林 延吉 133002
基金项目:国家语委“十二·五”科研规划项目基金(No.YB125-178);吉林省高教科研项目基金(No.JGJX2019D20)资助
摘    要:自然场景中文本检测易受光照、复杂背景、多语言文字、字体及尺寸等因素影响,该文提出了一种基于Itti视觉关注模型与多尺度最大稳定极值区域(maximally stable extremalregion,MSER)结合的自然场景文本检测算法.首先利用改进的Itti视觉关注模型提取文本特征图,并采用不同结合策略得到各尺度文本显著图;然后结合多尺度的MSER区域得到3种文本候选区域.根据文字与生成文本框的几何规则合并文本候选区域得到文本行;最后利用随机森林分类器除去非文本区域得到最终文本区域.实验结果表明,该方法对于自然场景图像中的文本检测具有较高的精确度和一定的鲁棒性.

关 键 词:自然场景  Itti视觉关注模型  最大稳定极值区域  文字区域检测  
收稿时间:2018-11-14

Text Detection in Natural Scene Based on Visual Attention Model and Multi-scale MSER
WANG Daqian,CUI Rongyi,JIN Jingxuan.Text Detection in Natural Scene Based on Visual Attention Model and Multi-scale MSER[J].Journal of Applied Sciences,2020,38(3):496-506.
Authors:WANG Daqian  CUI Rongyi  JIN Jingxuan
Institution:College of Engineering, Yanbian University, Yanji 133002, Jilin province, China
Abstract:Aiming at the low accuracy of current natural image detection algorithms, which is induced by the influence of illumination, complex background, multi-language and variety of font and size, a natural image text detection algorithm based on Itti visual salience model and multi-scale maximally stable extremal region (MSER) is proposed. First, we extract a text feature map from the improved Itti visual attention model, and obtain the text saliency maps of different scales by using different combination strategies. Then three kinds of text candidate regions can be figured out by combining with the multiscale MSER region, and text lines can be obtained by the text candidate regions according to these geometric rules of text and generated text boxes. Finally, the text area is obtained by using the random forest classifier to remove the non-text regions. Experimental results show that the text detection algorithm proposed in this paper has high detection accuracy and robustness under the influences of multi-language, text distortion and variety of size.
Keywords:natural scene  Itti visual attention model  maximally stable extremal region (MSER)  text area detection  
本文献已被 CNKI 等数据库收录!
点击此处可从《应用科学学报》浏览原始摘要信息
点击此处可从《应用科学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号