首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于连通域的蒙古文文档图像版面分析方法
引用本文:魏宏喜,高光来.一种基于连通域的蒙古文文档图像版面分析方法[J].内蒙古大学学报(自然科学版),2007,38(5):586-590.
作者姓名:魏宏喜  高光来
作者单位:内蒙古大学计算机学院,呼和浩特,010021
摘    要:版面分析是一个将文本页面图像分割成不同区域,并标定区域类型(如文字、图片、表格等)的过程,与字符识别具有同等重要的地位.提出了一种基于连通域的蒙古文版面分析方法,它提取文档图像中所有连通域,根据连通域的大小进行聚类,从而可以得到文字连通域和非文字连通域,达到分割版面的目的.实验证明,该算法能够对蒙古文书籍版面进行准确的分析.

关 键 词:蒙古文文档图像  版面分析  自底向上法  自顶向下法  连通域  连通域  蒙古文  文档图像  版面分析  方法  Connected  Components  Based  Images  Document  Layout  Analysis  书籍版面  算法  验证  非文字  聚类  大小  提取  地位  同等重要  字符识别
文章编号:1000-1638(2007)05-0586-05
修稿时间:2006-05-15

A Method of Layout Analysis for Mongolian Document Images Based on Connected Components
WEI Hong-xi,GAO Guang-lai.A Method of Layout Analysis for Mongolian Document Images Based on Connected Components[J].Acta Scientiarum Naturalium Universitatis Neimongol,2007,38(5):586-590.
Authors:WEI Hong-xi  GAO Guang-lai
Institution:College of Computer Science, Inner Mongolia University ,Hohhot 010021 ,China
Abstract:Layout analysis is a process that a document image is segmented into different areas and the areas should be classified.It is as important as the character recognition.A new layout analysis method for the Mongolian document images was proposed based on the connected components analysis.All the connected components of a document image are searched by the pixel labeling.Then,they are clustered by their size.Thereby,many connected components of character and non-character can be achieved separately.Experiment shows that the method is suitable for the layout of Mongolian books
Keywords:Mongolian document image  layout analysis  bottomup approach  top-down approach  connected component
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号