首页 | 本学科首页   官方微博 | 高级检索  
     检索      

复杂中文报纸的版面分析、理解和重构
引用本文:陈明,丁晓青,梁健.复杂中文报纸的版面分析、理解和重构[J].清华大学学报(自然科学版),2001,41(1):29-32.
作者姓名:陈明  丁晓青  梁健
作者单位:清华大学,电子工程系,
基金项目:国家“八六三”高技术项目!(86 3-30 6 -0 3-0 5 -6 ),国家自然科学基金资助项目!(6 96 82 0 0 3)
摘    要:在将纸张介质的文档自动转换成电子文档格式的过程中 ,版面的分析、理解和重构是十分关键的问题。针对复杂中文报纸版面 ,提出了一个基于最近邻连接强度和行列可信度的自底向上的版面分析算法和一个基于规则的块生长的版面理解算法 ,并讨论版面重构的相关问题和实现。综合这些算法并结合汉字识别核心 ,实现了一个完整的自动电子出版物制作系统。实验和实际运行的系统证明了算法的有效性和系统的实用性

关 键 词:版面分析  版面理解  版面重构
文章编号:1000-0054(2001)01-0029-04
修稿时间:1999年12月7日

Analysis, understanding and representation of Chinese newspapers with complex layout
CHEN Ming,Ding Xiaoqing,LIANG Jian.Analysis, understanding and representation of Chinese newspapers with complex layout[J].Journal of Tsinghua University(Science and Technology),2001,41(1):29-32.
Authors:CHEN Ming  Ding Xiaoqing  LIANG Jian
Abstract:Layout Analysis, understanding and representation are important problems when transforming paper documents to electronic versions. A bottom up algorithm of layout analysis based on nearest neighbor connect strength and line confidence is proposed for Chinese newspapers with complex layouts. We also propose a rule based grow the algorithm for layout understanding. The implementation of layout representation is also discussed. These algorithms with a Chinese character recognition engine were used to finish a complete system to automatically do electronic publishing. The algorithms were proven be efficient and practical by experiment results and a practical operating system.
Keywords:layout  analysis  layout understanding  layout representation
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号