首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于拉普拉斯能量的低质量手写体文档图像二值化
引用本文:冯炎,陈汝真.基于拉普拉斯能量的低质量手写体文档图像二值化[J].科学技术与工程,2020,20(26):10835-10839.
作者姓名:冯炎  陈汝真
作者单位:西藏大学信息科学技术学院,拉萨850000
基金项目:国家自然科学基金项目(61661047)
摘    要:文档图像二值化是文档分析与识别中的一个重要环节。本文针对低质量手写体文档图像提出了一种二值化算法,算法首先对文档进行相位保持降噪并计算背景修复模板,然后用图像修复算法和形态学闭运算估计文档背景,用背景补偿算法提高文档对比度,接着用背景补偿后的文档图像构造拉普拉斯(laplacian)能量,最后采用图割算法求得最终二值化结果。实验结果表明,本文所构造拉普拉斯能量能够较准确地区分文字和背景,所提二值化算法在DIBCO2018数据集中的实验结果优于同类算法。

关 键 词:文档图像  二值化  拉普拉斯能量  图割算法
收稿时间:2019/9/18 0:00:00
修稿时间:2020/6/3 0:00:00

A Laplacian Energy for Low Quality Handwriting Document Image Binarization
FENG Yan,CHEN Ru-zhen.A Laplacian Energy for Low Quality Handwriting Document Image Binarization[J].Science Technology and Engineering,2020,20(26):10835-10839.
Authors:FENG Yan  CHEN Ru-zhen
Institution:School of Information Science and Technology,Tibet University
Abstract:Document image binarization can protect the original document and better present the contents to people. In this work a new algorithm for low quality handwriting document binarization is proposed. The algorithm firstly performs phase preserving denoising on document image and calculates inpainting mask, then estimates the background with image inpainting procedure and morphological closing operation, then improves contrast with the background compensation algorithm, and finally the Laplacian energy for the background-compensated image is constructed and the graph cut algorithm is adopted to obtain the final binarization result. The experiments on the DIBCO2018 dataset show that the Laplacian energy constructed by the proposed algorithm can distinguish the text and background more accurately and the binarization results of the proposed algorithm are better than the state-of-the-art techniques.
Keywords:document image  binarization  laplacian energy  graph cut
本文献已被 万方数据 等数据库收录!
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号