视频信号中实时字幕信息的提取方法 Real-time text information extraction from videos期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

视频信号中实时字幕信息的提取方法

引用本文：	欧国斌,张利,谢攀. 视频信号中实时字幕信息的提取方法[J]. 清华大学学报(自然科学版), 2002, 42(7): 869-872

作者姓名：	欧国斌张利谢攀

作者单位：	清华大学,电子工程系,北京,100084

基金项目：	国家自然科学基金资助项目 (60 172 0 2 7)

摘要：	为了在视频图像中进行字幕信息的实时提取 ,提出了一套简捷而有效的方法。利用视频图像中文本的频率特性与空间连续性 ,采用改进的投影阈值分割方法对视频中的文本进行实时分割。针对视频字幕在时间上的冗余特性 ,提出了一个基于有限状态机的动态缓冲的模型 ,在提高分割的正确率的同时减小了识别运算量。在识别部分 ,采用了一个 3层前向神经网络进行实时的识别。该算法已经成功地应用于卡拉 OK MTV歌词字幕信息同步提取系统中。
关键词：	分割识别视频动态缓冲分裂合并有限状态机
文章编号：	1000-0054(2002)07-0869-04
修稿时间：	2001-11-12
Real-time text information extraction from videos

OU Guobin,ZHANG Li,XIE Pan. Real-time text information extraction from videos[J]. Journal of Tsinghua University(Science and Technology), 2002, 42(7): 869-872

Authors:	OU Guobin ZHANG Li XIE Pan

Abstract:	A simple and effective method is presented for real time text segmentation and recognition in videos. The frequeny and spatial characteristics of the text are analyzed by a fast segmentation algorithm developed from the conventional threshold method. A dynamic buffering algorithm based on the Finite State Machine is used to eliminate the text's temporal redundancy and at the same time to correct segmentation errors. The recognition algorithm employs a 3 layer BP NN for real time recognition. The algorithms have been successfully applied to a system which automatically extracts lyrics from MTV Karaoke videos.

Keywords:	segmentation recognition video dynamic buffering splitting merging finite state machine
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏