首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于词序列拼积木模型的图像句子标注研究
引用本文:张红斌,殷依,姬东鸿,任亚峰.基于词序列拼积木模型的图像句子标注研究[J].北京理工大学学报,2017,37(11):1144-1149.
作者姓名:张红斌  殷依  姬东鸿  任亚峰
作者单位:华东交通大学软件学院,江西,南昌330013;武汉大学计算机学院,湖北,武汉430072
基金项目:国家自然科学基金资助项目(61373108,61173062);教育部人文社科基金资助项目(16YJAZH029,17YJAZH117);江西省科技厅科技攻关项目(20142BBG70011);江西省社科规划基金资助项目(16TQ02);江西省高校人文社科基金资助项目(XW1502、TQ1503);江西省教育厅科技项目(GJJ160497,GJJ160509,GJJ160531);江西省研究生创新基金(YC2016-S262)
摘    要:用句子标注图像,建立图像与文本间的跨媒体关联,以提升信息检索准确率,改善用户检索交互体验.利用KDES模型抽取图像特征,在多核学习模型中融合出MK-KDES特征,准确刻画图像视觉特性;设计自然语言生成模型:词序列拼积木,评估单词与图像内容的相关性,优选单词,并根据单词间的语义相关性与句法模式约束,将单词组合成N元词序列;把N元词序列输入模板生成句子.结果表明:MK-KDES-1特征聚焦于图像的纹理及形状视觉特性,它是改善句子BLEU-1评分的关键;而单词间的语义相关性与句法模式约束是提升句子BLEU-2评分的重要前提. 

关 键 词:自然语言生成  词序列拼积木WSBB  图像句子标注  N元词序列  语义相关性  句法模式约束
收稿时间:2016/7/25 0:00:00

Image Annotation by Sentences Based on Word Sequence Blocks Building Model
ZHANG Hong-bin,YIN Yi,JI Dong-hong and REN Ya-feng.Image Annotation by Sentences Based on Word Sequence Blocks Building Model[J].Journal of Beijing Institute of Technology(Natural Science Edition),2017,37(11):1144-1149.
Authors:ZHANG Hong-bin  YIN Yi  JI Dong-hong and REN Ya-feng
Institution:1. Software School, East China Jiaotong University, Nanchang, Jiangxi 330013, China;2. Computer School, Wuhan University, Wuhan, Hubei 430072, China
Abstract:Based on image annotation by sentences, the cross-media correlations between the images and the texts were constructed to improve the information retrieval accuracy and users'' retrieval experiences ultimately. The KDES model was applied to extract image features effectively and the MK-KDES features were obtained in turn by fusing the extracted features in the multiple kernel learning model to interpret the key visual characteristics of the images. A new natural language generation model named word sequence blocks building (WSBB) was designed to evaluate the semantic correlations between the words and the images. And several key words were summarized for generating sentences. According to the semantic correlations and syntactic mode constraints between words, many N gram word sequences were made up of those summarized words to better interpret the images. Finally, sentences were generated by inputting the N gram word sequences into the templates. Experimental results show that, the MK-KDES-1 feature can focus on describing the key texture and shape characteristics of the images, which helps to improve the BLEU-1 scores. Moreover, semantic correlation between the words is an important premise of improving the BLEU-2 scores as well as the syntactic mode constraints.
Keywords:natural language generation  word sequence blocks building(WSBB)  image annotation by sentences  N gram word sequence  semantic correlation  syntactic mode constraints
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京理工大学学报》浏览原始摘要信息
点击此处可从《北京理工大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号