基于递归长短期记忆网络和镜头序列注意网络的视频摘要生成 Research on Video Summarization Generation based on ReLSTM and Shot-Sequence-Attention期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于递归长短期记忆网络和镜头序列注意网络的视频摘要生成

引用本文：	张晨,王圣焘,武光利.基于递归长短期记忆网络和镜头序列注意网络的视频摘要生成[J].科学技术与工程,2023,23(18):7852-7860.

作者姓名：	张晨王圣焘武光利

作者单位：	甘肃政法大学外国语学院;甘肃政法大学网络空间安全学院

基金项目：	甘肃省自然科学基金(No.21JR7RA570)；甘肃政法大学重大科研创新项目(No.GZF2020XZDA03, 2017XQNLW12)；2022年甘肃省高等学校青年博士基金项目(No. 2022QB-123)；甘肃省高等学校创新基金项目(No.2022A-097)；甘肃省科技计划项目(No.20CX9JA130)；兰州市人才创新创业项目(No.2020-RC-27)

摘要：	为解决基于长短期记忆网络LSTM的视频摘要生成方法当输入序列过长时LSTM网络中的记忆单元不能集中在长时间序列的跨度上。通过深度学习的方法研究了一种基于递归长短期记忆网络(ReLSTM)和序列注意(SSA)的视频摘要生成模型用以提高深度学习网络学习时序特征的能力。该模型使用ReLSTM网络提取时间特征。同时，利用SSA动态调整每个视频序列输入到ReLSTM网络中的特征权重。结果表明：在数据集TVSum上F1-score平均提高2.5%，最高提高0.2%。在数据集SumMe上F1-score平均提高7.8%，最高提高3.4%。可见该方法能有效地学习镜头之间的时序特征。
关键词：	视频摘要 ReLSTM 镜头序列注意力特征融合
收稿时间：	2022/10/17 0:00:00
修稿时间：	2023/4/19 0:00:00
Research on Video Summarization Generation based on ReLSTM and Shot-Sequence-Attention

Zhang Chen,Wang Shengtao,Wu Guangli.Research on Video Summarization Generation based on ReLSTM and Shot-Sequence-Attention[J].Science Technology and Engineering,2023,23(18):7852-7860.

Authors:	Zhang Chen Wang Shengtao Wu Guangli

Institution:	school of foreign studies of gansu university of political science and law;School of Cyber Security

Abstract:	In the current LSTM-based video summary generation methods, memory units in the LSTM network cannot focus on the span of long time series when the input sequence is too long. A video summary generation model based on Recurrent Long short-term memory network (ReLSTM) and sequential attention (SSA) is proposed to improve the ability of deep learning networks to learn temporal features. This model uses the ReLSTM network to extract temporal features. Meanwhile, SSA is used to dynamically adjust the feature weights of each video sequence input into the ReLSTM network. The experimental results show that F1-score is increased by 2.5% on average and 0.2% at the highest on the dataset TVSum. On the dataset SumMe, F1-score was improved by 7.8% on average and 3.4% at the highest. Experimental results show that this method can effectively learn the temporal features between shots.

Keywords:	Video summarization recursive-LSTM shot-sequence-attention feature fusion

	点击此处可从《科学技术与工程》浏览原始摘要信息
	点击此处可从《科学技术与工程》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏