Multiple feature fusion in convolutional neural networks for action recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Multiple feature fusion in convolutional neural networks for action recognition

Authors:	Hongyang Li Jun Chen Ruimin Hu

Institution:	1.National Engineering Research Center for Multimedia Software,Wuhan University,Hubei,China;2.State Key Laboratory of Software Engineering,Wuhan University,Hubei,China

Abstract:	Action recognition is important for understanding the human behaviors in the video, and the video representation is the basis for action recognition. This paper provides a new video representation based on convolution neural networks (CNN). For capturing human motion information in one CNN, we take both the optical flow maps and gray images as input, and combine multiple convolutional features by max pooling across frames. In another CNN, we input single color frame to capture context information. Finally, we take the top full connected layer vectors as video representation and train the classifiers by linear support vector machine. The experimental results show that the representation which integrates the optical flow maps and gray images obtains more discriminative properties than those which depend on only one element. On the most challenging data sets HMDB51 and UCF101, this video representation obtains competitive performance.

Keywords:
本文献已被 CNKI SpringerLink 等数据库收录！