首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Multiple feature fusion in convolutional neural networks for action recognition
Authors:Hongyang Li  Jun Chen  Ruimin Hu
Institution:1.National Engineering Research Center for Multimedia Software,Wuhan University,Hubei,China;2.State Key Laboratory of Software Engineering,Wuhan University,Hubei,China
Abstract:Action recognition is important for understanding the human behaviors in the video, and the video representation is the basis for action recognition. This paper provides a new video representation based on convolution neural networks (CNN). For capturing human motion information in one CNN, we take both the optical flow maps and gray images as input, and combine multiple convolutional features by max pooling across frames. In another CNN, we input single color frame to capture context information. Finally, we take the top full connected layer vectors as video representation and train the classifiers by linear support vector machine. The experimental results show that the representation which integrates the optical flow maps and gray images obtains more discriminative properties than those which depend on only one element. On the most challenging data sets HMDB51 and UCF101, this video representation obtains competitive performance.
Keywords:
本文献已被 CNKI SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号