Optimizing a Parallel Video Encoder with Message Passing and a Shared Memory Architecture |
| |
Authors: | Junli Gu 谷俊丽 Yihe Sun 孙义和 |
| |
Institution: | Institute of Microelectronics, Tsinghua University, Beijing 100084, China |
| |
Abstract: | Implementing video applications on emerging multi-core processors is a promising technique for personal, real-time multi-media applications. However, when porting the legacy parallel video encoders developed for clusters to shared-memory multi-cores, the existing parallel algorithms result in workload imbalances on different cores and communication inefficiencies. This paper describes a strip-wise parallel scheme to balance workloads and a hybrid communication mechanism to reduce communication overhead. The implementation of the H.264 parallel encoder on an eight CPU Intel Xeon system achieves 5x to 6x speed-up over a single thread encoder and achieves a 29% performance improvement over the commonly used master-slave schemes on clusters. The paper also gives further analysis on scalability, parallel efficiency, workload balance, and communication overhead as the number of cores varies. |
| |
Keywords: | parallel video encoder speed improvement message passing shared memory |
本文献已被 CNKI ScienceDirect 等数据库收录! |
|