首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种新的H.264/AVC标量量化并行VLSI结构
引用本文:彭春干,于敦山,曹喜信,盛世敏.一种新的H.264/AVC标量量化并行VLSI结构[J].北京大学学报(自然科学版),2008,44(4).
作者姓名:彭春干  于敦山  曹喜信  盛世敏
作者单位:1. 北京大学信息科学技术学院微电子学系SoC试验室,北京,100871
2. 北京大学软件微电子学院,北京,102600
摘    要:针对H.264视频编码标准关键技术52级标量量化的VLSI实现过程中,传统结构的速度和面积不能有效满足H.264在高速高并行编码应用中的实时要求,通过采用部分CSD码无符号压缩移位加法树、参考电平连线、对量化系数和步长重新进行分组分段编码等方法,有效替代了H.264标量量化过程中出现的矩阵乘法、查表、除法等不利于硬件加速的算法,提出了一种非常适合流水加速的基于4×4块并行的VLSI结构,通过控制级联加法器级数就可以有效调节其速度性能,当级数为2时,其块处理速率可以达到121.6 MHz, 能够满足4096×2304@120 Hz视频的实时处理要求.该结构在面积和功耗方面较传统结构也有较大的改进,采用SMIC 0.13 μm工艺单元库,综合时钟频率设为100 MHz时,等效门和功耗分别节省了38%和30%.

关 键 词:VLSI结构  视频编码

A Novel Parallel VLSI Architecture for H.264/AVC Scalar Quantization
PENG Chungan,YU Dunshan,CAO Xixin,SHENG Shimin.A Novel Parallel VLSI Architecture for H.264/AVC Scalar Quantization[J].Acta Scientiarum Naturalium Universitatis Pekinensis,2008,44(4).
Authors:PENG Chungan  YU Dunshan  CAO Xixin  SHENG Shimin
Abstract:52-level scalar quantization technology plays an important role in H.264/AVC. A novel parallel VLSI architecture is proposed for its hardware implementation, in which the 4×4 matrix multiplications is replaced by 16 unsigned compressed shift-adder-trees using partial CSD code scheme, switching reference wirings substitutes for look-up operation, and division is also avoided effectively, and no ROM or RAM is adopted in the overall quantizer. It can fulfill all the quantization calculations for all H.264 hybrid transform in 4×4 block parallelism. Its block throughput can reach 121.6 MHz, which can meet the real-time requirement for 4096×2304@120 Hz (119.43936 M/s) video compression. Compared with the conventional architecture, 38% cost and 30% power are saved. Considering speed and cost optimization, this architecture is very suitable for pipeline acceleration, and it is a useful IP for high resolution H.264 encoder VLSI realization.
Keywords:H  264  H  264  VLSI  video coding
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号