Journal of Guangdong University of Technology ›› 2024, Vol. 41 ›› Issue (03): 131-140.doi: 10.12052/gdutxb.230067

• Information and Communication Technology • Previous Articles    

A Fast Combination Algorithm for HEVC Intra-Prediction Based on Neural Network

Fan Jun-yu1, Song Li-feng1,2   

  1. 1. School of Information Engineering, Guangdong University of Technology, Guangzhou 510006, China;
    2. Huizhou Guangdong University of Technology IoT Cooperative Innovation Institute Co., Ltd., Huizhou 516025, China
  • Received:2023-05-11 Online:2024-05-25 Published:2024-06-14

Abstract: To improve the real-time performance of High Efficiency Video Coding (HEVC) intra-frame encoding, a method, which utilizes a lightweight convolutional network with even-length and step-size convolutional kernels and a self-attention mechanism, is proposed to predict the intra-frame partitioning structure of Coding Tree Units(CTU) , thereby reducing the encoding time required for the encoder to perform quadtree recursive traversal partitioning on CTUs. In the original encoding strategy, Rough Mode Decision accelerates the process by estimating the rate-distortion loss value in Rate Distortion Optimization based on the Sum of Absolute Transformed Difference (SATD) -based loss value, but it still consumes a certain amount of encoding time. A proposed method reduces the number of patterns calculated in the Rough Mode Decision process through a sampling search approach, reducing the number of patterns from 35 to 18, and decreasing the time required to estimate the loss value during the Rough Mode Decision process. The more favorable multiple candidate intra-frame modes obtained from the Rough Mode Decision process are used for Rate Distortion Optimization. In order to reduce the number of candidate modes that need to be calculated in Rate Distortion Optimization, an early stopping decision is implemented by filtering out some less likely candidate modes based on the differences in the estimated loss values of the intra-frame prediction angle modes in the candidate mode list, thus reducing the number of candidate modes that need to be evaluated in Rate Distortion Optimization and consequently decreasing the computation time of the Rate Distortion Optimization process. The proposed algorithm achieves an average encoding time reduction of 78.15% on the test sequences, with a BD-PSNR of -0.168dB and a BD-RATE of 3.49%.

Key words: video coding, neural network, intra-frame prediction, fast algorithm

CLC Number: 

  • TN919.81
[1] LENG J, SUN L, IKENAGA T, et al. Content based hierarchical fast coding unit decision algorithm for HEVC[C]//2011 International Conference on Multimedia and Signal Processing. Guilin: IEEE, 2011: 56-59.
[2] 唐燕, 王晓东, 章联军. 一种HEVC的CU分割模式快速算法[J]. 无线通信技术, 2020, 29(3): 12-15.
TANG Y, WANG X D, ZHANG L J. A fast CU partition mode algorithm for HEVC [J]. Wireless Communication Technology, 2020, 29(3): 12-15.
[3] ZHANG Y, KWONG S, JIANG G, et al. Statistical early termination model for fast mode decision and reference frame selection in multiview video coding [J]. IEEE Transactions on Broadcasting, 2012, 58(1): 10-23.
[4] 何书前, 余绪杭, 邓正杰. 高效的H. 265/HEVC快速帧内编码方法[J]. 计算机工程与设计, 2022, 43(9): 2601-2608.
HE S Q, YU X H, DENG Z J. Efficient H. 265/HEVC fast intra frame coding method [J]. Computer Engineering and Design, 2022, 43(9): 2601-2608.
[5] 郭磊, 王晓东, 徐博文, 等. 基于HEVC的帧内预测模式决策和编码单元划分快速算法[J]. 计算机应用, 2018, 38(4): 1157-1163.
GUO L, WANG X D, XU B W, et al. Fast intra mode prediction decision and coding unit partition algorithm based on high efficiency video coding [J]. Journal of Computer Applications, 2018, 38(4): 1157-1163.
[6] SHEN L, ZHANG Z, LIU Z. Effective CU size decision for HEVC intracoding [J]. IEEE Transactions on Image Processing, 2014, 23(10): 4232-4241.
[7] NISHIKORI T, NAKAMURA T, YOSHITOME T, et al. A fast CU decision using image variance in HEVC intra coding[C]//2013 IEEE Symposium on Industrial Electronics & Applications. Kuching: IEEE, 2013: 52-56.
[8] 汤进, 彭勇. 基于时空相关与纹理特性的HEVC编码单元快速划分算法[J]. 计算机与数字工程, 2019, 47(007): 1753-1756.
TANG J, PENG Y. Fast coding unit partition algorithm for HEVC based on temporal-spatial correlation and texture property [J]. Computer & Digital Engineering, 2019, 47(007): 1753-1756.
[9] 伍冠健, 宋立锋. HEVC快速帧内模式和深度决策算法[J]. 广东工业大学学报, 2015, 32(4): 132-137.
WU G J, SONG L F. Fast intra mode and depth decision algorithm for HEVC [J]. Journal of Guangdong University of Technology, 2015, 32(4): 132-137.
[10] JAMALI M, COULOMBE S, CARON F. Fast HEVC intra mode decision based on edge detection and SATD costs classification[C]//2015 Data Compression Conference. Snowbird: IEEE, 2015: 43-52.
[11] MIN B, CHEUNG R. A fast CU size decision algorithm for the HEVC intra encoder [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2014, 25(5): 892-896.
[12] 石敏, 席诗华, 易清明. 基于预测单元尺寸的高效视频编码帧内预测模式快速选择的改进算法[J]. 激光与光电子学进展, 2019, 56(20): 226-234.
SHI M, XI S H, YI Q M. Improved algorithm for intraframe prediction mode fast selecting in high-efficiency video coding based on size of prediction units [J]. Laser & Optoelectronics Progress, 2019, 56(20): 226-234.
[13] CORREA G, ASSUNCAO P A, AGOSTINI L V, et al. Fast HEVC encoding decisions using data mining [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2014, 25(4): 660-673.
[14] HU Q, SHI Z, ZHANG X, et al. Fast HEVC intra mode decision based on logistic regression classification[C]//2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) . Nara: IEEE, 2016: 1-4.
[15] NAIR P S, RAO K R, NAIR M S. A machine learning approach for fast mode decision in HEVC intra prediction based on statistical features [J]. Journal of Intelligent and Fuzzy Systems, 2019, 36(3): 2095-2106.
[16] LIU D, LIU X, LI Y. Fast CU size decisions for HEVC intra frame coding based on support vector machines[C]//2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC/PiCom/DataCom/CyberSciTech) . Auckland: IEEE, 2016: 594-597.
[17] LIU X, LI Y, LIU D, et al. An adaptive CU size decision algorithm for HEVC intra prediction based on complexity classification using machine learning [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2017, 29(1): 144-155.
[18] RYU S, KANG J. Machine learning-based fast angular prediction mode decision technique in video coding [J]. IEEE Transactions on Image Processing, 2018, 27(11): 5525-5538.
[19] 周帅燃, 杨静. 低复杂度HEVC帧内编码快速划分算法[J]. 小型微型计算机系统, 2021, 42(7): 1475-1478.
ZHOU S R, YANG J. Fast partition algorithm of low complexity HEVC intra coding [J]. Journal of Chinese Computer Systems, 2021, 42(7): 1475-1478.
[20] 易清明, 林成思, 石敏. 利用深度学习的HEVC帧内编码单元快速划分算法[J]. 小型微型计算机系统, 2021, 42(2): 368-373.
YI Q M, LIN C S, SHI M. Fast HEVC coding units partitioning algorithm based on deep learning [J]. Journal of Chinese Computer Systems, 2021, 42(2): 368-373.
[21] LIU Z, YU X, GAO Y, et al. CU partition mode decision for HEVC hardwired intra encoder using convolution neural network [J]. IEEE Transactions on Image Processing, 2016, 25(11): 5088-5103.
[22] LAUDE T, OSTERMANN J. Deep learning-based intra prediction mode decision for HEVC[C]//2016 Picture Coding Symposium (PCS) . Nuremberg: IEEE, 2016: 1-5.
[23] CHEN Z, SHI J, LI W. Learned fast HEVC intra coding[J]. 2020 IEEE Transactions on Image Processing, 2020, 29: 5431-5446.
[24] XU M, LI T, WANG Z, et al. Reducing complexity of HEVC: a deep learning approach [J]. IEEE Transactions on Image Processing, 2018, 27(10): 5044-5059.
[25] 贾克斌, 崔腾鹤, 刘鹏宇, 等. 基于深层特征学习的高效率视频编码中帧内快速预测算法[J]. 电子与信息学报, 2021, 43(7): 2023-2031.
JIA K B, CUI T H, LIU P Y, et al. Fast prediction algorithm in high efficiency video coding intra-mode based on deep feature learning [J]. Journal of Electronics & Information Technology, 2021, 43(7): 2023-2031.
[1] Yin Dan-li, Ling Jie. Android Malware Application Detection Method Based on Heterogeneous Information Network [J]. Journal of Guangdong University of Technology, 2024, 41(02): 56-64.
[2] Chen Rui, Cai Nian, Luo Zhi-hao, Liu Xuan, Li Jian. Individual Survival Analysis of Breast Cancer Based on Multi-task Recurrent Neural Network Banded Regression Model [J]. Journal of Guangdong University of Technology, 2024, 41(01): 34-40.
[3] Huang Xiao-yong, Li Wei-tong. Fall Detection Algorithm Based on TSSI and STB-CNN [J]. Journal of Guangdong University of Technology, 2023, 40(04): 53-59.
[4] Xie Guo-bo, Lin Li, Lin Zhi-yi, He Di-xuan, Wen Gang. An Insulator Burst Defect Detection Method Based on YOLOv4-MP [J]. Journal of Guangdong University of Technology, 2023, 40(02): 15-21.
[5] Zhang Rui, Lyu Jun. Single-channel Speech Separation Based on Separated SI-SNR Regression Estimation and Adaptive Frequency Modulation Network [J]. Journal of Guangdong University of Technology, 2023, 40(02): 45-54.
[6] Qiu Jun-hao, Cheng Zhi-jian, Lin Guo-huai, Ren Hong-ru, Lu Ren-quan. Prescribed Performance Control for a Class of Nonlinear Pure-feedback Systems with Actuator Faults [J]. Journal of Guangdong University of Technology, 2023, 40(02): 55-63.
[7] Chen Jing-yu, Lyu Yi. Frost Detection Method of Cold Chain Refrigerating Machine Based on Spiking Neural Network [J]. Journal of Guangdong University of Technology, 2023, 40(01): 29-38.
[8] Ye Wen-quan, Li Si, Ling Jie. Sparse-view SPECT Image Reconstruction Based on Multilevel-residual U-Net [J]. Journal of Guangdong University of Technology, 2023, 40(01): 61-67.
[9] Peng Mei-chun, Yang Chen, Li Jun-ping, Ye Wei-bin, Huang Wen-wei. A Research on Vehicle Carbon Emission Calculating Method Based on BP Neural Network [J]. Journal of Guangdong University of Technology, 2023, 40(01): 107-112.
[10] Liu Hong-wei, Lin Wei-zhen, Wen Zhan-ming, Chen Yan-jun, Yi Min-qi. A MABM-based Model for Identifying Consumers' Sentiment Polarity―Taking Movie Reviews as an Example [J]. Journal of Guangdong University of Technology, 2022, 39(06): 1-9.
[11] Zhang Yun, Wang Xiao-dong. A Review and Thinking of Deep Learning with a Restricted Number of Samples [J]. Journal of Guangdong University of Technology, 2022, 39(05): 1-8.
[12] Peng Ji-guang, Xiao Han-zhen. Tracking and Obstacle Avoidance of Multi-mobile Robots Under Model Predictive Control [J]. Journal of Guangdong University of Technology, 2022, 39(05): 93-101.
[13] Li Yao-dong, Ren Zhi-gang, Wu Zong-ze. Deep Neural Network Based Predictive Control for Injection Molding Process [J]. Journal of Guangdong University of Technology, 2022, 39(05): 120-126,136.
[14] Zeng Jiang-yi, Li Zhi-sheng, Ou Yao-chun, Jin Yu-kai. PM2.5 Concentration Improving Prediction Modeling of Seasonal Index [J]. Journal of Guangdong University of Technology, 2022, 39(03): 89-94.
[15] Gary Yen, Li Bo, Xie Sheng-li. An Evolutionary Optimization of LSTM for Model Recovery of Geophysical Fluid Dynamics [J]. Journal of Guangdong University of Technology, 2021, 38(06): 1-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!