HEVC畫面內/間加速預測編碼之相互影響探討

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：71

、訪客IP：18.217.2.54

姓名

齊繁鴻(Chi-Fan Hung) 查詢紙本館藏

畢業系所

通訊工程學系在職專班

論文名稱

HEVC畫面內/間加速預測編碼之相互影響探討
(Discussion on the mutual influence of HEVC intra/inter accelerated prediction coding)

相關論文

★ 10Gb/s MM XFP光收發模組設計與實現	★ 資訊產品自動化測試之研究
★ 高電流密度鰭式氮化鎵高電子遷移率電晶體研究	★ 電子郵件及壓縮檔案解碼之研究
★ 渦輪碼在光學記錄系統上之應用	★ 離散餘弦轉換硬體架構之研究
★ 動態影像之錯誤隱藏研究	★ 即時性無失真壓縮編碼之研究
★ 類神經網路在手寫數字辨識之研究	★ 事後機率演算法則在資料儲存系統之研究
★ 紅外線傳輸協定及通道之研究	★ 低密度同位元檢查碼在數位資料儲存系統之研究
★ 一種新型的JPEG2000竄改偵測與還原技術	★ 即時性無失真壓縮之研究
★ 混合快速模式決策演算法之研究	★ 光學記錄MEPR2通道系統之時序恢復探討與研究

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 (2027-1-22以後開放)

摘要(中)

隨著科技進步影像的品質、解析度也隨之增加，資料量也越來越大，HEVC(High Efficiency Video Coding)又稱為H.265，採用更新的技術來降低位元率，連帶的也提升編碼計算複雜度。本論文採用深度學習與機器學習中的卷積神經網路CNN ( Convolutional Neural Network ) 和支持向量機SVM ( Support Vector Machine )，應用於HEVC編碼單元決策。將一個CTU分類成深度0、深度0~1、深度0~2、深度0~3四種類別，再利用卷積神經網路分層向下細分，藉由HEVC遞迴運算處理編碼單元的方式，在特定深度提前終止後續編碼計算。另外在畫面內使用支持向量機的決策閥值，透過特定條件減少進入卷積神經網路的次數以利於節省編碼時間。整體平均BDBR上升至4.72%，編碼時間平均可節省75.22%，並探討在Random Access和Low Delay架構下性能比較，以及不同GOP大小產生的影響。

摘要(英)

With the advancement of technology, the quality and resolution of images have also increased, and the amount of data becomes larger and larger. HEVC (High Efficiency Video Coding), also known as H.265, uses newer technology to reduce the bit rate, and the associated Improve coding computational complexity. This paper uses the convolutional neural network (CNN) and the support vector machine (SVM) in deep learning and machine learning that have flourished in recent years to apply it to HEVC coding unit decision-making. At the beginning of encoding, SVM is first used to classify the coding unit depth and prediction unit mode, and a CTU is classified into four categories: depth 0, depth 0~1, depth 0~2, and depth 0~3, and then the convolutional neural network is used. The network layer is subdivided downward. By using HEVC′s recursive operation to process coding units, subsequent coding calculations are terminated early at a specific depth. In addition, the decision threshold of the support vector machine is used to reduce the number of entries into the convolutional neural network through specific conditions to save coding time. The overall average BDBR increases to 4.72%, and the encoding time can be saved by 75.22% on average. Discuss the performance comparison between Random Access and Low Delay architectures, and the impact of different GOP size.

關鍵字(中)

★ 編碼單元
★ 快速深度決策
★ 畫面間預測
★ 改善編碼性能
★ 深度學習
★ 移動向量
★ 深度決策

關鍵字(英)

★ HEVC
★ SVM
★ CNN
★ Random Access
★ Low Delay

論文目次

章節目錄
論文摘要 I
Abstract II
圖目錄 VIII
表目錄 XI
第一章、緒論 1
1.1研究動機與目的 1
1.2論文架構 2
第二章、H.265/HEVC介紹 3
2.1 視訊編碼H.265 / HEVC介紹 3
2.2 編碼架構H.265 / HEVC介紹 4
2.2.1 編碼單元(Coding Unit, CU) 5
2.2.2 預測單元(Prediction Unit, PU) 7
2.2.3 轉換單元(Transform Unit, TU) 8
2.2.4 畫面間預測(Inter Prediction) 9
第三章、支持向量機及深度學習介紹 18
3.1支持向量機(Support Vector Machine, SVM) 18
3.2深度學習介紹 20
3.2.1 類神經網路(Artificial Neural Network, ANN) 21
3.2.2 深度神經網路(Deep Neural Networks, DNN) 21
3.2.3 卷積神經網路(Convolutional Neural Networks, CNN) 22
第四章、相關文獻回顧 26
4.1 利用支持向量機減少編碼單元複雜度相關文獻回顧 26
4.1.1 Reduction of Computational Complexity HEVC Inter Prediction With Support Vector Machine 26
4.2利用CNN減少CU編碼複雜度相關文獻回顧 32
4.2.1 Fast CU Depth Decision for HEVC Using Neural Networks 33
4.2.2 SVM/CNN-based CTU partition for HEVC inter prediction 35
第五章、HEVC畫面內/間加速預測編碼效應探討 38
5.1 隨機存取(Random Access)情境下之探討 38
5.1.1 編碼性能(BDBR) 、PSNR比較 38
5.1.2 編碼時間比較(time saving) 44
5.1.3 實驗結果分析 48
5.2 低延遲 (Low Delay)情境下之探討 50
5.2.1 編碼性能(BDBR) 、PSNR比較 50
5.2.2 編碼時間比較(time saving) 56
5.2.3 實驗結果分析與討論 60
5.3 隨機存取與低延遲實驗結果討論 62
5.3.1環境設置 62
5.3.2隨機存取(Random Access)與低延遲(Low Delay)結果比較 63
第六章、結論與未來展望 67
參考文獻 68

參考文獻

[1] JCT-VC, “High efficiency video coding (HEVC) test model 15(HM15) encoder description,” JCTVC-Q1002, JCT-VC Meeting, Valencia, ES, Apr. 2014.
[2] “Generic coding of moving pictures and associated audio information,” ISO/IEC 13818-2: Video (MPEG-2), May 1996.
[3] I. E. G. Richardson, H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia. Aberdeen, U.K.: John Wiley & Sons, 2003.
[4] P. Helle, S. Oudin, B. Bross, D. Marpe, M. O. Bici, K. Ugur, J. Jung, G. Clare, and T. Wiegand, “Block merging for quadtree-based partitioning in HEVC,” in Proc. IEEE Transactions on circuits and systems for video technology, vol. 22, no. 12, pp. 1720-1731, Dec. 2012.
[5] L. Zhao, X. Guo, S. Lei, S. Ma and D. Zhao, “Simplified AMVP for high efficiency video coding,” in Proc. IEEE ICIP, pp. 1-4, 27-30 Nov. 2012.
[6] J. L. Lin, Y. W. Chen, Y. W. Huang, and S. M. Lei, “Motion vector coding in the HEVC standard,” in Proc. IEEE Journal of Selected Topics in Signal Processing, vol. 7, no. 6, pp. 957-968, 3 July 2013.
[7] Y. Ismail and S. El-etriby, “Fast diamond search algorithm for real time video
coding,” in Proc. IEEE ICNC, pp. 729-733, Feb. 2012.
[8] K. Alex, I. Sutskever, and G. E. Hinton, “ImageNet Classification with Deep
Convolutional Neural Networks,” in Advances in Neural Information Processing
Systems, pp.1097-1105, 2012.
[9]X. Liu, Y. Li, D. Liu, P. Wang, L. T. Yang, “An Adaptive CU Size Decision Algorithm for HEVC Intra Prediction Based on Complexity Classification Using Machine Learning”, IEEE Transactions on Circuits and Systems for Video Technology, Vol 29, pp.144-155, 27 November 2017.
[10]T. Zhang, M. T. Sun, D. Zhao, W. Gao, “Fast Intra-Mode and CU Size Decision for HEVC”, IEEE Transactions on Circuits and Systems for Video Technology, Vol 27, pp.1714-1726, 20 April 2016.
[11]S. J. Cai, Yin yi Lin, “ Reduction of Computation Complexity for HEVC Intra Prediction with Support Vector Machine”, National Central University, Master Thesis, Jun 2017.
[12] J.K. Liu, “Reduction of Computational Complexity for HEVC Inter Prediction with Support Vector Machine,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C.,Jan 2019.
[13] K. Kim and W. W. Ro, "Fast CU depth decision for HEVC using neural networks", IEEE Trans. Circuits Syst. Video Technol., vol. 29, no. 5, pp. 1462-1473, May 2019.
[14]D.H.Yang,”SVM/CNN-based CTU partition for HEVC inter prediction” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C.,July 2021.
[15] Tianyi Li,Mai Xu,Xin Deng, “ A deep convolutional neural network approach for complexity reduction on intra-mode HEVC”, 2017 IEEE International Conference on Multimedia and Expo (ICME).
[16] Takafumi Katayama,Kazuki Kuroda,Wen Shi,Tian Song,Takashi Shimamoto,“Low-complexity intra coding algorithm based on convolutional neural network for HEVC”, 2018 International Conference on Information and Computer Technologies (ICICT).
[17] Kyungah Kim,Won Woo Ro,“Fast CU Depth Decision for HEVC Using Neural Networks”, IEEE Transactions on Circuits and Systems for Video Technology ( Volume: 29 , Issue: 5 , May 2019 ).
[18] Mai Xu,Tianyi Li,Zulin Wang,Xin Deng,Ren Yang,Zhenyu Guan, “Reducing Complexity of HEVC: A Deep Learning Approach”, IEEE Transactions on Image Processing ( Volume: 27 , Issue: 10 , Oct. 2018 ).
[19] Shiba Kuanar,K.R. Rao,Christopher Conly, “Fast Mode Decision In Hevc Intra Prediction, Using Region Wise CNN Feature Classification”, 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).
[20] Jun Shi,Changsheng Gao,Zhibo Chen, “Asymmetric-Kernel CNN Based Fast CTU Partition for HEVC Intra Coding”, 2019 IEEE International Symposium on Circuits and Systems (ISCAS).
[21] Yongfei Zhang,Gang Wang,Rui Tian,Mai Xu,C. C. Jay Kuo, “Texture-Classification Accelerated CNN Scheme for Fast Intra CU Partition in HEVC”, 2019 Data Compression Conference (DCC).
[22] Wenpeng Ren,Jia Su,Chang Sun,Zhiping Shi, “An IBP-CNN Based Fast Block Partition For Intra Prediction”, 2019 Picture Coding Symposium (PCS).
[23] Tianyi Li,Mai Xu,Xin Deng, “ A deep convolutional neural network approach for complexity reduction on intra-mode HEVC”, 2017 IEEE International Conference on Multimedia and Expo (ICME).
[24] Jun Shi,Changsheng Gao,Zhibo Chen, “Asymmetric-Kernel CNN Based Fast CTU Partition for HEVC Intra Coding”, 2019 IEEE International Symposium on Circuits and Systems (ISCAS).
[25] A. Mercat, M. Viitanen, and J. Vanne, “UVG dataset: 50/120fps 4K sequences for video codec analysis and development,” in Proc. ACM Multimedia Syst. Conf., Istanbul, Turkey, June 2020.
[26] Gitl HEVC/H. 265 Analyzer, [online] Available: https://github.com/lheric/GitlHEVCAnalyzer.

指導教授

林銀議(Yinyi Lin)

審核日期

2024-1-22

推文