以DenseNet / Denoising Autoencoder演算法則應用於HEVC和VVC畫面內解碼端後處理研究

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：39

、訪客IP：3.23.129.149

姓名

陳彥均(Yen-Chun Chen) 查詢紙本館藏

畢業系所

通訊工程學系

論文名稱

以DenseNet / Denoising Autoencoder演算法則應用於HEVC和VVC畫面內解碼端後處理研究
(Post-Processing for HEVC and VVC Intra Prediction With DenseNet / Denoising Autoencoder Algorithms)

相關論文

★ 10Gb/s MM XFP光收發模組設計與實現	★ 資訊產品自動化測試之研究
★ 高電流密度鰭式氮化鎵高電子遷移率電晶體研究	★ 電子郵件及壓縮檔案解碼之研究
★ 渦輪碼在光學記錄系統上之應用	★ 離散餘弦轉換硬體架構之研究
★ 動態影像之錯誤隱藏研究	★ 即時性無失真壓縮編碼之研究
★ 類神經網路在手寫數字辨識之研究	★ 事後機率演算法則在資料儲存系統之研究
★ 紅外線傳輸協定及通道之研究	★ 低密度同位元檢查碼在數位資料儲存系統之研究
★ 一種新型的JPEG2000竄改偵測與還原技術	★ 即時性無失真壓縮之研究
★ 混合快速模式決策演算法之研究	★ 光學記錄MEPR2通道系統之時序恢復探討與研究

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 (2026-1-10以後開放)

摘要(中)

現今的科技發展日新月異，人們的生活跟科技已經息息相關，而高解析度的影像也成為多數人的日常需求，舉凡人們在看電影或者打電動時都希望追求更高畫質的影像品質，因此在這些高解析度的影像品質背後，所花費的資料量無庸置疑是很龐大的，為了更有效的因應這些高解析度的影像，HEVC以及VVC都運用了許多方式可以有效的降低位元傳輸。
HEVC(High Efficiency Video Coding)採用四分樹(QuardTree)的編碼劃分架構，在編碼端會造成影像的失真，所以本論文採用三種在畫面內預測後處理的方式來增強影像的品質，第一種藉由Add Fusion替代Concatenate Fusion的方式去比較兩者的之間的差異，第二種是透過DenseNet的架構讓影像可以還原得更好，第三種是在模型的最後面再加上Denoising Autoencoder讓整體性能有更進一步的提升。最終整體架構在HEVC畫面內預測後處理可以提升0.35(dB)的BDPSNR與降低6.37(%)的BDBR。
VVC(Versatile Video Coding)相比HEVC則不僅是採用四分樹的劃分方式，還多了多類型(Multi-Type)的編碼劃分架構，相較於前一代的HEVC可以提供4K~16K高解析度的影像，讓使用者可以有更好的視覺體驗。本論文一樣採用上一段所提到的模型架構在VVC進行增強影像，最終整體架構在VVC畫面內預測後處理可以提升0.288(dB)的BDPSNR與降低5.25(%)的BDBR。

摘要(英)

Nowadays, technology is developing rapidly, people′s lives are closely related to technology, and high resolution images have become the daily needs of most people. For example, people want to pursue higher image quality when watching movies or playing video games. However, behind the quality of these high resolution images, the amount of data consumed is undoubtedly huge. In order to respond to these high-resolution images more effectively, both HEVC and VVC use many ways to effectively reduce transmission bit.
HEVC(High Efficiency Video Coding) adopts a quadtree encoding partition structure, which will cause image distortion at the encoding end. Therefore, this paper uses three post-processing methods of intra-screen prediction to enhance image quality. The first method uses Add Fusion instead of Concatenate Fusion and compare the differences. The second is to reconstruct the image better through the DenseNet architecture. The third is to add the Denoising Autoencoder at the end of the model to improve the overall performance. In the end, the post-processing for HEVC intra prediction can improve the BDPSNR by 0.35 (dB) and reduce the BDBR by 6.37 (%).
Compared with HEVC, VVC(Versatile Video Coding) not only adopts the quad-tree division method, but also has a multi-type (Multi-Type) encoding division architecture. It can support 4K~16K high resolution images, to provide better experiences of media consuming. This paper also uses the model architecture mentioned in the previous paragraph to improve and enhance the image in VVC. Final, the post-processing for VVC intra prediction can improve the BDPSNR by 0.288 (dB) and reduce the BDBR by 5.25 (%).

關鍵字(中)

★ 高效率視頻編碼
★ 多功能視訊編碼
★ 影像後處理

關鍵字(英)

★ HEVC
★ VVC
★ Image post-processing

論文目次

論文摘要 ix
Abstract x
章節目錄 vi
圖目錄 ix
表目錄 xiii
第一章、緒論 1
1.1 研究動機與目的 1
1.2 論文架構 2
1.3 高效率視頻編碼(HEVC)介紹 2
1.4 HEVC編碼架構介紹 3
1.4.1 編碼單元(Coding Unit) 4
1.4.2 預測單元(Prediction Unit) 5
1.4.3 轉換單元(Transform Unit) 6
1.4.4 碼率失真函數(RD cost) 7
1.4.5量化參數(Quantization Parameter) 8
1.4.6 畫面內編碼預測(Intra Prediction) 9
1.5 多功能影像編碼(VVC)介紹 11
1.6 VVC編碼架構介紹 12
1.6.1 編碼單元(Coding Unit) 12
1.6.2 QTMT 13
1.6.3 畫面內編碼預測(Intra Prediction) 14
1.7 支持向量機(Support Vector Machine)介紹 15
1.8 深度學習(Deep Learning)介紹 17
1.8.1 人工神經網路(Artificial Neural Networks) 18
1.8.2 深度神經網路(Deep Neural Network) 19
1.8.3 卷積神經網路(Convolutional Neural Networks, CNN) 21
第二章、相關技術與文獻回顧 24
2.1 DenseNet 24
2.2 Denoising AutoEncoder(DAE) 26
2.3 相關文獻 27
2.3.1 Enhancing HEVC Compressed Videos With A Partition-Masked Convolutional Neural Network 27
2.3.2 Post-Processing for HEVC Intra Prediction with ResNet algorithm 32
第三章、以DenseNet / Denoising AutoEncoder演算法則應用於HEVC畫面內解碼端後處理 37
3.1系統架構 37
3.2以相加融合(Add Fusion)與相連融合(Concatenate Fusion)應用於三通道殘差網路架構之性能探討 39
3.2.1 環境配置 39
3.2.2 訓練資料前處理階段 40
3.2.3 訓練階段 43
3.2.4 測試階段 46
3.2.5 性能探討 47
3.3以DenseNet應用於三通道網路架構之性能探討 48
3.3.1訓練階段 48
3.3.2測試階段 50
3.3.3性能探討 50
3.4加入Denoising AutoEncoder在三通道網路架構之性能探討 51
3.4.1訓練階段 52
3.4.2測試階段 54
3.4.3性能探討 55
3.5 碼率失真曲線分析 56
3.6 圖片差異分析 59
第四章、以DenseNet / Denoising AutoEncoder演算法則應用於VVC畫面內解碼端後處理 64
4.1 以相加融合(Add Fusion)與相連融合(Concatenate Fusion)應用於三通道殘差網路架構之性能探討 64
4.1.1訓練階段 64
4.1.2測試階段 67
4.1.3性能探討 67
4.2 以DenseNet應用於三通道網路架構之性能探討 68
4.2.1訓練階段 69
4.2.2測試階段 70
4.2.3性能探討 70
4.3加入Denoising AutoEncoder在三通道網路架構之性能探討 72
4.3.1訓練階段 72
4.3.2測試階段 74
4.3.3性能探討 75
4.4 碼率失真曲線分析 76
4.5 圖片差異分析 79
4.6 最終網路架構的HEVC與VVC結果比較探討 83
第五章、結論與未來展望 89
參考文獻 90

參考文獻

[1] ITU-T Rec. H.263,“Video Codec for Low Bit Rate Communication, ”1996.
[2] I.E.G.Richardson, H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia. Aberdeen, U.K.:John Wiley & Sons,2003.
[3] “Generic coding of moving pictures and associated audio information,” ISO/IEC 13818-2: Video (MPEG-2),May 1996.
[4] “Coding of audio-visual objects - Part 2: Visual,” in ISO/IEC 14496-2 (MPEG-4 Visual Version 1),Apr.1999.
[5] G. J. Sullivan, J.R. Ohm, W.J. Han and T. Wiegand,” Overview of the High Efficiency Video Coding (HEVC) Standard,” IEEE Trans. CSVT, vol. 22, no. 12, Dec. 2012.
[6] J.Kim, J.K. Lee, K.M. Lee,“Accurate Image Super-Resolution Using Very Deep Convolutional Networks”,The IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2016,pp.1646-1654.
[7] G. Huang, Z Liu, Laurens van der Maaten,and Laurens van der Maaten, “Densely Connected Convolutional Networks”, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 47004708, 2017.
[8] P. Vincent, H. Larochelle, Y. Bengio, and P. Manzagol, "Extracting and composing robust features with denoising autoencoders", in Proceedings of the 25th International Conference on Machine Learning (ICML-08) 2008.
[9] X.He, Q.Hu, X.Han, X.Zhang, C.Zhang, W.Lin, "Enhancing HEVC Compressed Videos With A Partition-Masked Convolutional Neural Network", International Conference on Image Processing(ICIP) 2018, pp.216-220.
[10] C.H Chen,“CNN-Based Post-Processing for HEVC Intra Prediction”, Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., July 2020.
[11] P.H Tsui, “Post-Processing for HEVC Intra Prediction with ResNet algorithm” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., January 2022.
[12] C.K Hsieh,“CNN-Based Post-Processing for HEVC Inter Prediction,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., July 2020.

指導教授

林銀議(Yin-Yi Lin)

審核日期

2023-1-16

推文