中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/92825
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 41267898      線上人數 : 164
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/92825


    題名: 分散式編碼用於VVC/H.266;Distributed Video Coding On Versatile Video Coding
    作者: 鍾承學;Chung, Cheng-Hsueh
    貢獻者: 通訊工程學系
    關鍵詞: 多功能影像編碼;支持向量機;卷積神經網路;編碼單元;分散式視訊編碼;畫面內預測;Versatile Video Coding;support vector machines;convolutional neural networks;coding units;distributed video coding;intra prediction
    日期: 2023-01-16
    上傳時間: 2024-09-19 16:21:05 (UTC+8)
    出版者: 國立中央大學
    摘要: 在這日新月異的時代,隨著網路的進步以及科技的發達,人們對於追求更高品質的事物始終不會停滯,對於高解析度的影像也是如此,為了能夠更有效率的壓縮這些巨大的視訊資料量,VVC採用了一些更新穎的技術,如矩形編碼樹單元、碼率失真最佳化等等,但於此同時也造成了編碼計算複雜度的提升,本論文結合近幾年來十分熱門的深度學習與機器學習,即卷積神經網路與隨機森林分類器,將其應用於VVC編碼單元編碼區外的劃分。不同於原始VVC遞迴運算編碼單元碼率失真成本,本論文在編碼一開始時先使用支持向量機及卷積神經網路,將方形編碼單元區塊做出劃分,再利用隨機森林分類器向下細分矩形編碼單元區塊,分類完成的區塊將只會進行一次的編碼,藉此大幅節省編碼所需時間,後續再透過隨機森林決策輔助原始VVC篩選預測模式的方式,將整體計算縮減至不到兩成。後續在解碼端則引入三通道殘差神經網路架構,以不同的資訊去補償我們在編碼端的失真。以此實現分散式視訊編碼的概念,結合快速預測模式與解碼端之後處理補償影像品質。實驗結果與VVC相比,整體平均BDBR下降1.63%的情況下,整體編解碼時間大約可以節省51.48%。;In this ever-changing era, with the advancement of the Internet and the development of technology, people will never stop pursuing higher-quality things, and the same is true for high-resolution images. In order to compress these huge videos more efficiently data volume, VVC adopts some more novel technologies, such as rectangular coding tree unit, rate-distortion optimization, etc., but at the same time, it also causes an increase in the complexity of coding calculations. This paper combines the very popular in recent years Deep learning and machine learning, namely convolutional neural networks and random forest classifiers, are applied to VVC coding unit depth decisions. Different from the original VVC recursive operation coding unit rate distortion cost, this paper first uses support vector machine and convolutional neural network to divide the square coding unit blocks at the beginning of coding, and then uses random forest classifier to Subdividing the rectangular coding unit block, the classified block will only be coded once, thereby greatly saving the time required for coding, and then using random forest decision-making to assist the original VVC to filter the prediction mode, reducing the overall calculation to Less than 20%. Subsequently, a three-channel residual neural network architecture is introduced at the decoding end to compensate our distortion at the encoding end with different information. In this way, the concept of distributed video coding is realized, and the fast prediction mode is combined with post-processing at the decoding end to compensate for image quality. Experimental results Compared with VVC, when the overall average BDBR is reduced by 1.63%, the overall side decoding time can be saved by about 51.48%.
    顯示於類別:[通訊工程研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML8檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明