中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/96272
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 81570/81570 (100%)
Visitors : 47093917      Online Users : 444
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/96272


    Title: 漸進式特徵融合模型應用於 VVC 畫面間編碼之快速演算法;Fast QTMT Partition Algorithm for VVC Inter Prediction with Hierarchical Feature Fusion Model
    Authors: 謝尚融;JUNG, HSIEH SHANG
    Contributors: 通訊工程學系
    Keywords: 多功能影像編碼;支持向量機;卷積神經網路;隨機森林分類器;畫面間編碼預測;深度學習;Versatile Video Coding;Support Vector Machine;Convolutional Neural Network;Random Forest Classifier;Inter-frame Prediction;Deep Learning
    Date: 2024-12-12
    Issue Date: 2025-04-09 17:31:36 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 現今科技日新月異,科技與人們的生活息息相關,但隨著數位內容需求的增加,尤其是在高解析度(如4K、8K甚至更高)的視頻領域,人們對於影像品質和視覺體驗的期望也越來越高,因此H.266/VVC採用許多先進的技術來解決人類的需求,隨之而來的影響是編碼計算複雜度的提升。
    本論文運用機器學習跟深度學習的方式,將其應用在H.266/VVC畫面間預測,首先運用機器學習中的支持向量機(SVM)將CU做第一次劃分,接著運用卷積神經網路(CNN)將CU做第二次劃分,針對H.266/VVC新增的多類型樹劃分模式,採用隨機森林分類器(RFC)將CU劃分的更精細,最後再結合CU-PU Decision進行第三次的劃分,透過漸進式的劃分使得編碼過程,不需再計算冗長的碼率失真代價函數。第四章則探討將隨機森林分類器,銜接在不同位置上,所造成的影像品質與時間節省的差異。最終實驗結果與H.266/VVC相比,平均整體 BDBR 為2.50百分比,而編解碼時間可以達到64.67百分比的節省。;In today’s rapidly evolving technological era, people′s lives are closely connected to technology. With the increasing demand for digital content, especially in high-resolution video(such as 4K, 8K,and even higher),expectations for image quality and visual experience are rising. Therefore, H.266/VVC employs many effective techniques to address these needs.
    However,this comes with the consequence of increased encoding complexity.This paper applies machine learning and deep learning techniques to inter-frame prediction in H.266/VVC. First, the Support Vector Machine (SVM) from machine learning is used for the initial division of Coding Units (CUs). Next, a Convolutional Neural Network (CNN) is
    applied for a second division of the CUs. For the new multi-type tree partitioning mode introduced by VVC, a Random Forest Classifier (RFC) is used to further refine the CU division.
    Finally, the CU-PU Decision is combined for a third division. Through this progressive division, the encoding process avoids the need for calculating the lengthy rate-distortion cost
    function. Chapter four explores the impact on image quality and time savings when connecting the Random Forest Classifier at different stages. The final experimental results show that,
    compared to H.266/VVC, the average Bjøntegaard Delta Bit Rate (BDBR) is 2.50%, and encoding/decoding time is 64.67%.
    Appears in Collections:[Graduate Institute of Communication Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML25View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明