English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 78818/78818 (100%)
造訪人次 : 34653045      線上人數 : 740
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/82290


    題名: 基於深度學習之360度視訊分析與處理;Deep Learning Based 360-Degree Vi Deo Processing and Analysis
    作者: 唐之瑋
    貢獻者: 國立中央大學通訊工程學系
    關鍵詞: 360度視訊;深度學習;CMP投影影像;視訊編碼之位元率控制;視覺追蹤;;360 degree video;deep learning;cubemap projection;rate control of video coding;visual tracking.
    日期: 2020-01-13
    上傳時間: 2020-01-13 14:37:15 (UTC+8)
    出版者: 科技部
    摘要: 近年虛擬實境(virtual reality, VR)興起,360度視訊(360 degree video)提供使用者身臨其境的體驗,而由於360度視訊須由球體域投影至二維平面(sphere-to-plane projection),以致不同區域有不均勻的幾何形變(geometric deformation),因此,幾乎現有基於傳統2D影像而設計的視訊分析與處理方案,於此類投影影像皆無法達到最佳效能。雖然現今360度視訊壓縮標準相關研究顯示,cubemap (CMP) 投影影像的編碼rate-distortion (R-D)效能優於equirectangular projection (ERP)影像,但因球體域上均勻取樣的各點,非均勻地投影至CMP的各面(face)的不同區域,其相關編碼方案仍需改進,其中,目前幾乎無任何已發表文獻探討編碼CMP投影影像的位元率控制(rate control)。另一方面,視覺追蹤為許多電腦視覺應用必要的前端處理,但現有的視覺追蹤方案,卻僅極少數基於360度視訊之CMP投影影像而設計。由於近年電腦視覺領域研究顯示深度學習(deep learning)可有效提升其準確率,因此,本一年期計畫之研究目標,為對360度視訊的CMP投影影像處理與分析的兩個主題提出設計: (1)基於深度學習的視訊編碼(video coding)之位元率控制(rate control)。(2) 基於深度學習的視覺追蹤(visual tracking)。本計畫研究成果預期將有效提升編碼CMP投影影像的位元率控制之準確率,與CMP投影影像的視覺追蹤準確率,進而提升360度視訊相關應用之品質。 ;With the rise of virtual reality, 360 degree videos provide users immersive experiences. However, sphere-to-plane projections of 360 degree videos lead to non-uniform geometric deformations in various regions of 2-D projected images. Accordingly, 2-D projections of 360 degree videos degrade performance of existing video analysis and processing schemes that designed for conventional 2-D images. Although the state-of-the-art in 360-degree video coding indicates that the R-D performance of the cubemap (CMP) projection outperforms that of the equirectangular projection (ERP), the uniform samples in the sphere domain are still projected non-uniformly onto different regions of each face of the CMP projection. Thus, the existing CMP projection based encoder can be further improved while there is few rate control scheme designed for CMP encoding. On the other hand, visual tracking is essential to many applications of computer vision. However, there are few trackers designed for the CMP projection based 360 degree videos. Since research results indicate that deep learning can significantly improve performance of applications of computer vision, this project focuses on two topics of the CMP projection of 360 degree video processing and analysis: (1) Deep learning based rate control of video coding. (2) Deep learning based visual tracking. For the CMP projection of 360 degree videos, this project expects to increase accuracy of rate control of video coding and accuracy of visual tracking. Accordingly, quality of the 360 degree video related applications can be significantly improved.
    關聯: 財團法人國家實驗研究院科技政策研究與資訊中心
    顯示於類別:[通訊工程學系] 研究計畫

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML315檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明