中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/80996
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 83776/83776 (100%)
造访人次 : 59253844      在线人数 : 1568
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/80996


    题名: 基於雙流卷積神經網路的三百六十度視訊等距長方投影之行人追蹤;Pedestrian Tracking Based on Two-flow Convolutional Neural Network for Equirectangular Projection of 360-degree Videos
    作者: 黃郁婷;Huang, Yu-Ting
    贡献者: 通訊工程學系
    关键词: 行人追蹤;三百六十度視訊;等距長方圖投影;雙流卷積網路;損失函數;pedestrian tracking;360-degree videos;equirectangular projection (ERP);two-flow convolutional neural network;loss function
    日期: 2019-07-30
    上传时间: 2019-09-03 15:24:46 (UTC+8)
    出版者: 國立中央大學
    摘要: 對等距長方圖投影(equirectangular mapping projection, ERP)進行的行人追蹤時,因 ERP各區域不同程度的幾何失真,使多數現有追蹤器準確率降低。另外,360度視訊的高畫面率與高空間解析度導致高計算複雜度。因此,本論文提出採用雙流卷積神經網路 (two-flow convolutional neural network)為追蹤架構,且因不須於線上再訓練與更新神經網路參數,而可以高速對360度視訊進行追蹤,目前畫面的搜索視窗及目標模版之輸入,以卷積神經網路(convolutional neural network, CNN)各擷取階層式特徵,使卷積特徵兼具空間及多層特徵資訊。因應目標物於ERP影像不同區域的不均勻幾何失真,網路預測的邊界框(bounding, box)與目標模版的相似度為目標模板更新之標準。其中,相似度計算僅採用目標模版的強健特徵,以提升相似度量測的可靠性。此外,訓練採用的損失函數(loss function) 將依據預測座標狀態而採用L1與GIoU (generalized intersection over union, GIoU),透過採用GIoU loss降低神經網路對目標物大小之敏感度。實驗結果顯示本論文提出之方案,在目標有小幅度的縮放時,有著比SiamFC追蹤器更好的追蹤效果。;Non-uniform geometric distortions of the equirectangular projection (ERP) of 360-degree videos decreases tracking accuracy of most existing trackers. In addition, the high frame rate and spatial resolution of 360-degree videos cause high computational complexity. Hence, this thesis proposes a two-flow convolutional neural network that measures similarity of two inputs for pedestrian tracking on 360-degree videos. High-speed tracking is achieved since on-line re-training and update of the neural network model is not applied. Both the hierarchically spatial and convolutional features are extracted from the search window of the current frame and the target template to improve tracking accuracy. The tracker will update the target template by the similarity between the bounding box of the network prediction and the target template. In addi-tion, to improve the reliability of the similar measurement, the similarity calculation only uses the robust features of the target template. At the training stage, the loss function considers either the L1 loss or the generalized intersection over union (GIoU) according to the predicted location of the bounding box of the target. Experimental results show that the proposed scheme has a better tracking effect than the SiamFC tracker when the target has a small zoom.
    显示于类别:[通訊工程研究所] 博碩士論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    index.html0KbHTML303检视/开启


    在NCUIR中所有的数据项都受到原著作权保护.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明