中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/81097
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 78852/78852 (100%)
造访人次 : 38468648      在线人数 : 268
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/81097


    题名: RetinaNet應用於人臉偵測;Face Detection with RetinaNet
    作者: 柯金泉;Tuyen, Kha Kim
    贡献者: 資訊工程學系在職專班
    关键词: RetinaNet應用於人臉偵測;Face Detection with RetinaNet
    日期: 2019-05-31
    上传时间: 2019-09-03 15:34:24 (UTC+8)
    出版者: 國立中央大學
    摘要: 中文摘要

    人臉偵測是許多人臉相關應用的關鍵步驟,如人臉校正、人臉驗證、人臉識別以及人群行為分析等等。然而,小尺寸、遮蔽、光線、姿態變形、表情以及其他負面因素經常出現在真實世界的影像中,並為人臉偵測帶來巨大挑戰。此外,運算成本也是人臉偵測在實時應用的一大難題。

    傳統方法使用人工設計的運算以滑動窗來偵測人臉的位置,這需要花費更多運算並且會影像正確率,尤其在偵測小尺寸人臉時更是如此。近來,基於深度卷積神經網路(CNN)的通用物件偵測方法獲得巨大成功。現代的物件偵測器包含一階段方法(如YOLO、SSD)與二階段方法(如Faster RCNN, RFCN)。一階段方法廣泛地使用單次前饋全卷積神經網絡來直接預測每個提取框的類別和對應的邊界框而不像二階段方法需要對每個提取框分別進行分類運算與邊界框調整。因此,一階段方法擁有更低的計算成本,而兩階段方法通常能獲得較高的準確度。

    在本篇研究中,我發布了用於人臉偵測的RetinaNet,同時解決了小尺寸人臉與運算成本的問題;特別的是,同時改進了一階段與二階段方法。
    ;Abstract

    Face detection is a critical step for many face-related applications, such as face alignment, face verification, face identification, crowed behavior analysis etc. However, small size, occlusion, illumination, pose deformation, expression and other disadvantageous factors often appear in real-world images, which bring great challenges to face detection. Besides, computation cost is also a big challenge for face detection in real-time application.

    Traditional approach use manual operation with slide windows to skim and detect face location, it cost much computation and affect accuracy, especially with small size face. Recently, generic object detection based on deep convolution neural networks (CNNs) has achieved great success. It utilizes modern object detectors including one stage methods (e.g., YOLO, SSD) and two stage methods (e.g., Faster RCNN, RFCN). One stage methods refer broadly to architectures that use a single feed-forward full convolutional neural network to directly predict each proposal’s class and corresponding bounding box without requiring a second stage per-proposal classification operation and box refinement . Therefore, one stage methods success in computation cost whereas two stage mothods winner accuracy performance.

    In this research, I deployed RetiaNet for face detection, it could solve the small size problem as well as computation cost; especially, it has benefit of both one-stage and two-stage methods .
    显示于类别:[資訊工程學系碩士在職專班 ] 博碩士論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    index.html0KbHTML229检视/开启


    在NCUIR中所有的数据项都受到原著作权保护.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明