中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/89852
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 78852/78852 (100%)
Visitors : 35338296      Online Users : 903
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/89852


    Title: 使用特徵增強策略在 MLP-Mixer 影像分類器;Applying Feature Enhancement Strategies in MLP-Mixer Image Classifier
    Authors: 童子祐;Tung, Tzu-Yu
    Contributors: 資訊工程學系
    Keywords: 影像特徵增強;方向梯度直方圖;局部二值模式;單尺度視網膜增強算法;image feature enhancement;Histogram of Oritentd Gradients;Local Binary Patterns;Single-Scale Retinex;MLP-Mixer
    Date: 2022-07-30
    Issue Date: 2022-10-04 12:02:16 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 由於卷積神經網路模型擁有龐大的演算法,運算過程被視為黑盒子(black box)無法對其提出合理的解釋與分析,因此本研究提出透過增強影像特徵的方式並結合MLP-Mixer 分類器,增加整個辨識系統的可解釋性與準確,該辨識系統架構應用於魚類、種子和中歐森林生物辨識資料集。首先針對影像先進行形狀、紋理與顏色的特徵增強,再將特徵增強過後的影像(Feature-Enhanced Image, FEI)作為 MLP Mixer 分類器的輸入,分別輸出三個特徵增強方式的 Top-5,作為此三個 Top-5 作為決策融合的輸入,透過多類別羅吉斯回歸(Multinomial Logistic Regression)輸出最終決策結果。本篇研究在 40 種魚類資料集上達到 99%的辨識率,優於未使用特徵增強的 MLP-Mixer 分類器的 96%辨識率;在 560 類種子資料集上達到 90.65%的辨識率,優於混合式神經網路(ResNet-50+Siamese)的 70.23%辨識率;在中歐森林資料集153 類上達到 97.91%的辨識率,優於採用單個卷積神經網路架構的 93.4%辨識率。
    ;Since the convolutional neural network model has a huge algorithm, the operation process is regarded as a black box and cannot provide a reasonable explanation and analysis. Therefore, this study proposes to enhance the image features and combine the MLP-Mixer classifier to increase the overall Interpretability and accuracy of the identification system architecture applied to the fish, seed and central European forest biometric datasets.Firstly, the features of shape, texture and color are enhanced for the image, and then the image after feature enhancement is used as the input of the MLP Mixer classifier, and the Top-5 of the three feature enhancement methods are output respectively, as the three Top-5 as the input of the MLP-Mixer classifier. The input of the decision fusion, the final decision result is output through multi-class Logis regression.This study achieves a recognition rate of 99% on 40 fish datasets, which is better than the 96% recognition rate of the MLP-Mixer classifier without feature enhancement;Achieving a recognition rate of 90.65% on the 560-category seed dataset, which is better than the 70.23% recognition rate of the hybrid neural network;It achieves a recognition rate of 97.91% on 153 categories of the Central European Forest dataset, which is better than the 93.4% recognition rate using a single convolutional neural network architecture.
    Appears in Collections:[Graduate Institute of Computer Science and Information Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML48View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明