English  |  正體中文  |  简体中文  |  Items with full text/Total items : 83776/83776 (100%)
Visitors : 61878898      Online Users : 950
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: https://ir.lib.ncu.edu.tw/handle/987654321/98604


    Title: 基於SRAM之Systolic計算式記憶體架構與設計自動化;Systolic SRAM-based Computing-In-Memory Architecture and Design Automation
    Authors: 陳妤蓉;Chen, Yu-Jung
    Contributors: 電機工程學系
    Keywords: 計算式記憶體;資料密集型運算;數位;深度神經網路;管線化;計算單元;convolution;Digital;network;systolic;automation;Computing Unit
    Date: 2025-08-27
    Issue Date: 2025-10-17 12:59:29 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 計算式記憶體(Computing-in-Memory, CIM)被視為解決資料密集型運算的有效方案。本論文提出一種基於SRAM的數位CIM(DCIM)systolic架構,用於深度神經網路(DNN)運算。該架構由具多個處理單元(PU)的管線化計算單元(CU)組成。我們開發了一套自動化設計框架,能在延遲、頻率與頻寬等限制下,探索systolic陣列的深度與寬度分割,以最小化晶片面積。核心概念是透過對部分和(PSUM)加法樹進行分割以縮短關鍵路徑,從而提升時脈頻率並減少所需PU數量。雖然更深的分割會增加PU面積,但框架能找到最佳化的折衷點。與TSMC基準設計相比,本架構在AlexNet與VGG16運算下可同時降低約15%的功耗與面積,且不增加延遲,展現出相較於傳統架構更高的能效與面積效率。;Computing-in-memory (CIM) has emerged as a promising solution for data-intensive workloads. This thesis proposes a systolic SRAM-based digital CIM (DCIM) architecture for deep neural networks (DNNs), consisting of pipelined computing units (CUs) with multiple processing units (PUs). We develop an automated design framework that optimizes depth and width partitioning of the systolic array to minimize silicon area under constraints of latency, frequency, and bandwidth. The key idea is to partition the partial-sum adder tree to shorten critical paths, enabling higher clock rates and reducing the required PU count. While deeper partitioning increases PU area, our framework identifies the optimal trade-off for each workload. Compared with a TSMC baseline design, the proposed architecture achieves ~15% reductions in both power and area without latency loss on AlexNet and VGG16, demonstrating improved efficiency over conventional approaches.
    Appears in Collections:[Graduate Institute of Electrical Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML74View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明