重新思考虛擬記憶體管理的方式以開放通道式固態硬碟最大限度地減少深度學習推薦系統演算法的讀寫流量

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：27

、訪客IP：18.119.133.228

姓名

狄尚弘(Shang-Hung Ti) 查詢紙本館藏

畢業系所

資訊工程學系

論文名稱

重新思考虛擬記憶體管理的方式以開放通道式固態硬碟最大限度地減少深度學習推薦系統演算法的讀寫流量
(Rethinking Virtual Memory Management to Minimize the I/O Traffic of Deep Learning Recommendation Algorithm to Open Channel SSD)

相關論文

★ 開啟製程相似檢查方法在組裝超級塊上以最小化額外的寫入延遲	★ LaDy: Enabling Locality-aware Deduplication Technology on Shingled Magnetic Recording Drives
★ On Minimizing Writing Overhead to Establish a Low-latency LSM-tree on Skyrmion-based Racetrack Memory	★ WABE: Rethinking B-epsilon-tree to Minimize Write-amplification on NAND Flash Memory
★ Rethinking Bϵ tree Indexing Structure over NVM with the Support of Multi-write Modes	★ Prophet’s Insight: Unleashing Deduplication System Performance in Multi-tier Storage Systems
★ Freeing the Power of High Parallelism: Accelerating the Bϵ-Tree Indexing Scheme Performance on Open-Channel SSD	★ Applying Content-Defined Chunking to OCSSD-based Deduplication Systems
★ GraLoc: Preserving Graph Locality to Minimize Read and Write Amplification on NAND Flash Memory	★ On Minimizing Writing Overhead to Establish a Low-latency LSM-tree on Skyrmion-based Racetrack Memory

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 (2024-8-10以後開放)

摘要(中)

由於計算數據的快速增長，基於DRAM的主要存儲裝置無法容納來自數據密集型應用(如機器學習算法和推薦系統)的所有待處理數據。因此，主要存儲裝置和下層存儲設備之間的數據移動導致了一個重要的性能問題。當傳統的基於NAND的固態硬碟(SSD)應用於計算機架構時，性能問題無法得到解決，因為存儲驅動器無法區分來自主機系統的數據類型。然而，一種新型的存儲介質，即開放通道固態硬盤(OCSSD)，已經被提出來，提供了一條從主機端系統優化數據在存儲空間上放置的路徑。在這項研究中，我們為一個著名的數據密集型應用(即深度學習推薦系統(DLRM))在OCSSD存儲驅動器上開發了一個新的存取數據模型。我們的解決方案被稱為OC-DLRM，通過I/O單元將經常訪問的數據放在一起，可以最大限度地減少對快閃記憶體的I/O流量。根據我們的實驗結果，與傳統的虛擬內存管理方案相比，OC-DLRM明顯減少了記憶體和存儲設備之間的I/O流量。

摘要(英)

Due to the rapid growth of computing data, DRAM-based main memory cannot accommodate all to-be-processed data from data-intensive applications (e.g., machine learning algorithms and recommendation systems). Therefore, data movement between main memory and a storage device results in a significant performance issue. When a traditional NAND-based solid-state drive (SSD) is applied to a computer architecture, the performance issue cannot be tackled because a storage drive cannot distinguish the types of data from the host system. However, a new type storage medium, namely open-channel SSD (OCSSD), has been proposed to provide a path to optimize data placement on the storage space from the host-side system. In this study, we develop a new data access model for a well-known data-intensive application (i.e., deep learning recommendation system (DLRM)) on an OCSSD storage drive. Our solution, called OC-DLRM, can minimize the I/O traffic to the flash memory storage device by considering the I/O unit of a flash memory drive to place the frequently-accessed data together. According to our experimental results, the OC-DLRM significantly decrease the amount of I/O traffic between memory and storage devices, compared with the traditional virtual memory management solution.

關鍵字(中)

★ 固態硬碟
★ 推薦系統
★ 類神經網路
★ 虛擬記憶體
★ 深度學習

關鍵字(英)

★ solid state drives
★ recommendation systems
★ neural networks
★ virtual memory
★ deep learning

論文目次

摘要 i
ABSTRACT ii
誌謝 iii
目錄 iv
表目錄 v
圖目錄 vi
Chapter 1 Introduction 1
Chapter 2 Background 3
2-1 Recommendation Systems 3
2-2 Open-Channel SSD 5
2-3 Recommendation Systems on NVM 6
2-4 Motivation 7
Chapter 3 OC-DLRM 10
3-1 Overview 10
3-2 Mapping Strategy 12
3-3 Embedding Vector Management 13
3-4 Garbage Collection 15
Chapter 4 Evaluation 17
4-1 Environment Setup 17
4.2 Experimental Result 18
Chapter 5 Conclusion 23
Reference 24

參考文獻

1. Gupta, U., et al. The architectural implications of facebook′s dnn-based personalized recommendation. in 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA). 2020. IEEE.
2. Zhao, Z., et al. Recommending what video to watch next: a multitask ranking system. in Proceedings of the 13th ACM Conference on Recommender Systems. 2019.
3. Zhou, G., et al. Deep interest evolution network for click-through rate prediction. in Proceedings of the AAAI conference on artificial intelligence. 2019.
4. Lui, M., et al. Understanding capacity-driven scale-out neural recommendation inference. in 2021 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). 2021. IEEE.
5. Hui, B., et al., Personalized recommendation system based on knowledge embedding and historical behavior. Applied Intelligence, 2022. 52(1): p. 954-966.
6. Picoli, I.L., et al. Open-Channel SSD (What is it Good For). in CIDR. 2020.
7. Picoli, I.L., et al. uFLIP-OC: Understanding flash I/O patterns on open-channel solid-state drives. in Proceedings of the 8th Asia-Pacific Workshop on Systems. 2017.
8. Wang, P., et al. An efficient design and implementation of LSM-tree based key-value store on open-channel SSD. in Proceedings of the Ninth European Conference on Computer Systems. 2014.
9. Zhang, X., et al. Optimizing Performance for Open-Channel SSDs in Cloud Storage System. in 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS). 2021. IEEE.
10. González, J., et al. Application-driven flash translation layers on open-channel SSDs. in Proceedings of the 7th non Volatile Memory Workshop (NVMW). 2016.
11. Qin, H., et al., QBLKe: Host-side flash translation layer management for Open-Channel SSDs. Journal of Systems Architecture, 2021. 119: p. 102233.
12. Chen, J., et al. PATCH: Process-variation-resilient space allocation for open-channel SSD with 3D flash. in 2019 Design, Automation & Test in Europe Conference & Exhibition (DATE). 2019. IEEE.
13. Naumov, M., et al., Deep learning recommendation model for personalization and recommendation systems. arXiv preprint arXiv:1906.00091, 2019.
14. Bjørling, M., J. Gonzalez, and P. Bonnet. {LightNVM}: The Linux {Open-Channel}{SSD} Subsystem. in 15th USENIX Conference on File and Storage Technologies (FAST 17). 2017.
15. Wilkening, M., et al. RecSSD: near data processing for solid state drive based recommendation inference. in Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. 2021.
16. Wan, H., et al. FlashEmbedding: storing embedding tables in SSD for large-scale recommender systems. in Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems. 2021.
17. Kim, M. and S. Lee. Reducing tail latency of DNN-based recommender systems using in-storage processing. in Proceedings of the 11th ACM SIGOPS Asia-Pacific Workshop on Systems. 2020.
18. Soltaniyeh, M., et al. Near-Storage Processing for Solid State Drive Based Recommendation Inference with SmartSSDs®. in Proceedings of the 2022 ACM/SPEC on International Conference on Performance Engineering. 2022.
19. Eisenman, A., et al., Bandana: Using non-volatile memory for storing deep learning models. Proceedings of Machine Learning and Systems, 2019. 1: p. 40-52.
20. Paszke, A., et al., Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 2019. 32.
21. Levental, M. and E. Orlova, Comparing the costs of abstraction for DL frameworks. arXiv preprint arXiv:2012.07163, 2020.

指導教授

陳增益(Tseng-Yi Chen)

審核日期

2022-8-24

推文