適用於深度增強式學習之瀑布式排程方法;Waterfall Model for Deep Reinforcement Learning Based Scheduling

NCU Institutional Repository > 資訊電機學院 > 通訊工程學系碩士在職專班 > 博碩士論文 > Item 987654321/80978

請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/80978

題名:	適用於深度增強式學習之瀑布式排程方法;Waterfall Model for Deep Reinforcement Learning Based Scheduling
作者:	劉政威;Liu, Zheng-Wei
貢獻者:	通訊工程學系在職專班
關鍵詞:	排程;強化學習;Scheduling;Reinforcement Learning
日期:	2019-07-31
上傳時間:	2019-09-03 15:23:11 (UTC+8)
出版者:	國立中央大學
摘要:	第四代通訊系統已可滿足移動式設備的多媒體應用需求。透過基地台提供的排程服務，用戶設備可在通訊系統的下行鏈路獲取各自所需的資料封包，藉以滿足並獲得更好的應用服務，因此配給通道資源並提供用戶群排程服務的演算法相當關鍵。本文實現一行動通訊排程學習平台，提出基於Deep Deterministic Policy Gradient模型，並採用瀑布模型概念將排程算法流程依序解析為排序挑選、資源評估和通道分配三個階段，透過階段微型算法學習挑選在當前通訊環境下使單位時間資料吞吐量更多並滿足更多用戶需求的瀑布式排程方法。行動通訊排程學習平台由六大模組元件架構而成：基地台與通道資源、強化學習神經網路、用戶設備屬性、應用服務類型、環境資訊與獎勵函式，與階段微型算法與依賴注入。利用反轉控制與依賴注入降低平台軟體耦合性，在階段微型算法與六大模組元件的維護上變得相當容易。;The fourth generation of communication systems has been able to meet the multimedia application needs of mobile devices. Through the scheduling service provided by the base station, the user equipment can obtain the data packets required by the downlink of the communication system to meet and obtain better application services, so the channel resources are allocated and the calculation of the user group scheduling service is provided. The law is quite critical. This paper implements a mobile communication scheduling learning platform, and proposes a Deep Deterministic Policy Gradient model. The waterfall model concept is used to analyze the scheduling algorithm flow into three stages: sorting selection, resource evaluation and channel allocation. A waterfall scheduling method that enables more data throughput per unit time and meets more user needs in the current communication environment. The mobile communication scheduling learning platform is composed of six modular components: base station and channel resources, enhanced learning neural network, user equipment attributes, application service types, environmental information and reward functions, and phase micro-algorithms and dependency injection. . Using inversion control and dependency injection to reduce platform software coupling, it is quite easy to maintain the stage micro-algorithm and the six module components.
顯示於類別:	[通訊工程學系碩士在職專班 ] 博碩士論文

文件中的檔案:

檔案	描述	大小	格式	瀏覽次數
index.html		0Kb	HTML	318	檢視/開啟

在NCUIR中所有的資料項目都受到原著作權保護.

社群 sharing

資料載入中.....