隨著第五代行動通訊技術的發展逐步成熟,人們對於透過行動網路進行交流與溝通的需求也持續飛快增長,未來將出現許多對延遲和可靠度要求嚴格的新型應用服務,例如:以360度8K超高畫質XR來觀看直播或遊玩多人互動式線上遊戲,使得5G網路的環境變得更加擁擠且複雜。因此未來在5G網路中如何有效率地進行資源分配將成為一項重要的任務,並且隨著人工智慧領域的蓬勃發展,許多關於機器學習的研究已被用於處理無線資源管理的問題上。然而傳統的深度學習方法雖然效能強大並已能夠在單一任務上取得優異的表現,但是當遇到變化較大的新環境時,無外乎都得再從頭開始學習。而元學習是機器學習領域近年來興起的新技術,被認為能夠有效地解決這個問題,因此在本研究中我們提出了基於元學習適應性的深度強化學習演算法,用以解決在5G MEC下多使用者互動環境中的資源分配問題,來讓系統的頻寬效益最大化。最後我們的實驗結果顯示,透過元學習的技術不但能有效的提高現有深度強化學習演算法的效能,並且未來將我們的資源分配機器學習模型部署到新環境時,更能快速適應從未接觸過的新環境。;As the development of the fifth-generation mobile communication technology gradually matures, people′s demand for communication through mobile networks continues to grow rapidly. In the future, many new application services with strict requirements on latency and reliability will appear, such as 360-degree 8K Ultra-high-definition XR watching the live stream or playing multiplayer interactive online games makes the 5G network environment more crowded and complicated. Therefore, how to efficiently allocate resources in 5G networks will become an important task in the future, and with the vigorous development of artificial intelligence, many kinds of research on machine learning have been used to deal with wireless resource management issues. Although traditional deep learning methods are powerful and have been able to achieve excellent performance on a single task. But when encountering a new environment with great changes, it is nothing more than learning from scratch. Meta-Learning is a new technology that has emerged in the field of machine learning in recent years. It is considered to be able to effectively solve this problem. In this research, we propose a deep reinforcement learning algorithm based on Meta-Learning adaptability to solve the problem of resource allocation in a multi-user interactive environment under 5G MEC to maximize the bandwidth benefit of the system. The final experimental results show that the Meta-Learning technology can not only effectively improve the performance of existing deep reinforcement learning algorithms, but also when deploying our resource allocation machine learning model to a new environment in the future, it will be able to quickly adapt to a new environment that has never been touched before.