隨著5G技術的快速發展,網路流量不斷增加且對於低延遲的需求日益強烈。衛星網路因此成為下一代通訊網路的關鍵組成部分。然而,由於地球上陸地和海洋分佈的不均,導致全球人口分布存在差異。這使得在衛星網路中使用傳統的路由算法可能導致衛星節點在穿越人口密集區域時出現傳輸壅塞的情況。為了確保衛星網路的性能,並應對不斷增長的網路流量,本研究提出了一種基於強化學習的路由方法,旨在實現流量的有效分配。我們的方法採用集中式管理,融合衛星鏈路的動態資訊,包括剩餘頻寬、傳輸延遲、可用時間和接收節點的平均等待時間作為學習參數。我們採用雙重深度強化學習網路(DDQN)模型來最佳化路由決策。通過與其他負載平衡路由方法進行性能比較,我們在不同網路場景和傳輸需求下進行模擬,結果顯示我們提出的路由方法在較小傳輸延遲的情況下,在剩餘頻寬和封包遺失率等方面表現出色。;With the rapid development of 5G technology, there is a continuous increase in network traffic and a growing demand for low latency. Consequently, satellite networks have become a crucial component of the next-generation communication infrastructure. However, due to the uneven distribution of land and oceans on Earth, there exist disparities in the global population distribution. This disparity can lead to transmission congestion for satellite nodes when passing through densely populated areas, if traditional routing algorithms are employed. In order to ensure the performance of satellite networks and address the ever-growing network traffic, this study introduces a routing approach based on reinforcement learning aimed at achieving effective traffic allocation. Our method involves centralized management and integrates dynamic information from satellite links, including remaining bandwidth, transmission delay, available time, and average waiting time of receiving nodes, as learning parameters. We utilize a Double Deep Q-Network (DDQN) model to optimize routing decisions. Through performance comparisons with other load-balancing routing methods, we conduct simulations under various network scenarios and transmission requirements. The results demonstrate that our proposed routing method exhibits superior performance in terms of reduced transmission delay, improved remaining bandwidth utilization, and reduced packet loss rate.