本論文提出的方法是以辨識物體的特徵作為前處理的步驟,再根據結果分類出特定場景,透過Mask R-CNN算法針對輸入的圖片進行特定室內物件分割的處理,接著以分割完的物件作為場景的特徵,再與場景結合並進行分類。實驗結果證明,透過獲取場景中物件特徵的方法的前處理,能在場景識別中取得更好的場景分類準確度。 ;Scene Recognition is an important operation of Image Semantic Segmentation, in the wide range of scene recognition, it is a thorny issue to correctly and efficient find effective location information in specific scene. In the mission of scene recognition, a scene is mainly comprised of three elements, including object, spatial layout and the relationship between backgrounds, these object types in scene have huge impact on results of classification. Through this matter, scene could be recognized based on those identified objects of scene, for example, bathtub or toilet in the bathroom, bed or writing desk in the bedroom. In this thesis, an effective architecture for scene recognition is proposed. The architecture includes a pre-process step to identify feature of each object, then classify specified scene based on the results of object feature. Moreover, those input pictures will be pre-processed through Mask R-CNN algorithm to identify specific indoor objects by results of segmentation, and those specified indoor objects become elements for scene recognition classification. The experimental results show that through pre-process of object identification, the proposed method has the advantages of accuracy in scene recognition.