隨著自然語言處理相關技術的快速發展,其在跨領域的應用上也有相當的發展。生醫文本探勘是生醫領域相關研究中重要的目的之一,隨著相較於以前的紙本記錄更趨向電子化的紀錄方式,在生醫文本探勘中也提供更多的資源去做研究。我們以醫院病歷作為研究方向,針對不同醫院科別間的病歷轉移學習作為主要目的。要達到這項目標,我們會使用到生醫領域的命名實體標註技術(Named Entity Recognition),藉由其預測出在病歷中的疾病名稱,使醫療人員在統整記錄診斷時能有相當的幫助。過去的研究中,大致上分類為基於規則的生醫文本命名實體標註以及基於字典的命名實體標註兩大方向。但這兩者共同的缺點為會有文字的歧異性,並不能良好的區分語意問題。 為了解決這樣的問題,我們使用機器學習的方法,BioBERT(Bidirectional Encoder Representations from Transformers for Biomedical Text Mining)則是在生醫自然語言處理領域中相當重要的技術之一。在我們的實驗中,我們將以醫院的科別為單位去做病歷文本探勘,以分析在不同科的病歷所訓練出的模型轉移學習到其他科別時的效果與不同科之間的文本差異。;With the rapid development of natural language processing (NLP), there has been considerable development in their cross-domain applications. Biomedical text mining is one of the most important purposes in biomedical research, and with the move towards electronic records as opposed to paper records, it provides more resources for biomedical text mining. We use hospital medical records as the research data source, and the primary objective is to apply the transfer learning of medical records between different hospital departments. To achieve this goal, we use Named Entity Recognition (NER), a technique used in the biomedical field that predicts the name of a disease in the patient’s record, to help medical experts in the consolidation of diagnoses. In the past studies, the two main approaches are roughly classified as rule-based biomedical text NER and dictionary-based NER. However, their common disadvantage is textually ambiguous, which is not the best way to distinguish semantic problems. BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining) is one of the most important technologies in the field of biomedical natural language processing, and we use machine learning to solve ambiguous word problems. In our experiment, We will apply text mining on medical records through hospital departments in order to analyze the effect of transferring the model trained in medical records from different departments to other departments and the differences in text between different departments.