由ERP資料庫和BW系統自動化產生同義字之關鍵績效指標和多維度模型

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：11

、訪客IP：18.227.140.54

姓名

陳雅鈞(Ya-Jyun Chen) 查詢紙本館藏

畢業系所

工業管理研究所

論文名稱

由ERP資料庫和BW系統自動化產生同義字之關鍵績效指標和多維度模型
(Automatically generating Key Performance Indices candidates based on synonyms and dimensional model design from ERP database and BW system)

相關論文

★ 以類神經網路探討晶圓測試良率預測與重測指標值之建立	★ 六標準突破性策略—企業管理議題
★ 限制驅導式在製罐產業生產管理之應用研究	★ 應用倒傳遞類神經網路於TFT-LCD G4.5代Cell廠不良問題與解決方法之研究
★ 限制驅導式生產排程在PCBA製程的運用	★ 平衡計分卡規劃與設計之研究-以海軍後勤支援指揮部修護工廠為例
★ 木製框式車身銷售數量之組合預測研究	★ 導入符合綠色產品RoHS之供應商管理-以光通訊產業L公司為例
★ 不同產品及供應商屬性對採購要求之相關性探討－以平面式觸控面板產業為例	★ 中長期產銷規劃之個案探討 -以抽絲產業為例
★ 消耗性部品存貨管理改善研究-以某邏輯測試公司之Socket Pin為例	★ 封裝廠之機台當機修復順序即時判別機制探討
★ 客戶危害限用物質規範研究-以TFT-LCD產業個案公司為例	★ PCB壓合代工業導入ISO/TS16949品質管理系統之研究-以K公司為例
★ 報價流程與價格議價之研究–以機殼產業為例	★ 產品量產前工程變更的分類機制與其可控制性探討-以某一手機產品家族為例

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 ( 永不開放)

摘要(中)

在先前的研究中，關鍵績效指標候選的產生，主要透過文字探勘的技術，比對系統中對於實體屬性的欄位敘述與資料倉儲(DW)系統中既有之指標運算域的敘述的相似度來進行實體屬性與運算域的連結，透過預先定義的公式結構來產生候選之關鍵績效指標，並且為績效指標的候選建立多維度模型。
在此研究中，我們透過將運算域的敘述以同義字的方式做關鍵字的展開，希望能夠比對到更多的實體性的欄位敘述來增加KPI候選的數量。另外，由於KPI內運算域的敘述為一個聚合的數值，所以沒有辦法透過字面上的比對來找到相似的實體屬性敘述；在本研究中,我們提供了解析聚合數值的演算法來為聚合運算域找到相關的實體屬性敘述。而在過濾關鍵績效指標的候選時，我們發現一部分的無意義指標的產生，是由於資料本身並沒有進行錯誤資料型態的過濾，在此研究中也會進行修正。最後，我們建立候選關鍵績效指標之多維度模型。為了提升多維度模型的管理意涵，在此研究中將會考慮到階層關係，增加可用來做為管理意涵的分析的維度表格。

摘要(英)

In previous research, the authors treat operand in KPI formula as query word to find similar description of entity attribute to form KPI candidates. However, operand in KPI formula only contains 2-3 words, which are hard to perfectly perform a query so that it may causes worse mining results and also affect the number of KPI candidates generated. After generating KPI candidates, we discover that some aggregate value operand cannot find its mapping attributes. Furthermore, we discover that some of the dropped KPI candidates are due to the data type of operands in KPI candidates are not numeric value. When previous authors generate dimensional model, they only consider the entities that adjacent to fact table as dimension tables, ignoring other entities that connect to the adjacent entities, which may cause information lost.
In this research, in order to increase the number of KPI candidates we generated, we expand the operand words from existing KPI in Data Warehouse (DW) system based on its synonyms from lexical database, attaching these synonyms to operand words as query words. By text mining technique, we modified TFIDF to compare the similarity between description of entity-attributes and query words. Moreover, we also modified the predefined structure that used to generate KPI candidate by switching the operand set based on operator. Besides, in order to decrease the number of meaningless KPI candidates, we filter out those entity-attributes with uncountable data type. For those aggregated operands which cannot find mapping attributes, we proposed an algorithm to disaggregate it to find description of entity attribute. Moreover, eliminating entity-attribute of improper data type may also make TFIDF weighting procedure more precisely. Also, we improve the dimension model through merging the entities that connect to dimension table by their hierarchy.

關鍵字(中)

★ 多維度模型設計
★ 關鍵績效指標

關鍵字(英)

★ dimensional model design
★ Key performance index

論文目次

摘要 ................................................................................................................... i
Abstract ............................................................................................................. ii
Table of Content ............................................................................................... iii
Chapter 1 Introduction ....................................................................................... 1
1.1 Background and motivation .................................................................. 1
1.2 Research problem ................................................................................. 3
1.3 Research objective ............................................................................... 6
1.4 Research methodology ......................................................................... 7
Chapter 2 Literature Review .............................................................................. 9
2.1 Query Expansion .................................................................................. 9
2.2 Lexical semantic relation .................................................................... 10
2.3 Wordnet database ............................................................................... 12
2.4 Text Frequency in Document .............................................................. 13
2.5 dimension table design ....................................................................... 15
2.5.1 Dimension tables combination ................................................. 15
Chapter 3 Methodology ................................................................................... 17
3.1Data preprocessing .............................................................................. 17
3.1.1Extract description of entity-attribute from metadata model ...... 17
3.1.2Extract the formula of existing KPI in DW system .................... 17
3.1.3 Extract the detail component of aggregate value from other
source ...................................................................................................... 18
3.1.4Extract the synonym of operand words from database ............... 18
3.2Operands Classified into Operand Set. ................................................ 19
3.3Data cleaning and transforming ........................................................... 19
3.3.1 data type filtering ..................................................................... 19
3.3.2 Stemming ................................................................................ 20
3.4KPI candidate Generating.................................................................... 21
3.4.1Entity-attribute assignment ....................................................... 21
3.4.2 Find mapping description of entity attribute for aggregate
operand .................................................................................................... 24
3.4.3 Generating KPI candidates....................................................... 32
3.5 Constructing Dimensional Model ....................................................... 33
3.5.1Finding out the shortest path of entities given by KPI candidate 33
3.5.2Fact entity definition ................................................................. 34
3.5.3Dimension tables combination .................................................. 35
3.5.4 Dimension generating .............................................................. 37
Chapter 4 Application ...................................................................................... 39
4.1 Data preprocessing ............................................................................. 39
4.1.1 Extract description of entity-attribute from SAP ERP............... 39
4.1.2 Extract the formula of existing KPI in DW system or other source
................................................................................................................ 40
4.1.3 Extract the synonym of operand words from Wordnet database 43
4.1.4 Extract the detail of existing KPI from WIKI or text book. ...... 45
4.2 Operands Classified into Operand Set. ............................................... 46
4.3 Data cleaning and transforming .......................................................... 48
4.3.1 data type filtering ..................................................................... 48
4.3.2 Stemming ................................................................................ 49
4.4 Entity-attribute assignment and Generating KPI candidates ................ 50
4.5 Constructing Dimensional Model ....................................................... 61
4.5.1 Finding out the fact table for KPI candidate by shortest path .... 61
4.5.2 Dimension tables combination ................................................. 63
4.5.3 Dimension generating .............................................................. 65
Chapter 5 Conclusion ...................................................................................... 67
5.1 Research contribution ......................................................................... 67
5.2 Research limitation............................................................................. 68
5.3 Future research ................................................................................... 68
Reference ........................................................................................................ 69

參考文獻

1. Andy Chiu, Pascal Poupart, and Chrysanne DiMarco,(2006),Learning Lexical Semantic Relations using Lexical Analogies — Extended Abstract,
2. Boehnlein, M., & Ulbrich-vom Ende, A. (1999). Deriving initial data warehouse structures from the conceptual data models of the underlying operational information systems. Proceedings of the 2nd ACM International Workshop on Data Warehousing and OLAP, 15-21.
3. Chen, Y. T., & Hsu, P. Y. (2007). A grain preservation translation algorithm: From ER diagram to multidimensional model. Information Sciences, 177(18), 3679-3695.
4. Dawson, J. L. Suffix removal and word conflation. ALLC Bulletin, 33-46.
5. Dori, D., Feldman, R., & Sturm, A. (2005). Transforming an operational system model to a data warehouse model: A survey of techniques. Proceedings of the IEEE International Conference on Software-Science, Technology & Engineering, 47-56.
6. Ellen M. Voorhees(1994),Query expansion using lexical-semantic relations, ACM, ISBN 0-387-19889-X,spring verlag New York, Inc, New York, NY, USA,
7. Olga Vechtomova & Ying wang(2006), A study of the effect of term proximity on query expansion, Journal of Information Science 2006 32: 324
8. Frakes, W. B., & Baeza-Yates, R. (1992). Information retrieval: Data structures and algorithms Prentice-Hall, Inc. Upper Saddle River, NJ, USA.
9. Golfarelli, M., Maio, D., & Rizzi, S. (1998). Conceptual design of data warehouses from E/R schemes. Proceedings of the Hawaii International Conference on System Sciences, 31, 334-343.
10. George A. Miller,1995,Wordnet: a lexical database for English, ACM New York, NY, USA , Volume 38 Issue 11, Nov. 1995
11. Husemann, B., Lechtenborger, J., & Vossen, G. (2000). Conceptual data warehouse design. Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW 2000), 83
12. Hang chi(2002),Probabilistic Query Expansion Using Query Logs, ACM 1-58113-449-5/02/0005.
13. Jane Greenberg,Automatic query expansion via lexical–semantic relationships,2001,Journal of the American Society for Information Science and Technology.
14. Kimball, R., & Ross, M. (2002). The data warehouse toolkit (2nd Ed.) Wiley & Sons, Inc.
15. Krovetz, R. (2000). Viewing morphology as an inference process. Artificial Intelligence, 118(1-2), 277-294.
16. Lovins, J. B. (1968). Development of a stemming algorithm. Mechanical Translation and Computational Linguistics, (11), 22-31.
17. Malinowski, E. (2008). Advanced data warehouse design: From conventional to spatial and temporal applications Springer-Verlag New York Inc.
18. Marotta, A., & Ruggia, R. (2002). Data warehouse design: A schema-transformation approach. International Conference of the Chilean Computer Science Society (SCCC’’02), 153.
19. Michael Chau,Xiao Fang, Olivia R,Liu Sheng,2005, Journal of the American Society for Information Science and Technology, Volume 56, Issue 13, pages 1363–1376, November.
20. Moldovan, D.I.,(2002),Using Wordnet and lexical Operator to improve Internet search, IEEE 1089-7801/00/$10.00 c2000
21. Moody, D. L., & Kortink, M. A. R. (2000). From enterprise models to dimensional models: A methodology for data warehouse and data mart design. DMDW’00, Sweden, 5.
22. Paice, C. D. (1990). Another stemmer. ACM SIGIR Forum, 24(3) 56-61.
23. Phipps, C., & Davis, K. (2002). Automating data warehouse conceptual schema design and evaluation. Proc.4th DMDW, Toronto, Canada,
24. Porter, M. F. (1997). An algorithm for suffix stripping. Program, 14(3)
25. Rila Mandala(2002), Ad Hoc Retrieval Experiments Using WordNet and Automatically constructed thesari,
26. Rocchio(1971),Relevance feedback on information retrieval, SMART Retrieval System: Experiments in Automatic Document Processing (1971), pp. 313-323
27. Rowen, W., Song, I. Y., Medsker, C., Arynth, I., & Ewen, E. (2001). An analysis of many-to-many relationships between fact and dimension tables in dimensional modeling. Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW’2001), Interlaken Switzerland.
28. Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513-523.
29. Salton, G., & Lesk, M. (1968). Computer evaluation of indexing and text processing. Journal of the ACM, 15(1), 8-36.
30. Sinz, E. J. (1988). The structures entity-relationship model (SERM). Angewandte Informatik, 30(5), 191-202.
31. Song, I. Y., Khare, R., & Dai, B. (2007). SAMSTAR: A semi-automated lexical method for generating star schemas from an entity-relationship diagram. Proceedings of the ACM Tenth International Workshop on Data Warehousing and OLAP, 9-16.
32. Yih-Chen Wang, James Vandendorpe, and Martha Evens. Relational thesauri in information retrieval. Journal of the American Soczety for Information Sczence, 36(1):15-27, January 1985.
33. Yutaka Matsuo& Mitsuru Ishizuka,2004, International Journal on Artificial Intelligence Tools,392-396
34. 吳佩蓉，「從 ERP交易資料發掘供應商績效評估指標並考慮查詢構面」，國立中央大學，碩士論文，民國95年。
35. 李季庭，「由 ERP 系統中資料模型之敘述自動挖掘關鍵績效指標」，國立中央大學，碩士論文，民國98年。
36. 鐘國華，「利用企業資源規劃以及商業智慧資料產生銷售面績效評估指標」，國立中央大學，碩士論文，民國97年。
37. 黃聖賢, 「由ERP資料庫之實體-關聯模型自動畫產生關鍵績效指標的多維度模型」，國立中央大學，碩士論文，民國99年。
38. 廖敏妏, 「由ERP資料庫和BW系統自動化產生流程導向之關鍵績效指標和多維度模型」，國立中央大學，碩士論文，民國99年。

指導教授

沈國基(Gwo-ji Sheen)

審核日期

2012-7-19

推文