Responsive image
博碩士論文 etd-0705115-170321 詳細資訊
Title page for etd-0705115-170321
論文名稱
Title
利用第二型隸屬函數之遺傳模糊資料探勘
Genetic-Fuzzy Mining with Type-2 Membership Functions
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
87
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2015-07-24
繳交日期
Date of Submission
2015-08-05
關鍵字
Keywords
資料探勘、關聯規則、第二型模糊集、遺傳模糊資料探勘、隸屬函數
genetic-fuzzy mining, association rule, data mining, type-2 fuzzy set, membership function
統計
Statistics
本論文已被瀏覽 5676 次,被下載 468
The thesis/dissertation has been browsed 5676 times, has been downloaded 468 times.
中文摘要
關聯規則探勘常用於找出資料中具有價值的資訊。而在實際的交易資料庫中,物品通常伴隨著購買數量出現。模糊理論因此被應用於此類型的資料探勘,且許多探勘演算法亦被提出來挖掘模糊關聯規則。然而在過往的研究中,大多專注於利用第一型隸屬函數進行探勘,因此在本論文中,我們採用第二型隸屬函數進行資料挖掘。第二型隸屬函數相較於第一型隸屬函數較為一般化,所以能夠處理資料中更多的不確定性。我們首先提出一個利用第二型隸屬函數挖掘模糊關聯規則的方法。數值化的資料首先透過事先給定的第二型隸屬函數轉為第二型模糊區間,我們接著提出一個重心法將該模糊區間降為模糊值,並利用該值進行模糊關聯規則探勘。隸屬函數對於探勘的結果有著重大的影響,又由於多數的模糊關聯規則探勘演算法需要給定隸屬函數,因此我們針對此問題提出一個學習合適第二型隸屬函數的方法。該方法基於遺傳演算法進行學習,首先將隸屬函數編碼為染色體並設計合理的基因運算,藉由遺傳演算法的幫助找出適當的第二型隸屬函數。最後我們對探勘結果的品質做更進一步的調整,提出一個採用兩組式表示法編碼的染色體及針對提高模糊關聯規則信賴度的適應函數。利用上述提出之方法進行的實驗結果顯示對於模糊關聯規則的數量及品質的確得到提升。
Abstract
Association rule mining is commonly utilized to extract useful information from given data. Since items are usually with quantities in real-world transaction databases, the fuzzy set theory is applied to many mining approaches for deriving fuzzy association rules. In the past, fuzzy mining mainly focused on type-1 membership functions. In this thesis, we attempt to use type-2 membership functions for mining. Type-2 fuzzy sets are generalization of type-1 fuzzy sets and are able to handle more uncertainty than type-1. An interval type-2 fuzzy association rule mining approach is first proposed in this thesis. Rules are mined by predefined interval type-2 membership functions. The quantitative transactions are transformed into fuzzy values according to the corresponding type-2 membership functions. The interval type-2 fuzzy values will be reduced to type-1 values by a centroid type reduction method in order to induce fuzzy association rules. Since membership functions are usually assumed to be known in advance in most of the fuzzy data mining approaches, thus a GA-based type-2 fuzzy association rule mining is proposed to learn appropriate type-2 membership functions. The type-2 membership functions of each item are encoded as a chromosome and appropriate genetic operators are designed to find good solutions. In order to further enhance the quality of mining results, another GA-based representation, the 2-tuple linguistic representation, is also proposed. It adopts a different tuning mechanism and a modified evaluation function for the chromosomes to evolve. Experiments are also made to show the effectiveness of the proposed approaches. From the experimental results, the proposed approaches can mine more rules than using type-1 membership functions, and the qualities of rules are improved as well.
目次 Table of Contents
摘要 ii
Abstract iii
Content iv
List of Figures vii
CHAPTER 1 Introduction 1
1.1 Motivation 1
1.2 Organization of the Thesis 4
CHAPTER 2 Review of Related Work 6
2.1 Type-2 Fuzzy Sets 6
2.2 Fuzzy Data Mining 7
2.3 Genetic-Fuzzy Mining 8
CHAPTER 3 Fuzzy Data Mining with Type-2 Membership Functions 10
3.1 Introduction 10
3.2 Proposed Approach 10
3.3 An Example 15
CHAPTER 4 Genetic-Fuzzy Mining with Type-2 Membership Functions 23
4.1 Introduction 23
4.2 A GA-Based Mining Framework 24
4.2.1 Chromosome Representation 25
4.2.2 Initial Population 27
4.2.3 Fitness Function and Selection 27
4.2.4 Genetic Operators 30
4.3 Proposed Approach 30
4.4 An Example 33
CHAPTER 5 Genetic-Fuzzy Mining with Tuning Mechanisms 41
5.1 Introduction 41
5.2 A GA-Based Tuning Framework 42
5.2.1 Chromosome Representation 43
5.2.2 Initial Population 44
5.2.3 Fitness Function and Selection 44
5.2.4 Genetic Operators 47
5.3 Proposed Approach 48
5.4 An Example 50
CHAPTER 6 Experimental Evaluation 59
6.1 Datasets 59
6.2 Experimental Results 60
6.2.1 Evaluations on Proposed T2FM 60
6.2.2 Evaluations on Proposed T2GFM 63
6.2.3 Evaluations on Proposed 2-Tuple T2GFM 66
CHAPTER 7 Conclusions and Future Work 70
References 72
參考文獻 References
[1] R. Agrawal and R. Strikant, "Fast Algorithms for Mining Association Rules in Large Databases," The International Conference on Very Large Data Bases, pp. 487-499, 1994.
[2] J. Alcalá-Fdez, R. Alcalá, M. J. Gacto and F. Herrera, "Learning the Membership Function Contexts for Mining Fuzzy Association Rules by Using Genetic Algorithms," Fuzzy Sets and Systems, vol. 160, no. 7, pp. 905-921, 2009.
[3] W. H. Au and K. C. C. Chan, "Mining Fuzzy Association Rules in a Bank-Account Database," IEEE Transactions on Fuzzy Systems, vol. 11, no. 2, pp. 238-248, 2003.
[4] D. Bernardo, H. Hagras and E. Tsang, "A Genetic Type-2 Fuzzy Logic Based System for the Generation of Summarised Linguistic Predictive Models for Financial Applications," Soft Computing, vol. 17, no. 12, pp. 2185-2201, 2013.
[5] C. H. Cai, A. W. C. Fu, C. H. Cheng and W. W. Kwong, "Mining Association Rules with Weighted Items," The International Database Engineering and Applications Symposium, pp. 68-77, 1998.
[6] O. Castillo, P. Melin and W. Pedrycz, "Design of Interval Type-2 Fuzzy Models Through Optimal Granularity Allocation," Applied Soft Computing, vol. 11, no. 8, pp. 5590-5601, 2011.
[7] K. C. C. Chan and W. H. Au, "An Effective Algorithm for Discovering Fuzzy Rules in Relational Databases," The IEEE International Conference on Fuzzy Systems, vol. 2, pp. 1314-1319, 1998.
[8] C. H. Chen, T. P. Hong and V. S. Tseng, "A SPEA2-Based Genetic-Fuzzy Mining Algorithm," The IEEE International Conference on Fuzzy Systems, pp. 1-5, 2010.
[9] C. H. Chen, T. P. Hong and Y. C. Lee, "Genetic-Fuzzy Mining with Taxonomy," International Journal of Uncertainty, Fuzziness, and Knowledge-Based Systems, vol. 20, no. 2, pp. 187-205, 2012.
[10] H. P. Chiu, Y. T. Tang and K. L. Hsieh, "Applying Cluster-Based Fuzzy Association Rules Mining Framework into EC Environment," Applied Soft Computing, vol. 12, no. 8, pp. 2114-2122, 2012.
[11] O. Cordón, F. Herrera and P. Villar, "Generating the Knowledge Base of a Fuzzy Rule-Based System by the Genetic Learning of the Data Base," IEEE Transactions on Fuzzy Systems, vol. 9, no. 4, pp. 667-674, 2001.
[12] D. Dubois, H. Prade and T. Sudkamp, "On the Representation, Measurement, and Discovery of Fuzzy Associations," IEEE Transactions on Fuzzy Systems, vol. 13, no. 2, pp. 250-262, 2005.
[13] A. M. El-Nagar and M. El-Bardini, "Derivation and Stability Analysis of the Analytical Structures of the Interval Type-2 Fuzzy PID Controller," Applied Soft Computing, vol. 24, pp. 704-716, 2014.
[14] M. H. Fazel Zarandi, B. Rezaee, I. B. Turksen and E. Neshat, "A Type-2 Fuzzy Rule-Based Expert System Model for Stock Price Analysis," Expert Systems with Applications, vol. 36, no. 1, pp. 139-154, 2009.
[15] M. H. Fazel Zarandi and R. Gamasaee, "A Type-2 Fuzzy System Model for Reducing Bullwhip effects in Supply Chains and Its Application in Steel Manufacturing," Scientia Iranica, vol. 20, no. 3, pp. 879-899, 2013.
[16] M. Fazzolari, R. Alcalá, Y. Nojima, H. Ishibuchi and F. Herrera, "A Review of the Application of Multi-Objective Evolutionary Systems: Current Status and Further Directions," IEEE Transactions on Fuzzy Systems, vol. 21, no. 1, pp. 45-65, 2013.
[17] J. Han and Y. Fu, "Discovery of Multiple-Level Association Rules from Large Databases," The International Conference on Very Large Data Bases, pp. 420-431, 1995.
[18] F. Herrera, M. Lozano and J. L. Verdegay, "Fuzzy Connectives Based Crossover Operators to Model Genetic Algorithms Population Diversity," Fuzzy Sets and Systems, vol. 92, no. 1, pp. 21-30, 1997.
[19] F. Herrera, "Genetic Fuzzy Systems: Taxonomy, Current Research Trends and Prospects," Evolutionary Intelligence, vol. 1, no. 1, pp. 27-46, 2008.
[20] T. P. Hong, C. S. Kuo and S. C. Chi, "Mining Association Rules from Quantitative Data," Intelligent Data Analysis, vol. 3, no. 5, pp. 363-376, 1999.
[21] T. P. Hong, C. H. Chen, Y. L. Wu and Y. C. Lee, "A GA-based Fuzzy Mining Approach to Achieve a Trade-off Between Number of Rules and Suitability of Membership Functions," Soft Computing, vol. 10, no. 11, pp. 1091-1101, 2006.
[22] T. P. Hong, K. Y. Lin and B. C. Chien, "Mining Fuzzy Multiple-Level Association Rules from Quantitative Data," Applied Intelligence, vol. 18, no. 1, pp. 79-90, 2003.
[23] Y. C. Hu, "Determining Membership Functions and Minimum Fuzzy Support in Finding Fuzzy Association Rules for Classification Problems," Knowledge-Based Systems, vol. 19, no. 1, pp. 57-66, 2006.
[24] Y. H. Hu, F. Wu and Y. J. Liao, "An Efficient Tree-Based Algorithm for Mining Sequential Patterns with Multiple Minimum Supports," Journal of Systems and Software, vol. 86, no. 5, pp. 1224-1238, 2013.
[25] T. C. K. Huang, "Discovery of Fuzzy Quantitative Sequential Patterns with Multiple Minimum Supports and Adjustable Membership Functions," Information Sciences, vol. 222, no. 10, pp. 126-146, 2013.
[26] M. Kaya and R. Alhajj, "Genetic Algorithm Based Framework for Mining Fuzzy Association Rules," Fuzzy Sets and Systems, vol. 152, no. 3, pp. 587-601, 2005.
[27] M. Kaya and R. Alhajj, "Utilizing Genetic Algorithms to Optimize Membership Functions for Fuzzy Weighted Association Rules Mining," Applied Intelligence, vol. 24, no. 1, pp. 7-15, 2006.
[28] C. M. Kuok, A. W. C. Fu and M. H. Wong, "Mining Fuzzy Association Rules in Databases," ACM SIGMOD Record, vol. 27, no. 1, pp. 41-46, 1998.
[29] K. M. Lee, "Mining Generalized Fuzzy Quantitative Association Rules with Fuzzy Generalization Hierarchies," IFSA World Congress and 20th NAFIPS International Conference, vol. 5, pp. 2977-2982, 2001.
[30] Y. C. Lee, T. P. Hong and W. Y. Lin, "Mining Fuzzy Association Rules with Multiple Minimum Supports Using Maximum Constraints," Lecture Notes in Computer Science, vol. 3214, pp. 1283-1290, 2004.
[31] Y. C. Lee, T. P. Hong and T. C. Wang, "Multi-Level Fuzzy Mining with Multiple Minimum Supports," Expert Systems with Applications, vol. 34, no. 1, pp. 459-468, 2008.
[32] A. Mangalampalli and V. Pudi, "FPrep: Fuzzy Clustering Driven Efficient Automated Pre-Processing for Fuzzy Association Rule Mining," The IEEE International Conference on Fuzzy Systems, pp. 1-8, 2010.
[33] F. D. Martino and S. Sessa, "Type-2 Interval Fuzzy Rule-Based Systems in Spatial Analysis," Information Sciences, vol. 279, no. 20, pp. 199-212, 2014.
[34] S. G. Matthews, M. A. Gongora, A. A. Hopgood and S. Ahmadi, "Temporal Fuzzy Association Rule Mining with 2-Tuple Linguistic Representation," The IEEE International Conference on Fuzzy Systems, pp. 1-8, 2012.
[35] P. Melin and O. Castillo, "A Review on the Applications of Type-2 Fuzzy Logic in Classification and Pattern Recognition," Expert Systems with Applications, vol. 40, no. 13, pp. 5413-5423, 2013.
[36] J. M. Mendel and R. I. Bob John, "Type-2 Fuzzy Sets Made Simple," IEEE Transactions on Fuzzy Systems, vol. 10, no. 2, pp. 117-127, 2002.
[37] Y. Nojima and H. Ishibuchi, "Incorporation of User Preference into Multiobjective Genetic Fuzzy Rule Selection for Pattern Classification Problems," Artificial Life and Robotics, vol. 14, pp. 418-421, 2009.
[38] A. Parodi and P. Bonelli, "A New Approach of Fuzzy Classifier Systems," The International Conference on Genetic Algorithms, pp. 223-230, 1993.
[39] A. Savasere, E. Omiecinski and S. Navathe, "An Efficient Algorithm for Mining Association Rules in Large Databases," The International Conference on Very Large Data Bases, pp. 432-443, 1995.
[40] R. Srikant and R. Agrawal, "Mining Generalized Association Rules," The International Conference on Very Large Data Bases, pp. 407-419, 1995.
[41] C. H. Wang, T. P. Hong and S. S. Tseng, "Integrating Membership Functions and Fuzzy Rule Sets from Multiple Knowledge Sources," Fuzzy Sets and Systems, vol. 112, pp. 141-154, 2000.
[42] R. R. Yager, "Fuzzy Subsets of Type II in Decisions," Journal of Cybernetics, vol. 10, pp. 137-159, 1980.
[43] S. Yue, E. Tsang, D. Yeung and D. Shi, "Mining Fuzzy Association Rules with Weighted Items," The IEEE International Conference on Systems, Man and Cybernetics, pp. 1906-1911, 2000.
[44] L. A. Zadeh, "The Concept of a Linguistic Variable and Its Application to Approximate Reasoning—I," Information Sciences, vol. 8, no. 3, pp. 199-249, 1975.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code