Responsive image
博碩士論文 etd-0906111-214151 詳細資訊
Title page for etd-0906111-214151
論文名稱
Title
模糊時序性資料探勘之研究
A Study on Fuzzy Temporal Data Mining
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
90
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2011-07-22
繳交日期
Date of Submission
2011-09-06
關鍵字
Keywords
項目銷售期間、模糊週期性關聯規則、資料探勘、模糊資料探勘、模糊理論
fuzzy data mining, Fuzzy set, data mining, fuzzy temporal association rule, item lifespan
統計
Statistics
本論文已被瀏覽 5638 次,被下載 128
The thesis/dissertation has been browsed 5638 times, has been downloaded 128 times.
中文摘要
資料挖掘的目的在於如何從資料庫中,擷取出感興趣以及具有意義的頻繁項目集,而其中一個子題,時序性資料探勘,由於其廣泛利用使得此議題,所以,近幾年受到許多注意,其主要的考量在於資料庫中分析具有時間性的資料,並且由其中發現出具時間性的項目集以及規律。在日常生活中,普遍存在的各種模糊性現象,包含人類的思維以及決策等,皆存在有模糊的特質,模糊理論強調許多事情的結果,無法符合傳統的非0即1的二元邏輯,而是介於之間。在論文中,我們提出了三個模糊時序性資料探勘的方法,分別去做探討,針對交易資料庫中每筆項目的交易資料,以及他們所發生的時間,去做資料探勘的動作,這三個方法分別處理不同的時間區間的定義。第一個方法是針對每個項目以它所發生的第一筆交易時間至最後時間為評估的時間週期;第二個方法則是根據每個項目實際出版的時間作為評估週期;第三個方法則是找出在某一時間至最後時間裡,皆為頻繁狀態的時間作為評估週期。本論文所提的方法則是在架構在上述三種不同時間週期來找出各自的模糊週期性關聯規則。最後,實驗結果也證實本論文所提的方法確實能有效的針對不同時間週期進行資料挖掘的程序,在出版時序資料庫裡,能找出更多具感興趣的模糊週期規則。
Abstract
Data mining is an important process of extracting desirable knowledge from existing databases for specific purposes. Nearly all transactions in real-world databases involve items bought, quantities of the items, and the time periods in which they appear. In the past, temporal quantitative mining was proposed to find temporal quantitative rules from a temporal quantitative database. However, the quantitative values of items are not suitable to human reasoning. To deal with this, the fuzzy set theory was applied to the temporal quantitative mining because of its simplicity and similarity to human reasoning. In this thesis, we thus handle the problem of mining fuzzy temporal association rules from a publication database, and propose three algorithms to achieve it. The three algorithms handle different lifespan definitions, respectively. In the first algorithm, the lifespan of an item is evaluated from the time of the first transaction with the item to the end time of the whole database. In the second algorithm, an additional publication table, which includes the publication date of each item in stores, is given, and thus the lifespan of an item is measured by its entire publication period. Finally in the third algorithm, the lifespan of an item is calculated from the end time of the whole database to its earliest time in the database for the item to be a fuzzy temporal frequent item within the duration. In addition, an effective itemset table structure is designed to store and get information about itemsets and can thus speed up the execution efficiency of the mining process. At last, experimental results on two simulation datasets compare the mined fuzzy temporal quantitative itemsets and rules with and without consideration of lifespans of items under different parameter settings.
目次 Table of Contents
Chapter 1 Introduction 1
1.1 Background and Motivation 1
Chapter 2 Review of Related Works 3
2.1 Binary Data Mining 3
2.2 The Apriori Algorithm 4
2.3 Fuzzy Data Mining 5
2.4 Fuzzy Temporal Data Mining 6
2.5 Publication Information Table 7
2.6 Membership Functions 9
2.7 Problem Statement and Definitions 12
Chapter 3 Fuzzy Temporal Association Rule Mining Approach 13
3.1 An Example 17
Chapter 4 General Fuzzy Temporal Association Rule Mining 24
4.1 An Example 30
Chapter 5 General Fuzzy Temporal Association Rules with Effect Lifespans 46
5.1 An example 52
Chapter 6 Experiment Results and Discussion 71
Chapter 7 Conclusion and Future Works 77
References 79
參考文獻 References
[1] R. Agrawal and R. Srikant, “Fast algorithm for mining association rules,” The International Conference on Very Large Data Bases, pp. 487-499, 1994.
[2] R. Agrawal, R. Srikant, and Q. Vu, “Mining association rules with item constraints,” The Third International Conference on Knowledge Discovery in Databases and Data Mining, pp. 67-73, Newport Beach, California, 1997.
[3] R. Agrawal, T. Imielinksi and A. Swami, “Database mining: a performance perspective,” IEEE Transactions on Knowledge and Data Engineering, Vol. 5, No. 6, pp. 914-925, 1993.
[4] R. Agrawal, T. Imielinksi and A. Swami, “Mining association rules between sets of items in large database,” The 1993 ACM SIGMOD Conference, pp. 207-216, Washington DC, USA, 1993.
[5] C. C. Chan and W. H. Au, “Mining fuzzy association rules,” The Conference on Information and Knowledge Management, Las Vegas, pp. 209-215, 1997.
[6] D. Chandra, P. Sao and S. Gajbhiye, “Calendar-based temporal fuzzy weighted association rule mining,” The International Conference on Information Science and Applications, pp. 86-89, 2010.
[7] C. Y. Chang, M. S. Chen, and C. H. Lee, "Mining general temporal association rules for items with different exhibition periods," The IEEE International Conference on Data Mining, pp. 59-66, 2002.
[8] J. Chen, A. Mikulcic, and D. H. Kraft, “An integrated approach to information retrieval with fuzzy clustering and fuzzy inferencing,” in O. Pons, M. A. Vila and J. Kacprzyk (eds.), Knowledge Management in Fuzzy Databases, Heidelberg, Germany: Physica-Verlag, 2000.
[9] L. Chen, S. S. Bhowmick, and J. Li, “Mining temporal indirect associations,” The International Conference on Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 425-434, 2006.
[10] P. A. Heng, T. T. Wong, Y. Rong, Y. P. Chui, Y. M. Xie, K. S. Leung, and P. C. Leung, “Intelligent inferencing and haptic simulation for Chinese acupuncture learning and training,” IEEE Transactions on Information Technology in Biomedicine, Vol. 10, No. 1, pp. 28-41, 2006.
[11] T. P. Hong, C. S. Kuo, and S. C. Chi, “Trade-off between time complexity and number of rules for fuzzy mining from quantitative data,” International Journal of Uncertainty, Fuzziness and Knowledge-based Systems, Vol. 9, No. 5, pp. 587-604, 2001.
[12] IBM Quest Data Mining Project, “Quest synthetic data generation code,” http://www.almaden.ibm.com/cs/quest/syndata.html, 1996.
[13] H. Ishibuchi and T. Yamamoto, “Rule weight specification in fuzzy rule-based classification systems,” IEEE Transactions on Fuzzy Systems, Vol. 13, No. 4, pp. 428-435, 2005.
[14] C. Kuok, A. Fu, and M. Wong, “Mining fuzzy association rules in databases,” SIGMOD Record, Vol. 27, No. 1, pp. 41-46, 1998.
[15] C. H. Lee, C. R. Lin, and M. S. Chen, “On mining general temporal association rules in a publication database,” The IEEE International Conference on Data Mining, pp337-344, 2001.
[16] W. J. Lee, “Discovery of fuzzy temporal and periodic association rules,” National Sun Yat-sen University: Department of Electrical Engineering. 2008.
[17] Y. C. Lee, T. P. Hong, and T. C. Wang, "Multi-level fuzzy mining with multiple minimum supports," Expert Systems with Applications, Vol. 34 , Issue 1, pp. 459-468, 2008.
[18] Y. C. Lee, T. P. Hong, and W. Y. Lin, “Mining fuzzy association rules with multiple minimum supports using maximum constraints,” Lecture Notes in Computer Science, Vol. 3214, pp. 1283-1290, 2004.
[19] C. J. Li and T. Q. Yang, “Effective mining of fuzzy quantitative weighted association rules,” The International Conference on E-Business and E-Government, pp. 1418-1421, 2010.
[20] Y. Liu, W. Liao, and A. Choudhary, “A fast high utility itemsets mining algorithm,” The Utility-Based Data Mining Workshop, pp. 90-99, 2005.
[21] W. Siler and J. James, Fuzzy Expert Systems and Fuzzy Reasoning, John Wiley & Sons, 2004.
[22] R. Srikant and R. Agrawal, “Mining quantitative association rules in large relational tables,” The 1996 ACM SIGMOD International Conference on Management of Data, pp. 1-12, 1996.
[23] S. Yue, E. Tsang, D. Yeung, and D. Shi, “Mining fuzzy association rules with weighted items,” The IEEE International Conference on Systems, Man and Cybernetics, pp. 1906-1911, 2000.
[24] L. A. Zadeh, “Fuzzy sets,” Information and Control, Vol. 8, No. 3, pp. 338-353, 1965.
[25] H. Zhang and D. Liu, Fuzzy Modeling and Fuzzy Control, Springer Verlag, 2006.
[26] C. Zhuo, L. Jiahui, and L. Chen, "A fuzzy calendar-based algorithm for mining temporal association rules and its application", The International Conference on Fuzzy Systems and Knowledge Discovery, pp. 28-33, 2009.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code