國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,數位流行音樂之樂曲結構分析與情緒識別 ,Structure Analysis and Emotion Recognition of Digital Popular Music

論文名稱 Title	數位流行音樂之樂曲結構分析與情緒識別 Structure Analysis and Emotion Recognition of Digital Popular Music
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	101 學年度第 1 學期 The fall semester of Academic Year 101	語文別 Language	英文 English
學位類別 Degree	碩士 Master	頁數 Number of pages	58
研究生 Author	劉晟志 Cheng-Chih Liu
指導教授 Advisor	葉家宏 Chia-Hung Yeh
召集委員 Convenor	蘇柏齊 Po-Chyi Su
口試委員 Advisory Committee	黃婉甄, 簡鳳村, 陳佳妍 Wan-Jen Huang; Wan-Jen Huang; Chia-Yen Chen
口試日期 Date of Exam	2013-01-23	繳交日期 Date of Submission	2013-02-05
關鍵字 Keywords	相似度矩陣、樂曲結構、自適性、樂曲情緒辨識 ada-boost, music structure, music emotion recognition, similarity matrix
統計 Statistics	本論文已被瀏覽 5725 次，被下載 0 次 The thesis/dissertation has been browsed 5725 times, has been downloaded 0 times.

中文摘要
本論文提出關於流行樂曲之結構分析及情緒辨別演算法，經過數道處理程序，本論文所提出之樂曲結構分析演算法能將樂曲之副歌片段準確找出，進而利用本論文所提出之樂曲情緒辨識演算法，辨識樂曲中所隱含之情緒。本論文之主要貢獻為本論文充分理解並利用與流行樂曲之情緒以及結構相關之法則，並且已本論文所提出有效之分類器結構設計，準確地辨識出樂曲的代表情緒。本論文所提出支樂曲結構分析準確率達到近七成五，樂曲情緒辨識之準確率更高達八成，且本論文所提出之演算法於不同語言之測試資料庫皆能維持穩定的準確率。由實驗結果可知，本論文所提出演算法相當強健可靠。
Abstract
In this thesis, the proposed schemes are designed to analysis the music structure and recognize the emotion of popular music. Series of procedure are arranged to dig out the chorus sections. After that, the proposed emotion recognition algorithm is able to find out the emotion that enhanced in music clips. The major contribution of this thesis is that we investigated and summarize the structure composition rules of popular music as well as how to recover the enhanced emotion via a sufficient classifier structure. The accuracy of the second phase of this thesis is directly influence by the performance of first phase. The accuracy of the structure analysis is approach 75% at best situation, and the overall accuracy of emotion recognition is around 80%. The accuracy is stable in database of different languages. Experimental results show that our proposed algorithm is robust

目次 Table of Contents
中文摘要 i Abstract ii Contents iii List of Figures v List of Tables vii Chapter 1 Introduction 1 1.1 Overview of Music 1 1.2 Motivation 4 1.3 Contribution 4 1.4 Organization 5 Chapter 2 Background Review 6 2.1 Audio Signal Processing 7 2.2 Audio Features 9 2.3 Emotion Models 10 2.4 Ada-boost 13 2.5 Dynamic Time Warping 15 Chapter 3 Chorus Detection 17 3.1 Overview 18 3.2 The Proposed Structure Analysis Algorithm 20 Chapter 4 Emotion Detection 27 4.1 Overview 27 4.2 Classifier Structure 31 Chapter 5 Experimental Results 34 5.1 Structure Analysis 35 5.2 Emotion Recognition 39 5.3 Summaries 41 Chapter 6 Conclusions and Future Work 42 Reference 45

參考文獻 References
[1] Paulus, J., Klapuri, A., "Music Structure Analysis by Finding Repeated Parts", in Proceedings of the 1st Audio and Music Computing for Multimedia Workshop (AMCMM), pp. 59-68, 2006 [2] R. Shuker, “Understanding popular music culture,” Routledge, 2007 [3] D. Stein, “Engaging music: Essay in music analysis,” New York, Oxford university press, 2005. [4] Kimber, D., and L. Wilcox, L., “Acoustic segmentation for audio browsers," in Proceedings of Interface Conference., 1996 [5] M. Goto, “A Chorus Section Detection Method for Musical Audio Signals and Its Application to a Music Listening Station,” IEEE Transactions on Audio, Speech, and Language Processing , Vol. 14, 2006. [6] Y.-H. Yang, C.-C Liu, and H. H. Chen, “Music emotion classification: A fuzzy approach,” in Proceedings of ACM Multimedia, pp. 81-84, 2006. [7] Y. Li, S.-H. Lee, C.-H. Yeh, and C.-C. Jay Kuo, “Techniques for movie content analysis and skimming,” IEEE Signal Processing Magazine, vol. 23, no. 2, pp. 79~89, March 2006. [8] S.-H. Lee, C.-H. Yeh, and C. -C. Jay Kuo, “Automatic movie skimming with story units via general tempo analysis,” in Proceedings of SPIE Electronic Image Storage and Retrieval Methods and Applications for Multimedia, vol. 5307, pp. 396-407, 2004. [9] J. Foote, “Automatic audio segmentation using a measure of audio novelty,” in Proceedings of IEEE-ICME, pp. 452–455, July 2000. [10] N. Kosugi, Y. Nishihara, , T. Sakata, M. Yamamuro, and K. Kushima, “A practical query-by humming system for a large music database,” in Proceedings of the 8th ACM, pp. 333-342, 2000. [11] M. Lslam, H. Lee, A. Paul, and J. Baek, “Content-based music retrieval using beat information,” in Proceedings of International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), pp. 317-321, 2007. [12] R. McNab, L. Smith, I. Witten, C. Henderson, and S. Cunningham, “Towards the digital music library: tune retrieval form acoustic input,” in Proceedings of ACM Digital Libraries’96, pp. 11-18, 1996. [13] S. Blackburn and D. DeRoure, “A tool for content based navigation of music,” in Proceedings of the 6th ACM Multimedia, pp. 361-368, 1998. [14] R. Lowrance and R.A. Wagner, “An extension of the string-to-string correction problem,” Journal of the ACM, vol. 22, pp. 177–183, April 1975. [15] T. Tsai, and J. Hung, “Content-based retrieval of mp3 songs for one singer using quantization tree indexing and melody-line tracking method,” in Proceedings of the International Conference on Acoustics, Speech and Signal Processing, vol. 5, pp. 505-508, May 2006. [16] F. Kuo and M. Shan, “Music retrieval by melody style,” in Proceedings of International Symposium on Multimedia, pp. 613-618, 2009. [17] T. Mulder, J. Martens, S. Pauws, F. Vignoli, M. Lesaffre, M. Lenman, B. Baets, and H. Meyer, “Factors affecting music retrieval in query by melody,” IEEE Transactions on Multimedia, vol.8, pp. 728-739, 2006. [18] Y. Zhu, C. Xu, and M. Kankanhalli, “Melody curve processing for music retrieval,” in Proceedings of International conference on Multimedia and Expo, pp. 285-288, 2003. [19] R.Cai, C. Zhang, L. Zhang, and W. Ma, “Scalable music recommendation by search,” in Proceedings of the 15th international conference on Multimedia, pp. 1065-1074, 2007. [20] http://www.moodagent.com [21] C. E. Shannon and W. Weaver, “The mathematical theory of communication,” University of Illinois press, 1949. [22] Y. Shiu, H. Jeong, & C.-C. Jay Kuo, “Similar segment detection for music structure analysis via Viterbi algorithm,” in Proceedings of IEEE international Conference on Multimedia and Expo., pp. 789-792, 2006. [23] B. Mark A, & W. H. Gregory, “To catch a chorus: using chroma-based representations for audio thumbnailing,” in Proceedings of IEEE workshop on the Applications of Signal Processing to Audio and Acoustics, pp. 15-18, 2001. [24] C. Mathew & J. Foote, “Automatic music summarization via similarity analysis,” in Proceedings of International Conference on Music Information Retrieval, pp. 81-85, 2002. [25] C. Matthew & J. Foote, “Summarizing popular music via structural similarity analysis,” in Proceedings of IEEE workshop on the Applications of Signal Processing to Audio and Acoustics, pp. 127-130, 2003. [26] J. Foote, “Visualizing music and audio using self-similarity,” in Proceedings of ACM Multimedia, pp. 77-80, November 1999. [27] W. Dowling, and J. Harwood, “Music Cognition,” Academic Press, pp. 202, December 1985. [28] R. Thayer, “The biopsychology of mood and arousal,” Oxford university press, May, 1989. [29] L. Lu, D. Liu, and H.-J. Zhang, “Automatic mood detection and tracking of music audio signals,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, pp. 5-18, December 2005. [30] D. Yang and W. Lee, “Disambiguating music emotion using software agents,” in Proceedings of International workshop on Human-centered multimedia, pp. 52-58, 2004. [31] A. Tellegen, D. Watson, and L. Clark, “On the dimensional and hierarchical structure of affect,” Psychological Science, vol. 10, no. 4, pp. 297-303, July 1999. [32] Y. Freund and R. E. Schapire, “A decision-theoretic generalization of on-line learning and an application to boosting,” Journal of Computer and System Sciences, vol. 55, August 1997. [33] G. Li, C. An, J. Pang, M. Tan and X. Tu, “Color image adaptive clustering segmentation,” in Proceedings of Third International Conference on Image and and Graphics, pp.104-107, 2004. [34] S. C. Ahalt, A. K. Krishnamurty, P. Chen, and D. E. Melton, “ Competitive learning algorithms for vector quantization,” Neural Networks, vol. 3, pp. 277-291,1990. [35] L. Xu, and A. Krzyzak, “Rival penalized competitive learning for clustering analysis, RBF Net, and curve detection,” IEEE Transactions on Neural Networks, vol. 4, no. 4, July 1993. [36] D. E. Rumelhart and D. Zipser, “Feature discovery by competitive learning,” Cognitive Science, vol. 9, pp. 75-112, 1985. [37] S. Grossberg, “Competitive learning: from iterative activation to adaptive resonance,” Cognitive Science, vol. 11, pp. 23-63, 1987. [38] R. Hecht-Nielsen, “Counter propagation networks,” Applied Optics, vol. 26, pp. 4979-4984, 1987. [39] M. Cooper, and J. Foote, “Scene boundary detection via video self-similarity analysis,” in Proceedings of International Conference on Image Processing, vol.3, p.p. 378-381, 2001. [40] H. T. Chen, M. H. Hsiao, W. J. Tsai, S. Y. Lee, and J. Y. Yo, “A tempo analysis system for automatic music accompaniment,” in Proceedings on IEEE Multimedia and Expo., pp. 64-67, 2007. [41] A. Schutz, and D. Slock, “Periodic signal modeling for the octave problem in music transcription,” in Proceedings of Digital Signal Processing, pp. 1-6, 2009. [42] Vintsyuk, T.K. "Speech discrimination by dynamic programming". Kibernetika, Vol. 4, pp. 81-88, Jan.-Feb. 1968 [43] Sakoe, H. and Chiba, S., Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech and Signal Processing, pp. 43– 49, 1978 [44] Plutchik, R. "The Nature of Emotions". American Scientist. Retrieved 14 April 2011. [45] Heng Tze Cheng, Yi-Hsuan Yang, Yu-Ching Lin, Homer H. Chen. “Multimodal Structure Segmentation and Analysis of Music using Audio and Textual Information.” In International Symposium on Circuits and Systems (ISCAS), pp. 1677-1680, 2009

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：自定論文開放時間 user define 開放時間 Available：校內 Campus：永不公開 not available 校外 Off-campus：永不公開 not available 您的 IP(校外) 位址是 3.138.200.66 論文開放下載的時間是校外不公開 Your IP address is 3.138.200.66 This thesis will be available to you on Indicate off-campus access is not available.
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS