國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,深度學習應用：電影票房預測與風機大維護時間預測,Deep learning applications: Prediction of Movie box Office Performance and Great Maintenance Time for Wind Turbine

論文名稱 Title	深度學習應用：電影票房預測與風機大維護時間預測 Deep learning applications: Prediction of Movie box Office Performance and Great Maintenance Time for Wind Turbine
系所名稱 Department	通訊工程研究所 Institute of Communications Engineering
畢業學年期 Year, semester	106 學年度第 2 學期 The spring semester of Academic Year 106	語文別 Language	英文 English
學位類別 Degree	碩士 Master	頁數 Number of pages	68
研究生 Author	余承恩 Cheng-en Yu
指導教授 Advisor	葉家宏 Chia-Hung Yeh
召集委員 Convenor	陳俊良 Jiann-Liang Chen
口試委員 Advisory Committee	彭勝龍 Sheng-Lung Peng
口試日期 Date of Exam	2018-06-13	繳交日期 Date of Submission	2018-07-07
關鍵字 Keywords	支持向量機、狀態監控、卷積神經網路、風機、資料探勘、輿情分析 Wind turbine, Conditional monitoring, Support Vector Machine, Sentiment analysis, Data mining, Convolution Neural Network
統計 Statistics	本論文已被瀏覽 5686 次，被下載 0 次 The thesis/dissertation has been browsed 5686 times, has been downloaded 0 times.

中文摘要
近年來深度學習技術獲得巨大關注與進步，並成功地應用於市場產品；以影像與聲音辨識為大宗，廣泛應用於不同領域。其中卷積神經網絡是最熱門的深度學習神經網路架構，最初在影像處理上取得了驚人的成果，之後其應用更擴及於各大領域。本論文提出使用卷積神經網絡(Convolutional Neural Network)來建立預測模型，並進行了兩項研究。第一部分基於與電影相關的網路關鍵資訊進行國內電影票房分析與預測，第二部分為風機大維護預測。台電於彰化濱海工業區有兩個風場，本論文使用兩個風場過去所收集的資料來進行風機大維護預測研究。實驗結果顯示出深度學習在預測上也能有不錯的效果。
Abstract
In recent years, deep learning has received much public attention and made great progress technically; it has been successfully applied to the products in the marketplace, and it has been widely used in different fields for its capabilities in image and voice identification. Convolutional neural network (CNN) is popular deep-learning neural network architecture, and it has initially achieved impressive performance in image processing and can even be seen in various fields later. This paper proposes two studies based on Convolutional Neural Network to establish a predictive model. The first study analyzes and predicts the box office of Taiwan based on the key information related to the movie. The second study predicts the great maintenance of the wind turbines. Taipower has two wind farms located in Changhua Coastal Industrial Park. This paper uses data collected from two wind farms to do forecast research. The experimental results show that deep learning can also have good performance in forecasting.

目次 Table of Contents
審定書 i 誌謝 ii 中文摘要 iii Abstract iv Contents v List of Figures vii List of Tables viii Chapter 1 Introduction 1 1.1 Introduction 1 1.2 Motivation 4 1.3 Contributions 5 1.4 Organization 6 Chapter 2 Background Review 7 2.1 Sentiment Analysis 7 2.2 Social Media 9 2.3 Movie box office 13 2.4 Condition monitoring of wind turbines 14 2.5 Related algorithm 15 2.5.1 Support Vector Machine 15 2.5.2 Convolutional Neural Network 16 2.5.3 Simple Linear Regression 17 2.5.4 Apriori Algorithm 18 Chapter 3 Proposed Method 20 3.1 Movie Box Office Forecasting 20 3.1.1 Data Collection 21 3.1.2 Data Preprocessing and Prediction Model Construction 22 3.2 Great Maintenance Time Forecasting 32 3.2.1 Data Preprocessing 33 3.2.2 Prediction Model Construction 41 Chapter 4 Experimental Results 43 4.1 Results of Movie Box Office Forecasting 43 4.2 Results of Great Maintenance Time Forecasting 46 Chapter 5 Conclusions 48 Reference 51

參考文獻 References
[1] Twitter, http://twitter.com. [2] Facebook, http://facebook.com [3] M. D. Conover, B. Gonçalves, J. Ratkiewicz, A. Flammini and F. Menczer, “Predicting the political alignment of twitter users,” in Proceedings of the International Conference on Social Computing, pp. 192-199, Boston, USA, Oct. 2011. [4] Y. Liu, J. Huang, A. An and X. Yu, “ARSA: A sentiment-aware model for predicting sales performance using blogs,” in Proceedings of the ACM Special Interest Group on Information Retrieval, pp. 607-614, Amsterdam, The Netherlands, Jul. 2007. [5] Epagogix, http://www.epagogix.com/ [6] A. R. Panaligan, and A. Chen, “Quantifying moviemagic with google search,” Google, 2013. [7] Internet movie database, http://imdb.com. [8] K. R. Apala, M. Jose, S. Motnam, C. C. Chan, K. J. Liszka and F. de Gregorio, “Prediction of movies box office performance using social media,” Advances in Social Networks Analysis and Mining (ASONAM), pp. 1209-1214, Niagara Falls, Canada, Aug. 2013. [9] C. Cortes and V. Vapnik, “Support-vector networks,” Machine learning, vol. 20, no. 3, pp. 273-297, 1995. [10] J. Han, M. Kamber and J. Pei, Data Mining: Concepts and Techniques, Third Edition, Morgan Kaufmann, 2006. [11] B. Pang, L. Lee, and S. Vaithyanathan, “Thumbs up?: sentiment classification using machine learning techniques,” in Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10, pp. 79-86, 2002. [12] J. Erman, M. Arlitt, and A. Mahanti, “Traffic classification using clustering algorithms,” in Proceedings of the 2006 SIGCOMM workshop on Mining network data, pp. 281-286, 2006. [13] A. Kyriakopoulou and T. Kalamboukis, “Text classification using clustering,” in Proceedings of The 17th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), pp. 28-38, 2006. [14] N. Slonim, N. Friedman, and N. Tishby, “Unsupervised document classification using sequential information maximization,” in Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 129-136, 2002. [15] A. Tumasjan, T. O. Sprenger, P. G. Sandner, and I. M. Welpe, “Predicting elections with twitter: What 140 characters reveal about political sentiment,” ICWSM, vol. 10, no. 1, pp. 178-185, 2010. [16] T. Sakaki, M. Okazaki, and Y. Matsuo, “Earthquake shakes Twitter users: real-time event detection by social sensors,” in Proceedings of the 19th international conference on World wide web, pp. 851-860. 2010. [17] M. Skoric, N. Poor, P. Achananuparp, E. P. Lim, and J. Jiang, “Tweets and votes: A study of the 2011 singapore general election.” HICSS, pp. 2583-2591, 2012. [18] C. Williams, and G. Gulati, “What is a social network worth? Facebook and vote share in the 2008 presidential primaries,” American Political Science Association, 2008. [19] B. O'Connor, R. Balasubramanyan, B. R Routledge, and N. A. Smith, “From tweets to polls: Linking text sentiment to public opinion time series,” ICWSM, vol. 11, pp. 122-129, 2010. [20] J. Bollen, H. Mao, and X. Zeng, “Twitter mood predicts the stock market” Journal of computational science, vol. 2, no. 1, pp. 1-8, 2011. [21] J. Ritterman, M. Osborne, and E. Klein, “Using prediction markets and Twitter to predict a swine flu pandemic,” In 1st international workshop on mining social media,vol. 9, pp. 9-17, 2009. [22] N. UzZaman, R. Blanco, and M. Matthews, “TwitterPaul: Extracting and aggregating Twitter predictions,” arXiv preprint, 2012. [23] E. Bothos, D. Apostolou, and G. Mentzas, “Using social media to predict future events with agent-based markets,” IEEE Intelligent Systems, vol. 25, no. 6, pp. 50-58, 2010. [24] B. Pang, L. Lee, and S. Vaithyanathan, “Thumbs up?: sentiment classification using machine learning techniques,” In Proceedings of the ACL-02 conference on Empirical methods in natural language, vol. 10, pp. 79-86, 2002. [25] A. Tumasjan, T. O. Sprenger, P. G. Sandner, and I. M. Welpe, “Predicting elections with twitter: What 140 characters reveal about political sentiment,” ICWSM, vol. 10, no. 1, pp. 178-185, 2010. [26] P. T. Metaxas, E. Mustafaraj, and D. Gayo-Avello, “How (not) to predict elections.“ In Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference, pp. 165-171, 2011. [27] “An In-Depth Look Inside the Twitter World,” http://www.sysomos.com/insidetwitter. [28] D. Gayo Avello, P. T. Metaxas, and E. Mustafaraj, “Limits of electoral predictions using twitter,” in Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, 2011. [29] B. J. Jansen, M. Zhang, K. Sobel, and A. Chowdury, “Twitter power: Tweets as electronic word of mouth,” Journal of the Association for Information Science and Technology, vol. 60, no. 11, pp. 2169-2188, 2009. [30] S. Asur, B. A. Huberman, “Predicting the future with social media,” in Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, vol. 1, pp. 492-499, 2010. [31] W. Zhang, and S. Skiena, “Improving movie gross prediction through news analysis,” In Web Intelligence and Intelligent Agent Technologies, vol. 1, pp. 301-304, 2009. [32] M. Joshi, D. Das, K. Gimpel, and N. A. Smith, “Movie reviews and revenues: An experiment in text regression,” In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 293-296, 2010. [33] R. Sharda, and D. Delen, “Predicting box-office success of motion pictures with neural networks,” Expert Systems with Applications, vol. 30 no. 2, pp. 243-254, 2006. [34] B. Pang, and L. Lee, “Opinion mining and sentiment analysis,” Foundations and Trends in Information Retrieval, vol. 2, pp. 1-135, 2008. [35] G. Mishne, and N. S. Glance, “Predicting Movie Sales from Blogger Sentiment,” In AAAI spring symposium: computational approaches to analyzing weblogs, pp. 155-158, 2006. [36] P. Caselitz and J. Giebhardt, “Rotor condition monitoring for improved operational safety of offshore wind energy converters,” Journal of Solar Energy Engineering, vol. 127, no. 2, pp. 253-261, 2005. [37] F. P. G. Márquez, A. M. Tobias, J. M. P. Pérez and M. Papaelias, “Condition monitoring of wind turbines: techniques and methods,” Renewable Energy, vol. 46, pp. 169-178, 2012. [38] H. Muller, M. Poller, A. Basteck, M. Tilscher and J. Pfister, “Grid compatibility of variable speed wind turbines with directly coupled synchronous generator and hydro-dynamically controlled gearbox,” in Proceedings of Sixth International Workshop on Large-Scale Integration of Wind Power and Transmission Networks for Offshore Wind Farms, pp. 307-315, 2006. [39] K. Schroeder, W. Ecke, J. Apitz, E. Lembke and G. Lenschow, “A fibre Bragg grating sensor system monitors operational load in a wind turbine rotor blade,” Measurement Science and Technology, vol. 17, no. 5, 2006. [40] S. Soua, P. Van Lieshout, A. Perera, T. H. Gan, and B. Bridge, “Determination of the combined vibrational and acoustic emission signature of a wind turbine gearbox and generator shaft in service as a pre-requisite for effective condition monitoring,” Renewable Energy, vol. 51, pp. 175-181, 2013. [41] E. Jasiūnienė, R. Raišutis, R. Šliteris, A. Voleišis and M. Jakas, “Ultrasonic NDT of wind turbine blades using contact pulse-echo immersion testing with moving water container,” Ultragarsas, vol. 63, no. 3, pp. 28-32, 2008. [42] A. Kusiak, and A. Verma, “A data-mining approach to monitoring wind turbines,” IEEE Transactions on Sustainable Energy, vol. 3, no. 1, pp. 150-157, 2012. [43] A. Kusiak, and A. Verma, “A data-driven approach for monitoring blade pitch faults in wind turbines,” IEEE Transactions on Sustainable Energy, vol. 2, no. 1, pp. 87-96, 2011. [44] K. Kim, G. Parthasarathy, O. Uluyol, W. Foslien and S. Sheng and P. Fleming, “Use of SCADA data for failure detection in wind turbines,” in Proceedings of ASME 5th International Conference on Energy Sustainability, pp. 2071-2079, 2011. [45] M. Schlechtingen, I. F. Santos and S. Achiche, “Using data-mining approaches for wind turbine power curve monitoring: a comparative study,” IEEE Transactions on Sustainable Energy, vol. 4, no. 3, pp. 671-679, 2013. [46] R. Agrawal and R. Srikant, “Fast algorithms for mining association rules,” in Proceedings of 20th International Conference on Very Large Data Bases, pp. 487-499, 1994. [47] PTT, https://www.ptt.cc [48] 開眼電影網，http://atmovies.com.tw. [49] 觸電網，http://truemovies.com [50] J. Han and M. Kamber, “Data Mining:Concepts and Techniques,” 3rd ed. San Francisco: Morgan Kaufmann Publishers, pp. 112-114, 2011. [51] 台灣最高電影票房，https://zh.wikipedia.org/wiki/台灣最高電影票房. [52] L. P. Jing, H. K. Huang, and H.-B. Shi, “Improved feature selection approach TFIDF in text mining.” in Proceedings of International Conference on Machine Learning and Cybernetics, vol. 2, pp. 944-946, 2002. [53] Y. Goldberg, and O. Levy, “word2vec explained: Deriving mikolov et al.'s negative-sampling word-embedding method,” arXiv preprint, 2014. [54] Y. Kim, “Convolutional neural networks for sentence classification,” arXiv preprint, 2014. [55] jieba, https://github.com/fxsjy/jieba. [56] L. W. Ku and H. H. Chen, “Mining Opinions from the Web: Beyond Relevance Retrieval,” Journal of the American Society for Information Science and Technology, vol. 58, no. 12, pp. 1838-1850, 2007. [57] Keras, https://github.com/keras-team/keras

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：自定論文開放時間 user define 開放時間 Available：校內 Campus：已公開 available 校外 Off-campus：已公開 available etd-0528118-144143.pdf
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS