國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,利用模糊類神經網路擷取圖片中之人臉及身體 ,A Neuro-Fuzzy Approach to Detection of Human Face and Body for MPEG Video Compression

論文名稱 Title	利用模糊類神經網路擷取圖片中之人臉及身體 A Neuro-Fuzzy Approach to Detection of Human Face and Body for MPEG Video Compression
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	89 學年度第 2 學期 The spring semester of Academic Year 89	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	55
研究生 Author	杜世懷 Shih-Huai Du
指導教授 Advisor	李錫智 Shie-Jue Lee
召集委員 Convenor	黃蓮池 Lain-Chyr Hwang
口試委員 Advisory Committee	吳志宏 Chih-Hong Wu
口試日期 Date of Exam	2001-06-27	繳交日期 Date of Submission	2001-07-24
關鍵字 Keywords	none Video Objects, Neuro-Fuzzy modeling, MPEG-4, MPEG-7
統計 Statistics	本論文已被瀏覽 5836 次，被下載 4678 次 The thesis/dissertation has been browsed 5836 times, has been downloaded 4678 times.

中文摘要
在最新的影像壓縮技術MPEG-4及MPEG-7中，從影像中辨識出主要的VOs是相當重要的一環。在本篇論文中，我們成功的將Neuro-Fuzzy modeling技術應用於影像臉部及身體的擷取。我們首先利用Fuzzy Clustering的技術將影像中的pixels分類成數個clusters，並擷取出Fuzzy rules。接著利用人類臉部的膚色及身體的移動量等特性，判斷出clusters屬於前景或背景。最後，利用Neural network修正Fuzzy rules，將前景及背景間的邊緣地帶做細部的判斷，完成整張影像人體的擷取。由於我們的方法能更精確的取出Neural network所需要的training data，因此能大幅提高臉部及身體擷取的正確性。此外，在VOs移動量不大的情況下，我們的方法仍然可以成功的擷取出所要的VOs。
Abstract
For some new multimedia applications using Mpeg-4 or Mpeg-7 video coding standards, it is important to find the main objects in a video frame. In this thesis, we propose a neuro-fuzzy modeling approach to the detection of human face and body. Firstly, a fuzzy clustering technique is performed to segment a video frame into clusters to generating several fuzzy rules. Secondly, chrominance and motion features are used to roughly classify the clusters into foreground and background, respectively. Finally, the fuzzy rules are refined by a fuzzy neural network, and the ambiguous regions between foreground and background are further distinguished by the fuzzy neural network. Our method improves the correctness of human face and body detection by getting training data more precisely. Besides, we can extract the VOs correctly even the VOs have no obvious motion in the video sequence.

目次 Table of Contents
目錄摘要……………………………………………………...i Abstract………………………………………………....ii 第一章簡介…………...…………………...………..…1 1.1 Video Objects....................................…………………...…….…........2 1.2 Object-based Compression.............................….………………….....4 第二章 Objects Segmentation...…...........…....…...........6 2.1 Introduction To Objects Segmentation.....……………..…...............6 2.2 Segmentation Of Human Face......................……………..…...........7 2.3 Segmentation Of Human Body.........................………..……….......8 2.4 Introduction To Nikolaos Doulamis' Method....………..……..........9 2.5 Introduction To Kompatsiaris’Method………………………........14 2.6 Drawbacks Of these Two Methods...........………………...............17 第三章 Fuzzy Neural Network之簡介..……………...19 3.1 Architecture of the neuro-fuzzy network..............…………….….19 3.2 Hybrid Learning Algorithm......................…………………...........22 第四章 Neuro-Fuzzy Model 應用於 Human Face and Body Detection.…………………………………...…...29 4.1 Self-Contructing Clustering……..............……………….....…......29 4.2 重新編號及合併過小的區塊...........…………...…..........….........32 4.3 Human Face Extraction.............................……………...…….......34 4.4 Human body Extraction......................…………………….............39 4.5 Refining Fuzzy Rules by Neural Fuzzy Network……...........….....43 第五章模擬結果…………………………………..…45 5.1 實際影像模擬結果..……………………........................….............46 5.2 數據比較分析.........................................…………………………..51 第六章總結...........................................………….......52

參考文獻 References
參考文獻 [1] Doulamis, N. D. and Doulamis, A. D. and Kollias, S. D., “Object Based Coding of Video Sequences at Low Bit Rates Using Adaptively Trained Neural Networks,” in Proceeding of the 6th IEEE International Conference on Electronics, Circuits and Systems, pp. 969 –972, 1999. [2] Kim, S. H. and Kim, N. K. and Ahn, S. C. and Kim, H. G., “Object Oriented Face Detection Using Range and Color Information,” in Proceedings of the 3th IEEE International Conference on Automatic Face and Gesture Recognition, pp. 76-81, 1998. [3] Kim, S. H. and Kim, H.G., “Face Detection Using Multi-modal Information,” in Proceeding of the 4th IEEE International Conference on Automatic Face and Gesture Recognition, pp. 14-19, 2000. [4] Kuo, C. M. and Hsieh, C. H. and Huang, Y. R., “A New Temporal-Spatial Image Sequence Segmentation for Object-Orented Video Coding,” in Proceeding of IEEE Asia Pacific Conference on Multimedia Technology and Applications, pp. 117-127, 2000. [5] Wang, H. and Chang, S.-F., “A Highly Efficient System for Automatic Face Region Detection in MPEG Video,” in IEEE Transactions on Circuits and Systems for Video Technology, Vol 7, NO. 4, AUGUST 1997. [6] Doulamis, N. and Doulamis, A. and Kollias, S., “Improving the Performance of MPEG Compatible Encoding at Low Bit Rates Using Adaptive Neural Networks,” in Real-Time Imaging, Vol 6, NO. 5, 2000. [7] Kompatsiaris, I. and Strintzis, M. G., “Spatiotemporal Segmentation and Tracking of Objects for Visualization of Videoconference Image Sequence,” in IEEE Transactions on Circuits and Systems for Video Technology, Vol. 10, NO. 8, DECEMBER 2000. [8] Ouyang, C.-S. and Lee, S.-J., “A New Self-Constructing Approach for Neuro-Fuzzy Modeling,” in Proceedings of the 8th International Fuzzy Systems Association World Congress, pp. 787-791, Taipei, Taiwan, 1999. [9] Ouyang, C.-S. and Lee, S.-J., “An Improved Learning Algorithm for Rule Refinement in Neuro-Fuzzy Modeling,” in Proceedings of the 3rd International Conference on Knowledge-Based Intelligent Information Engineering Systems, pp. 238-241, Adelaide City, Australia, 1999. [10] Chai, D. and Ngan, K. N., “Face Segmentation Using Skin-Color Map in Videophone Applications,” in IEEE Transactions on Circuits and Systems for Video Technology, Vol. 9, NO.4, JUNE 1999.

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：校內校外完全公開 unrestricted 開放時間 Available：校內 Campus：已公開 available 校外 Off-campus：已公開 available etd-0724101-130917.pdf
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS