國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,不特定語者中量語詞辨識系統之設計研究 ,A design of speaker-independent medium-size phrase recognition system

論文名稱 Title	不特定語者中量語詞辨識系統之設計研究 A design of speaker-independent medium-size phrase recognition system
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	90 學年度第 2 學期 The spring semester of Academic Year 90	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	65
研究生 Author	賴昭華 Zhao-Hua Lai
指導教授 Advisor	陳志堅 Chih-Chien Chen
召集委員 Convenor	汪啟茂 Chii-Maw Uang
口試委員 Advisory Committee	李聰 Tsong Lee
口試日期 Date of Exam	2002-07-24	繳交日期 Date of Submission	2002-09-12
關鍵字 Keywords	語詞辨識、搜尋子空間、倒頻譜、隱藏式馬可夫模型、不特定語者 speaker-independent, cepstrum, hidden Markov model, phrase recognition, search subspace
統計 Statistics	本論文已被瀏覽 5672 次，被下載 43 次 The thesis/dissertation has been browsed 5672 times, has been downloaded 43 times.

中文摘要
對於語詞辨識系統而言,不特定語者(Speaker-Independent)語詞辨識系統實現,具有相當的難度,因此何如建立精確、快速且具強健性的不特定語者辨識系統,一直是一個很大的挑戰。本研究以隱藏式馬可夫模型(Hidden Markov model ,HMM)為基礎, 建立不特定語者(Speaker-Independent)語詞辨識系統,隱藏式馬可夫模型已經被應用在許多領域, 語音辨識就是其中重要的應用, 隱藏式馬可夫模型以狀態(State)描述語音產生的方式,為一可以代表語音時變特性之統計語音模型。本研究所採用的語音資料庫分別為OGI(Oregon Graduate Institute of Science and Technology)英文語料及麥克風中文語料, 分別實現特定語者(speaker-dependent)及不特定語者(speaker-independent) 語詞辨識系統。
Abstract
There are a lot of difficulties that have to be overcome in the speaker-independent (S.I.) phrase recognition system . And the feasibility of accurate ,real-time and robust system pose of the greatest challenges in the system. In this thesis ,the speaker-independent phase recognition system is based on Hidden Markov Model (HMM). HMM has been proved to be of great value in many applications, notably in speech recognition. HMM is a stochastic approach which characterizes many of the variability in speech signal. It applys the state-of-the-art approach to Automatic Speech Recognition .

目次 Table of Contents
目錄頁次論文摘要………………………………………………………………Ⅰ 致謝……………………………………………………………………Ⅱ 目錄……………………………………………………………………Ⅲ 圖表目錄………………………………………………………………Ⅵ 第一章緒論...............................................1 1-1 研究動機與目的......................................1 1-2研究方法.............................................2 1-3 論文架構......................................... ..3 第二章語詞辨識系統與數位語音訊號處理....................5 2-1 語詞辨識系統介紹....................................5 2-2 辨識系統之語音前置處理..............................8 2-2-1 端點偵測(Endpoint Detection)...................10 2-2-2 乘上視窗函數(Window)...........................10 2-3 語音訊號之特徵萃取.................................11 2-3-1倒頻譜(Cepstrum)................................12 第三章隱藏式馬可夫模型為基礎之語詞辨識系統...........19 3-1 語音訊號之隱藏式馬可夫模型.........................19 3-2隱藏式馬可夫模型之建立..............................20 3-3隱藏式馬可夫模型之訓練..............................21 3-3-1期望值最大演算法 ...............................21 3-3-2參數重估(Parameters Reestimation)...............22 3-4隱藏式馬可夫模型之辨識程序..........................27 3-5 減少搜尋空間解碼演算法.............................30 3-5-1 狀態改變點偵測(Change-Point Point Detection....31 3-5-2 搜尋空間.......................................33 第四章系統設計實作結果與比較.........................37 4-1 系統設計...........................................37 4-2 系統實作...........................................39 4-3結果與比較..........................................40 4-3-1 語音特徵萃取與辨識率之關係.....................40 4-3-2 語詞訓練次數與辨識率之關係.....................43 4-3-3 模型狀態數與辨識率之關.........................46 第五章結論與建議................................... .52 5-1 結論...............................................52 5-2 建議...............................................53 參考文獻.................................................54 附錄(一) ...............................................56 附錄(二) ................................................59

參考文獻 References
參考文獻 [1] Erhan Cinlar, To Stochastic Processes, New Jersey : Prentice Hall,Inc.,1975. [2] B. H. Juang and L. R. Rabiner,” Mixture Autoregressive Hidden Markovmodels for speech signals.” IEEE Trans. Speech and Audio Processing,vol.33 ,pp 1404-1413, 1985. [3] J. R. Deller, J. G. Proakis, and J. H. L. Hansen, Discrete Time Processing of Speech Signals, New York: Macmillan Pub. Co., 1993. [4] A,M,Kondoz, Digital Speech coding, New York: John Wiley &Sons Inc., 1994. [5] R.W. Schafer and J.D. Markel, Eds., Speech Analysis, New York: IEEE Press, 1979. [6] J. Makhoul, “Linear prediction: A tutorial review,” Proc. IEEE , vol.63 ,pp. 561-580,Feb. 1989. [7] Lawrence Rabiner and Biing-Hwang Juang, Fundamentals of Speech Recognition, New Jersey: Prentice Hall,Inc.,1993. [8] L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proc. IEEE , vol. 77 , pp.257 - 286 , Feb. 1989. [9] O. Cappe, C.E. Mokbel, D. Jouvet and E. Moulines, “ An algorithm for maximum likelihood estimation of hidden Markov models with unknown state-tying,” IEEE Trans. Speech and Audio Processing,vol. 6 ,pp 61- 70, Jan.1998. [10] Qi Li, “Search-space reduction for fast , optimal HMM decoding in speaker verification,” in IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 2 , pp. II1189 -II1192 , 2000. [11] Qi Li, “ A detection approach to search-space reduction for HMM state alignment in speaker verification,” IEEE Trans. Speech and Audio Processing,vol. 9 ,pp 569-578, July 2001. [12] M. Bilginer Gulmezoglu, V. Dzhafarov , M. Keskin and A. Barkana, “ A novel approach to isolated word recognition,” IEEE Trans. Speech and Audio Processing,vol. 7 ,pp 620-628, Nov. 1999. [13] Yoshua Bengio,“Markovian Models for Sequential Data,” Neural Computing Surveys 2,pp.129-162,1999. [14] 張照煌, “語音辨識技術應用之發展趨勢,”工研院電通所, 民國85年7月. [15] 龍生雲,“不特定語句之中文語者辨識系統研究,” 國立中山大學電機工程研究所博士論文, 民國88年11月17日. [16] 黃俊豪, “大量語者不特定語句環境下語者辨識系統之特徵設計,” 國立中山大學電機工程研究所碩士論文, 民國90年6月5日. [17] 黃銘崇, “不特定語者語詞辨識系統之特徵設計,”國立中山大學電機工程研究所碩士論文, 民國90年6月5日.

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：校內公開，校外永不公開 restricted 開放時間 Available：校內 Campus：已公開 available 校外 Off-campus：永不公開 not available 您的 IP(校外) 位址是 18.222.184.162 論文開放下載的時間是校外不公開 Your IP address is 18.222.184.162 This thesis will be available to you on Indicate off-campus access is not available.
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS