論文使用權限 Thesis access permission:校內立即公開,校外一年後公開 off campus withheld
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available
論文名稱 Title |
不特定語者語詞辨識系統特徵設計 A Feature Design System for Speaker Independent Phrase Recognition |
||
系所名稱 Department |
|||
畢業學年期 Year, semester |
語文別 Language |
||
學位類別 Degree |
頁數 Number of pages |
51 |
|
研究生 Author |
|||
指導教授 Advisor |
|||
召集委員 Convenor |
|||
口試委員 Advisory Committee |
|||
口試日期 Date of Exam |
2001-06-05 |
繳交日期 Date of Submission |
2001-06-15 |
關鍵字 Keywords |
差異子空間、信號空間、梅爾倒頻譜、倒頻譜、語詞辨識、能量-Entropy特徵 Signal Space, Difference Subspace, Cepstrum, Energy-Entropy Feature, Phrase Recognition, Mel-Cepstrum |
||
統計 Statistics |
本論文已被瀏覽 5696 次,被下載 3896 次 The thesis/dissertation has been browsed 5696 times, has been downloaded 3896 times. |
中文摘要 |
本論文採用一種新的語詞辨識方法,將同一類的語詞轉換為差異子空間的方式,消除語者本身或語者之間說話時的差異性,並且應用於多語者的使用環境之中。此外,本論文亦提出一種新的端點偵測法來判斷信號中語音的成分。最後並以Microsoft Windows為作業平台,完成理論的驗證。 |
Abstract |
A novel phrase recognition method is proposed. It eliminates the speech difference between intraspeaker or interspeaker by transform phrases to difference subspace. A new endpoint detection method is also proposed, it can detection the human speech signal more effectively. All methods are test and verify at Microsoft Windows environment. |
目次 Table of Contents |
目 錄 頁 次 論文提要..…….………………………………………………………..Ⅱ 致謝…………………………………………………………………….Ⅲ 目錄…………………………………………………………………….Ⅳ 圖表目錄……………………………………………………………….Ⅴ 第一章 緒論………………………………………………..…………1 1-1 研究動機………………………………………………………1 1-2 語音辨識系統介紹……………………………………………1 1-3 論文主題………………………………………………………6 1-4 論文架構………………………………………………………7 第二章 語音訊號處理…………………………………………………8 2-1 語音處理介紹…………………………………………………8 2-1-1 端點偵測(Endpoint Detection)……………………...9 2-1-2 預強(Preemphasize)…………………………………11 2-1-3 加窗函數…………………………………………….12 2-2 端點偵測之研究………………………………………………15 2-3 倒頻譜係數(Cepstrum Coefficient)…………………………...21 2-4 梅爾-倒頻譜(Mel-Cepstrum Coefficient) …………………….23 第三章 語詞辨識之研究………………………………………………27 3-1 信號空間………………………………………………………27 3-2 語詞信號空間…………………………………………………29 3-3 共通向量與差異子空間的正交性質…………………………33 3-4 LBG分類器 ………………………………………………….34 第四章 實驗結果及系統設計…………………………………………37 4-1 英文語音資料測試結果………………………………………41 4-2 中文語音資料測試結果………………………………………46 4-3 視窗程式架構…………………………………………………49 第五章 結論與建議……………………………………………………50 |
參考文獻 References |
[1] John R. Deller,Jr. , John G. Proakis, and John H. L. Hansen—“Discrete- Time Processing of Speech Signals”, New Jersey, Prentice Hall, 1987. [2] 林燾,王理嘉—“語音學教程”,五南圖書出版公司,1995. [3] Rivarol Vergin, Douglas O’Shaughnessy, and Azarshid Farhat— “Generalized Mel Frequency Cepstral Coefficients for Large-Vocabulary Specker-Independent Continuous-Speech Recognition”, IEEE Transactions on Speech and Audio Processing, Vol. 7, NO.5, September 1999. [4] Zimer Tranter—“Principles of Communications”, Houghton Mifflin Company, 1995, 4th Edition. [5] Steven J.Leon—“Linear Algebra with Application”, Macmillan Publishing Company, New York, 3rd, 1990. [6] M.B.Gulmezoglu , V.Dzhafarov , M.Keskin and A.Barkana ,”A Novel Approach to Isolated Word Recognition,” IEEE Trans.on Speech and Audio Processing,Vol.7 , No.6 , pp.620-628,1999. [7] 龍生雲, “不特定語句之中文語者辨識系統研究”, 國立中山大學電機工程研究所博士論文, pp.20, 民國88年11月17日 [8] Lawrence Rabiner, Biing-Hwang Juang—“Fundamentals of Speech Recognition”, Prentice Hall, 1993. [9] Liang-sheng Huang and Chung-ho Yang, “A Novel Approach to Robust Speech Endpoint Detection in Car Environments”, Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on , Volume: 3 , pp.1751-1754. [10] Joseph W.Picone, “Signal Modeling Technique in Speech Recognition”, Proceedings of The IEEE, Vol. 81, NO.9, pp.1221-1223, September 1993. [11] Alan V. Oppenheim ,Ronald W. Schafer , with John R.Buck—1st ed .”Discrete-Time Signal Processing”, Chapter 12, Prentice Hall , 1989 . [12] Yariv Ephraim and Harry L.Van Trees, “A Signal Subspace Approach for Speech Enhancement”, IEEE Transactions on Speech and Audio Processing, Vol.3, NO. 4, pp251-266, July 1995 [13] 林合仁, “中文語詞辨識系統之視窗軟體設計研究”, 國立中山大學電機工程研究所碩士論文, 民國88年6月30日 [14] Shallom, I.D., Haimi-Cohen, R., Rannon, Z.M., “Dynamic Time Warping with Generalized Templates for Speaker Independent Speech Recognition”, Electrical and Electronics Engineers in Israel. The Sixteenth Conference of page1-page4. 1989. |
電子全文 Fulltext |
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。 論文使用權限 Thesis access permission:校內立即公開,校外一年後公開 off campus withheld 開放時間 Available: 校內 Campus: 已公開 available 校外 Off-campus: 已公開 available |
紙本論文 Printed copies |
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。 開放時間 available 已公開 available |
QR Code |