國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,不特定語者語詞辨識系統特徵設計,A Feature Design System for Speaker Independent Phrase Recognition

論文名稱 Title	不特定語者語詞辨識系統特徵設計 A Feature Design System for Speaker Independent Phrase Recognition
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	89 學年度第 2 學期 The spring semester of Academic Year 89	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	51
研究生 Author	黃銘崇 Ming-Chong Huang
指導教授 Advisor	陳志堅 Chih-Chien Chen
召集委員 Convenor	汪啟茂 Chii-Maw Yang
口試委員 Advisory Committee	李聰 Tsung Lee
口試日期 Date of Exam	2001-06-05	繳交日期 Date of Submission	2001-06-15
關鍵字 Keywords	差異子空間、信號空間、梅爾倒頻譜、倒頻譜、語詞辨識、能量-Entropy特徵 Signal Space, Difference Subspace, Cepstrum, Energy-Entropy Feature, Phrase Recognition, Mel-Cepstrum
統計 Statistics	本論文已被瀏覽 5696 次，被下載 3896 次 The thesis/dissertation has been browsed 5696 times, has been downloaded 3896 times.

中文摘要
本論文採用一種新的語詞辨識方法，將同一類的語詞轉換為差異子空間的方式，消除語者本身或語者之間說話時的差異性，並且應用於多語者的使用環境之中。此外，本論文亦提出一種新的端點偵測法來判斷信號中語音的成分。最後並以Microsoft Windows為作業平台，完成理論的驗證。
Abstract
A novel phrase recognition method is proposed. It eliminates the speech difference between intraspeaker or interspeaker by transform phrases to difference subspace. A new endpoint detection method is also proposed, it can detection the human speech signal more effectively. All methods are test and verify at Microsoft Windows environment.

目次 Table of Contents
目錄頁次論文提要..…….………………………………………………………..Ⅱ 致謝…………………………………………………………………….Ⅲ 目錄…………………………………………………………………….Ⅳ 圖表目錄……………………………………………………………….Ⅴ 第一章緒論………………………………………………..…………1 1-1 研究動機………………………………………………………1 1-2 語音辨識系統介紹……………………………………………1 1-3 論文主題………………………………………………………6 1-4 論文架構………………………………………………………7 第二章語音訊號處理…………………………………………………8 2-1 語音處理介紹…………………………………………………8 2-1-1 端點偵測(Endpoint Detection)……………………...9 2-1-2 預強(Preemphasize)…………………………………11 2-1-3 加窗函數…………………………………………….12 2-2 端點偵測之研究………………………………………………15 2-3 倒頻譜係數(Cepstrum Coefficient)…………………………...21 2-4 梅爾-倒頻譜(Mel-Cepstrum Coefficient) …………………….23 第三章語詞辨識之研究………………………………………………27 3-1 信號空間………………………………………………………27 3-2 語詞信號空間…………………………………………………29 3-3 共通向量與差異子空間的正交性質…………………………33 3-4 LBG分類器 ………………………………………………….34 第四章實驗結果及系統設計…………………………………………37 4-1 英文語音資料測試結果………………………………………41 4-2 中文語音資料測試結果………………………………………46 4-3 視窗程式架構…………………………………………………49 第五章結論與建議……………………………………………………50

參考文獻 References
[1] John R. Deller,Jr. , John G. Proakis, and John H. L. Hansen—“Discrete- Time Processing of Speech Signals”, New Jersey, Prentice Hall, 1987. [2] 林燾，王理嘉—“語音學教程”，五南圖書出版公司，1995. [3] Rivarol Vergin, Douglas O’Shaughnessy, and Azarshid Farhat— “Generalized Mel Frequency Cepstral Coefficients for Large-Vocabulary Specker-Independent Continuous-Speech Recognition”, IEEE Transactions on Speech and Audio Processing, Vol. 7, NO.5, September 1999. [4] Zimer Tranter—“Principles of Communications”, Houghton Mifflin Company, 1995, 4th Edition. [5] Steven J.Leon—“Linear Algebra with Application”, Macmillan Publishing Company, New York, 3rd, 1990. [6] M.B.Gulmezoglu , V.Dzhafarov , M.Keskin and A.Barkana ,”A Novel Approach to Isolated Word Recognition,” IEEE Trans.on Speech and Audio Processing,Vol.7 , No.6 , pp.620-628,1999. [7] 龍生雲, “不特定語句之中文語者辨識系統研究”, 國立中山大學電機工程研究所博士論文, pp.20, 民國88年11月17日 [8] Lawrence Rabiner, Biing-Hwang Juang—“Fundamentals of Speech Recognition”, Prentice Hall, 1993. [9] Liang-sheng Huang and Chung-ho Yang, “A Novel Approach to Robust Speech Endpoint Detection in Car Environments”, Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on , Volume: 3 , pp.1751-1754. [10] Joseph W.Picone, “Signal Modeling Technique in Speech Recognition”, Proceedings of The IEEE, Vol. 81, NO.9, pp.1221-1223, September 1993. [11] Alan V. Oppenheim ,Ronald W. Schafer , with John R.Buck—1st ed .”Discrete-Time Signal Processing”, Chapter 12, Prentice Hall , 1989 . [12] Yariv Ephraim and Harry L.Van Trees, “A Signal Subspace Approach for Speech Enhancement”, IEEE Transactions on Speech and Audio Processing, Vol.3, NO. 4, pp251-266, July 1995 [13] 林合仁, “中文語詞辨識系統之視窗軟體設計研究”, 國立中山大學電機工程研究所碩士論文, 民國88年6月30日 [14] Shallom, I.D., Haimi-Cohen, R., Rannon, Z.M., “Dynamic Time Warping with Generalized Templates for Speaker Independent Speech Recognition”, Electrical and Electronics Engineers in Israel. The Sixteenth Conference of page1-page4. 1989.

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：校內立即公開，校外一年後公開 off campus withheld 開放時間 Available：校內 Campus：已公開 available 校外 Off-campus：已公開 available etd-0615101-191556.pdf
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS