國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,特定語者中文語詞辨識系統之設計研究,A Design of Speaker Dependent Mandarin Recognition System

論文名稱 Title	特定語者中文語詞辨識系統之設計研究 A Design of Speaker Dependent Mandarin Recognition System
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	93 學年度第 2 學期 The spring semester of Academic Year 93	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	41
研究生 Author	潘睿慈 Ruei-tsz Pan
指導教授 Advisor	陳志堅 Chih-Chien Chen
召集委員 Convenor	汪啟茂 Chii-Maw Uang
口試委員 Advisory Committee	李聰 Tsung Lee
口試日期 Date of Exam	2005-07-26	繳交日期 Date of Submission	2005-09-02
關鍵字 Keywords	梅爾倒頻譜係數、隱藏式馬可夫模型、端點偵測、線性預估碼線性預估編碼激發源、母音模型、語詞辨識 Hidden Markov model (HMM), end-point detection, vowel model, phrase recognition, LPC scaled excitation, Mel-frequency cepstrum coefficients
統計 Statistics	本論文已被瀏覽 5734 次，被下載 0 次 The thesis/dissertation has been browsed 5734 times, has been downloaded 0 times.

中文摘要
論文裡探討如何利用梅爾倒頻譜參數、線性預估碼激發源、母音模型、隱藏式馬可夫模型及維特比演算法等語詞辨識相關技術，來設計一套中文語詞的語音辨識系統。主要辨識系統採用目前被廣泛地應用在語音辨識的隱藏式馬可夫模型。此外，為了加快辨識速度，吾人利用中文母音結構的穩定特性，結合母音辨識的方法來完成。在語者相依、實驗室的環境之下，平均在一秒左右可完成單詞辨識，辨識率達98%。
Abstract
A Mandarin phrase recognition system based on MFCC, LPC scaled excitation, vowel model, hidden Markov model (HMM) and Viterbi algorithm is proposed in this thesis. HMM, which is broadly used in speech recognition at present, is adopted in the main structure of recognition. In order to speed up the recognition time, we take advantage of stability of vowels in Mandarin and incorporate with vowel class recognition in our system. For the speaker-dependent case, a single Mandarin phrase recognition can be accomplished within 1 seconds on average in the laboratory environment.

目次 Table of Contents
摘要 1 致謝辭 2 目錄 3 圖目錄 5 表目錄 6 第 1 章緒論 7 1-1 研究動機與目的 7 1-2 研究方向與方法簡介 8 1-3 章節大要 11 第 2 章理論背景 12 2-1 語詞辨識流程 12 2-2 特徵擷取 13 2-2-1 最大相似比(MLR) 15 2-2-2 視窗函數(Window Function) 16 2-2-3 梅爾刻度三角濾波器組(Mel-scale Triangular Filter Banks) 17 2-2-4 離散餘弦轉換(DCT) 19 2-3 音節切割 20 2-3-1 線性預估編碼(Linear Predictive Coding，LPC) 22 2-3-2 線性預估編碼激發源(LPC scaled excitation) 23 2-4 辨識系統 26 2-4-1 HMM模型描述 27 2-4-2 HMM模型訓練 29 2-4-3 HMM模型辨識法則 30 第 3 章實驗結果 33 3-1 音節部份 33 3-2 語詞部份 35 第 4 章結論與建議 37 4-1 結論 37 4-2 建議 38 參考文獻 38

參考文獻 References
[1] 賴昭華, “不特定語者中量語詞辨識系統之設計研究” , 國立中山大電機工程研究所碩士論文, 民國91年7月. [2] 許博閔, “混合式中文人名語音辨識系統之設計研究”, 國立中山大學電機工程研究所碩士論文, 民國93年7月. [3] 張慶勇, “中文地址語音辨識系統之設計研究”, 國立中山大學電機工程研究所碩士論文, 民國93年7月. [4] Lawrence Rabiner and Biing-Hwang Juang, “Fundamentals of Speech Recognition”, pp. 97-112, N.J.: Prentice Hail, 1993. [5] V. R. Algazi, K. L. Brown, M. J. Ready, D. H. Irvine, C. L. Cadwell and Sang Chung, “Transform Representation of the Spectra of Acoustic Speech Segment with Applications－I: General Approach and Application to Speech Recognition,” IEEE Trans. Speech and Audio Processing, vol.1, No.2, April 1993. [6] A. M. Kondoz, “Digital Speech Coding”, New York : John Wiley & Sons Inc., 1994. [7] S. S. Stevens and J. Volkmann,“The relation of pitch of frequency : A revised scale”, Am. J. Psychol., 53 : 329-353, 1940. [8] A.lan V. Oppenheim, Ronald W. Schafer, with John R. Buck, “Discrete-Time Signal Processing”, pp. 589-595, N.J.: Prentice Hall, 1999. [9] 賴晶儀,“國語語音之音韻歷程分析及治療簡介”, 民國92年11月. [10] 陳豫德, “中文人名語音辨識系統之設計研究,” 國立中山大學電機工程研究所碩士論文, 民國92年7月. [11] John R. Deller, J. G.. Proakis, and John H. L. Hansen, “Discrete-Time Processing of Speech Signals”, New York: Macmillan Pub. Co., 1993. [12] L. R. Rabiner, “A tutorial on hidden Markov modles and selected application in speech recognition”, Proc. IEEE, vol.77, pp. 257-286, Feb. 1989.

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：校內校外均不公開 not available 開放時間 Available：校內 Campus：永不公開 not available 校外 Off-campus：永不公開 not available 您的 IP(校外) 位址是 18.117.182.179 論文開放下載的時間是校外不公開 Your IP address is 18.117.182.179 This thesis will be available to you on Indicate off-campus access is not available.
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS