國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,中文語音辨識系統增進辨識率之策略研究 - 以人名系統與二、三、四字詞系統為例,A Design of Recognition Rate Improving Strategy for Speech Recognition System - A Case Study on Mandarin Name and Phrase Recognition System

論文名稱 Title	中文語音辨識系統增進辨識率之策略研究 - 以人名系統與二、三、四字詞系統為例 A Design of Recognition Rate Improving Strategy for Speech Recognition System - A Case Study on Mandarin Name and Phrase Recognition System
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	96 學年度第 2 學期 The spring semester of Academic Year 96	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	43
研究生 Author	陳儒平 Ru-Ping Chen
指導教授 Advisor	陳志堅 Chih-Chien Chen
召集委員 Convenor	汪啟茂 Chii-Maw Uang
口試委員 Advisory Committee	李聰 Tsung Lee
口試日期 Date of Exam	2008-07-25	繳交日期 Date of Submission	2008-08-30
關鍵字 Keywords	隱藏式馬可夫模型、梅爾倒頻譜係數 MFCC, Hidden Markov Model
統計 Statistics	本論文已被瀏覽 5646 次，被下載 0 次 The thesis/dissertation has been browsed 5646 times, has been downloaded 0 times.

中文摘要
本論文之主要目的，在設計與實作中文人名與二、三、四字語詞辨識系統。系統運用梅爾倒頻譜係數、隱藏式馬可夫模型與語音文字比對策略，來做語詞的候選機制。實驗證實在語者相依的情況下，於訓練時加入重疊音框以及混合訓練之策略，中文人名以及二、三、四字語詞之辨識率，約可分別提升4%、5%、4%與2%。系統在 Intel Celeron 2.4 GHz CPU之個人電腦與Red Hat Linux 9.0之運算環境下，語詞辨識平均約可在2秒內完成。
Abstract
The objective of this thesis is to design and implement a speech recognition system for Mandarin names and phrases. This system utilizes Mel frequency cepstral coefficients, hidden Markov model and lexicon search strategy to select the phrase candidates. The experimental results indicate that for the speaker dependent case, a strategy incorporating overlapping frames and hybrid training can result in an improvement of 4%, 5%, 4% and 2% on the recognition rate for the Mandarin name, two-word, three-word and four-word phrase recognition systems respectively. Under Redhat Linux 9.0 operating system, any Mandarin name or phrase can be recognized within 2 seconds by a computer with Intel Celeron 2.4 GHz CPU.

目次 Table of Contents
摘要I 致謝II 目錄III 圖目錄V 表目錄VI 第一章緒論1 1-1 研究動機與目的1 1-2 研究方法1 1-3 章節概要2 第二章系統架構與語音訊號處理之相關技術3 2-1 語音辨識系統架構3 2-2 切割單音5 2-2-1 能量(Energy5 2-2-2 越零率(Zero Crossing Rate)5 2-2-3 線性預估係數誤差能量(LPCEE)6 2-3 特徵萃取(使用梅爾倒頻譜係數(MFCC))8 2-3-1 漢明視窗(Hamming Window)8 2-3-2 離散傅立葉轉換(DFT)8 2-3-3 梅爾三角濾波器8 2-3-4 離散餘弦轉換(DCT)10 2-3-5 對數能量與差量倒頻譜參數10 第三章隱藏式馬可夫模型(HMM，Hidden Markov Model) 12 3-1 模型訓練13 3-2 文字比對17 第四章人名與字號系統設計之介紹19 4-1 資料庫建立與規劃19 4-2 訓練單音的錄製方式19 4-3 系統輸入與顯示方式20 第五章實驗結果24 5-1 實驗的模擬結果24 5-2 系統相關參數設定31 第六章結論與討論32 6-1 結論32 6-2 討論33 第七章參考文獻34

參考文獻 References
[1] 鄭鶴得,“中文二字語詞辨識系統之設計研究”,國立中山大學電機工程研究所碩士論文, 民國96年7月。 [2] 吳俊榮,“中文二、三、四字語詞辨識系統之設計研究”,國立中山大學電機工程研究所碩士論文, 民國96年7月。 [3] 杜秋娟,“十萬個中文人名語音辨識系統之設計研究”,國立中山大學電機工程研究所碩士論文, 民國96年7月。 [4] 王小川,“語音訊號處理”,全華,民國93年。 [5] 陳躍升,“中文履歷表之語音建構系統設計”,國立中山大學電機工程研究所碩士論文, 民國95年7月。 [6] 林維琦，“古今中外人名語音辨識系統之設計研究”，國立中山大學電機工程研究所碩士論文，民國95年7月。 [7] Ben Gold and Nelson Morgan, “Speech and Audio Signal Processing: Processing and Perception of Speech and Music”, John Wiley & Sons. Inc. 2000 [8] 國語日報出版中心主編，“新編國語日報辭典”，出版者 : 國語日報社，民國96年。 [9] Tze Fen Li, “Speech recognition of mandarin monosyllables,” Patter Recognition, vol.36 pp2713-2721, April 2003. [10] L. R. Rabiner, “A tutorial on hidden Markov modles and selected application in speech recognition”, Proc. IEEE, vol.77, pp. 257-286, Feb. 1989 [11] 林士翔,“數據擬合與分群方法於強健語音特徵擷取之研究”,國立台灣師範大學資訊教育研究所碩士論文, 民國96年7月。 [12] 張志豪,“強健性和鑑別力語音特徵擷取技術於大詞彙連續語音辨識之研究”,國立台灣師範大學資訊工程研究所碩士論文, 民國94年7月。 [13] Lawrence Rabiner and Biing-Hwang Juang, "Fundamentals of Speech Recognition", N.J.: Prentice Hall, 1993。 [14] 馬自毅、顧宏義注譯，“新譯百家姓” ，三民書局印行，民國94年3月。 [15] 中國人名大辭典/方毅、臧勵龢等編.--台灣商務,民79。 [16] 四庫全書傳記資料索引附字號索引.--台北:商務,民79。

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：校內校外均不公開 not available 開放時間 Available：校內 Campus：永不公開 not available 校外 Off-campus：永不公開 not available 您的 IP(校外) 位址是 18.218.113.150 論文開放下載的時間是校外不公開 Your IP address is 18.218.113.150 This thesis will be available to you on Indicate off-campus access is not available.
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS