國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,韓文語音辨識系統之設計研究,A Design of Korean Speech Recognition System

論文名稱 Title	韓文語音辨識系統之設計研究 A Design of Korean Speech Recognition System
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	98 學年度第 2 學期 The spring semester of Academic Year 98	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	63
研究生 Author	吳秉洋 Bing-Yang Wu
指導教授 Advisor	陳志堅 Chih-Chien Chen
召集委員 Convenor	盧而輝 Er-Hui Lu
口試委員 Advisory Committee	李聰, 汪啟茂, 柏小松 Tsung Lee; Chii-Maw Uang; Xiao-Song Bo
口試日期 Date of Exam	2010-07-28	繳交日期 Date of Submission	2010-08-24
關鍵字 Keywords	隱藏式馬可夫模型、線性預估倒頻譜係數、梅爾倒頻譜係數 Mel-frequency cepstral coefficients, Linear predictive cepstrum coefficients, Hidden Markov model
統計 Statistics	本論文已被瀏覽 5633 次，被下載 0 次 The thesis/dissertation has been browsed 5633 times, has been downloaded 0 times.

中文摘要
本論文探討韓文語音辨識系統之設計與實作策略，以韓文常用單音節之語音特徵，作為主要的訓練與辨識方式。運用韓語發音規則，每輪錄製二種不同音調之常用單音，五輪每個單音可得10次之聲紋特性來作訓練資料。系統採用梅爾倒頻譜係數與線性預估倒頻譜係數，來作特徵參數之萃取；運用隱藏式馬可夫模型，來作單音之辨識。在Pentium 2.4 GHz之個人電腦與Ubuntu 9.04作業系統下，針對4865個韓文語詞，吾人約可達到92.25%之正確辨識率，平均所需辨識時間約在1.5秒以內。
Abstract
This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Korean pronunciation rules. These 10 utterances are collected through reading 5 rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4.Mel-frequency cepstral coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 2.4 GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 92.25% can be reached for a 4865 Korean phrase database. The average computation time for each phrase is about 1.5 seconds.

目次 Table of Contents
摘要 II 致謝 III 目錄 IV 圖目錄 VI 表目錄 VII 第一章緒論 1 1-1 研究動機 1 1-2 研究方法 2 1-3 論文章節概要 3 第二章韓文語音學基礎 4 2-1 韓文之字母 4 2-2 韓文之音節 6 2-3 韓文之文字 7 2-4 韓文之羅馬拼音 10 第三章語音辨識系統的流程與數學架構 13 3-1 前處理流程 14 3-1-1 音節切割 14 3-1-2 語音訊號之前置處理 18 3-2 特徵萃取流程 20 3-2-1 線性預估倒頻譜係數 20 3-2-2 梅爾倒頻譜係數 22 3-3 隱藏式馬可夫模型介紹 26 3-4 隱藏式馬可夫模型之訓練 28 3-4-1 參數初始化 28 3-4-2 參數重估 31 3-5 維特比演算法 34 第四章韓文語音辨識系統之訓練策略 36 4-1 硬體環境與軟體規範 36 4-2 模擬語詞數量 38 4-3 單音模型之訓練方式 40 4-3-1 辨識率與單音訓練次數之關係 40 4-3-2 單音訓練每次錄製不同個數之單音 42 4-3-3 每輪每次錄製兩個不同音調之單音 44 第五章韓文語音辨識系統實作成果與辨識效能 47 5-1 韓文常用語詞辨識系統 47 5-2 韓文能力測驗檢定辨識系統 49 5-3 韓國地名辨識系統 51 第六章結論與未來展望 54 6-1 結論 54 6-2 未來展望 54 參考文獻 56

參考文獻 References
[1] The Unicode Consortium,The Unicode Standard:Worldwide Character Encoding, Version 1.0,Addison-Wesley,1992. [2] 吳俊榮，“中文二，三，四字詞語詞辨識系統之設計研究”，國立中山大學碩士論文，民國96年 [3] 陳孟揚，“日文語音辨識系統之設計研究”，國立中山大學碩士論文，民國98年 [4] 王俊，韓語語音學，水牛圖書出版 [5] 簡江作，最新韓國語會話，國立編譯館，民國91年 [6] 王小川，語音訊號處理，全華圖書出版，民國93年 [7] 謝文廣，“中文語音辨識系統增進辨識率之策略研究以地址系統與二、三、四字詞系統為例”，國立中山大學碩士論文，民國98年 [8] Keiichi Tokuda, Takao Kobayashi, Takashi Masuko, and Satoshi Imai, “Mel-generalized cepstral analysis-A unified approach to speech spectral estimation”Proc.ICASSP,pp.1043-1046,1994. [9] 賴昭榮，“中文語音辨識系統降低訓練量之策略研究-以地址系統與二、三、四字詞系統為例”，國立中山大學碩士論文，民國97年 [10] 蔡昀庭，“基於隱藏式馬可夫模型之中文語音合成系統”，國立清華大學碩士論文，民國98年 [11] 鄭鶴得，“中文二字語詞辨識系統之設計研究”，國立中山大學碩士論文，民國96年 [12] 周思源，“T3台語剖析樹語料庫與Brill詞類標記”，國立清華大學碩士論文，民國94年 [13] 韓國語能力考試 http://topik.or.kr/ [14] 韓國教育課程評價院 http://www.kice.re.kr/index.do [15] 中華民國交通部觀光局 http://admin.taiwan.net.tw/ [16] 中文維基百科 http://zh.wikipedia.org/

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：校內校外均不公開 not available 開放時間 Available：校內 Campus：永不公開 not available 校外 Off-campus：永不公開 not available 您的 IP(校外) 位址是 3.140.198.43 論文開放下載的時間是校外不公開 Your IP address is 3.140.198.43 This thesis will be available to you on Indicate off-campus access is not available.
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS