國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,西文語音辨識系統之設計研究,A Design of Spanish Speech Speech Recognition System

論文名稱 Title	西文語音辨識系統之設計研究 A Design of Spanish Speech Speech Recognition System
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	98 學年度第 2 學期 The spring semester of Academic Year 98	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	51
研究生 Author	史世洲 Shih-Jhou Shih
指導教授 Advisor	陳志堅 Chih-Chien Chen
召集委員 Convenor	柏小松 XIAO-SONG BO
口試委員 Advisory Committee	汪啟茂, 李聰, 盧而輝 Chii-Maw Uang; Tsung Lee; ER-HUI LU
口試日期 Date of Exam	2010-07-28	繳交日期 Date of Submission	2010-08-24
關鍵字 Keywords	隱藏式馬可夫模型、語音辨識、梅爾倒頻譜係數、線性預估倒頻譜係數 Mel-frequency cepstral coefficients, Linear predictive cepstral coefficients, Hidden Markov model
統計 Statistics	本論文已被瀏覽 5693 次，被下載 0 次 The thesis/dissertation has been browsed 5693 times, has been downloaded 0 times.

中文摘要
本論文主要探討西文語音辨識系統之設計與實作策略。系統以西文常用單音節作為主要的訓練與辨識方式。運用西語發音規則，將242個常用單音節，每個錄製6輪，每輪唸一聲與四聲兩種不同音調的單音各一次，六輪每個單音可得12次之聲紋特性作為訓練語料。系統採用梅爾倒頻譜係數與線性預估倒頻譜係數，經由隱藏式馬可夫模型，來作聲音之辨識。在CPU時脈為1.6GHz的AMD Sempron Processor 2800+ 之個人電腦與Ubuntu 9.04作業系統下，針對吾人所收集之4217筆西文語詞，在切字正確的前提下，吾人約可達到86%之正確辨識率，平均所需辨識時間約在1.5秒以內。
Abstract
This thesis investigates the design and implementation strategies for a Spanish speech recognition system. It utilizes the speech features of the 242 common Spanish mono-syllables as the major training and recognition methodology. A training database of twelve utterances per mono-syllable is established by applying Spanish pronunciation rules. These twelve utterances are collected through reading six rounds of the same mono-syllable with two different tones. The first pronounced pattern has high pitch of tone one, while the second one has falling pitch of tone four. Mel-frequency cepstral coefficients, linear predictive cepstral coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the AMD Sempron Processor 2800+ with 1.6GHz clock rate personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 86% can be reached for a 4217 Spanish phrase database. The average computation time for each phrase is about 1.5 seconds.

目次 Table of Contents
摘要 II 致謝 III 目錄 IV 圖目錄 VI 表目錄 VII 第一章緒論 1 1-1 研究動機 1 1-2 研究方法 2 1-3 論文章節概要 3 第二章西班牙文語音學基礎 4 2-1 字母表 4 2-2 子音與母音的發音規則 6 2-3 發音的音節區分 10 2-4 重音符號語發音規則 11 第三章語音訊號處理相關技術介紹 12 3-1 辨識系統架構與前處理 12 3-1-1 音節切割與去除靜音 13 3-1-2 能量與越零率 13 3-1-3 線性預估係數誤差能量 15 3-2 特徵萃取流程 17 3-2-1 預強濾波器 17 3-2-2 加視窗 18 3-2-3 離散傅利葉轉換 19 3-2-4 梅爾頻率濾波器 19 3-2-5 離散餘弦轉換 21 3-2-6 線性預估倒頻譜係數 22 3-3 隱藏式馬可夫模型 25 3-3-1 參數初始化 26 3-3-2 參數重估 27 3-3-3 正向程序與逆向程序 27 3-3-4 狀態轉移機率矩陣參數重估 29 3-3-5 狀態觀察機率矩陣參數重估 29 3-4 維特比演算法 31 第四章西文語音辨識系統實作成果與辨識效能 33 4-1 單音模型的建立 35 4-2 單音模型訓練方法(一) 36 4-3 單音模型訓練方法(二) 38 4-4 西文常用語詞辨識系統 40 4-5 中西語言辨識系統 41 第五章結論與未來展望 43 5-1 結論 43 5-2 未來展望 43 參考文獻 44

參考文獻 References
[1] 維基百科, http://zh.wikipedia.org/ [2] 王小川,語音訊號處理, 全華,民國93年。 [3] 謝文廣,“中文語音辨識系統增進辨識率之策略研究-以地址系統與二、三、四字詞為例”, 國立中山大學電機工程研究所碩士論文, 民國98年 7月。 [4] 陳永銘,“英文語音辨識系統之設計研究”, 國立中山大學電機工程研究所碩士論文, 民國98年7月。 [5] 林娟娟,西班牙文發音法, 冠唐出版社, 台北, 民國95年。 [6] 何仕凡,西班牙語發音一學就會, 三思堂出版社, 台北, 民國91年。 [7] 陳孟揚,“日文語音辨識系統之設計研究”, 國立中山大學電機工程研究所碩士論文, 民國98年7月。 [8] L.Villarrubia, L.H. Gomez, J.M. Elvira, and J.C. Tirrecilla, “Context-Dependent units for Vocabulary-Independent Spanish Speech Recognition”, IEEE, 1996. [9] H.Hasan, J.M Pardo, S. Alexandres, and C. Casado, "Phonetic Properties Of A Large Spanish lexicon And Its Implications For Large Vocabulary Speech Recognition", IEEE, 1989.

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：校內校外均不公開 not available 開放時間 Available：校內 Campus：永不公開 not available 校外 Off-campus：永不公開 not available 您的 IP(校外) 位址是 3.145.186.6 論文開放下載的時間是校外不公開 Your IP address is 3.145.186.6 This thesis will be available to you on Indicate off-campus access is not available.
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS