國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,醫療資訊語音查詢系統之設計研究,A Design of Spoken Language Querying System for Medical Information

論文名稱 Title	醫療資訊語音查詢系統之設計研究 A Design of Spoken Language Querying System for Medical Information
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	104 學年度第 1 學期 The fall semester of Academic Year 104	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	74
研究生 Author	王怡婷 Yi-Ting Wang
指導教授 Advisor	陳志堅 Chih-Chien Chen
召集委員 Convenor	柏小松 Sheau-Shong Bor
口試委員 Advisory Committee	汪啟茂, 李聰 Chii-Maw Wang; Tsung Lee
口試日期 Date of Exam	2015-07-29	繳交日期 Date of Submission	2016-01-21
關鍵字 Keywords	音位結構學、線性預估倒頻譜係數、醫療查詢、語音辨識、隱藏式馬可夫模型、梅爾頻率倒頻譜係數 Hidden Markov model, Phonotactics, Linear predictive cepstral coefficients, Mel-frequency cepstral coefficients, Speech recognition system, Medical information query
統計 Statistics	本論文已被瀏覽 5677 次，被下載 20 次 The thesis/dissertation has been browsed 5677 times, has been downloaded 20 times.

中文摘要
政府自民國八十四年施行全民健保以來，有效降低了國民就醫的負擔，但也同時間接地提高了國民就醫的頻率。因此，醫療環境與就醫程序，一直都是近年來備受討論的議題。民國一百零二年，二代健保上路。行政院二代健保規劃小組，將醫院診所的基本資訊，如：開診科別、院所地址、電話號碼與醫療紀錄等公開，列為重要政策，以便提供民眾就醫選擇的參考。為此，吾人運用衛生福利部之公開資訊與各醫務相關單位之網站訊息，設計了一套語音醫療資訊查詢系統，希望藉由將掛號及醫療查詢服務，用語音的方式，來提升使用上的便利性，獲得更好的醫療程序及環境。對於不善操作手機與電腦的族群，或有閱讀障礙的同胞，亦能透過講話的方式，達到掛號或查詢的目的。本系統依國語發音規則，挑選出2,699個常用二字詞，來建立單音模型，以作為語音訓練之基礎。吾人首先以梅爾頻率倒頻譜係數與線性預估倒頻譜係數，來萃取音節之雙特徵參數，其次運用隱藏式馬可夫模型，來估量與儲存各個單音的統計特性。最後在辨識階段，透過音位結構學的交叉比對，得到正確答案。在Intel Core2 Quad 2.5GHz CPU之個人電腦與Ubuntu 10.04之作業系統環境下，吾人針對全台21家大型教學醫院，79間區域醫院，以及台北聯合門診中心，共兩萬兩千多筆醫療資訊，實作了語音掛號系統，每間醫院掛100次不同科別及醫生，掛號正確率可達95.87%。另外針對全台6,548家牙醫診所與四萬多位醫師，亦實作了查詢系統，其查詢正確率分別可達98.79%%與93.975%。
Abstract
The National Health Insurance (NHI) has been operative in Taiwan since 1995. It did reduce the medical expense of the residents, but at the same time increased the visit frequency of the patients. Therefore, the medical environment and the patient workflow have always been a disputable issue in those days. The government then implemented the 2nd generation NHI in 2013. One of its major policies was to make the medical information of hospitals and clinics public and provide patients convenient access to the data of departments, addresses, phone numbers and medical records. By the use of open data from the Ministry of Health and Welfare, and other medical agencies, a spoken language querying system for medical information is designed and implemented in this thesis to further accommodate the oral access capability. Doctor appointment making, hospital information searching and the dental clinics inquiring can be accomplished by speech. This helps the people who are not good at using computers or cell phones, especially for those who are dyslexic, blind, or elderly. In this system, a training database of 2,699 two-syllable phrases is selected from the Chinese pronunciation rules. Mel-frequency cepstral coefficients and linear predicted cepstral coefficients are first used to extract the bi-parametric feature for each recorded syllable. Hidden Markov model is then employed to find and store the syllable probabilistic properties. Finally, in the recognition stage, the phonotactic rules are applied to the medical information database to acquire the final answer. Under the Intel Core2 Quad 2.5 GHz CPU and Ubuntu 10.04 operating system environment, a success rate of 95.87% for the hospital appointment scheduling can be achieved by using a 100 appointment per agency test database. This figure is counted over 22,000 medical entries from the 21 medical centers, 79 regional hospitals and Taipei Joint Outpatient Center. Moreover, correct querying rates of 98.79%% and 93.975% can also be reached for a 6,548 dental clinics database and a 40,000 doctor names database respectively.

目次 Table of Contents
目錄論文審定書 i 誌謝 ii 摘要 iii Abstract iv 目錄 v 圖目錄 vii 表目錄 ix 第一章緒論 1 1-1 研究動機 1 1-2 研究目的與研究背景 2 1-3 論文章節概要 3 第二章國語語音學與台灣醫療體制介紹 4 2-1 國語發展史 4 2-2 國語簡介和拼音結構介紹 4 2-2-1 國語變調介紹 6 2-3 語言學介紹 8 2-3-1 語音產生的過程與傳遞 8 2-3-2 發聲的生理原理 9 2-3-3 語音感知的生理原理 12 2-4 台灣醫療分級介紹 13 2-5 二代健保介紹 17 第三章語音辨識系統架構與設計概念 18 3-1 音節端點偵測與前處理 19 3-1-1 越零率 19 3-1-2 能量 19 3-2 前處理 20 3-2-1 視窗 20 3-2-2 漢明窗 20 3-2-3 預強調 21 3-3 特徵萃取 22 3-3-1 線性預估倒頻譜（LPCC） 22 3-3-2 梅爾倒頻譜（MFCC） 27 3-4 隱藏式馬可夫模型 32 3-4-1 隱藏式馬可夫鏈 32 3-4-2 隱藏式馬可夫模型之建立 33 3-4-3 隱藏式馬可夫模型之訓練 34 第四章語音辨識系統實作流程與成果分析 38 4-1 系統參數介紹 38 4-2 系統設計流程介紹 39 4-3 模擬詞彙建構及測試結果 52 第五章未來展望 60 參考文獻 61 圖目錄圖1-1 台灣醫院掛號批價情況 2 圖2-1 國語注音音調變化圖 5 圖2-2 語音產生過程與傳遞 8 圖2-3 人類發聲器官圖 10 圖2-4 聲門狀態_日常呼吸情況 11 圖2-5 聲門狀態_耳語情況 11 圖2-6 聲門狀態_平時發聲 12 圖2-7 人耳構造 13 圖2-8 醫療分級制度的規劃圖 13 圖2-9 台灣醫學評鑑流程圖 15 圖2-10 台灣醫學教育分析圖 16 圖2-11 各級醫院掛號收費標準 17 圖3-1 語音辨識流程圖 18 圖3-2 漢明窗示意圖 21 圖3-3 線性預估倒頻譜係數計算流程圖 27 圖3-4 梅爾圖頻率與真實頻率對應關係圖 29 圖3-5 梅爾三角濾波器 30 圖3-6 語音訊號及其隱藏式馬可夫模型 33 圖4-1 系統顯示結果 39 圖4-2 系統時效功能顯示 40 圖4-3 全台牙醫診所查詢系統顯示圖_1 48 圖4-4 全台牙醫診所查詢系統顯示圖_2 48 圖4-5 全台醫師查詢系統顯示圖_1 49 圖4-6 全台醫師查詢系統顯示圖_2 50 圖4-7 全台醫師查詢系統顯示圖_3 51 表目錄表2-1 國語注音符號表 5 表2-2 中文二字變調範例 6 表2-3 中文三字變調範例_1 7 表2-4 中文三字變調範例_2 7 表4-1 系統軟硬體及參數設定 38 表4-2 全台灣醫學中心列表 42 表4-3 全台灣區域醫院列表 46 表4-4 全台灣牙醫診所列表 47 表4-5 時間辨識率 52 表4-6 各醫院及整體掛號辨識率 57 表4-7 全台牙醫診所查詢系統辨識率 58 表4-8 全台醫師查詢系統辨識率 59

參考文獻 References
參考文獻 [1] 以民眾為考之知醫療品質資訊現況，行政院二代健保規劃小組總結報告之政策建議, http://www.mohw.gov.tw/MOHW_Upload/dmc_history/UpFile [2] 台南成大醫院診間叫號系統暨診間多媒體看板＆病患報到系統啟用上線，兆銀資訊系統股份有限公司, http://www.megasilver.com.tw/20100731.html [3] 維基百科-中華文化, https://zh.wikipedia.org/wiki/中华文化 [4] 維基百科-注音符號, https://zh.wikipedia.org/wiki/注音符號 [5] 漢語拼音-三聲變調:講解, http://cle.ust.hk/online_resources/common/pinyin/content/toneChange/thirdVoices/html/intro.html [6] 王士元、彭剛，語言、與音與技術，香港城市大學出版社，民國96年 [7] 謝國平，語言學概論，三民書局，民國100年 [8] 坂井建雄、橋本尚詞，人體構造地圖，瑞昇文化，民國100年 [9] 衛生福利部中央健康保險署, http://www.nhi.gov.tw [10] 李卓倫，如何配合總額預算分配落實分級醫療研究，行政院衛生署九十三年度補助研究計畫 [11] 財團法人醫院評鑑暨醫療品質策進會, http://www.tjcha.org.tw/FrontStage/index.aspx [12] 王俊文、葉德豐、曾坤儀，台灣醫院評鑑趨勢之探討，中山醫學雜誌，14卷4期，p.513-522 [13] 衛生福利部醫事司, http://www.mohw.gov.tw/cht/DOMA/DM1.aspx?f_list_no=608&fod_list_no=897 [14] Vikramjit Mitra,Wen Wang, Andreas Stolcke,Hosung Nam,Colleen Richey, Jiahong Yuan and Mark Liberman, “Articulatory for large-vocabulary speech recognition,” IEEE International Conference on Speech and Signal Processing, pp.7145-7149, May.2013 [15] 王小川，語音訊號處理，全華圖書出版社，民國98年 [16] 吳俊榮，中文二、三、四字詞語詞辨識系統之設計研究，國立中山大學電機工程學系碩士論文，民國96年 [17] L.R. Rabiner,“A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition,” IEEE International Conference on Speech and Signal Processing, vol.77, pp.257-286, Feb.1989 [18] Gaurav Talwar, Robert F. Kubichek and Hongkang Liang,“Hiddenness Control of Hidden Markov Models and Application to Objective Speech Quality and Isolated-Word Speech Recognition,” IEEE International Conference on Signals Systems and Computers, pp.1076-1080, Oct.2006 [19] Georgios Kotsalis, Alexandre Megretski and Munther A.Dahleh, “A Model Reduction Algorithm for Hidden Markov Models,” IEEE International Conference on Decision and Control,pp.3424-3429, Dec.2006 [20] Vikramjit Mitra, Horacio Franco, Martin Graciarena and Dimitra Vergyri, “Medium-duration modulation cepstral feature for robust speech recognition,” IEEE International Conference on Speech and Signal Processing, pp. 1749-1753, USA, 2014. [21] 台灣牙醫界2015年，第34卷第11期

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：自定論文開放時間 user define 開放時間 Available：校內 Campus：已公開 available 校外 Off-campus：已公開 available etd-0008116-145650.pdf
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS