Responsive image
博碩士論文 etd-0611117-233036 詳細資訊
Title page for etd-0611117-233036
論文名稱
Title
估計最佳切分點個數及其位置的統計方法
Methods of estimating the optimal number and locations of cut points
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
42
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2017-06-30
繳交日期
Date of Submission
2017-07-12
關鍵字
Keywords
概似估計量、切分點個數、交叉驗證、AIC、切分點位置、多變量 Cox 比例風險模型
cross validation, cox proportional hazard model, Cut points, genetic algorithm
統計
Statistics
本論文已被瀏覽 5724 次,被下載 0
The thesis/dissertation has been browsed 5724 times, has been downloaded 0 times.
中文摘要
在醫學中常會需要將連續型的預後因子做切分,轉換成類別型的變數,方便臨床上的判定及診斷。在傳統方法中,一般而言是找一個切分點。本文將提供一個方法,當不只一個切分點的時候,可以正確決定出切分點位置(cut point) 及切分點個數(cut number),藉由多變量Cox 比例風險模型及基因演算法,運用概似估計量當作判定最佳切分點位置的準則,及交叉驗證(cross validation) 來修正p-value、AIC ,藉此選擇正確切分點個數。最後,利用子宮頸癌(cervical cancer) 的資料來研究不同預後因子下最佳的切分點個數及切分點位置。
Abstract
In clinical,we often need to cut the continuous prognostic factor into categorical variables
in order to determine the clinical diagnosis. Traditionally, there are many ways to find one cut point to divide data into two groups. This paper will provide a method, when you need more than one cut point, you can correctly determine the optimal numbers of cut points and their locations. By multivariate Cox proportional hazard model and genetic algorithm , using the log-likelihood statistic as the criterion to determine the optimal cut points, and two-fold cross validation to correct the p-value, AIC, and select the correct
number of cut points. At last, we use cervical cancer data to discuss the different prognostic
factors under the optimal number of points and their locations.
目次 Table of Contents
論文審定書 i
誌謝 ii
摘要 iii
Abstract iv
1 研究動機與目的 1
2 資料描述 2
2.1資料處理 2
2.2 變數介紹 2
2.2.1連續型變數 2
2.2.2離散型變數 2
3 研究方法5
3.1 Kaplan-Meier 存活函數 5
3.2 Cox 比例風險模型 5
3.3 羅吉斯迴歸(Logistic regression) 5
3.4 AIC (Akaike information criterion) 6
3.5 傳統方法 6
3.6 估計切分點個數和位置 6
3.6.1 概似比檢定Likelihood ratio test 7
3.7 校正型一誤差Corrected Type I error 8
3.7.1 部分概似函數Partial likelihood 9
3.8 兩摺交叉驗證及拔靴法Twofold Cross-Validation with bootstrap 9
3.9 基因演算法Genetic Algorithms 12
3.10 模擬 14
3.10.1 模擬設計 14
4 研究結果15
4.1 決定切分點 15
v
4.1.1 二元資料 15
4.1.2 存活資料 16
4.1.3 多變量存活資料 17
4.2 模擬結果 19
4.2.1 單變量模擬結果 19
4.2.2 多變量模擬結果 21
4.3 子宮頸癌資料分析 24
5 結論與結語 27
參考文獻 28
A 附錄 29
附錄 29
A.1 附錄一 29
A.2 附錄二 31
圖次
表次
參考文獻 References
[1] Chang, C., Hsieh, M.K., Chang, W.Y., Chiang, A.J., Cen, J. (2017). Determining the
Optimal Number and Location of Cutoff Points with Application to Data of Cervical
Cancer. Plos One. https://doi.org/10.1371/journal.pone.0176231.
[2] Clark, L.H. , Jackson, A.L., Soo, A.E., Orrey, D.C., Gehrig, P.A., Kim, K.H. (2016).
Extremes in body mass index affect overall survival in women with cervical cancer.
Gynecol Oncol, 141, 497-500.
[3] Faraggi, D. and Simon, R. (1996). A simulation study of cross-validation for selecting
an optimal cutpoint in univariable survival analysis. Statistics in Medicine, 15, 22032213.
[4] Hilsenbeck, S.G., Clark, G.M., McGuire, W.L. (1992). Why do so many prognostic
factors fail to pan out? Breast Cancer Research and Treatment, 22, 197-206.
[5] Kizer, N.T., Thaker, P.H., Gao, F., Zighelboim, I., Powell, M.A.,Rader, J.S., et al.
(2011). The effects of body mass index on complications and survival outcomes in
patients with cervical carcinoma undergoing curative chemoradiation therapy. Cancer,
117, 948-956.
[6] Mazumdar, M., Smith, A., Bacik, J. (2003). Methods for categorizing a prognostic
variable in a multivariate setting. Statistics in medicine, 22, 559-571.
[7] Mebane, W.R. and Sekhon, J.S. (2011). Genetic optimization using derivatives:The
rgenoud package for R. Journal of Statistical Software, 42, 1-26.
[8] Miller, R., and Siegmund, D. (1982). Maximally selected chi square statistics. Biometrics,
38, 1011-1016.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus:永不公開 not available
校外 Off-campus:永不公開 not available

您的 IP(校外) 位址是 3.15.147.53
論文開放下載的時間是 校外不公開

Your IP address is 3.15.147.53
This thesis will be available to you on Indicate off-campus access is not available.

紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 永不公開 not available

QR Code