Responsive image
博碩士論文 etd-0105115-014637 詳細資訊
Title page for etd-0105115-014637
論文名稱
Title
利用半胱胺酸標籤之雙硫鍵連結模式預測方法
The Disulfide Bonding Pattern Prediction with the Cysteine Labels
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
50
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2015-01-26
繳交日期
Date of Submission
2015-02-08
關鍵字
Keywords
行為知識空間、支持向量機、蛋白質、半胱胺酸、雙硫鍵
behavior knowledge space, disulfide bond, support vector machine, protein, cysteine
統計
Statistics
本論文已被瀏覽 5698 次,被下載 616
The thesis/dissertation has been browsed 5698 times, has been downloaded 616 times.
中文摘要
在這篇論文,在事先給予半胱胺酸的鍵結情形下我們提出了一個新方法來解決蛋白質雙硫鍵的鍵結圖形問題。
我們的方法首先利用整體的半胱胺酸資訊來預測半胱胺酸的標籤,然後使用該標籤預測的結果來預測整體的雙硫鍵鍵結圖形。
如實驗結果顯示的,在相同的訓練資料之下,我們的方法相較於其他的方法擁有較高的準確度。
更進一步的,為了提升我們方法的準確度,我們使用半胱胺酸隔離輪廓方法與行為知識空間來建立我們的混合方法來;該混合方法是由陳等學者所提出。
Abstract
There are four versions for the disulfide prediction problem, including
chain classification, bonding state of each cysteine, connectivity of each cysteines pair and disulfide bonding pattern.
Among these problems, the prediction of disulfide bonding pattern is the most difficult, especially for the proteins with four or more disulfide bonds.
In that case, a large amount of training proteins should be collected in order to obtain a reliable prediction model.
In this thesis, we propose a novel algorithm to solve the disulfide bonding pattern problem for the proteins that the bonding states of cysteines are given.
Our method first predicts the labels of cysteines with global information, and then uses the label prediction results to predict the whole disulfide bonding pattern.
As the experimental results show, our method achieves a higher accuracy than other previous methods for given the same training dataset.
Furthermore, to improve the accuracy of our method, we use the CSP method and the BKS table to build our hybrid model, which was proposed by Chen et al.
目次 Table of Contents
中文審定書 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
英文審定書 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
謝辭 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
摘要 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
TABLE OF CONTENT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Chapter 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Chapter 2. Preliminary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1 Proteins and Amino Acid Residues . . . . . . . . . . . . . . . . . . . 5
2.2 Support Vector Machine . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.3 Position-Specific Scoring Matrix . . . . . . . . . . . . . . . . . . . . . 7
2.4 The Behavior Knowledge Space Method . . . . . . . . . . . . . . . . 8
2.5 Previous Studies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.5.1 Cysteine Separations Profiles . . . . . . . . . . . . . . . . . . 9
2.5.2 Chen's Behavior Knowledge Space with SVM Methods . . . . 10
2.5.3 Lin's Method with Structural Information and Multiple Tra-
jectory Search . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Chapter 3. Our Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.1 Cysteine Label Prediction . . . . . . . . . . . . . . . . . . . . . . . . 13
3.2 The Pairwise Method with Label Prediction Results . . . . . . . . . . 15
3.3 The Hybrid Method . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Chapter 4. Experimental Results . . . . . . . . . . . . . . . . . . . . . . 21
4.1 Datasets and Performance Evaluation . . . . . . . . . . . . . . . . . . 21
4.2 Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
Chapter 5. Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
參考文獻 References
[1] S. F. Altschul, T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller,
and D. J. Lipman, "Gapped blast and psi-blast: a new generation of protein
database search programs," Nucleic Acids Research, Vol. 25, No. 17, pp. 3389-
3402, 1997.
[2] C. C. Chang and C. J. Lin, "LIBSVM: A library for support vector machines,"
2001. Software available at http://www.csie.ntu.edu.tw/ cjlin/libsvm.
[3] H. Y. Chen, C. B. Yang, K. T. Tseng, and C. Y. Hor, "The disulfide connec-
tivity prediction with support vector machine and behavior knowledge space,"
Proc. of International Conference on Knowledge Discovery and Information
Retrieval, Vilamoura, Algarve, Portugal, pp. 112-118, Sept. 19-22, 2013.
[4] J. Cheng, H. Saigo, and P. Baldi, "Large-scale prediction of disulphide bridges
using kernel methods, two-dimensional recursive neural networks, and weighted
graph matching," PROTEINS: Structure, Function, and Genetics, Vol. 62,
pp. 617-629, 2006.
[5] C. C. Chuang, C. Y. Chen, J. M. Yang, P. C. Lyu, and J. K. Hwang, "Relation-
ship between protein structures and disulfide-bonding patterns," PROTEINS:
Structure, Function, and Genetics, Vol. 53, pp. 1-5, 2003.
[6] M. O. Dayho , R. M. Schwartz, and B. C. Orcutt, "A model of evolutionary
change in proteins," Atlas of Protein Sequence and Structure (M. O. Dayho ,
ed.), pp. 345-352, 1978.
[7] R. Ed, "A C program to solve maximum weight matching," 1999. Software
available at http://elib.zib.de/pub/Packages/mathprog/.
[8] S. Heniko and J. G. Heniko , "Amino acid substitution matrices from protein
blocks," Proceedings of the National Academy of Sciences of the United States
of America, Vol. 89, No. 22, pp. 10915-10919, 1992.
[9] D. T. Jones, "Protein secondary structure prediction based on position-speci c
scoring matrices," Journal of Molecular Biology, Vol. 292, No. 2, pp. 195-202,
1999.
[10] H. H. Lin, J. C. Hsu, Y. N. Hsu, R. H. Pan, Y. F. Chen, and L. Y. Tseng,
"Disulfide connectivity prediction based on structural information without a
prior knowledge of the bonding state of cysteines," Computers in Biology and
Medicine, Vol. 43, pp. 1941-1948, 2013.
[11] H. H. Lin and L. Y. Tseng, "Prediction of disulfide bonding pattern based on a
support vector machine and multiple trajectory search," Information Sciences,
Vol. 199, pp. 167-178, 2012.
[12] H. L. Liu and S. C. Chen, "Prediction of disulfide connectivity in proteins
with support vector machine," Journal of the Chinese Institute of Chemical
Engineers, Vol. 38, No. 1, pp. 63-70, 2007.
[13] C. H. Lu, Y. C. Chen, C. S. Yu, and J. K. Hwang, "Predicting disulfide con-
nectivity patterns," PROTEINS: Structure, Function, and Genetics, Vol. 67,
pp. 262-270, 2007.
[14] G. L. J. Rama, A. P. Shilton, M. M. Parker, and M. Palaniswami, "Prediction
of cystine connectivity using svm," Bioinformation, Vol. 1, No. 2, pp. 69-74,
2005.
[15] S. Raudys and F. Roli, "The behavior knowledge space fusion method: Analysis
of generalization error and strategies for performance improvement," In Proc.
Int. Workshop on Multiple Classi er Systems (LNCS 2709, pp. 55-64, Springer,
2003.
[16] A. Sali and T. Blundell, "Comparative protein modelling by satisfaction of
spatial restraints," Journal of Molecular Biology, Vol. 1, pp. 779-815, 1993.
[17] V. N. Vapnik, The Nature of Statistical Learning Theory. Springer, 1999.
[18] A. Vullo and P. Frasconi, "Disulfide connectivity prediction using recursive
neural networks and evolutionary information," Bioinformatics, Vol. 20, No. 5,
pp. 653{659, 2004.
[19] C. J.Wang, C. B. Yang, C. Y. Hor, and K. T. Tseng, "Disulfide bond prediction
with hybrid models," Proc. of the 2012 International Conference on Computing
and Security (ICCS12), Ulaanbaatar, Mongolia, jul, 2012.
[20] T. F. Wu and C. J. Lin, "Probability estimates for multi-class classification
by pairwise coupling," Journal of Machine Learning Research, Vol. 5, pp. 975-
1005, 2004.
[21] E. Zhao, H. L. Liu, C. H. Tsai, H. K. Tsai, C. H. Chan, and C. Y. Kao,
"Cysteine separations pro les on protein sequences infer disulfide connectivity,"
Bioinformatics, Vol. 21, No. 8, pp. 1415-1420, 2005.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code