論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available
論文名稱 Title |
非重疊反轉與轉位距離之計算方法 An Algorithm for Computing the Distance of the Non-overlapping Inversion and Transposition |
||
系所名稱 Department |
|||
畢業學年期 Year, semester |
語文別 Language |
||
學位類別 Degree |
頁數 Number of pages |
47 |
|
研究生 Author |
|||
指導教授 Advisor |
|||
召集委員 Convenor |
|||
口試委員 Advisory Committee |
|||
口試日期 Date of Exam |
2017-01-16 |
繳交日期 Date of Submission |
2017-01-24 |
關鍵字 Keywords |
動態規劃、Run、基因重組、反轉、非重疊、轉位 Non-Overlapping, Genome Rearrangement, Inversion, Transposition, Dynamic Programming, Run |
||
統計 Statistics |
本論文已被瀏覽 5667 次,被下載 298 次 The thesis/dissertation has been browsed 5667 times, has been downloaded 298 times. |
中文摘要 |
鑒於基因重組問題的重要性益增,Ta等人於2015年定義了非重疊反轉與轉位距離問題,並提出O(n^3)時間之方法。目的為計算兩長度皆為n的字串之突變距離,允許的突變為不互相重疊的反轉與轉位。本論文應用非重疊轉位與run之關聯,提出有效率之方法於O(n^2)時間計算非重疊反轉與轉位距離。 |
Abstract |
Reacting to the growing interest in large-scale mutation events, Ta et al. introduced the non-overlapping inversion and transposition distance problem in 2015, which is to calculate the mutation distance between two strings of the same length n. The allowed operations include non-overlapping inversions and transpositions. They presented an algorithm with O(n^3) time. In this thesis, we propose an efficient algorithm to solve the same problem in O(n^2) time. Our algorithm depends on a strong connection between the non-overlapping transposition and the run structure. |
目次 Table of Contents |
論文審定書 i THESIS VERIFICATION FORM ii 論文公開授權書 iii 謝 辭 iv 摘 要 v ABSTRACT vi LIST OF FIGURES viii Chapter 1. Introduction 1 Chapter 2. Preliminaries 4 2.1 The Non-overlapping Inversion and Transposition Distance Problem 4 2.2 Ta's Algorithm for the Non-overlapping Inversion and Transposition Distance Problem 5 2.3 Repetitions and Runs 8 Chapter 3. Our Algorithm 11 3.1 Computation Overview of the Non-overlapping Inversion and Transposition Distance 11 3.2 Computation of Equality and Inversion Matrices 15 3.3 Computation of the Transposition Matrix with a Brute-Force Method 16 3.4 An Analysis of the Brute-Force Method 21 3.5 An Improved Method for Computing the Transposition Matrix 24 Chapter 4. Conclusion 33 BIBLIOGRAPHY 34 |
參考文獻 References |
[1] A. V. Aho and J. E. Hopcroft, The Design and Analysis of Computer Algorithms. Boston, Massachusetts, USA: Addison-Wesley Longman, 1974. [2] A. Apostolico and F. P. Preparata, "Optimal o -line detection of repetitions in a string," Theoretical Computer Science, Vol. 22, No. 3, pp. 297-315, 1983. [3] V. Bafna and P. A. Pevzner, "Genome rearrangements and sorting by reversals," SIAM Journal on Computing, Vol. 25, No. 2, pp. 272-289, 1996. [4] V. Bafna and P. A. Pevzner, "Sorting by transpositions," SIAM Journal on Discrete Mathematics, Vol. 11, No. 2, pp. 224-240, 1998. [5] H. Bannai, S. Inenaga, Y. Nakashima, M. Takeda, K. Tsuruta, et al., "The "runs" theorem," arXiv preprint arXiv:1406.0263, 2014. [6] J. Berstel, A. Lauve, C. Reutenauer, and F. V. Saliola, Combinatorics on Words: Christo el Words and Repetitions in Words. Providence, Rhode Island, USA: American Mathematical Society, 2009. [7] L. Bulteau, G. Fertin, and I. Rusu, "Sorting by transpositions is difficult," SIAM Journal on Discrete Mathematics, Vol. 26, No. 3, pp. 1148-1180, 2012. [8] A. Caprara, "Sorting by reversals is difficult," Proceedings of the first annual international conference on Computational molecular biology, Santa Fe, New Mexico, USA, pp. 75-83, ACM, 1997. [9] D. A. Christie, Genome rearrangement problems. PhD thesis, University of Glasgow, 1998. [10] M. Crochemore, "An optimal algorithm for computing the repetitions in a word," Information Processing Letters, Vol. 12, No. 5, pp. 244-250, 1981. [11] M. Crochemore, "Linear searching for a square in a word," Bulletin-European Association for Theoretical Computer Science, Vol. 24, No. 1, pp. 66-72, 1984. [12] M. Crochemore, L. Ilie, and L. Tinta, "The "runs" conjecture," Theoretical Computer Science, Vol. 412, No. 27, pp. 2931-2941, 2011. [13] M. Crochemore, C. S. Iliopoulos, M. Kubica, J. Radoszewski, W. Rytter, and T. Wale n, "Extracting powers and periods in a word from its runs structure," Theoretical Computer Science, Vol. 521, pp. 29-41, 2014. [14] M. Crochemore, M. Kubica, J. Radoszewski, W. Rytter, and T.Wale n, "On the maximal sum of exponents of runs in a string," Journal of Discrete Algorithms, Vol. 14, pp. 29-36, 2012. [15] M. Crochemore and W. Rytter, Text Algorithms. Singapore: World Scientific, 1994. [16] G. Fertin, A. Labarre, I. Rusu, E. Tannier, and S. Vialette, Combinatorics of Genome Rearrangements. Cambridge, Massachusetts, USA: MIT Press, 2009. [17] D. Gus field, Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge, England: Cambridge University Press, 1997. [18] L. S. Heath and J. P. C. Vergara, "Sorting by bounded block-moves," Discrete Applied Mathematics, Vol. 88, No. 1, pp. 181-206, 1998. [19] C. S. Iliopoulos, D. Moore, and W. F. Smyth, "A characterization of the squares in a Fibonacci string," Theoretical Computer Science, Vol. 172, No. 1, pp. 281-291, 1997. [20] N. Jacobson, Basic Algebra I. New York, New York, USA: W.H. Freeman, 1985. [21] J. Kececioglu and D. Sanko , "Exact and approximation algorithms for sorting by reversals, with application to genome rearrangement," Algorithmica, Vol. 13, No. 1-2, pp. 180-210, 1995. [22] R. Kolpakov and G. Kucherov, "Finding maximal repetitions in a word in linear time," Proceedings of the 40th Annual Symposium on Foundations of Computer Science, New York, New York, USA, pp. 596-604, IEEE Computer Society, 1999. [23] C. Ledergerber and C. Dessimoz, "Alignments with non-overlapping moves, inversions and tandem duplications in O(n^4) time," Journal of Combinatorial Optimization, Vol. 16, No. 3, pp. 263-278, 2008. [24] V. Levenshtein, "Binary codes capable of correcting deletions, insertions and reversal," Soviet Physics Doklady, Vol. 10, No. 8, pp. 707-710, 1966. [25] M. Lothaire, Combinatorics on Words. Cambridge, England: Cambridge University Press, 1997. [26] M. E. MacDonald, C. M. Ambrose, M. P. Duyao, R. H. Myers, C. Lin, L. Srinidhi, G. Barnes, S. A. Taylor, M. James, N. Groot, et al., "A novel gene containing a trinucleotide repeat that is expanded and unstable on huntington's disease chromosomes," Cell, Vol. 72, No. 6, pp. 971-983, 1993. [27] M. Maes, "On a cyclic string-to-string correction problem," Information Processing Letters, Vol. 35, No. 2, pp. 73-78, 1990. [28] M. G. Main, "Detecting leftmost maximal periodicities," Discrete Applied Mathematics, Vol. 25, pp. 145-153, 1989. [29] M. G. Main and R. J. Lorentz, "An O(n log n) algorithm for finding all repetitions in a string," Journal of Algorithms, Vol. 5, No. 3, pp. 422-432, 1984. [30] E. M. McCreight, "A space-economical suffix tree construction algorithm," Journal of the ACM, Vol. 23, No. 2, pp. 262-272, 1976. [31] E. W. Myers, "An O(ND) di erence algorithm and its variations," Algorithmica, Vol. 1, No. 1-4, pp. 251-266, 1986. [32] M. Schoniger and M. S. Waterman, "A local algorithm for DNA sequence alignment with inversions," Bulletin of Mathematical Biology, Vol. 54, pp. 521-536, 1992. [33] D. Shapira and J. A. Storer, "Edit distance with move operations," Proceedings of the Annual Symposium on Combinatorial Pattern Matching, Fukuoka, Japan, pp. 85-98, Springer, 2002. [34] J. Simpson, "Modi ed padovan words and the maximum number of runs in a word," Australasian Journal of Combinatorics, Vol. 46, pp. 129-145, 2010. [35] T. T. Ta, C.-Y. Lin, and C. L. Lu, "An efficient algorithm for computing non-overlapping inversion and transposition distance," Proceedings of the 32nd Workshop on Combinatorial Mathematics and Computation Theory, Taichung, Taiwan, pp. 55-61, 2015. [36] A. F. Vellozo, C. E. Alves, and A. P. do Lago, "Alignment with non-overlapping inversions in O(n^3)-time," Proceedings of the International Workshop on Algorithms in Bioinformatics, Zurich, Switzerland, pp. 186-196, Springer, 2006. [37] R. A. Wagner and M. J. Fischer, "The string-to-string correction problem," Journal of the ACM, Vol. 21, No. 1, pp. 168-173, 1974. [38] M. E. M.Walter, Z. Dias, and J. Meidanis, "Reversal and transposition distance of linear chromosomes," Proceedings of the String Processing and Information Retrieval: A South American Symposium, Santa Cruz de La Sierra, Bolivia, pp. 96-102, IEEE, 1998. [39] G. Watterson, W. J. Ewens, T. E. Hall, and A. Morgan, "The chromosome inversion problem," Journal of Theoretical Biology, Vol. 99, No. 1, pp. 1-7, 1982. [40] I. Witten, A. Mo at, and T. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images. Burlington, Massachusetts, USA: Morgan Kaufmann, 1999. |
電子全文 Fulltext |
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。 論文使用權限 Thesis access permission:自定論文開放時間 user define 開放時間 Available: 校內 Campus: 已公開 available 校外 Off-campus: 已公開 available |
紙本論文 Printed copies |
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。 開放時間 available 已公開 available |
QR Code |