Responsive image
博碩士論文 etd-0022117-230149 詳細資訊
Title page for etd-0022117-230149
論文名稱
Title
非重疊反轉與轉位距離之計算方法
An Algorithm for Computing the Distance of the Non-overlapping Inversion and Transposition
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
47
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2017-01-16
繳交日期
Date of Submission
2017-01-24
關鍵字
Keywords
動態規劃、Run、基因重組、反轉、非重疊、轉位
Non-Overlapping, Genome Rearrangement, Inversion, Transposition, Dynamic Programming, Run
統計
Statistics
本論文已被瀏覽 5667 次,被下載 298
The thesis/dissertation has been browsed 5667 times, has been downloaded 298 times.
中文摘要
鑒於基因重組問題的重要性益增,Ta等人於2015年定義了非重疊反轉與轉位距離問題,並提出O(n^3)時間之方法。目的為計算兩長度皆為n的字串之突變距離,允許的突變為不互相重疊的反轉與轉位。本論文應用非重疊轉位與run之關聯,提出有效率之方法於O(n^2)時間計算非重疊反轉與轉位距離。
Abstract
Reacting to the growing interest in large-scale mutation events, Ta et al. introduced the non-overlapping inversion and transposition distance problem in 2015, which is to calculate the mutation distance between two strings of the same length n. The allowed operations include non-overlapping inversions and transpositions. They presented an algorithm with O(n^3) time. In this thesis, we propose an efficient algorithm to solve the same problem in O(n^2) time. Our algorithm depends on a strong connection between the non-overlapping transposition and the run structure.
目次 Table of Contents
論文審定書  i
THESIS VERIFICATION FORM  ii
論文公開授權書  iii
謝 辭  iv
摘 要  v
ABSTRACT  vi
LIST OF FIGURES  viii
Chapter 1. Introduction  1
Chapter 2. Preliminaries  4
2.1 The Non-overlapping Inversion and Transposition Distance Problem  4
2.2 Ta's Algorithm for the Non-overlapping Inversion and Transposition Distance Problem  5
2.3 Repetitions and Runs  8
Chapter 3. Our Algorithm  11
3.1 Computation Overview of the Non-overlapping Inversion and Transposition Distance  11
3.2 Computation of Equality and Inversion Matrices  15
3.3 Computation of the Transposition Matrix with a Brute-Force Method  16
3.4 An Analysis of the Brute-Force Method  21
3.5 An Improved Method for Computing the Transposition Matrix  24
Chapter 4. Conclusion  33
BIBLIOGRAPHY  34
參考文獻 References
[1] A. V. Aho and J. E. Hopcroft, The Design and Analysis of Computer Algorithms. Boston, Massachusetts, USA: Addison-Wesley Longman, 1974.
[2] A. Apostolico and F. P. Preparata, "Optimal o -line detection of repetitions in a string," Theoretical Computer Science, Vol. 22, No. 3, pp. 297-315, 1983.
[3] V. Bafna and P. A. Pevzner, "Genome rearrangements and sorting by reversals," SIAM Journal on Computing, Vol. 25, No. 2, pp. 272-289, 1996.
[4] V. Bafna and P. A. Pevzner, "Sorting by transpositions," SIAM Journal on Discrete Mathematics, Vol. 11, No. 2, pp. 224-240, 1998.
[5] H. Bannai, S. Inenaga, Y. Nakashima, M. Takeda, K. Tsuruta, et al., "The "runs" theorem," arXiv preprint arXiv:1406.0263, 2014.
[6] J. Berstel, A. Lauve, C. Reutenauer, and F. V. Saliola, Combinatorics on Words: Christo el Words and Repetitions in Words. Providence, Rhode Island, USA: American Mathematical Society, 2009.
[7] L. Bulteau, G. Fertin, and I. Rusu, "Sorting by transpositions is difficult," SIAM Journal on Discrete Mathematics, Vol. 26, No. 3, pp. 1148-1180, 2012.
[8] A. Caprara, "Sorting by reversals is difficult," Proceedings of the first annual international conference on Computational molecular biology, Santa Fe, New Mexico, USA, pp. 75-83, ACM, 1997.
[9] D. A. Christie, Genome rearrangement problems. PhD thesis, University of Glasgow, 1998.
[10] M. Crochemore, "An optimal algorithm for computing the repetitions in a word," Information Processing Letters, Vol. 12, No. 5, pp. 244-250, 1981.
[11] M. Crochemore, "Linear searching for a square in a word," Bulletin-European Association for Theoretical Computer Science, Vol. 24, No. 1, pp. 66-72, 1984.
[12] M. Crochemore, L. Ilie, and L. Tinta, "The "runs" conjecture," Theoretical Computer Science, Vol. 412, No. 27, pp. 2931-2941, 2011.
[13] M. Crochemore, C. S. Iliopoulos, M. Kubica, J. Radoszewski, W. Rytter, and T. Wale n, "Extracting powers and periods in a word from its runs structure," Theoretical Computer Science, Vol. 521, pp. 29-41, 2014.
[14] M. Crochemore, M. Kubica, J. Radoszewski, W. Rytter, and T.Wale n, "On the maximal sum of exponents of runs in a string," Journal of Discrete Algorithms, Vol. 14, pp. 29-36, 2012.
[15] M. Crochemore and W. Rytter, Text Algorithms. Singapore: World Scientific, 1994.
[16] G. Fertin, A. Labarre, I. Rusu, E. Tannier, and S. Vialette, Combinatorics of Genome Rearrangements. Cambridge, Massachusetts, USA: MIT Press, 2009.
[17] D. Gus field, Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge, England: Cambridge University Press, 1997.
[18] L. S. Heath and J. P. C. Vergara, "Sorting by bounded block-moves," Discrete Applied Mathematics, Vol. 88, No. 1, pp. 181-206, 1998.
[19] C. S. Iliopoulos, D. Moore, and W. F. Smyth, "A characterization of the squares
in a Fibonacci string," Theoretical Computer Science, Vol. 172, No. 1, pp. 281-291, 1997.
[20] N. Jacobson, Basic Algebra I. New York, New York, USA: W.H. Freeman, 1985.
[21] J. Kececioglu and D. Sanko , "Exact and approximation algorithms for sorting by reversals, with application to genome rearrangement," Algorithmica, Vol. 13, No. 1-2, pp. 180-210, 1995.
[22] R. Kolpakov and G. Kucherov, "Finding maximal repetitions in a word in linear time," Proceedings of the 40th Annual Symposium on Foundations of Computer Science, New York, New York, USA, pp. 596-604, IEEE Computer Society, 1999.
[23] C. Ledergerber and C. Dessimoz, "Alignments with non-overlapping moves, inversions and tandem duplications in O(n^4) time," Journal of Combinatorial Optimization, Vol. 16, No. 3, pp. 263-278, 2008.
[24] V. Levenshtein, "Binary codes capable of correcting deletions, insertions and reversal," Soviet Physics Doklady, Vol. 10, No. 8, pp. 707-710, 1966.
[25] M. Lothaire, Combinatorics on Words. Cambridge, England: Cambridge University Press, 1997.
[26] M. E. MacDonald, C. M. Ambrose, M. P. Duyao, R. H. Myers, C. Lin, L. Srinidhi, G. Barnes, S. A. Taylor, M. James, N. Groot, et al., "A novel gene containing a trinucleotide repeat that is expanded and unstable on huntington's disease chromosomes," Cell, Vol. 72, No. 6, pp. 971-983, 1993.
[27] M. Maes, "On a cyclic string-to-string correction problem," Information Processing Letters, Vol. 35, No. 2, pp. 73-78, 1990.
[28] M. G. Main, "Detecting leftmost maximal periodicities," Discrete Applied Mathematics, Vol. 25, pp. 145-153, 1989.
[29] M. G. Main and R. J. Lorentz, "An O(n log n) algorithm for finding all repetitions in a string," Journal of Algorithms, Vol. 5, No. 3, pp. 422-432, 1984.
[30] E. M. McCreight, "A space-economical suffix tree construction algorithm," Journal of the ACM, Vol. 23, No. 2, pp. 262-272, 1976.
[31] E. W. Myers, "An O(ND) di erence algorithm and its variations," Algorithmica, Vol. 1, No. 1-4, pp. 251-266, 1986.
[32] M. Schoniger and M. S. Waterman, "A local algorithm for DNA sequence alignment with inversions," Bulletin of Mathematical Biology, Vol. 54, pp. 521-536, 1992.
[33] D. Shapira and J. A. Storer, "Edit distance with move operations," Proceedings of the Annual Symposium on Combinatorial Pattern Matching, Fukuoka, Japan, pp. 85-98, Springer, 2002.
[34] J. Simpson, "Modi ed padovan words and the maximum number of runs in a word," Australasian Journal of Combinatorics, Vol. 46, pp. 129-145, 2010.
[35] T. T. Ta, C.-Y. Lin, and C. L. Lu, "An efficient algorithm for computing non-overlapping inversion and transposition distance," Proceedings of the 32nd Workshop on Combinatorial Mathematics and Computation Theory, Taichung, Taiwan, pp. 55-61, 2015.
[36] A. F. Vellozo, C. E. Alves, and A. P. do Lago, "Alignment with non-overlapping inversions in O(n^3)-time," Proceedings of the International Workshop on Algorithms in Bioinformatics, Zurich, Switzerland, pp. 186-196, Springer, 2006.
[37] R. A. Wagner and M. J. Fischer, "The string-to-string correction problem," Journal of the ACM, Vol. 21, No. 1, pp. 168-173, 1974.
[38] M. E. M.Walter, Z. Dias, and J. Meidanis, "Reversal and transposition distance of linear chromosomes," Proceedings of the String Processing and Information Retrieval: A South American Symposium, Santa Cruz de La Sierra, Bolivia, pp. 96-102, IEEE, 1998.
[39] G. Watterson, W. J. Ewens, T. E. Hall, and A. Morgan, "The chromosome inversion problem," Journal of Theoretical Biology, Vol. 99, No. 1, pp. 1-7, 1982.
[40] I. Witten, A. Mo at, and T. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images. Burlington, Massachusetts, USA: Morgan Kaufmann,
1999.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code