論文使用權限 Thesis access permission:校內校外完全公開 unrestricted
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available
論文名稱 Title |
附加條件限制之最長共同子序列方法 An Approach for Solving the Constrained Longest Common Subsequence Problem |
||
系所名稱 Department |
|||
畢業學年期 Year, semester |
語文別 Language |
||
學位類別 Degree |
頁數 Number of pages |
29 |
|
研究生 Author |
|||
指導教授 Advisor |
|||
召集委員 Convenor |
|||
口試委員 Advisory Committee |
|||
口試日期 Date of Exam |
2003-07-11 |
繳交日期 Date of Submission |
2003-08-28 |
關鍵字 Keywords |
最長共同子序列、限制、序列比對、動態規劃 sequence alignment, dynamic programming, costrained, longest common subsequence |
||
統計 Statistics |
本論文已被瀏覽 5734 次,被下載 3744 次 The thesis/dissertation has been browsed 5734 times, has been downloaded 3744 times. |
中文摘要 |
一個子序列是經由把一個給定的序列不刪或刪掉一些符號得來的。給定兩個序列,最長共同子序列問題就是找出一個共同擁有的子序列中長度最長的。而有條件限制的最長共同子序列問題則是找出一個最長的內含指定序列的子序列。注意一個附加條件限制的最長子序列可能比一般的最長子序列短。 在本論文中我們提出一個動態規劃方法的演算法來解有條件限制的最長共同子序列問題。所需的時間複雜度是O(pmn),其中m和n為給定的序列的長度,p為要求的序列的長度。我們的演算法也可以應用到附加條件限制的序列比對問題。也就是序列比對問題在額外要求指定的符號對齊在同一行。 |
Abstract |
A subsequence is obtained by deleting zero or more symbols from a given sequence. For given two sequences, the longest common subsequence problem is to find a common subsequence whose length is the longest. The constrained longest common subsequence (CLCS) problem is to find a longest common subsequence that contains a specific subsequence. Note a CLCS may be shorter than an LCS. In this thesis, we propose a dynamic programming algorithm for solving the CLCS problem. The time complexity is O(pmn), where m and n are the lengths of the given sequences and p is the length of the constraint sequence. Our algorithm can also be applied to solve the constrained sequence alignment problem, which is a sequence alignment problem with the requirement that some specific symbols must be aligned together. |
目次 Table of Contents |
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 Chapter 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Chapter 2. Preliminary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2.1 The Longest Common Subsequence Problem . . . . . . . . . . . . . . . . . . 4 2.2 The Constrained LCS Problem . . . . . . . . . . . . . . . . . . . . . . . 8 Chapter 3. An Algorithm for Solving the Constrained LCS Problem . . . . . . .10 3.1 The Constrained LCS Problem . . . . . . . . . . . . . . . . . . . . . . . 10 3.2 Algorithm and Complexity . . . . . . . . . . . . . . . . . . . . . . . . 13 3.3 Proof of the Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . 17 Chapter 4. Simplified Algorithm . . . . . . . . . . . . . . . . . . . . . . . 21 4.1 Simplification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 4.2 Algorithm and Complexity . . . . . . . . . . . . . . . . . . . . . . . . 22 Chapter 5. Extension to the Sequences Alignment Problem . . . . . . . . . . . 23 5.1 Multiple Sequences Alignment . . . . . . . . . . . . . . . . . . . . . . 23 5.2 Multiple Sequences Alignment with Constraints . . . . . . . . . . . . . . 25 Chapter 6. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 |
參考文獻 References |
[1] A. Aho, D. Hirschberg, and J. Ullman, “Bounds on the complexity of the longest common subsequence problem,” Journal of the ACM, Vol. 23, No. 1, pp. 1–12, 1976. [2] A. Apostolico and C. Guerra, “The longest common subsequence problem revisited,” Algorithmica, No. 2, pp. 315–336, 1987. [3] L. Bergroth, H. Hakonen, and T. Raita, “A survey of longest common subsequence algorithms,” Seventh International Symposium on String Processing Information Retrieval, pp. 39–48, 2000. [4] V. Chavatal, D. Klarner, and D. Knuth, “Selected combinatorial research problem,” Tech. Rep. STAN-CS-72-292, Stanford Univ., p. 26, 1972. [5] D. S. Hirschberg, “Algorithms for the longest common subsequence problem,” Journal of the ACM, Vol. 24, No. 4, pp. 664–675, 1977. [6] J. W. Hunt and T. G. Szymanski, “A fast algorithm for computing longest common subsequences,” Communications of the ACM, Vol. 20, No. 5, pp. 350–353, 1977. [7] J. Modelevsky, “Computer applications in applied genetic engineering,” Advances in Applied Microbiology, Vol. 30, pp. 169–195, 1984. [8] E. W. Myers, “An O(nd) difference algorithm and its variations,” Algorithmica, No. 1, pp. 251–266, 1986. [9] N. Nakatsu, Y. Kambayashi, and S. Yajima, “A longest common subsequence algorithm suitable for similiar texts,” Acat Informatica, Vol. 18, pp. 171–179, 1982. [10] C. Rick, “Simple and fast linear space computation of longest common subsequences,” Information Processing Letters, pp. 275–281, 2000. [11] T. F. Smith and M. S. Waterman, “Comparison of biosequences,” Advances in Applied Mathematics, Vol. 2, pp. 482–489, 1981. [12] R. A. Wagner and M. J. Fischer, “The string-to-string correction problem,”Journal of the ACM, Vol. 21, No. 1, pp. 168–173, 1975. [13] S. Wu, U. Manber, G. Myers, and W. Miller, “An O(NP) sequence comparison algorithm,” Information Processing Letters, Vol. 35, pp. 317–323, 1990. [14] C. B. Yang and R. C. T. Lee, “Systolic algorithm for the longest common subsequence problem,” Journal of the Chinese Institue of Engineers, Vol. 10, No. 6, pp. 691–699, 1987. |
電子全文 Fulltext |
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。 論文使用權限 Thesis access permission:校內校外完全公開 unrestricted 開放時間 Available: 校內 Campus: 已公開 available 校外 Off-campus: 已公開 available |
紙本論文 Printed copies |
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。 開放時間 available 已公開 available |
QR Code |