論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available
論文名稱 Title |
限制最長共同子序列之對角線演算法 A Diagonal-Based Algorithm for the Constrained Longest Common Subsequence Problem |
||
系所名稱 Department |
|||
畢業學年期 Year, semester |
語文別 Language |
||
學位類別 Degree |
頁數 Number of pages |
69 |
|
研究生 Author |
|||
指導教授 Advisor |
|||
召集委員 Convenor |
|||
口試委員 Advisory Committee |
|||
口試日期 Date of Exam |
2018-07-30 |
繳交日期 Date of Submission |
2018-08-01 |
關鍵字 Keywords |
對角線、相似度、支配、限制最長共同子序列、最長共同子序列、演算法設計 diagonal, domination, similarity, constrained longest common subsequence, longest common subsequence, design of algorithm |
||
統計 Statistics |
本論文已被瀏覽 5659 次,被下載 133 次 The thesis/dissertation has been browsed 5659 times, has been downloaded 133 times. |
中文摘要 |
過去數十年以來,最長共同子序列問題及其變形都已經有很深的研究。其中,限制最長共同子序列問題為給定 A 與 B 兩條序列以及一條限制序列 C,長度分別為m、n、r,找出 A 與 B 的最長共同子序列,而且此序列必須包含限制序列 C。本論文中提出一個演算法基於Nakatsu等學者提出的對角線概念來得到CLCS的長度。我們的演算法能更有效找到CLCS的長度並且時間複雜度與空間複雜度為O(rL(m-L))跟O(mr),L 為CLCS長度。如實驗結果所示,我們的演算法表現出比先前發表的演算法還要好的效能。 |
Abstract |
The longest common subsequence (LCS) problem and its variations have been studied deeply in past decades. In the constrained longest common subsequence (CLCS) problem, given three sequences A, B, and C of lengths m, n, and r, respectively, its goal is to find the LCS of A and B that C is a subsequence contained in the LCS answer. This thesis proposes an algorithm for obtaining the CLCS length based on the diagonal concept for finding the LCS length proposed by Nakatsu et al. Our algorithm can find the CLCS length more efficiently with O(rL(m - L)) time and O(mr) space, where L is the CLCS length. As the experimental results show, our CLCS algorithm outperforms the previously published algorithms. |
目次 Table of Contents |
TABLE OF CONTENTS Page VERIFICATION FORM . . . . . . . . . . . . . . . . . . . . . . . . . . . . i THESIS AUTHORIZATION FORM . . . . . . . . . . . . . . . . . . . . iii ACKNOWLEDGMENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . iv CHINESE ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . v ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii LIST OF SYMBOLS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv Chapter 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Chapter 2. Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.1 The Longest Common Subsequence Problem . . . . . . . . . . . . . . 5 2.2 The Longest Common Subsequence Algorithm Proposed by Nakatsu et al. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 2.3 The Constrained Longest Common Subsequence Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.3.1 The Constrained Longest Common Subsequence Algorithm Proposed by Tsai . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.3.2 The Constrained Longest Common Subsequence Algorithm Proposed by Chin et al. . . . . . . . . . . . . . . . . . . . . . 12 2.3.3 The Constrained Longest Common Subsequence Algorithm Proposed by Arslan and Egecioglu . . . . . . . . . . . . . . . 16 Chapter 3. The Diagonal-based Algorithm . . . . . . . . . . . . . . . . 17 Chapter 4. Experimental Results . . . . . . . . . . . . . . . . . . . . . . 24 Chapter 5. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Appendixes A. Miscellaneous Experimental Results . . . . . . . . . . . . . . . . . . 42 |
參考文獻 References |
[1] H.-Y. Ann, C.-B. Yang, C.-T. Tseng, and C.-Y. Hor, "Fast algorithms for computing the constrained LCS of run-length encoded strings," Theoretical Computer Science, Vol. 432, pp. 1-9, 2012. [2] A. N. Arslan and O. Egecioglu, "Algorithms for the constrained longest common subsequence problems," International Journal of Foundations of Computer Science, Vol. 16, No. 06, pp. 1099-1109, 2005. [3] D. Becerra, W. Soto, L. Nino, and Y. Pinzon, "An algorithm for constrained LCS," Proceedings of 2010 IEEE/ACS International Conference on Computer Systems and Applications (AICCSA), Hammamet, Tunisia, pp. 1-7, IEEE, 2010. [4] G. S. Brodal, K. Kaligosi, I. Katriel, and M. Kutz, "Faster algorithms for computing longest common increasing subsequences," Proceedings of Annual Symposium on Combinatorial Pattern Matching, Barcelona, Spain, pp. 330- 341, Springer, 2006. [5] Y.-C. Chen and K.-M. Chao, "On the generalized constrained longest common subsequence problems," Journal of Combinatorial Optimization, Vol. 21, No. 3, pp. 383-392, Apr. 2011. [6] F. Y. L. Chin, A. D. Santis, A. L. Ferrara, N. L. Ho, and S. K. Kim, "A simple algorithm for the constrained sequence problems," Information Processing Letters, Vol. 90(4), pp. 175-179, 2004. [7] S. Deorowicz, "Fast algorithm for the constrained longest common subsequence problem," Theoretical and Applied Informatics, Vol. 19, No. 2, pp. 91-102, 2007. [8] S. Deorowicz, "Bit-parallel algorithm for the constrained longest common subsequence problem," Fundamenta Informaticae, Vol. 99, No. 4, pp. 409-433, 2010. [9] S. Deorowicz and J. Obstoj, "Constrained longest common subsequence computing algorithms in practice," Computing and Informatics, Vol. 29, No. 3, pp. 427-445, 2012. [10] D. He and A. N. Arslan, "A space-ecient algorithm for the constrained pairwise sequence alignment problem," Genome Informatics, Vol. 16, No. 2, pp. 237-246, 2005. [11] D. S. Hirschberg, "A linear space algorithm for computing maximal common subsequences," Communications of the ACM, Vol. 18, No. 6, pp. 341-343, 1975. [12] W.-C. Ho, "A fast algorithm for the constrained longest common subsequence problem with small alphabet," Master's Thesis, Department of Computer Science and Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan, 2017. [13] W.-C. Ho, K.-S. Huang, and C.-B. Yang, "A fast algorithm for the constrained longest common subsequence problem with small alphabet," Proceedings of the 34th Workshop on Combinatorial Mathematics and Computation Theory, Taichung, Taiwan, pp. 13-25, 2017. [14] K.-S. Huang, C.-B. Yang, K.-T. Tseng, H.-Y. Ann, and Y.-H. Peng, "Efficient algorithms for nding interleaving relationship between sequences," Informa- tion Processing Letters, Vol. 105, No. 5, pp. 188-193, 2008. [15] J. W. Hunt and T. G. Szymanski, "A fast algorithm for computing longest common subsequences," Communications of the ACM, Vol. 20, No. 5, pp. 350- 353, 1977. [16] C. S. Iliopoulos and M. S. Rahman, "New efficient algorithms for the LCS and constrained LCS problems," Information Processing Letters, Vol. 106(1), pp. 13-18, 2008. [17] J. B. Kruskal, "An overview of sequence comparison: Time warps, string edits, and macromolecules," SIAM Review, Vol. 25, No. 2, pp. 201-237, 1983. [18] N. Nakatsu, Y. Kambayashi, and S. Yajima, "A longest common subsequence algorithm suitable for similar text strings," Acta Informatica, Vol. 18, pp. 171- 179, 1982. [19] C.-L. Peng, "An approach for solving the constrained longest common subsequence problem," Master's Thesis, Department of Computer Science and Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan, 2003. [20] Y.-H. Peng, C.-B. Yang, K.-S. Huang, and K.-T. Tseng, "An algorithm and applications to sequence alignment with weighted constraints," International Journal of Foundations of Computer Science, Vol. 21, pp. 51-59, 2010. [21] Z. Peng and H.-F. Ting, "Time and space ecient algorithms for constrained sequence alignment," Proceedings of International Conference on Implementa- tion and Application of Automata, Kingston, Ontario, Canada, pp. 237-246, Springer, 2004. [22] Y. T. Tsai, "The constrained longest common subsequence problem," Informa- tion Processing Letters, Vol. 88, pp. 173-176, 2003. [23] K.-T. Tseng, D.-S. Chan, C.-B. Yang, and S.-F. Lo, "Efficient merged longest common subsequence algorithms for similar sequences," Theoretical Computer Science, Vol. 708, pp. 75-90, Jan. 2018. [24] R. Wagner and M. Fischer, "The string-to-string correction problem," Journal of the ACM, Vol. 21, No. 1, pp. 168-173, 1974. [25] W.-L.Wang, "Longest common subsequence with constraint.," Master's Thesis, Department of Computer Science and Information Engineering, National Chi- Nan University, Nantou, Taiwan, 2006. [26] I.-H. Yang, C.-P. Huang, and K.-M. Chao, "A fast algorithm for computing a longest common increasing subsequence," Information Processing Letters, Vol. 93, No. 5, pp. 249-253, 2005. |
電子全文 Fulltext |
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。 論文使用權限 Thesis access permission:自定論文開放時間 user define 開放時間 Available: 校內 Campus: 已公開 available 校外 Off-campus: 已公開 available |
紙本論文 Printed copies |
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。 開放時間 available 已公開 available |
QR Code |