Responsive image
博碩士論文 etd-0701118-144718 詳細資訊
Title page for etd-0701118-144718
論文名稱
Title
限制最長共同子序列之對角線演算法
A Diagonal-Based Algorithm for the Constrained Longest Common Subsequence Problem
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
69
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2018-07-30
繳交日期
Date of Submission
2018-08-01
關鍵字
Keywords
對角線、相似度、支配、限制最長共同子序列、最長共同子序列、演算法設計
diagonal, domination, similarity, constrained longest common subsequence, longest common subsequence, design of algorithm
統計
Statistics
本論文已被瀏覽 5659 次,被下載 133
The thesis/dissertation has been browsed 5659 times, has been downloaded 133 times.
中文摘要
過去數十年以來,最長共同子序列問題及其變形都已經有很深的研究。其中,限制最長共同子序列問題為給定 A 與 B 兩條序列以及一條限制序列 C,長度分別為m、n、r,找出 A 與 B 的最長共同子序列,而且此序列必須包含限制序列 C。本論文中提出一個演算法基於Nakatsu等學者提出的對角線概念來得到CLCS的長度。我們的演算法能更有效找到CLCS的長度並且時間複雜度與空間複雜度為O(rL(m-L))跟O(mr),L 為CLCS長度。如實驗結果所示,我們的演算法表現出比先前發表的演算法還要好的效能。
Abstract
The longest common subsequence (LCS) problem and its variations have been studied deeply in past decades. In the constrained longest common subsequence (CLCS) problem, given three sequences A, B, and C of lengths m, n, and r, respectively, its goal is to find the LCS of A and B that C is a subsequence contained in the LCS answer. This thesis proposes an algorithm for obtaining the CLCS length based on the diagonal concept for finding the LCS length proposed by Nakatsu et al. Our algorithm can find the CLCS length more efficiently with O(rL(m - L)) time and O(mr) space, where L is the CLCS length. As the experimental results show, our CLCS algorithm outperforms the previously published algorithms.
目次 Table of Contents
TABLE OF CONTENTS
Page
VERIFICATION FORM . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
THESIS AUTHORIZATION FORM . . . . . . . . . . . . . . . . . . . . iii
ACKNOWLEDGMENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
CHINESE ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii
LIST OF SYMBOLS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv
Chapter 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Chapter 2. Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1 The Longest Common Subsequence Problem . . . . . . . . . . . . . . 5
2.2 The Longest Common Subsequence Algorithm Proposed by Nakatsu
et al. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.3 The Constrained Longest Common Subsequence
Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.3.1 The Constrained Longest Common Subsequence Algorithm
Proposed by Tsai . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.3.2 The Constrained Longest Common Subsequence Algorithm
Proposed by Chin et al. . . . . . . . . . . . . . . . . . . . . . 12
2.3.3 The Constrained Longest Common Subsequence Algorithm
Proposed by Arslan and Egecioglu . . . . . . . . . . . . . . . 16
Chapter 3. The Diagonal-based Algorithm . . . . . . . . . . . . . . . . 17
Chapter 4. Experimental Results . . . . . . . . . . . . . . . . . . . . . . 24
Chapter 5. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Appendixes
A. Miscellaneous Experimental Results . . . . . . . . . . . . . . . . . . 42
參考文獻 References
[1] H.-Y. Ann, C.-B. Yang, C.-T. Tseng, and C.-Y. Hor, "Fast algorithms for
computing the constrained LCS of run-length encoded strings," Theoretical
Computer Science, Vol. 432, pp. 1-9, 2012.
[2] A. N. Arslan and  O. Egecioglu, "Algorithms for the constrained longest common
subsequence problems," International Journal of Foundations of Computer
Science, Vol. 16, No. 06, pp. 1099-1109, 2005.
[3] D. Becerra, W. Soto, L. Nino, and Y. Pinzon, "An algorithm for constrained
LCS," Proceedings of 2010 IEEE/ACS International Conference on Computer
Systems and Applications (AICCSA), Hammamet, Tunisia, pp. 1-7, IEEE,
2010.
[4] G. S. Brodal, K. Kaligosi, I. Katriel, and M. Kutz, "Faster algorithms for
computing longest common increasing subsequences," Proceedings of Annual
Symposium on Combinatorial Pattern Matching, Barcelona, Spain, pp. 330-
341, Springer, 2006.
[5] Y.-C. Chen and K.-M. Chao, "On the generalized constrained longest common
subsequence problems," Journal of Combinatorial Optimization, Vol. 21, No. 3,
pp. 383-392, Apr. 2011.
[6] F. Y. L. Chin, A. D. Santis, A. L. Ferrara, N. L. Ho, and S. K. Kim, "A simple
algorithm for the constrained sequence problems," Information Processing
Letters, Vol. 90(4), pp. 175-179, 2004.
[7] S. Deorowicz, "Fast algorithm for the constrained longest common subsequence
problem," Theoretical and Applied Informatics, Vol. 19, No. 2, pp. 91-102, 2007.
[8] S. Deorowicz, "Bit-parallel algorithm for the constrained longest common subsequence
problem," Fundamenta Informaticae, Vol. 99, No. 4, pp. 409-433,
2010.
[9] S. Deorowicz and J. Obstoj, "Constrained longest common subsequence computing
algorithms in practice," Computing and Informatics, Vol. 29, No. 3,
pp. 427-445, 2012.
[10] D. He and A. N. Arslan, "A space-ecient algorithm for the constrained
pairwise sequence alignment problem," Genome Informatics, Vol. 16, No. 2,
pp. 237-246, 2005.
[11] D. S. Hirschberg, "A linear space algorithm for computing maximal common
subsequences," Communications of the ACM, Vol. 18, No. 6, pp. 341-343, 1975.
[12] W.-C. Ho, "A fast algorithm for the constrained longest common subsequence
problem with small alphabet," Master's Thesis, Department of Computer Science
and Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan,
2017.
[13] W.-C. Ho, K.-S. Huang, and C.-B. Yang, "A fast algorithm for the constrained
longest common subsequence problem with small alphabet," Proceedings of
the 34th Workshop on Combinatorial Mathematics and Computation Theory,
Taichung, Taiwan, pp. 13-25, 2017.
[14] K.-S. Huang, C.-B. Yang, K.-T. Tseng, H.-Y. Ann, and Y.-H. Peng, "Efficient
algorithms for nding interleaving relationship between sequences," Informa-
tion Processing Letters, Vol. 105, No. 5, pp. 188-193, 2008.
[15] J. W. Hunt and T. G. Szymanski, "A fast algorithm for computing longest
common subsequences," Communications of the ACM, Vol. 20, No. 5, pp. 350-
353, 1977.
[16] C. S. Iliopoulos and M. S. Rahman, "New efficient algorithms for the LCS
and constrained LCS problems," Information Processing Letters, Vol. 106(1),
pp. 13-18, 2008.
[17] J. B. Kruskal, "An overview of sequence comparison: Time warps, string edits,
and macromolecules," SIAM Review, Vol. 25, No. 2, pp. 201-237, 1983.
[18] N. Nakatsu, Y. Kambayashi, and S. Yajima, "A longest common subsequence
algorithm suitable for similar text strings," Acta Informatica, Vol. 18, pp. 171-
179, 1982.
[19] C.-L. Peng, "An approach for solving the constrained longest common subsequence
problem," Master's Thesis, Department of Computer Science and Engineering,
National Sun Yat-Sen University, Kaohsiung, Taiwan, 2003.
[20] Y.-H. Peng, C.-B. Yang, K.-S. Huang, and K.-T. Tseng, "An algorithm and
applications to sequence alignment with weighted constraints," International
Journal of Foundations of Computer Science, Vol. 21, pp. 51-59, 2010.
[21] Z. Peng and H.-F. Ting, "Time and space ecient algorithms for constrained
sequence alignment," Proceedings of International Conference on Implementa-
tion and Application of Automata, Kingston, Ontario, Canada, pp. 237-246,
Springer, 2004.
[22] Y. T. Tsai, "The constrained longest common subsequence problem," Informa-
tion Processing Letters, Vol. 88, pp. 173-176, 2003.
[23] K.-T. Tseng, D.-S. Chan, C.-B. Yang, and S.-F. Lo, "Efficient merged longest
common subsequence algorithms for similar sequences," Theoretical Computer
Science, Vol. 708, pp. 75-90, Jan. 2018.
[24] R. Wagner and M. Fischer, "The string-to-string correction problem," Journal
of the ACM, Vol. 21, No. 1, pp. 168-173, 1974.
[25] W.-L.Wang, "Longest common subsequence with constraint.," Master's Thesis,
Department of Computer Science and Information Engineering, National Chi-
Nan University, Nantou, Taiwan, 2006.
[26] I.-H. Yang, C.-P. Huang, and K.-M. Chao, "A fast algorithm for computing
a longest common increasing subsequence," Information Processing Letters,
Vol. 93, No. 5, pp. 249-253, 2005.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code