Responsive image
博碩士論文 etd-0723112-012802 詳細資訊
Title page for etd-0723112-012802
論文名稱
Title
以查表為主的函數計算與錯誤更正碼之設計
Table Based Design for Function Evaluation and Error Correcting Codes
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
133
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2012-06-26
繳交日期
Date of Submission
2012-07-23
關鍵字
Keywords
非等份切割法、函數近似方法、多項式逼近法、里德所羅門編碼器、查表法錯誤糾正碼
function evaluation, piecewise polynomial approximation, Reed-Solomon code, table-based methods, error correcting coding
統計
Statistics
本論文已被瀏覽 5697 次,被下載 237
The thesis/dissertation has been browsed 5697 times, has been downloaded 237 times.
中文摘要
在各種研究領域中,我們常使用查表法來加速硬體的效能及減少硬體的面積。本篇論文提出了幾種應用於錯誤糾正碼以及函數近似法中以查表法為主的新設計。在第三章中,我們設計了一個新的函數近似法,稱為”二階段近似法”。在這個方法中,使用piecewise的一階多項式做為第一階段的初始化逼近,接著再利用共享”正規化後的差值函數 ”來做為第二階段的進一步逼近。我們在第四章中,則透過非等份切割法,切割表格後再將表格重排,以取得可較節省面積的對應方式,此方法針對函數近似演算法加以改良,可有效減少表格面積、整體硬體面積以及額外的硬體負擔。在前面所提的函數近似法中,是使用多項式逼近法將原本的函數曲線切割成許多子區間,每個子區間再以較低階的多項式逼近,並將多項式的係數儲存於ROM中。而piecewise架構在四個部份會產生誤差,分別為多項式逼近(approximation errors)、係數量化(coefficient quantization errors)、乘法器及平方器等元件截斷(truncation errors)和最後四捨五入(rounding error)的誤差。不像傳統的piecewise架構必須事先做誤差分析,評估每個部分有多少誤差預算,再將誤差分配給每個會產生誤差的部份。因此在第五章中,本論文提出了一個新的方法,不再需要事先分配誤差給各部份,而是將上述所有誤差一併考慮,可以有效降低ROM和算術單元的面積與延遲時間。同樣,類似的查表法概念也可以被應用在錯誤糾正碼的方面。所以在第六章中,我們發展了一個以拉格朗日插值法(Lagrange Interpolation)為基礎的里德所羅門編碼器(Reed-Solomon Encoder),此方法主要的核心概念為將修改過的拉格朗日多項式係數存在表格中以加速硬體的計算,並且達到低功率的效果。
Abstract
Lookup-table (LUT)-based method is a common approach used in all kinds of research topics. In this dissertation, we present several new designs for table-based function evaluation and table-based error correcting coding. In Chapter 3, a new function evaluation method, called two-level approximation, is presented where piecewise degree-one polynomials are used for initial approximation in the first level, followed by the refined approximation for the shared normalized difference functions in the second level. In Chapter 4, we present a new non-uniform segmentation method that searches for the optimal segmentation scheme with the different design goals of minimizing either ROM, total area, or delay. In Chapter 5, a new design methodology for table-based function evaluation is presented. Unlike previous approaches that usually determine the bit widths by assigning allowable errors for individual hardware components, the total error budget of our new design is considered jointly in order to optimized the bit widths of all the hardware components, leading to significant improvements in both area and delay. Finally, in Chapter 6, the similar table-based concept is used in the design of error correcting encoder using the modified polynomial of the Lagrange interpolation formula, resulting in smaller critical path delay and lower power consumption.
目次 Table of Contents
中文論文審定書 i
英文論文審定書 ii
Acknowledgement(誌謝) iv
中文摘要 v
Abstract vi
Chapter 1 Introduction 1
1.1 Motivation 1
1.2 Dissertation Organization 3
Chapter 2 Survey of Related Works 4
2.1 Arithmetic Categories 4
2.1.1 Table-bound Methods 6
2.1.2 In-between Methods 12
2.1.3 Summary 26
2.2 Design of error correcting encoder 27
2.2.1 Linear Feedback Shift Register (LFSR) 28
2.2.2 Lagrange 31
Chapter 3 Table-Based Two-Level Approximation 33
3.1 Two-level Approximation 33
3.1.1 Initial Approximation 33
3.1.2 Refined Approximation 35
3.2 Hardware Architecture 37
3.2.1 Overall Hardware Architecture 37
3.2.2 Trade-off Between Two levels of Approximations 38
3.3 Error Analysis and Bit Width Optimization 39
3.4 Experimental results and comparison 40
3.5 Conclusion 44
Chapter 4 Non-uniform Segmentation for Function Evaluation with Address Re-mapping 45
4.1 Previous Uniform and Non-uniform Segmentation 45
4.2 Proposed Non-uniform Segmentation with Remapping 46
4.2.1 Address Remapping 49
4.2.2 Hardware Implementation 52
4.3 Experimental Results 54
4.3.1 Comparison of Coefficient ROM and Segment Address Encoder 54
4.3.2 Optimization of ROM, Total Area and Delay 56
4.3.3 Comparison with Hierarchical Segmentation 59
4.3.4 Trade-Off of Different Design Methods 66
4.4 Conclusions 69
Chapter 5 Hardware-Efficient Function Evaluation with Joint Error Consideration 70
5.1 Conventional Error Analysis 70
5.2 Proposed Design 72
5.2.1 Proposed Vertical Truncation Algorithm 72
5.2.2 Proposed Slanted Truncation Algorithm 79
5.3 Experimental Results 80
5.4 Conclusions 84
Chapter 6 Table-Based Design of Error Correcting Encoder 85
6.1 VLSI Design of Lagrange Interpolation Formula 85
6.1.1 Pipelined Lagrange Interpolation Architecture 88
6.1.2 Low Power Design 91
6.2 Experimental Results and Comparison 98
6.2.1 Critical Path Analysis 98
6.2.2 Experimental Results and Comparison 100
6.3 Conclusions 106
Chapter 7 Conclusions 108
7.1 Conclusions 108
7.2 Future Works 108
Bibliography 111
參考文獻 References
[1] J. M. Muller, “A few results on table-based methods,” Reliable Computing, Vol. 5, No. 3, pp. 279-288, August 1999.
[2] M. J. Schulte and J. E. Stine, “Symmetric bipartite tables for accurate function approximation,” in Proc. 13th Symp. Computer Arithmetic, Asilomar, 1997, pp. 175-183.
[3] J. E. Stine and M. J. Schulte, “The symmetric table addition method for accurate function approximation,” Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, Vol. 21, pp. 167-177, 1999.
[4] M. J. Stine and J. E. Stine, “Approximating elementary functions with symmetric bipartite tables,” IEEE Transactions on Computers, Vol. 48, No. 8, pp. 842-847, August 1999.
[5] F. D. Dinechin and A. Tisserand, “Some improvements on multipartite table methods,” in Proc. 15th IEEE Symp. Computer Arithmetic, Vail, 2001, pp. 128-135.
[6] F. D. Dinechin and A. Tisserand, “Multipartite table method,” IEEE Transactions on Computers, Vol. 54, No. 3, pp. 319-330, March 2005.
[7] T.-B. Jung, S.-F. Hsiao, and M.-J. Tsai, “Para-CORDIC: parallel CORDIC rotation algorithm,” IEEE Transactions on Circuits and System-I, Vol. 51, No. 8, pp. 1515-1524, August 2004.
[8] E. Antelo and J. Villalba, “Low latency pipelined circular CORDIC,” in Proc. 17th IEEE Symp. Computer Arithmetic, Cape Cod, 2005, pp. 280-287.
[9] E. Antelo, J. Villalba, and E. L. Zapata, “Low-latency pipelined 2D and 3D CORDIC processors,” IEEE Transactions on Computers, vol. 57, no. 3, pp.404-417, March 2008.
[10] M. J. Schulte and E. E. Swartzlander, “Hardware designs for exactly rounded elementary functions,” IEEE Transactions on Computers, Vol. 43, No. 8, August 1994.
[11] J. Cao, B. Wei, and J. Cheng, “High-performance architectures for elementary function generation,” in Proc. 15th IEEE Symp. Computer Arithmetic, Vail, 2001, pp. 136-144.
[12] J. A. Pineiro, J. D. Bruguera, and J. M. Muller, “Faithful powering computation using table look-up and fused accumulation tree,” in Proc. IEEE 15th Symp. Computer Arithmetic, Vail, 2001, pp. 40-47.
[13] J. M. Muller, “Partially rounded small-order approximations for accurate, hardware-oriented, table-based methods,” in Proc. IEEE 16th Symp. Computer Arithmetic, Santiago de Compostela, 2003, pp. 1-8.
[14] E. G. Walters-III and M. J. Schulte, “Efficient function approximation using truncated multipliers and squarers,” in Proc. IEEE 17th Symp. Computer Arithmetic, Cape Cod, 2005.
[15] J. A. Pineiro, J. M. Muller, and J. D. Bruguera, “High-speed function approximation using a minimax quadratic interpolator,” IEEE Transactions on Computers, Vol. 54, No. 3, pp. 304-318, March 2005.
[16] D.-U. Lee, W. Luk, J. Villasenor, and P. Y. K. Cheung, “Hierarchical segmentation schemes for function evaluation,” in Proc. 11th IEEE Int’l Conf. Field-Programmable Technology, Tokyo, 2003, pp. 92-99.
[17] D.-U. Lee, W. Luk, J. Villasenor, and P. Y. K. Cheung, “Non-uniform segmentation for hardware function evaluation,” in Proc. 11th Int’l Conf. Field Proframmable Logic and Applications, Tokyo, 2003, pp. 796-807.
[18] T. Sasao, S. Nagayama, and J. T. Butler, “Numerical function generators using LUT cascades,” IEEE Transactions on Computers, Vol. 56, No. 6, June 2007.
[19] D.-U. Lee, R. C. C Cheung, W. Luk, and J. D. Villasenor, “Hardware implementation trad-offs of polynomial approximations and interpolations,” IEEE Transactions on Computers, Vol. 57, No. 5, May 2008.
[20] D.-U. Lee, A. A. Gaffar, O. Mencer, and W. Luk, “Adaptive range reduction for hardware function evaluation,” in Proc. IEEE Int’l Conf. Field-Programmable Technology, Brisbane, 2004, pp.169-176.
[21] D.-U. Lee, A. A. Gaffar, O. Mencer, and W. Luk, “Optimizing hardware function evaluation,” IEEE Transactions on Computers, Vol. 54, No. 12, December 2005.
[22] H.-J. Ko and S.-F. Hsiao, “Design and application of faithfully rounded and truncated multipliers with combined deletion, reduction, truncation, and rounding,” IEEE Transactions on Circuits and Systems II , pp. 304-308, May 2011.
[23] S.-F. Hsiao, P.-C. Wei, and C.-P. Lin, “An automatic hardware generator for special arithmetic functions using various ROM-based approximation approaches,” in Proc. IEEE Int’l Symp. on Circuits and Systems (ISCAS) , Seattle, 2008, pp. 468-471.
[24] H.-J. Ko, S.-F. Hsiao, and W.-L. Huang, “A new non-uniform segmentation and addressing remapping strategy for hardware-oriented function evaluators based on polynomial approximation,” in Proc. IEEE Int’l Symp. on Circuits and Systems (ISCAS), Paris, 2010, pp. 4153-4156.
[25] D.-U. Lee, R. C. C. Cheung, W. Luk and J. D. Villasenor, “Hierarchical segmentation for function evaluation,” IEEE Transactions on on VLSI systems, Vol. 17, No. 1, pp. 103-116, January 2009.
[26] B. G. Nam, H. Kim, and H. J. Yoo, “Power and area-efficient unified computation of vector and elementary functions for handheld 3D graphics systems,” IEEE Transactions on Computers, Vol. 57, No. 4, pp. 490-504, April 2008.
[27] A. Munshi, “Opengl ES common/common-lite profile specification,” Ver. 1.1, November 2004
[28] A. Munshi, “Opengl ES common/common-lite profile specification,” Ver. 2.0, November. 2004
[29] K.-M. Huang, “Design, implementation, and verification of a programmable floating- and fixed- point vertex shader,” MS Thesis, Dept. of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, 2009.
[30] E. R. Berlekamp. “Bit-serial Reed-Solomon encoders,” IEEE Transactions on Information Theory, Vol. 28, No.6, pp. 869-874. , 1982
[31] E. R. Berlekamp, Algebraic coding theory. New York : McGraw-Hill, 1968.
[32] ESTI, EN300744 v1.4.1. Digital video broadcasting (DVB); framing structure, channel coding and modulation for digital terrestrial television. European telecommunication standard, 2001.
[33] G. Fettweis, and M. Hassner, “A combined Reed-Solomon encoder and syndrome generator with small hardware complexity,” in Proc. IEEE Int’l Symp. on Circuits and Systems (ISCAS), San Diego, 1992, pp. 1871-1874.
[34] M. A. Hasan, and V. K. Bhargava, “Architecture for a low complexity rate-adaptive Reed-Solomon encoder,” IEEE Transactions on Computers, Vol. 44, No.7, pp. 938-942, 1995.
[35] C.-T. Hsueh, J.-F. Wang, and J.-Y. Lee, “Using Lagrange’s interpolation formula to construct Reed-Solomon codes over GF(p),” Electronics Letters, Vol. 24, No.3, pp. 174-175, 1988.
[36] I.-S. Hsu, I. S. Reed, and T.-K. Truong, “The VLSI implementation of a Reed Solomon encoder using Berlekamp's bit-serial multiplier algorithm,” IEEE Transactions on Computers, Vol. 33, No.10, pp. 906 – 911, 1984.
[37] S. Lin, and D. J. Costello, Error control coding: fundamentals and applications. Englewood Cliffs, NJ: Prentice-Hall, 1983.
[38] T.-C. Lin、H.-C. Chang、H.-P. Lee、S.-I. Chu、T.-K. Truong, "Decoding of the (31, 16, 7) quadratic residue code," Journal of the Chinese Institute of Engineers , Vol.33, No.4, pp.573-580, June 2010.
[39] C. Liu, “The RS encoding implementing of bit-parallel algorithm based on dual basis,” Journal of Beijing Broadcasting Institute, Vol.12, No.1, pp.43-48, 2005.
[40] K. Y. Liu, “Architecture for VLSI design of Reed-Solomon encoders,” IEEE Transactions on Computers, Vol.31, No.2, pp. 170 – 175, 1982.
[41] R. L. Miller, and T.-K. Truong, “Fast algorithm for encoding the (255,223) Reed-Solomon code over GF(28),” Electronics Letters, Vol. 16, No.6, pp. 222-223, 1980.
[42] I. S. Reed and G. Solomon, “Polynomial codes over certain finite fields,” Journal of the Society for Industrial and Applied Mathematics, Vol. 8, No.2, pp. 300-304, 1960.
[43] Z. Ren and D. Yao, “An improved high-speed RS encoding algorithm,” in Proc. IEEE Int’l Symp. on Microwave, Antenna, Propagation and EMC Technologies for Wireless Communications (MAPE), Beijing, 2009, pp. 521 – 523.
[44] G. Seroussi, “A systolic Reed-Solomon encoder,” IEEE Transactions on Information Theory, Vol. 37, No.4, pp. 1217 – 1220, 1991.
[45] S. Smith, M. Benaissa and D. Taylor, “High level synthesis of an (N,K) Reed-Solomon encoder using VHDL,” in Proc. IEE Colloquium on VHDL (Very High Speed Integrated Circuits Hardware Description Language) - Applications and CAE Advances, London, 1993, 1/1 - 1/4.
[46] M.-A. Song, S.-Y. Kuo and I.-F. Lan, “A low complexity design of Reed Solomon code algorithm for advanced RAID system,” IEEE Transactions on Consumer Electronics, Vol. 53, No.2, pp. 265–273, 2007.
[47] Z.-F. Tan, H. Xie, G.-J. Wu, M.-X. Liao, “Design and implementation of Reed-Solomon encoder in CMMB system,” in Proc. Int’l Conf. on Wireless Communications Networking and Mobile Computing (WiCOM), Chengdu, 2010, pp. 1 – 4.
[48] S. B. Wicker, Error control systems for digital communication and storage. Englewood Cliffs, NJ : Prentice Hall, 1995.
[49] X.-J. Chang, J. Guo and Z. Li, “RS encoder design based on FPGA,” in Proc. IEEE Int’l Conf. on Advanced Computer Control (ICACC), Shenyang, 2010, pp. 419-421.
[50] S. K. Jain and K. K. Parhi, “Efficient standard basis Reed–Solomon encoder,” in Proc. IEEE Int’l Conf. Acoustics, Speech, Signal Process. (ICASSP), Atlanta, 1996, Vol. 6, pp. 3287–3290.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code