國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,效能下降容忍應用方法之開發與數值預測器電路之案例探討,Development of A Performance Degradation Tolerance Utilization Methodology and A Case Study on Value Prediction Unit

論文名稱 Title	效能下降容忍應用方法之開發與數值預測器電路之案例探討 Development of A Performance Degradation Tolerance Utilization Methodology and A Case Study on Value Prediction Unit
系所名稱 Department	電機工程學系 Department of Electrical Engineering
畢業學年期 Year, semester	102 學年度第 2 學期 The spring semester of Academic Year 102	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	80
研究生 Author	郭俊緯 Chun-Wei Kuo
指導教授 Advisor	謝東佑 Tong-Yu Hsieh
召集委員 Convenor	王太平 Tai-Ping Wang
口試委員 Advisory Committee	鄺獻榮, 丁信文 Shiann-Rong Kuang; Hsin-Wen Ting
口試日期 Date of Exam	2014-07-31	繳交日期 Date of Submission	2014-09-02
關鍵字 Keywords	效能下降容忍、效能下降錯誤、數值預測器、錯誤容忍、良率、穩定度 value prediction unit, fault tolerance, yield, performance degradation tolerance, performance degrading fault, reliability
統計 Statistics	本論文已被瀏覽 5666 次，被下載 34 次 The thesis/dissertation has been browsed 5666 times, has been downloaded 34 times.

中文摘要
隨著半導體製程的進步，電子元件尺寸可有效縮小。然而晶片也因此更容易受到製造缺陷(defect)或製程參數飄移的影響。如何有效提升晶片良率一直以來為學術界及工業界的重點研究項目之一。另一方面，對於醫療、汽車、飛機、處理器等應用來說，穩定度相當重要，錯誤存在時所需付出的代價可能極高。傳統上可透過容錯技術將錯誤效應進行遮罩或修正，但所需成本可能相當可觀。效能下降容忍是近幾年來被提出可有效率提升電路良率及穩定度之嶄新觀念。此觀念之基礎在於電路中可能存在一特殊種類的錯誤，稱之為效能下降錯誤。當系統內存在此種錯誤時系統功能不會產生任何錯誤結果，僅會使系統效能下降。倘若下降之幅度對市場應用來說仍可接受，則此晶片極有可能仍可繼續使用。效能下降容忍技術聚焦在分析待測電路內效能下降錯誤所導致之效能下降幅度，並根據分析結果將待測電路進行適當分類，將不同效能等級的電路應用於適當的電子產品中，藉此提升電路的有效良率及利潤。針對電路中存在效能下降錯誤將導致嚴重系統效能下降的元件，我們則可使用容錯設計加以保護，使系統效能仍在可接受範圍內。由於這些關鍵元件通常僅占整體面積之一小部分，透過僅保護這些元件，所需之硬體成本將可有效降低。本論文提出一效能下降容忍應用方法，並使用可用來提升處理器運算效能之數值預測器作為案例探討。此方法提供一流程讓使用者可一步步將效能下降容忍應用在適當電路。針對數值預測器中佔有絕大面積之記憶體，我們注入不同錯誤密度的多重stuck-at faults並分析其效能下降程度。利用CPU95 與 CPU2006標準測試程式所進行的實驗結果顯示，數值預測器中所有的錯誤均為效能下降錯誤。當錯誤密度為1%時，幾乎不會導致任何效能下降，而即使錯誤密度高達20%，效能下降也僅有10.95%。針對存在錯誤時將導致18.51%至22.13%效能下降的關鍵邏輯元件(面積僅佔整體電路的0.046%)，我們使用常用之三模冗餘技術加以保護，而所需額外付出之面積成本僅有0.13%。
Abstract
The advance in semiconductor manufacturing processes leads to feature size shrink of transistors. However, chips thus become more sensitive to process defects and variation. How to effectively improve yield has been one of the hot research topics in both the academia and the industry. On the other hand, for some critical applications such as medical systems, vehicle and aircraft systems or processors, reliability is of great importance. Conventionally by using some fault tolerance techniques, fault effects can be masked or corrected. Nevertheless, the required cost may not be affordable. Performance degradation tolerance (PDT) is a new notion that has been proposed recently to efficiently enhance effective yield and reliability of designs. This notion concentrates on one special type of fault, called performance degrading fault (pdef). This type of faults can only result in some performance degradation without any computation errors. If the degree of the degraded performance is still acceptable for marketing, the chips containing pdef are quite likely to be still marketable. The main focus of PDT is to carefully analyze the induced performance degradation by pdef, and properly grade target chips according to the analysis results. By selling the graded chips to different applications, the effective yield and profit of target products can be enhanced. For the critical components of a target design where pdef would induce significant performance degradation, fault tolerance techniques can be used to protect these components such that the degraded performance is still acceptable. Since such components usually occupy only a small area of the whole design, by only protecting only these components, the required hardware cost can be effectively reduced. In this thesis we propose a PDT application methodology, and employ a value prediction unit that can enhance the performance of processors as a case study. This methodology provides a step-by-step guideline for the users to apply PDT to adequate applications. Targeting the memories that occupy the most area of a value predictor, we inject multiple stuck-at faults with various fault densities to analyze the induced performance degradation. The experimental results based on CPU95 and CPU2006 benchmark programs show that all faults in a value prediction unit are all pdef. When the fault density is 1%, almost no degradation is induced. Even when the fault density is 20%, the degradation is only 10.95%. For the critical logic part where pdef would induce 18.51%~22.13% degradation, the common triple modular redundancy (TMR) method is used to protect this part. The required hardware overhead is only 0.13%.

目次 Table of Contents
論文審定書 i 摘要 ii Abstract iii 目錄 iv 圖次 vi 表次 ix 第一章介紹 1 1.1 研究動機 1 1.2 貢獻 5 1.3 章節介紹 6 第二章背景知識與過去相關研究 8 2.1 容錯 8 2.2 效能下降錯誤 8 2.3資料相依性 10 2.4數值預測及數值預測器背景介紹 12 2.5效能下降之分支預測器 16 2.6效能下降之快取記憶體 17 第三章支援效能下降容忍流程 19 第四章數值預測器效能下降容忍度分析 22 4.1 目標電路 - 混合型數值預測器 22 4.1.1二階步階數值預測器 23 4.1.2限定內容方法數值預測器 27 4.1.3 數值標籤預測器 31 4.1.4 混合型預測器選擇機制 34 4.2數值預測器組態 36 4.3實作結果 38 4.4分析環境 39 4.5效能評估 - 預測準確度與覆蓋率 41 4.6效能分析 42 4.7錯誤分析 45 4.7.1二階步階數值預測器分析結果 46 4.7.2限定內容方法數值預測器分析結果 49 4.7.3數值標籤預測器分析結果 52 4.7.4預測器選擇機制分析結果 61 4.8重新設計 62 4.9 有效良率提升評估 64 第五章結論 66 參考文獻 67

參考文獻 References
[1] Y. Zorian, D. Gizopoulos, C. Vandenberg, and P. Magarshack, "Guest editors' introduction: design for yield and reliability," IEEE Design & Test of Computers, vol.21, no.3, pp.177-182, 2004. [2] L. T. Wang, C. E. Stroud, and N. A. Touba, System-on-Chip Test Architectures: Nanometer Design for Testability, Elsevier, Morgan Kaufmann Publishers, 2007. [3] T. Y. Hsieh, M. A. Breuer, M. Annaveram, S. K. Gupta, and K. J. Lee, "Tolerance of performance degrading faults for effective yield improvement," Proc. Int’l Test Conf., pp.1-10, 2009 [4] K. J. Lee, T. Y. Hsieh, and M. A. Breuer, "A novel test methodology based on error-rate to support error-tolerance," Proc. Int’l Test Conf., pp.1136-1144, 2005. [5] M. A. Breuer, and H. Zhu, "Error-tolerance and multi-media," Proc. Int’l Conf. on Intelligent Information Hiding and Multimedia Signal Processing, pp.18-20, 2006. [6] I. Chong and A. Ortega, "Hardware testing for error tolerant multimedia compression based on linear transforms," Proc. Int’l. Symp. on Defect and Fault Tolerance in VLSI Systems, pp.523-531, 2005. [7] C.-L. Hsu, Y.-S. Huang and T.-H. Liu, "SSD-based testing scheme for error tolerance analysis in H.264/AVC encoder," Proc. Int’l. Conf. on Communications, Circuits and Systems, pp.684-688, 2008. [8] H. Chung and A. Ortega, "Analysis and testing for error tolerant motion estimation," Proc. Int’l. Symp. on Defect and Fault Tolerance in VLSI Systems, pp.514-522, 2005. [9] W. Stallings, Computer organization and architecture: designing for performance (8th edition), Prentice Hall Publishers, 2009. [10] J. J. Tang, (2012, January). 創造殺手級3D IC產品　CPU/記憶體堆疊勢在必行. 新電子科技雜誌. Retrieved July 13, 2014, from http://www.mem.com.tw/article_content.asp?sn=1201130015 [11] M. Manoochehri, M. Annavaram and M. Dubois, "CPPC: Correctable parity protected cache," Computer Architecture (ISCA), 2011 38th Annual International Symposium on, pp.223,234, 4-8, 2011. [12] S. Almukhaizim, T. Verdel, and Y. Makris, "Cost-effective graceful degradation in speculative processor subsystems: the branch prediction case," Proc. Int’l Conf. on Computer Design, pp.194, 197, 13-15, 2003. [13] A. Perais, and A. Seznec, "Revisiting value prediction, "Technical Report, INRIA, 2012. [14] M. Burtscher, and B.G. Zorn, "Hybrid load-value predictors, " IEEE Trans. Computers, vol.51, no.7, pp.759-774, 2002. [15] M. Burtscher. Improving Context-Based Load Value Prediction. PhD thesis, University of Colorado, 2000. [16] Y. Sazeides, J. E. Smith, "The predictability of data values, " Proc. Int’l. Conf. on Microarchitecture, pp.248,258, 1997. [17] H. Lee, S. Cho, B. R. Childers, "Performance of graceful degradation for cache faults," in Symposium on VLSI, pp.409-415, 2007. [18] J. Jiang, Y. Zhang, "Accepting performance degradation in fault-tolerant control system design," Control Systems Technology, IEEE Transactions on , vol.14, no.2, pp.284,292, 2006 [19] A. Seznec, P. Michaud, "A case for (partially) tagged geometric history length branch prediction," Journal of Instruction Level Parallelism, vol.8, pp.1–23, 2006. [20] T. Austin, E. Larson, and D. Ernst, "SimpleScalar: An Infrastructure for Computer System Modeling," IEEE Computer, vol.35, pp.59-67, 2002. [21] Standard Performance Evaluation Corporation (http://www.spec.org/cpu95) [22] W. Suntiamorntut, "Hamming Distance and Normalization Circuits," Int’l. Conf. on Communications, Circuits and Systems, pp.1053,1056, 11-13, 2007.

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：自定論文開放時間 user define 開放時間 Available：校內 Campus：已公開 available 校外 Off-campus：已公開 available etd-0801114-230511.pdf
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS