Responsive image
博碩士論文 etd-0721117-173334 詳細資訊
Title page for etd-0721117-173334
論文名稱
Title
基於Tag Array之快取記憶體過度集中存取的有效解決機制
A Hybrid SPM-Cache with Tag-Based Hot Cache Sets Support
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
59
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2017-07-14
繳交日期
Date of Submission
2017-09-11
關鍵字
Keywords
衝突性失誤、過度集中存取、草稿記憶體、快取記憶體
Conflict Miss, Hot Cache Set, Scratchpad Memory, Cache
統計
Statistics
本論文已被瀏覽 5643 次,被下載 20
The thesis/dissertation has been browsed 5643 times, has been downloaded 20 times.
中文摘要
因應製程的進步,科技的演變,嵌入式系統在現今電腦架構當中扮演著十分重要的角色,因其具有低成本與面積小的優勢提供可靠的效能執行特定的應用程式與任務。但受限於資源方面的限制,嵌入式系統需在效能與功耗之間取得完美的平衡以利系統可以長久的運作。快取記憶體為嵌入式系統當中負責快速提供中央處理器資料的記憶體,具有足夠的命中率與低延遲的優點,但卻也有著因應容量要求而使面積龐大與漏電的高功耗缺點。草稿記憶體(SPM:Scratchpad Memory)為一種進階使用快取記憶體之方法,具有高功耗效率與可預期性的存取延遲之優點,但是使用草稿記憶體需要占領一般快取記憶體一部分的容量並放置使用者選擇的資料,因此使用草稿記憶體將會使某一快取記憶體之set因未擁有足夠的空間放置資料而發生過度集中存取之狀況,導致命中率下降且效能降低。由於草稿記憶體使用之空間為data RAM,因此於此篇論文中,提出利用草稿記憶體之tag RAM做為過度存取之快取記憶體set的暫時性額外放置空間,實驗結果顯示最佳情況可降低衝突性失誤(conflict miss)至原本的0.85%,平均可降低至原本的28%。
Abstract
Embedded system plays an important role in modern computer architecture since it can provide reliable performance with lower cost and area compared to the general computer. Due to its resource limitations, embedded systems need to take the balance between energy consumption and performance. Cache consumes most of the chip area to provide enough hit rate and lower latency than memory. However, the cache also takes a large amount of energy due to its chip area and leakage power. Scratchpad Memory (SPM) is commonly used in the embedded system and has the advantages of energy-efficiency and predictable timing, but using SPM requires normal cache ways to hold the specific data during program execution. Each program has its unique cache requirement from execution start to end which leads to different proportion of normal cache ways and SPM ways. Hot cache sets are caused when some cache sets have fewer cache blocks to place data which leads to higher conflict miss rate. In this paper, we propose an adaptive tag-based spare cache architecture to solve hot cache sets problem during each program execution. The experiment results show 99.15% conflict miss reduction in the best case and 75.33% in average of MiBench.
目次 Table of Contents
論文審定書 i
摘要 iii
Abstract iv
Contents v
List of Figures vii
List of Tables ix
Listing of Listing x
Chapter 1. Introduction 1
1.1 Background 1
1.2 Motivation 2
1.3 Organization of the Thesis 3
Chapter 2. Related Works 5
2.1 Hybrid SPM-Cache Architecture 6
2.2 Cache Sets Utilization 7
2.3 Cache Miss Reduction 8
2.4 Additional Functions 10
2.5 Problem Discussions 12
Chapter 3. Hot Cache Sets Analysis 13
3.1 Program Execution Address 13
3.2 GEM5 14
3.3 Dinero IV 15
3.3.1 Normal Cache Operation 15
3.3.2 SPM Data Address and Removement 16
3.3.3 Spare Cache Data Block 17
Chapter 4. Spare Cache Architecture 19
4.1 Architecture Overview 20
4.2 Tag RAM Input Classifier 22
4.2.1 Spare Cache Access Table 24
4.2.2 Tag Signals Selection 27
Chapter 5. Experimental Results 30
5.1 Conflict Miss Reduction 31
5.1.1 Experiment Setup 31
5.1.2 Reduced Conflict Miss Results 32
5.1.3 Data Interference 34
5.2 Timing Analysis - Critical Path 37
5.3 Hardware Implementation - Gate Counts 38
5.4 Power Consumption Analysis 39
Chapter 6. Conclusion 41
Chapter 7. Future Work 42
References 43
Appendix A Cache Miss Calculation 46
參考文獻 References
[1] Farmahini-Farahani, Amin, Nam Sung Kim, and Katherine Morrow. "Energy-efficient reconfigurable cache architectures for accelerator-enabled embedded systems." Performance Analysis of Systems and Software (ISPASS), 2014 IEEE International Symposium on. IEEE, 2014.
[2] Banakar, Rajeshwari, et al. "Scratchpad memory: design alternative for cache on-chip memory in embedded systems." Proceedings of the tenth international symposium on Hardware/software codesign. ACM, 2002.
[3] Zhang, Wei, and Yiqiang Ding. "Hybrid SPM-cache architectures to achieve high time predictability and performance." Application-Specific Systems, Architectures and Processors (ASAP), 2013 IEEE 24th International Conference on. IEEE, 2013.
[4] Cong, Jason, et al. "An energy-efficient adaptive hybrid cache." Low Power Electronics and Design (ISLPED) 2011 International Symposium on. IEEE, 2011.
[5] Jouppi, Norman P. "Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers." Computer Architecture, 1990. Proceedings., 17th Annual International Symposium on. IEEE, 1990.
[6] Deayton, Peter, and Chung-Ping Chung. "Set utilization based dynamic shared cache partitioning." Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International Conference on. IEEE, 2011.
[7] Zhang, Chuanjun. "Balanced cache: Reducing conflict misses of direct-mapped caches." ACM SIGARCH Computer Architecture News 34.2 (2006): 155-166.
[8] Rolán, Dyer, Basilio B. Fraguela, and Ramón Doallo. "Adaptive line placement with the set balancing cache." Microarchitecture, 2009. MICRO-42. 42nd Annual IEEE/ACM International Symposium on. IEEE, 2009.
[9] Rolán, Dyer, Basilio B. Fraguela, and Ramón Doallo. "Reducing capacity and conflict misses using set saturation levels." High Performance Computing (HiPC), 2010 International Conference on. IEEE, 2010.
[10] Deayton, Peter, and Chung-Ping Chung. "Set utilization based dynamic shared cache partitioning." Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International Conference on. IEEE, 2011.
[11] Yang, Yun-Chung, and Ing-Jer Huang. "A Reconfigurable Cache for Efficient Usage of the Tag RAM Space. " National Sun Yat-Sen University, http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search-c/view_etd?URN=etd-0728114-114613 (2014).
[12] Lai, Chun-Hung, Yun-Chung Yang, and Jer Huang. "A versatile data cache for trace buffer support." IEEE Transactions on Circuits and Systems I: Regular Papers 61.11 (2014): 3145-3154.
[13] Guthaus, Matthew R., et al. "MiBench: A free, commercially representative embedded benchmark suite." Workload Characterization, 2001. WWC-4. 2001 IEEE International Workshop on. IEEE, 2001.
[14] Binkert, Nathan, et al. "The gem5 simulator." ACM SIGARCH Computer Architecture News 39.2 (2011): 1-7.
[15] Edler, Jan, and Mark D. Hill. "Dinero IV Trace-Driven Uniprocessor Cache Simulator." http://pages.cs.wisc.edu/~markhill/DineroIV/ (1998).
[16] Banakar, Rajeshwari, et al. "Scratchpad memory: design alternative for cache on-chip memory in embedded systems." Proceedings of the tenth international symposium on Hardware/software codesign. ACM, 2002.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code