Responsive image
博碩士論文 etd-0627114-221628 詳細資訊
Title page for etd-0627114-221628
論文名稱
Title
光線追蹤下基於SAH指標之低成本BVH建構電路設計
Design of Low-cost BVH Construction Circuit for Ray Tracing based on Surface Area Heuristic Metric
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
63
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2014-07-25
繳交日期
Date of Submission
2014-08-15
關鍵字
Keywords
光線追蹤、階層包圍體、電腦圖學、階層包圍體建構、光線追蹤硬體
ray-tracing, BVH, computer graphics, BVH tree construction, ray-tracing hardware
統計
Statistics
本論文已被瀏覽 5667 次,被下載 381
The thesis/dissertation has been browsed 5667 times, has been downloaded 381 times.
中文摘要
在即時動態場景的光線追蹤研究中,一個重要的議題即如何快速的建構出物件分群結構。本論文提出了一個基於surface-area-heuristic (SAH)指標之低成本binned bounding volume hierarchy (BVH) 樹結構建構電路設計。此樹結構建構電路的主要 工作是將一個節點內所包含的三角形根據給定的分割位置分成兩群三角形,並建構出兩個子節點紀錄三角形分群的結果。子節點建構的同時也會決定其分割位置以進行下一個節點的分群工作。
本論文一開始提出一個dual-bank的三角形記憶體架構,其透過少許數量的輔助暫存空間以及特殊的記憶體讀寫排程演算法來提升記憶體空間使用率。整體的記憶體使用量,與傳統的ping-pong緩衝記憶體架構比較,只需要其一半的暫存記憶體空間。接下來,一個管線化二元搜尋電路模組則用於決定輸出的三角形要分類至哪一個分類盒(bin)內。當所有的三角形皆被掃描過並更新至分類盒後,每個分類盒的axis-aligned bounding box (AABB)以及包含的三角形數量會開始進行組合,以計算出每一個分割候選位置的SAH成本。整個SAH計算僅花費102個時間週期。為了要提升硬體使用率,本篇提出了一個雙子樹交織(dual subtree interleaving)的排程方式進行節點分割工作。其使用兩個堆疊來實現,並維持兩個堆疊內都有尚未完成的子樹可以進行節點分割工作。
本篇提出的BVH樹結構建構電路硬體成本僅約361K個閘,並且在90nm製程下可以運行至199MHz工作時脈。與先前的設計比較,其面積時間乘積比值可以達到約原本的0.74。本篇之設計貢獻在於嵌入式環境下的即時動態場景光線追蹤系統開發。
Abstract
The acceleration of tree construction is a very important issue for real-time ray-tracing
of dynamic scenes. This paper presents a low-cost design of bounding volume hierarchy (BVH) tree builder based on surface-area-heuristic (SAH) binned BVH algorithm. The core operation of the tree builder is to partition one node of primitives into two separate parts as their child nodes, and also determine the next split location for two new born nodes.
This paper first proposes an efficient dual-bank primitive buffer architecture. With the
help of little extra auxiliary buffer space and the proposed special buffer access algorithm, our design can save one half of buffer capacity compared with the traditional ping-pong buffer design. Next, a pipeline binary search module has also been proposed to efficiently determine the bin location of the incoming primitive. After all the primitives have been scanned, the primitive count and axis-aligned bounding box (AABB) of each individual bin have to be combined in order to find the SAH cost of each split candidate. By considering the bin combination computation of all the split candidates, an efficient bin combine unit can be developed by simply accumulating the incoming bin data. The total number of cycles required in SAH calculation process is only 102. In order to increase the hardware utilization, this paper also proposed a dual subtree interleaving approach which interleaves the node-split operations from two stacks, and there is always a subtree is built in
each stack.
The proposed treebuilder only costs about 358K gates, and can run up to 141MHz for
90nm technology. Our area-time-product performance can achieve about 0.74 comparing to previous works. The proposed design can contribute to the development of embedded real-time ray-tracing systems.
目次 Table of Contents
論文審定書 i
摘要 ii
Abstract iii
圖目錄 vi
表目錄 vii
第一章概論 1
1.1 研究動機 1
1.2 論文大綱 3
第二章背景知識與相關研究 4
2.1 光線追蹤 4
2.2 Bounding Volume Hierarchy (BVH) 5
2.3 Surface Area Heuristic 8
2.4 相關BVH結構與變形 8
2.5 BVH建構加速 11
第三章Binned BVH建構電路設計 12
3.1 整體電路架構 12
3.2 邊界產生單元(Border Generation Unit) 15
3.3 三角形分群單元(Partition Unit) 16
3.3.1 三角形分群工作 17
3.3.2 三角形分群單元電路架構 21
3.3.3 Dual Primitive Buffer讀寫排程 23
3.4 分類選擇單元(Bin-Decision Unit) 30
3.5 分類組合單元(Bin Combine Unit) 31
3.6 SAH計算單元與SAH比較單元 34
3.7 多緒執行與工作分配單元 36
第四章實作結果與實驗比較 40
第五章結論與未來目標 49
參考文獻 51
附錄A 三角形分群排程控制虛擬碼 54
參考文獻 References
[1] J. Kalojanov and P. Slusallek, “A parallel algorithm for construction of uniform grids,” in Proceedings of the Conference on High Performance Graphics 2009. ACM, 2009, pp. 23–28.
[2] Z. Wu, F. Zhao, and X. Liu, “SAH KD-tree construction on GPU,” in Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics. ACM, 2011, pp. 71–78.
[3] B. Choi, R. Komuravelli, V. Lu, H. Sung, R. L. Bocchino, S. V. Adve, and J. C. Hart, “Parallel SAH kD tree construction,” in Proceedings of the Conference on High Performance Graphics. Eurographics Association, 2010, pp. 77–86.
[4] M. Zlatuˇska and V. Havran, “Ray tracing on a GPU with CUDA–comparative study of three algorithms,” WSCG’2010, pp. 69–76, 2010.
[5] K. Garanzha, “Efficient clustered BVH update algorithm for highly-dynamic models,” in IEEE Symposium on Interactive Ray Tracing, Aug. 2008, pp. 123–130.
[6] D. Kopta, T. Ize, J. Spjut, E. Brunvand, A. Davis, and A. Kensler, “Fast, effective BVH updates for animated scenes,” in Proceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games, ser. I3D ’12. ACM, 2012, pp. 197–204.
[7] C. Lauterbach, S.-E. Yoon, D. Tuft, and D. Manocha, “RT-DEFORM: Interactive ray tracing of dynamic scenes using BVHs,” in IEEE Symposium on Interactive Ray Tracing, Sep. 2006, pp. 39–46.
[8] K. Garanzha, S. Premoˇze, A. Bely, and V. Galaktionov, “Grid-based SAH BVH construction on a GPU,” The Visual Computer, vol. 27, no. 6-8, pp. 697–706, Jun. 2011.
[9] C. Lauterbach, M. Garland, S. Sengupta, D. Luebke, , and D. Manocha, “Fast BVH construction on GPUs,” in Proc. Eurographics ’09, Computer Graphics Forum, vol. 28, Mar. 2009, pp. 375–384.
[10] J. Pantaleoni and D. Luebke, “HLBVH: Hierarchical LBVH construction for realtime ray tracing of dynamic geometry,” in Proceedings of the Conference on High Performance Graphics 2010. ACM, 2010, pp. 87–95.
[11] T. Karras and T. Aila, “Fast parallel construction of high-quality bounding volume hierarchies,” in Proceedings of the 5th High-Performance Graphics Conference. ACM, 2013, pp. 89–99.
[12] I. Wald, “Fast construction of SAH BVHs on the Intel many integrated core (MIC) architecture,” IEEE Transactions on Visualization and Computer Graphics, vol. 18, no. 1, pp. 47–57, Jan. 2012.
[13] M. J. Doyle, C. Fowler, and M. Manzke, “A hardware unit for fast SAH-optimised BVH construction,” ACM Transactions on Graphics, vol. 32, no. 4, pp. 139:1–139:10, Jul. 2013.
[14] B. D. Greve, “Reflections and refractions in ray tracing.”
[15] M. Hapala, T. Davidoviˇc, I. Wald, V. Havran, and P. Slusallek, “Efficient stack-less BVH traversal for ray tracing,” in Proceedings of the 27th Spring Conference on Computer Graphics. ACM, 2011, pp. 7–12.
[16] S. Laine, “Restart trail for stackless BVH traversal,” in Proceedings of the Conference on High Performance Graphics. Eurographics Association, 2010, pp. 107–111.
[17] J. D. MacDonald and K. S. Booth, “Heuristics for ray tracing using space subdivision,” The Visual Computer, vol. 6, no. 3, pp. 153–166, 1990.
[18] J. Goldsmith and J. Salmon, “Automatic creation of object hierarchies for ray tracing,” IEEE Computer Graphics and Applications, vol. 7, no. 5, pp. 14–20, May 1987.
[19] I. Wald, S. Boulos, and P. Shirley, “Ray tracing deformable scenes using dynamic bounding volume hierarchies,” ACM Transactions on Graphics, vol. 26, no. 1, pp. 6:1–6:18, Jan 2007.
[20] M. Stich, H. Friedrich, and A. Dietrich, “Spatial splits in bounding volume hierarchies,” in Proceedings of the Conference on High Performance Graphics 2009, 2009, pp. 7–13.
[21] M. Ernst and G. Greiner, “Multi bounding volume hierarchies,” in IEEE Symposium on Interactive Ray Tracing 2008. IEEE, 2008, pp. 35–40.
[22] T.-J. Kim, B. Moon, D. Kim, and S.-E. Yoon, “RACBVHs: Random-accessible compressed bounding volume hierarchies,” IEEE Transactions on Visualization and Computer Graphics, vol. 16, no. 2, pp. 273–286, 2010.
[23] T. Karras, “Maximizing parallelism in the construction of bvhs, octrees, and k-d trees,” in Proceedings of the Fourth ACM SIGGRAPH/Eurographics conference on High-Performance Graphics. Eurographics Association, 2012, pp. 33–37.
[24] D. Kopta, J. Spjut, E. Brunvand, and A. Davis, “Efficient MIMD architectures for high-performance ray tracing,” in IEEE International Conference on Computer Design, Oct 2010, pp. 9–16.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code