國立中山大學,National Sun Yat-sen University,學位論文,thesis/dissertation,使用Stereo Matching立體視差估算之實作,Implementation of Disparity Estimation Using Stereo Matching

論文名稱 Title	使用Stereo Matching立體視差估算之實作 Implementation of Disparity Estimation Using Stereo Matching
系所名稱 Department	資訊工程學系 Department of Computer Science and Engineering
畢業學年期 Year, semester	99 學年度第 2 學期 The spring semester of Academic Year 99	語文別 Language	中文 Chinese
學位類別 Degree	碩士 Master	頁數 Number of pages	63
研究生 Author	王穎鈞 Ying-Chung Wang
指導教授 Advisor	蕭勝夫 Shen-Fu Hsiao
召集委員 Convenor	陳中和 Chen Chung-Ho
口試委員 Advisory Committee	陳銘志, 鄺獻榮, 張雲南 Ming-Chih Chen; Shiann-Rong Kuang; Yun-Nan Chang
口試日期 Date of Exam	2011-07-25	繳交日期 Date of Submission	2011-08-08
關鍵字 Keywords	視差值、極線幾何、像素匹配、動態規劃、深度圖 dynamic programming, depth Image-based Rendering, autostereoscopic, stereo vision, stereo matching
統計 Statistics	本論文已被瀏覽 5760 次，被下載 1239 次 The thesis/dissertation has been browsed 5760 times, has been downloaded 1239 times.

中文摘要
3D立體視覺的廣義來說由兩個階段所組成的。前期階段最主要的目的就是將輸入的影像的，利用我們的立體匹配（Stereo Matching）來求得立體視差值（ Disparity ）再利用簡單的相似三角型公式得到我們的深度圖（Depth Map）。後期階段則是將前面所述產生的深度圖利用depth-based image rendering (DIBR)來產生多個視角的影像。立體匹配是利用大量的運算來找出兩張不同的影像之間所相對應的深度圖。在本論文中我們實作出三種不同的立體匹配方法。pixel-based, window-based, and dynamic programming (DP)-based. Pixel--based 和 window-based 的演算法是屬於區域性的最佳化匹配，而DP則是屬於全域性的最佳化匹配。主要是由三個主要步驟構成：匹配成本的計算（matching cost computation），成本累積（cost aggregation），以及最後是回溯的部分（back-tracing）以硬體實作動態規劃（DP）的立體匹配往往需要大容量的記憶體空間來存取中間計算結果，使得面積無法縮減。在我們的論文裡提出了tile-based 的動態規劃方法，我們將原本的影像切成較小的tile，因此我們在計算差值時只需要較小的記憶體空間。
Abstract
General 3D stereo vision is composed of two major phases. In the first phase, an image and its corresponding depth map are generated using stereo matching. In the second phase, depth-based image rendering (DIBR) is employed to generate images of different view angles. Stereo matching, a computation-intensive operation, generates the depth maps from two images captured at two different view positions. In this thesis, we present hardware designs of three different stereo matching methods: pixel-based, window-based, and dynamic programming (DP)-based. Pixel--based and window-based methods belong to the local optimization stereo matching methods while DP, one of the global optimization methods, consists of three main processing steps: matching cost computation, cost aggregation, and back-tracing. Hardware implementation of DP-based stereo matching usually requires large memory space to store the intermediate results, leading to large area cost. In this thesis, we propose a tile-based DP method by partition the original image into smaller tiles so that the processing of each tile requires smaller memory size.

目次 Table of Contents
第1章緒論 1 1.1 研究動機 1 1.2 系統架構 1 1.3 論文架構 4 第2章研究背景與相關研究 5 2.1 研究背景：人類視覺成像 5 2.1.1 人眼與立體成像之關係 5 2.1.2 立體視覺研究背景 8 2.2 相關研究：三維圖像顯示器 10 2.2.1 Lenticular Display 11 2.2.2 Parallax Barrier 11 2.3 DIBR Warping 13 2.3.1 Virtual Image Feature 15 第3章演算法及設計原理 18 3.1 影像匹配 18 3.1.1 Matching Cost 20 3.1.2 Cost Aggregation 21 3.1.3 Disparity 25 3.2 Local Stereo Matching 26 3.2.1 Pixel-based Stereo Matching 27 3.2.2 Window-based Stereo Matching 28 3.3 Global Stereo Matching 31 3.3.1 Dynamic Programming Algorithm 31 第4章架構實現與總體最佳化 33 4.1 Local Stereo Matching實作架構 33 4.1.1 Pixel – base Stereo Matching 33 4.1.2 Window – based Stereo Matching 37 4.2 Global Stereo Matching實作架構 39 4.2.1 Dynamic Programming 39 4.3 提出實作方法：Tile – based 43 4.4 效能分析與比較 47 第5章結論與未來展望 49 5.1 結論 49 5.2 未來展望 49 參考文獻 51

參考文獻 References
[1] B. Curless and M. Levoy, “A volumetric method for building complex models fromrange images,” ACM Conf. Computer Graphics, Proc. SIGGRAPH ’96, pp. 303-312,Aug. 1996. [2] M. D. Wheeler, Y. Sato, and K. Ikeuchi, “Consensus surfaces for modeling 3D objects from multiple range images,” in Proc. IEEE ICCV '98, pp. 917-924, Jan.1998. A. Hilton, A. J. Stoddart, J. Illingworth, and T. Windeatt, “Reliable surface [3] reconstruction from multiple range images.” in Proc. Eur. Conf. on Computer Vision,pp 117-126, 1996. [4] S. Seitz and C. Dyer, “Photorealistic scene reconstruction by voxel coloring,” inProc. IEEE Int. Conf. Computer Vision and Pattern Recognition, pp. 1067-1073, Jun.1997. [5] S. Vedula, P. Rander, H. Saito, and T. Kanade, “Modeling, combining, and rendering dynamic real-world events from image sequences,” in Proc. 4th Conf. Virtual Systems and Multimedia, vol. 1, pp. 326-332, 1998. [6] S. J. Gortler, R. Grzeszczuk, R. Szeliski, and M. F. Cohen, “The lumigraph,” ACMConf. Computer Graphics, Proc. SIGGRAPH ’96, pp. 43-54, Aug. 1996. [7] A. Katayama, K. Tanaka, T. Oshino, and H. Tamura, "A viewpoint dependent stereoscopic display using interpolation of multi-viewpoint images," Proc. SPIE,Stereoscopic Displays and Virtual Reality Systems II, vol. 2409, pp. 11-20, 1995. [8] S. Laveau and O. Faugeras, “3-D scene representation as a collection of images,” in Proc. International Conference on Pattern Recognition, vol. 1, pp. 689-691, Oct.1994. [9] M. Levoy and P. Hanrahan, “Light field rendering,” ACM Conf. Computer Graphics,Proc. SIGGRAPH ’96, pp. 31-42, Aug. 1996.[8] S. E. Chen and L. Williams, “View interpolation for image synthesis,” ComputerGraphics and Interactive Techniques, Proc. of SIGGRAPH '93, pp. 279-288, Aug.1993. [10] L. McMillan and G. Bishop, “Plenoptic modeling: an image-based rendering system,” Computer Graphics, Proc. of SIGGRAPH '95, pp. 39-46, Aug. 1995. [11] S. Seitz and C. Dyer, “View morphing,” in Proc. of SIGGRAPH '96, pp. 21-30,1996. [12] C. Vazquez, W. J. Tam and F. Speranza, “Stereoscopic Imaging: Filling Disoccluded Areas in Depth Image-Based Rendering,” in Proc. Of SPIE vol. 6392, 2006. [13] J. Compen, “3D graphics rendering for multi-view displays: Using programmable shaders on graphics cards,” Koninklijke Philips Electronics N.V. 2005, Technical note TN-2004/00920. [14] Y.-C. Wu, “Image Synthesis Technology for Multi-view 3D Displays,” Master’s Thesis, Dept. of Electrical Engineering, National Cheng Kung University, 2008. [15] N. A. Dodgson, “Autostereoscopic 3D Displays,” Computer, vol. 38, iss. 8, pp. 31-36, Aug. 2005. [16] Daniel S. and Richard S., “A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms”, International Journal of Computer Vision ,pp 7-42,2001. [17] B.-G Nam, H. Kim, and H-J Yoo, “A Low-Power Unified Arithmetic Unit For Programmable Handheld 3-D Graphics Systems”, IEEE JOURNAL OF SOLID-STATE CIRCUITS (JSSC), VOL. 42, pp. 1767-1778, NO. 8, AUGUST 2007. [18] Ruigang Y. and Marc P., “Multi-Resolution Real-Time Stereo on Commodity Graphics Hardware”, Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’03), pp I-211-217 [19] N. Anantrasirichai, C. Nishan Canagarajah , David W. Redmill and David R. Bull, “Dynamic Programming for Multi-View Disparity/Depth Estimation”, ICASSP 2006 pp II-269-272 [20] S. Park, H. Jeong, “Real-time stereo vision FPGA chip with low-error rate,” in Proc. Int. Conf. Multimedia Ubiquitous Eng., Apr. 2007,pp. 751–756. [21] Y.-C. Wu, “Image Synthesis Technology for Multi-view 3D Displays,” Master’s Thesis, Dept. of Electrical Engineering, National Cheng Kung University, 2008. [22] Chul-Hwan Kim*, Ho-Keun Lee, and Yeong-Ho Ha, “Disparity space image based stereo matching using optimal path searching” Proceedings of SPIE–IS&T Electronic Imaging,SPIE Vol. 5022 ,2003. [23] M. Z. Brown, D. Burschka, and G. D. Hager, “Advances in computational stereo”, IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, no. 8,pp. 993–1008, Aug. 2003. [24] D. Scharstein and R. Szeliski, “A taxonomy and evaluation of dense two-frame stereo correspondence algorithms,” Int. J. Comput. Vision,vol. 47, pp. 7–42, 2002. [25] Chun-Jen Tsai and Aggelos K. Katsaggelos, Fellow, IEEE “Dense Disparity Estimation with aDivide-and-Conquer DisparitySpace Image Technique” IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 1, NO. 1, MARCH 1999 [26] Johan Compen, “3D graphics rendering for multiview displays: Using programmable shaders on graphics cards,” Koninklijke Philips Electronics N.V. 2005, Technical note TN-2004/00920, 2005. [27] Nelson Yen-Chung Chang, Tsung-Hsien Tsai, Bo-Hsiung Hsu, Yi-Chun Chen,and Tian-Sheuan Chang, “ Algorithm and Architecture of Disparity Estimation With Mini-Census Adaptive Support Weight”IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 20, NO. 6, JUNE 2010

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。論文使用權限 Thesis access permission：自定論文開放時間 user define 開放時間 Available：校內 Campus：已公開 available 校外 Off-campus：已公開 available etd-0808111-103145.pdf
紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊，請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。開放時間 available 已公開 available

QR Code

國立中山大學圖書與資訊處 │ 諮詢服務：2452 論文審查小組 │ 服務信箱 │ 系統開發維運：圖資處知識創新組

Office of Library and Information Services, National Sun Yat-sen University │ Contact Us : 2452 Thesis Format Review Team , Mail │ Development and operations : Knowledge Innovation Division, LIS