Responsive image
博碩士論文 etd-0408114-051506 詳細資訊
Title page for etd-0408114-051506
論文名稱
Title
視訊編碼系統之畫面內預測、殘餘值預測及轉碼效能強化之研究
Performance Enhancement of Video Coding System: Intra Prediction, Residual Prediction, and Transcoding
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
110
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2014-04-10
繳交日期
Date of Submission
2014-05-08
關鍵字
Keywords
殘餘值預測、畫面內預測、高效能視訊編碼、H.264高階視訊編碼、視訊轉碼、畫面選擇
intra prediction, High Efficiency Video Coding, H.264/AVC, frame selection, video transcoding, residual prediction
統計
Statistics
本論文已被瀏覽 5824 次,被下載 39
The thesis/dissertation has been browsed 5824 times, has been downloaded 39 times.
中文摘要
視訊編碼與傳輸在資訊傳遞中佔有非常重要的角色。便利的通訊環境及顯示設備的發展使得多媒體內容分享與取得十分普及,提升了大眾對高解析度視訊播放的需求,同時也提昇了現今對應用於高解析度視訊之高效能視訊編碼解決方案的迫切。本篇論文提出了視訊編轉碼之改善技術,第一部份主要探討畫面內編碼之改良,我們藉由適應性樣板比對找出良好的預測畫面,並藉由有限狀態機降低畫面內編碼之位元數。畫面殘餘值的資料量隨著量化參數變小而大幅增加,本論文第二部份提出一殘餘值預測演算法以增強畫面間編碼之預測準確性,並藉由拉格朗日成本函數選擇出較好的編碼方式。根據不同網路特性調整視訊內容是行動視訊通訊中重要的課題,本論文第三部份提出藉由畫面複雜度分析選擇重要畫面之時域視訊轉碼。簡而言之,本論文提出對於視訊編碼系統效能強化之研究,我們的視訊編碼系統針對畫面內及畫面間區塊相似度做探討,並研究由視覺複雜度及時域一致性對於畫面選擇的重要性。實驗結果可顯示我們的研究可提昇視訊編碼與傳輸之品質。
Abstract
The applications of video coding to wire or wireless communication play important roles in information transmission. With the convenient communication environment and development of the display device, the multimedia contents are much easier to be acquired and shared from the internet than the past. The increase of the screen resolution also gives a high demand for the high resolution video coding. In this dissertation, we aim at performance enhancements of video coding as well as video transcoding. In the first part of this dissertation, a new approach that aims to improve the coding performance of intra block is discussed. We use an adaptive template matching to find a better prediction block and employ the finite state machine to reduce the number of bits required for intra encoding. The data size of residual is raised rapidly when the number of the quantization parameter decrease. In the second part of this dissertation, a residual prediction algorithm is proposed to improve the prediction accuracy of inter encoding. Through the Lagrangian cost function, the better coding mode is selected in the proposed algorithm. Video content adaption of different network characteristics is a significant problem in the mobile video communication. The third part of this dissertation proposes a frame selection scheme for temporal video transcoding through frame complexity analysis that consists of visual complexity and temporal coherence. In brief, this dissertation analyzes the block correlation in same frame and in different frame and proves the importance of visual complexity and temporal coherence. Experimental results show that the proposed system can enhance the video quality of coding and transmission a video.
目次 Table of Contents
中文摘要 i
Abstract ii
Contents iv
List of Figures vii
List of Tables x
Chapter 1 Introduction 1
1.1 Introduction to Video Coding 1
1.2 Motivation 4
1.3 Contributions 6
1.4 Organization 8
Chapter 2 A New Intra Prediction with Adaptive Template Matching through Finite State Machine 9
2.1 Introduction 9
2.2 Intra Prediction of Video Coding Standard 13
2.2.1 H.264/AVC Intra Prediction 13
2.2.2 HEVC Intra Prediction 15
2.2.3 Background Review 17
2.2.4 Intra Prediction Using Intra-Macroblock Motion Compensation 17
2.2.5 Finite State Machine In Vector Quantization Coding 19
2.3 Adaptive Template Matching-Based Intra Prediction 22
2.3.1 Observation 22
2.3.2 Adaptive Template Selection 23
2.3.3 Adaptive Template Matching Based Intra Prediction 25
2.3.4 The Early Termination Of The Proposed HEVC 28
2.3.5 Algorithm Summary 31
2.4 Experimental Results 36
2.4.1 Experimental Results of H.264/AVC 36
2.4.2 Experimental Results of HEVC 40
Chapter 3 Second-Order-Residual Prediction for HEVC Inter Coding 44
3.1 Introduction 44
3.2 HEVC Inter Prediction 47
3.3 Proposed Second-Order-Residual Prediction 49
3.3.1 Observation 49
3.3.2 The Proposed Algorithm 51
3.4 Experimental Results 55
Chapter 4 Temporal Video Transcoding based on Frame Complexity Analysis for Mobile Video Communication 58
4.1 Introduction 58
4.2 Background Review 61
4.3 Frame Rate Decision 63
4.4 Proposed Frame Selection Scheme 64
4.5 Experimental Results 70
Chapter 5 Conclusions and Future Work 81
5.1 Conclusions 81
5.2 Feature Work 83
References 85
Publication List 94
參考文獻 References
[1] ISO/IEC IS 11172, Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media Up to about 1.5 Mbits/s, 1992.
[2] ISO/IEC IS 13818, Information Technology - Generic Coding of Moving Pictures and Associated Audio Information, 1994.
[3] ISO/IEC IS 14496, Coding of Moving Pictures and Audio, 1998.
[4] ITU-T Recommendation H.261. Line transmission of non-telephone signals. Video codec for audiovisual services at px64 kbits, 1993.
[5] ITU-T Recommendation H.263. Video coding for low bitrate communications, 1995.
[6] Video Coding for Low Bit-Rate Communication, ITU-T Recommendation H.263 Version 2 (H.263+), January 1998.
[7] ITU-T Recommendation H.264, “Advanced video coding for generic audiovisual services,” May 2003.
[8] T. Wiegand, G. J. Sullivan, G. Bjøntegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Transactions Circuits and Systems for Video Technology, vol. 13, no. 7, pp. 560-576, 2003.
[9] J. Jia, D. Yoon, and H. K Kim, “An efficient coding of intra picture prediction modes for H.264/AVC,” in Proceedings of Third International Conference on Multimedia and Ubiquitous Engineering, pp. 49-53, 2009.
[10] K. Zhang, X. Ji, Q. Huang, D. Zhao, and W. Gao, “An efficient coding method for intra prediction mode information,” in Proceedings of IEEE International Symposium on Circuits and Systems, pp. 2814-2817, 2009.
[11] P. Zhang, D. Zhao, S. Ma, Y. Lu, and W. Gao, “Multiple modes intra-prediction in intra coding,” in Proceedings of IEEE International Conference on Multimedia and Expo, vol. 1, pp. 419-422, 2004.
[12] L. Wang, L.-M. Po, Y.M.S. Uddin, K.-M. Wong, and S. Li, “A novel weighted cross prediction for H.264 intra coding,” in Proceedings of IEEE International Conference on Multimedia and Expo, pp. 165-168, 2009.
[13] S. Yu and C. Chrtsafis, “New intra prediction using intra-macroblock motion compensation,” JVT-C151r1.doc, April, 2004.
[14] T. K. Tan, C. S. Boon, and Y. Suzuki, “Intra prediction by template matching,” in Proceedings of IEEE International Conference on Image Processing, pp. 1693- 1696, 2006.
[15] T. K. Tan, C. S. Boon, and Y. Suzuki, “Intra prediction by averaged template matching predictors,” in Proceedings of 4th IEEE Consumer Communications and Networking Conference, pp. 405-409, 2007.
[16] C. Lan, J. Xu, F. Wu, and G. Shi, “Intra frame coding with template matching prediction and adaptive transform,” in Proceedings of IEEE International Conference on Image Processing, pp. 1221-1224, 2010.
[17] C.-S. Wu, S.-J. F. Jiang, and C.-H. Yeh, “New intra prediction with finite state machine for H.264/AVC,” in Proceedings of SPIE Visual Communications and Image Processing, vol. 7744, pp. 774414-1-774414-8, 2010.
[18] C. J. Kuo, C.-H. Yeh, and S. F. Odeh, “Polynomial search algorithm for motion estimation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 10, no. 5, pp.813-818, 2000.
[19] J. Foster, R. M. Grey, and M. D. Dunham, ‘‘Finite-state vector quantization for waveform coding,’’ IEEE Transactions on Information Theory, vol. 31, pp. 348-359, 1985.
[20] N. M. Nasrabadi, C. Y. Choo, and Y. Feng, ‘‘Dynamic finite-state vector quantization of digital images,’’ IEEE Transactions on Communications, vol. 42, pp. 2145-2154, 1994.
[21] G. Bjontegaard, “Calculation of average PSNR differences between RD-curves,” ITU-T Q6/SG16, Doc. VCEG-M33, Apr. 2001.
[22] Y. Suzuki, C. Seng Boon, and T. K. Tan, “Inter frame coding reference with template matching averaging,” in Proceedings of IEEE International Conference on Image Processing, pp. III-409-III-412, 2007.
[23] T. Chen, X. Sun, and F. Wu, “Predictive patch matching for inter frame coding,” in Proceedings of SPIE Visual Communications and Image Processing, vol. 7744, pp. 774412-1-774412-8, 2010.
[24] C. Zhang, K. Ugur, J. Lainema, A. Hallapuro, and M. Gabbouj, “Video coding using spatially varying transform,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 21, no. 2, pp. 127-140, Feb. 2011.
[25] X. Zhao, L. Zhang, S. Ma, and W. Gao, “Video coding with rate-distortion optimized transform,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 1, pp. 138-151, 2012.
[26] L. Xu and K. N. Ngan, “Video content dependent directional transform for high performance video coding,” in Proceedings of IEEE International Conference on Multimedia and Expo Workshops (ICMEW), pp. 79-83, 2012.
[27] S. D. Kim, S. J. Hwang, and M. H. Sunwoo, “Novel residual prediction scheme for hybrid video coding,” in Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 617-620, 2009.
[28] Q. Zhang, S.-H. Kim, Y. Dai and C.-C. J. Kuo, “Multi-Order-Residual (MOR) video coding: framework, analysis and performance,” in Proceedings of SPIE Visual Communications and Image Processing, vol. 7744, 2010.
[29] C.-H. Yeh, Y.-H. Chen, M.-C. Chi, and M.-J. Chen, “Parabolic motion-vector re-estimation algorithm for compressed video downscaling,” Journal of Signal Processing Systems, vol. 61, no. 3, pp. 375-386, 2010.
[30] I. Ahmad, X. Wei, Y. Sun, and Y.-Q. Zhang, “Video transcoding: an overview of various techniques and research issues,” IEEE Transactions on Multimedia, vol. 7, no. 5, pp. 793-804, 2005.
[31] F. Al-Abri, E. Edirisingh, J. De Cock, S. Notebaert, and R. Van de Walle, “Optimal H.264/AVC video transcoding system,” in Proceedings of IEEE International Conference on Consumer Electronics, pp. 335-336, 2011.
[32] R. Feghali, F. Speranza, D. Wang, and A. Vincent, “Video quality metric for bit rate control via joint adjustment of quantization and frame rate,” IEEE Transactions on Broadcasting, vol. 53, no. 1, pp. 441-446, 2007.
[33] S. H. Hong, S. J. Yoo, S. W. Lee, H. S. Kang, and S. Y. Hong, “Rate control of MPEG video for consistent picture quality,” IEEE Transactions on Broadcasting, vol. 49, no. 1, pp. 1-13, 2003.
[34] H. Xiong, J. Sun, S. Yu, J. Zhou, and C. Chen, “Rate control for realtime video network transmission on end-to-end rate-distortion and application-oriented QoS,” IEEE Transactions on Broadcasting, vol. 51, no. 1, pp. 122-132, 2005.
[35] F.-X. Coudoux, M. G. Gazalet, C. Mouton-Goudemand, P. Corlay, and M. Gharbi, “Extended coverage for DSL video distribution using a quality-oriented JSCC architecture,” IEEE Transactions on Broadcasting, vol. 54, no. 3, pp. 525-531, 2008.
[36] M. A. Bonuccelli, F. Lonetti, and F. Martelli, “Temporal transcoding for mobile video communication,” in Proceedings of The Second Annual International Conference on Mobile and Ubiquitous Systems: Networking and Services, pp. 502-506, 2005.
[37] Z. Cui and X. Zhu, “SSIM-based content adaptive frame skipping for low bit rate H.264 video coding,” in Proceedings of IEEE International Conference on Communication Technology, pp. 484-487, 2010.
[38] Z. Wu, H. Yu, B. Tang, and C. W. Chen, “Adaptive initial quantization parameter determination for H.264/AVC video transcoding,” IEEE Transactions on Broadcasting, vol. 58, no. 2, pp. 277-284, 2012.
[39] A.-S. Bacquet, C. Deknudt, P. Corlay, F.-X. Coudoux, and M. Gazalet, “Low complexity H.264/AVC spatial resolution transcoding in the transform domain,” in Proceedings of IEEE International Conference on Electronics, Circuits, and Systems, pp. 491-494, 2011.
[40] S.-L. Liu and C.-C. Jay Kuo, “Joint temporal-spatial rate control for adaptive video transcoding,” in Proceedings of International Conference on Multimedia and Expo, vol. 2, pp. 225-228, 2003.
[41] J. Harel, C. Koch, and P. Perona, “Graph-based visual saliency,” in Proceedings of Neural Information Processing Systems, pp. 545-552, 2006.
[42] K. Seshadrinathan, and A. C. Bovik, “Motion tuned spatio-temporal quality assessment of natural videos,” IEEE Transactions on Image Processing, vol. 19, no. 2, pp. 335-350, 2010.
[43] Q. Huynh-Thu and M. Ghanbari, “Temporal aspect of perceived quality in mobile video broadcasting,” IEEE Transactions on Broadcasting, vol. 54, no. 3, pp. 641-651, 2008.
[44] J.-N. Hwang, T.-D. Wu, and C.-W. Lin, “Dynamic frame-skipping in video transcoding,” in Proceedings of IEEE Workshop on Multimedia Signal Processing, pp. 377-383, 1997.
[45] H. Shu and L.-P. Chau, “Dynamic frame-skipping transcoding with motion information considered,” IET Image Processing, vol. 1, no. 4, pp. 335-342, 2007.
[46] C.-T. Hsu, C.-H. Yeh, C.-Y. Chen, and M.-J. Chen, “Arbitrary frame rate transcoding through temporal and spatial complexity,” IEEE Transactions on Broadcasting, vol. 55, no. 4, pp. 767-775, 2009.
[47] K. A. Peker and A. Divakaran, “Adaptive fast playback-based video skimming using a compressed-domain visual complexity measure,” in Proceedings of IEEE International Conference on Multimedia and Expo, vol. 3, pp. 2055-2058, 2004.
[48] Y.-M. Chen and I. V. Bajić, “A joint approach to global motion estimation and motion segmentation from a coarsely sampled motion vector field,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 21, no. 9, pp. 1316-1328, 2011.
[49] Y. Zhou, “An adaptive error-resilient scheme for temporal video transcoding,” in Proceedings of International Conference on Multimedia Technology, pp. 141-144, 2011.
[50] Y. Zhou, H. Ma, and Y. Chen, “A frame skipping transcoding method based on optimum frame allocation in sliding window,” in Proceedings of International Conference on Signal Processing Systems, vol. 1, pp. V1-83-V1-86, 2010.
[51] J.-W. Kim, K.-I. Ji, G.-R. Kwon, J.-S. Lee, and S.-J. Ko, “R-D optimized frame-skipping transcoder for low bit rate video transmission,” in Proceedings of International Conference on Consumer Electronics, pp. 421-422, 2006.
[52] Z. Zhang, H. Shi, and S. Wan, “Dynamic frame-skipping scheme for live video encoders,” in Proceedings of International Conference on Multimedia Technology, pp. 1-3, 2010.
[53] K. Sühring, Joint Video Team software JM18.2 [Online]. Available: http://iphome.hhi.de/suehring/tml/download/
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:自定論文開放時間 user define
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code