Responsive image
博碩士論文 etd-0806104-132010 詳細資訊
Title page for etd-0806104-132010
論文名稱
Title
以週邊文字支援圖件擷取之研究
Retrieval of Line-drawing Images Based on Surrounding Text
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
57
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2004-07-28
繳交日期
Date of Submission
2004-08-06
關鍵字
Keywords
查詢字詞擴展、週邊文字、圖件擷取、資訊擷取、圖件資料庫
Line-drawing image retrieval, Query expansion, Line-drawing image database, Information retrieval, Surrounding text
統計
Statistics
本論文已被瀏覽 5717 次,被下載 5
The thesis/dissertation has been browsed 5717 times, has been downloaded 5 times.
中文摘要
隨著資訊科技的發展,工程顧問公司開始逐步地將組織內的文件與圖件進行數位化的工作,並建構組織的數位圖書館,以協助文件之擷取。然而,即使在數位化圖書館的環境下,工程師仍面臨著一個重大的挑戰–如何在數位圖書館中尋找所需的圖件。如果所在之數位圖書館僅儲存了少量的圖件,工程師或許可以藉由瀏覽圖件縮圖的方式來尋找所需之圖件,但隨著圖件數量的快速增加,人工瀏覽式的圖件找尋,勢將成為一項費時的工作,並且可能因為無法找到適當圖件而令人感到挫折。

為了因應圖件快速增加的數位圖書館環境下,有效率與有效地擷取圖件的需求與重要性,本論文旨在發展一套圖件擷取系統。一般而言,工程文件中的圖件常伴隨著一些用以說明該圖件的文字一同出現。這些用以說明圖件的週邊文字,事實上提供了進行自動化圖件索引的重要資訊,藉由自週邊文字中適當地萃取索引字(關鍵字),我們便能夠以傳統的資訊擷取技術,來進行圖件擷取的工作。

具體而言,本論文提出了以圖件週邊文字做為基礎的圖件擷取技術。我們利用相同的週邊文字選取中心,以段落或句子的文法基本單位,建構了四種不同資訊量的週邊文字模型,同時以兩種不同的擷取方法–非擴展式與擴展式–進行了實證評估,以驗證不同週邊文字選取量與擷取方法對於圖件擷取效能的影響。實證的結果顯示,以圖件編號為週邊文字選取中心,配合三個句子的週邊文字選取量,可以得到較佳的圖件擷取效能。
Abstract
As advances of information technology, engineering consulting firms have gradually digitalized their documents and line-drawing images. Such digital libraries greatly facilitate document retrievals. However, engineers still face a challenging issue: searches and retrievals of line-drawing images in a digital library. With a small number of line-drawing images in a digital library, engineers can browse thumbnails for locating relevant images. As the number of line-drawing images increases, the manual browsing process is time-consuming and frustrated.

In response to the need and importance of supporting efficient and effective retrieval of line-drawing images, this thesis aims to develop a line-drawing image retrieval system. Typically, a line-drawing image within an engineering document is associated with surrounding text for description or illustration purpose. Such surrounding text provides important information for automatically indexing the line-drawing image. With extracted indexes (or keywords), retrieval of line-drawing images can be accomplished using a traditional information retrieval technique.

Specifically, in this study, we propose a line-drawing image retrieval system based on surrounding text. We develop four models for defining surrounding text boundaries for line-drawing images. Furthermore, two information retrieval techniques (one with and one without query expansion) are implemented and evaluated. According to our empirical evaluations, the surrounding text boundary model with image caption together with three sentences (preceding, image anchoring, and successive sentences) would result in the best retrieval effectiveness, as measured by recall and precision rates.
目次 Table of Contents
第一章 緒論
第一節 研究背景
第二節 研究動機與目的
第三節 論文架構

第二章 文獻探討
第一節 資訊擷取
壹、 布林模型
貳、 向量模型
參、 機率模型
第二節 影像的資訊擷取
壹、 以文字作為基礎的影像擷取
貳、 以影像內容為基礎的影像擷取
參、 相關性回饋
第三節 影像與其週邊文字之關係

第三章 以週邊文字為基礎之圖件擷取技術與模型的建立
第一節 文字模型的建立
第二節 資訊擷取架構
壹、 非擴展式的資訊擷取方法
貳、 擴展式的資訊擷取方法

第四章 研究實證與步驟
第一節 圖件、文字資料的取得與轉換
壹、 資料來源
貳、 圖件、文件模型的取得與轉換
參、 圖件、文件模型的篩選
第二節 查詢字詞的取得與篩選及相關圖件勾選
壹、 查詢字詞的取得與篩選
貳、 受測者進行相關圖件勾選
第三節 進行實證
壹、 圖件擷取
貳、 評估標準
第四節 實證結果
壹、 非擴展式的資訊擷取效能
貳、 擴展式的資訊擷取效能
參、 模型2的非擴展式與擴展式方法之資訊擷取效能比較

第五章 結論
第一節 研究貢獻
第二節 研究限制與未來研究方向

參考文獻
參考文獻 References
Abbadeni, N., Ziou, D., and Wang, S., ”Image Classification and Retrieval on the World Wide Web,” Proceedings of the Fourth ACM Conference on Digital Libraries, August 1999, pp.208-209.

Baeza-Yates, R., and Ribeiro-Neto, B., Modern Information Retrieval, Addison Wesley/ACM Pres, Harlow, England, 1999.

Baum, L. S., Boose, J. H., and Kelly, R. J., ”Graphics Recognition for a Large-Scale Airplane Information System,” Lecture Notes in Computer Science, August 1997, pp.291-301.

Brill, E. “A Simple Rule-based Part of Speech Tagger,” Proceedings of the Third Conference on Applied Natural Language Processing, Trento, Italy, 1992, pp.152-155.

Brill, E. “Some Advances in Rule-based Part of Speech Tagging,” Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94), Seattle, WA, 1994, pp.722-727.

Chen, Z., Liu, W., and Zhang, F., “Mining for Web Image Retrieval,” Journal of the American Society of Information Science, Vol. 52, No. 10, 2001, pp.831-839.

Favela, J. and Meza, V., “Image Retrieval Agent Integrating Image Content and Text,” IEEE Intelligent Systems, September 1999, pp.36-39.

Flank, S., “Sentences vs. Phrases: Syntactic Complexity in Multimedia Information Retrieval,” NAACL-ANLP 2000 Workshop: Syntactic and Semantic Complexity in Natural Language Processing Systems, 2000, pp.1-5.

Freeman, H., “Computer Processing of Line-Drawing Images,” ACM Computing Surveys, Vol. 6 , No. 1, March 1974, pp.57-97.

Goodrum, A. A., “Image Information Retrieval: An Overview of Current Research,” Informing Science, Vol. 3, No. 2, February 2000, pp.63-66.

Harmandas, H., Sanderson, M., and Dunlop, M. D., “Image Retrieval by Hypertext Links,” Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 1997, pp.296-303.

Jing, F., Li, M., and Zhang, H. J. , “An Effective Region-Based Image Retrieval Framework,” Proceedings of the Tenth ACM International Conference on Multimedia, 2002, pp.456-465.

Kobayashi, M. and Takeda, K., “Information Retrieval On The Web,” ACM Computing Surveys, Vol. 32, No. 2, June 2000, pp.144-173.

Lawrence, S. and Giles, C. L., “Text and Image Metasearch on the Web,” International Conference on Parallel and Distributed Processing Techniques and Applications, CSREA Press, 1999, pp. 829-835.

Markkula, M. and Sormunen, E., “End-User Searching Challenges Indexing Practices in the Digital Newspaper Photo Archive,” Information Retrieval, Vol. 1 , No. 4, January 2000, pp.259-285.

Rijsbergen, C. J. van, Information Retrieval, Butterworths, 1979.

Roberson, S. E., “The Probability Ranking Principle in IR,” Journal of Documentation 33, December 1977, pp.294-304.

Rui, Y., Huang, T. S., and Chang, S. F., “Image Retrieval: Past, Present, and Future,” Journal of Visual Communication and Image Representation, Vol. 10, 1999, pp.1-23.

Shen, H. T., Ooi, B. C., and Tan, K. L., “Giving Meanings to WWW Images,” Proceedings of the Eighth ACM International Conference on Multimedia, 2000, pp.39-47.

Soo , V. W., Lee, C. Y., and Li, C. C., “Automated Semantic Annotation and Retrieval Based on Sharable Ontology and Case-based Learning Techniques,” Proceedings of the third ACM/IEEE-CS Joint Conference on Digital Libraries, 2003, pp.61-72.

Su, Z., Li, S., and Zhang, H. J., “Extraction of Feature Subspaces for Content-Based Retrieval Using Relevance Feedback,” Proceedings of the Ninth ACM International Conference on Multimedia, 2001, pp.98-106.

Voultilainen, A., “Nptool: A Detector of English Noun Phrases,” Proceedings of Workshop on Very Large Corpora, Ohio, June 1993.

Wang, J. Z., “Scalable Integrated Region-Based Image Retrieval Using IRM and Statistical Clustering,” Proceedings of the First ACM/IEEE-CS Joint Conference on Digital Libraries, 2001, pp.268-277.

Worring, M., Wielinga, B., and Anjewierden, A., “Automatic Indexing of Text and Graphics in Technical Manuals,” IEEE International Conference on Multimedia and Expo , August 2001, pp.1156-1159.

Wu, P. and Manjunath, B. S., “Adaptive Nearest Neighbor Search For Relevance Feedback in Large Image Databases,” Proceedings of the Ninth ACM International Conference on Multimedia, 2001, pp.89-97.

Xu, J., “Solving the Word Mismatch Problem Through Automatic Text Analysis,” PhD Thesis, University of Massachusetts at Amherst, 1997.

Xu, J. and Croft, W. B., “Query Expansion Using Local and Global Document Analysis,” Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1996, pp. 4-11.

Zheng, C., Liu, W. Y., and Feng, Z., “Web Mining for Web Image Retrieval,” Journal of the American Society for Information Science and Technology, Vol. 52 , No. 10, August 2001, pp.831-839.
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:校內一年後公開,校外永不公開 campus withheld
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus:永不公開 not available

您的 IP(校外) 位址是 44.200.39.110
論文開放下載的時間是 校外不公開

Your IP address is 44.200.39.110
This thesis will be available to you on Indicate off-campus access is not available.

紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code