Use of Text Summarization for Supporting Event Detection
text summarization, event detection, environmental scanning
Environmental scanning, which acquires and use the information about event, trends, and changes in an organization’s external environment, is an important process in the strategic management of an organization and permits the organization to quickly adapt to the changes of its external environment. Event detection that detects the onset of new events from news documents is essential to facilitating an organization’s environmental scanning activity. However, traditional feature-based event detection techniques detect events by comparing the similarity between features of news stories and incur several problems. For example, for illustration and comparison purpose, a news story may contain sentences or paragraphs that are not highly relevant to defining its event. Without removing such less relevant sentences or paragraphs before detection, the effectiveness of traditional event detection techniques may suffer. In this study, we developed a summary-based event detection (SED) technique that filters less relevant sentences or paragraphs in a news story before performing feature-based event detection. Using a traditional feature-based event detection technique (i.e., INCR) as benchmark, the empirical evaluation results showed that the proposed SED technique could achieve comparable or even better detection effectiveness (measured by miss and false alarm rates) than the INCR technique, for data corpora where the percentage of news stories discussing
old events is high.
目次 Table of Contents
Chapter 1 Introduction 1
1.1 Background 1
1.2 Research Motivation and Objective 2
1.3 Organization of the Thesis 4
Chapter 2 Literature Review 5
2.1 Event Detection 5
2.2 Text Summarization 9
2.2.1 Edmundson’s Approach 9
2.2.2 Kupiec et al’s Approach 11
2.2.3 Teufel and Moens’ Approach 13
2.2.4 Mani and Bloedorn’s Approach 14
2.2.5 Neto et al’s Approach 16
2.2.6 Myaeng and Jang’s Approach 19
2.2.7 Summary of Text Summarization Approaches 20
Chapter 3 Development of Summary-based Event Detection (SED) Technique 22
3.1 Process of Summary-based Event-Detection (SED) Technique 24
3.2 News Summarization Phase 25
3.2.1 News Summarization Learning Task 26
3.2.2 News Summary Generation Task 32
3.3 Event Detection Phase 33
Chapter 4 Empirical Evaluation 35
4.1 Evaluation Design 35
4.1.1 Data Collection and Summary Preparation 35
4.1.2 Evaluation Criteria for Event Detection 37
4.1.3 Performance Benchmarks for Event Detection 38
4.2 Evaluation Result 38
4.2.1 Parameter Tuning 38
4.2.2 Comparative Evaluation of Event Detection Techniques 43
Chapter 5 Conclusions and Future Research Directions 47
Appendix A: List of Stop Words 49
Appendix B: Sentence Representation Schemes Employed by Existing Text Summarization Approaches 50
References 53
電子全文 Fulltext
論文使用權限 Thesis access permission:校內立即公開,校外一年後公開 off campus withheld
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available

紙本論文 Printed copies
開放時間 available 已公開 available

