(54.173.237.152) 您好!臺灣時間:2019/02/22 22:58
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果

詳目顯示:::

我願授權國圖
本論文永久網址: 
line
研究生:施威州
研究生(外文):Wei-Chou Shih
論文名稱:多媒體內容擷取之事件偵測
論文名稱(外文):Event Detection for Content-based Multimedia Retrieval
指導教授:王元凱王元凱引用關係
指導教授(外文):Yuan-Kai Wang
學位類別:碩士
校院名稱:輔仁大學
系所名稱:電子工程學系
學門:工程學門
學類:電資工程學類
論文出版年:2003
畢業學年度:91
語文別:中文
論文頁數:67
中文關鍵詞:多媒體檢索電影
外文關鍵詞:Multimedia RetrievalFilm
相關次數:
  • 被引用被引用:1
  • 點閱點閱:357
  • 評分評分:系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
多媒體內容擷取是一種以內容為基礎做成索引以供以後檢索查詢的研究技術,但是做成索引時是不需人工註釋定義,而採電腦自動分析處理製成索引。早期的研究多採用低階的影像特徵做為內容擷取的索引,而本篇論文是以研究在電影檢索系統中有關於場景切割以及將切割後的內容加以分析整理成數個特定的情境以供快速的瀏覽、查詢。本方法分為5個步驟:(1)藉由偵測影片中的Shot邊界將影片做正確的切割分段;(2)在每一個Shot中選出一張Key-frame以代表此一Shot的影像內容;(3)對每一張Key-frame取出特徵值;(4)對每一個Shot取出特定的情境因素特徵供查詢;(5)利用Key-frame對多個Shot組合做語意上的分類(Grouping)。經由以上五個步驟,我們可以自動化的完成一部電影的內容情境分析,且實驗的結果顯示在這套系統中對事件偵測有著非常好的正確率。

Content-based multimedia retrieval is a technology that automatically transforms multimedia content into indices for the purpose of inquiry. Early studies often used low-level image features as indices. This paper studies scene segmentation in film indexing system and organization of content after indexing for fast query. The approach: 1) shot boundaries detection; 2) key frame extraction; 3) features extraction; 4) event inference. Based on the 4 stages, we can automate scene analysis and segmentation of a film. Experimental results have shown positive outcomes.

中文摘要 i
英文摘要 ii
誌謝 iii
目錄 iv
圖目錄 vi
第一章 導論 1
1.1 研究動機 1
1.2 研究目的 1
1.3 前言 1
1.3.1 視訊資料擷取的分類 4
1.3.2 內容特徵萃取的方式 5
1.3.3 搜尋方法 6
1.4 本論文架構 7
第二章 場景切割 9
2.1 Shot邊界的種類 9
2.2我們的偵測方法 12
第三章 關鍵畫面擷取 15
3.1 一般畫面擷取 15
3.2 動態畫面辨識與選擇(Shot Identify and Select) 16
3.3 挑選關鍵畫面的流程 17
第四章 特徵萃取 18
4.1影像特徵萃取(Image Feature Extraction) 18
4.1.1 Color histogram 18
4.1.2 Fourier transform 22
4.1.3 Wavelet transform 30
4.1.4 Edge direction 34
4.2視訊特徵萃取(Video Features Extraction) 36
4.2.1 Shot length 36
4.2.2 Shot difference 37
4.2.3時間序列(Time Series) 37
4.3語意特徵的萃取(Semantic Feature) 38
第五章 事件推導 40
5.1室內及戶外(Indoor vs. Outdoor) 41
5.2城市及風景(City vs. Landscape) 43
5.3緊張及放鬆(Excitement vs. Release) 45
5.4追逐及對話(Chase vs. Conversation) 48
5.5天空(Sky) 51
第六章 實驗與結果分析 53
6.1系統介紹 53
6.1.1批次分析系統 54
6.1.2個別影像分析系統 56
6.1.3瀏覽查詢系統 58
6.2實驗討論 60
第七章 結論 62
7.1結論 62
7.2未來研究 62
參考文獻 64

[1] 沈錳坤, “Content-Based Retrieval of Video Database Systems,” 影像與識別 pp. 65~86, 87.09
[2] Z.-N. Li, O.R. Zaiane, Z. Tauber, “Illumination invariance and object model in content-based image and video retrieval,” J. Visual Commun. Image Representation, 10(3) pp. 219-244, 1999.
[3] Milind Ramesh Naphade and Thomas S. Huang, “A Probabilistic Framework for Semantic Video Indexing, Filtering, and Retrieval,” IEEE Transactions on multimedia, VOL. 3, No. 1, March 2001.
[4] Candemir Toklu and Shih-Ping Liou, “Image And Audio Sequence Visualization and Interaction Mechanisms for Structured video Browsing and Editing,” Siemens Corporate Research, USA
[5] Thomas G. Aguierre Smith Natalio C.Pincever, “Parsing Movies in Context,” USENIX-Summer ’91-Nashville, TN
[6] A. Del Bimbo, “Semantics-Based Retrieval By Content,” ICIP-2000, September 10 - 13, 2000.
[7] David Crandall and Rangachar Kasturi, “Robust Detection of Stylized Text Events in Digital Video,” Proceedings of the Sixth International Conference on Document Analysis and Recognition, Seattle, pp. 865-869, September 2001
[8] Sansanee Auephanwiriyakul, Anupam Joshi, and Raghu Krishnapuram, “Fuzzy Shot Clustering to Support Networked Video Databases,” IEEE FUZZ-IEEE 98/WCCI98, May 98.
[9] Giridharan Iyengar and Andrew Lippman, “Models for automatic classification of video sequences,” SPIE Storage and Retrieval for Image and Video Databases VI, pp. 3312-3334, 1998.
[10] G.Medioni, I.Cohen, F.Bremond, S.Hongeng, R.Nevatia, “Event Detection and Analysis from Video Streams,” IEEE Transactions on Pattern Analysis and Machine Intelligence 23(8): 873-889 (2001)
[11] A. Yoshitaka, T. Ishii, M. Hirakawa and T. Ichikawa, “Content-Based Retrieval of Video Data by the Grammar of Film,” IEEE Symposium on Visual Languages (1997) pp. 310 - 317.
[12] Niels Haering and Niels da Vitoria Lobo, “A Framework For Designing Event Detectors,” Fourth Asian Conference on Computer Vision, Taipei, Taiwan, Jan 2000.
[13] Niels Haering, Richard J.Qian, and M. Ibrahim Sezan, “A Semantic Event-Detection Approach and Its Application to Detecting Hunts in Wildlife Video,” IEEE Transactions on circuits and system for video technology, VOL. 10, No. 6, SEPTEMBER 2000
[14] Richard Qian, Niels Haering, Ibrahim Sezan, “A Computational Approach to Semantic Event Detection,” Computer Vision and Pattern Recognition-Volume 1, June 23 - 25, 1999.
[15] G.Sudhir, John C.M.Lee and Anil K. Jain, “Automatic Classification of Tennis Video for High-level Content-based Retrieval,” Content-Based Access of Image and Video Database, 1998. Proceedings., IEEE International Workshop., pp. 81-90, 1998.
[16] H. Zhang, Y. Gong, S. W. Smoliar and S. Y. Tan, “Automatic Parsing of News Video,” Proceedings of the International Conference on Multimedia Computing and Systems., pp. 45-54, 1994.
[17] 曾羽鴻, “News Video Content Search System,” 國立交通大學 資訊工程學系 88年碩士論文
[18] 江志釧, “The Study of Video Segmentation and Summary in News Video,” 國立交通大學 資訊工程學系 88年碩士論文
[19] Loong-Fah Cheong, “Scene-Based Shot Change Detection and Comparative Evaluation,” Computer Vision and Image Understanding 79, 224—235 (2000).
[20] Alberto Machi, Mario Tripiciano, “Video Shot Detection and Characterization in Semi-automatic Digital Video Restoration,” Proceedings of the International Conference on Pattern Recognition (ICPR’00)
[21] A.C. Kokaram, “Motion Picture Restoration,” Springer Verlag London, 1998.
[22] H. Sundaram and S.-F. Chang, “Video Scene Segmentation Using Video and Audio Features,” IEEE International Conference on Multimedia and Expo, New York, July, 2000.
[23] J.R. Kender B.L, Yeo, “Video Scene Segmentation Via Continuous Video Coherence,” CVPR ’98, Santa Barbara CA, Jun. 1998.
[24] 蘇志文, “Automated Digital Movie Skimming,” 輔仁大學 資訊工程學系 90年碩士論文
[25] H. Jiang, A. Helal, A. K. Elmagarmid, and A. Joshi. “Scene Change Detection Techniques for Video Database Systems, ” ACM Multimedia Systems, 6:3, May 1998.
[26] Yueting Zhuang, Yong Rui and Thomas S.Huang. “Adaptive Key Frame Extraction Using Unsupervised Clustering,” IEEE ICIP'98, Oct. 1998, Chicago, USA.
[27] Eung Kwan, Sung Joo, and Jong Soo, “Video Retrieval Based On Key Frame Extraction in Compressed Domain,” IEEE ICIP 1999, pp. 260-263
[28] F. Dufaux, “Key frame selection to represent a video, ” IEEE Int. Conf. Image Processing , Vancouver, Canada, Sept. 2000, vol. 2, pp. 275-278.
[29] A. K. Jain and A. Vailaya, “Image retrieval using color and shape,” Pattern Recognition, vol. 29, no. 8, pp. 1233 ~ 1244, 1996.
[30] 陳錫動, “Video Indexing Using Color Histogram and Camera Operation,” 國立中央大學 資訊工程研究所 88年碩士論文
[31] X. Wan and C.c.j. Kuo, “A New Approach to Image Retrieval with Hierarchical Color Clustering,” IEEE trans. Circuits and systems for video technology, vol. 8, no. 5, Sep. 1998.
[32] R. Jain, R. Kasturi and B. Schunck, “Machine Vision,” McGraw Hill, 1995.
[33] K. R. Rao and J. J. Hwang, “Techniques and standards for image、video and audio coding,” Prentice-Hall. Inc, New Jersey, U.S.A. 1996.
[34] M. Craizer, E. A. B. D. Silva and E. G. Ramos, “Convergent algorithms for successive approximation vector quantisation with applications to wavelet image compression,” IEEE Proceedings-Vision Image and Signal Processing, Vol. 146, No. 3, Jun. 1999, pp. 159-164.
[35] H. J. Wang and C. C. J. Huo, “A multi-threshold wavelet coder (MTWC) for high fidelity image compression,” IEEE Proceeding International Conference on Image Processing, Vol. 1, 1997, pp. 652-655.
[36] J. R. Parker, “Algorithms for Image Processing and Computer Vision,” WILEY
[37] J. M. Zhong, C. H. Leung and Y. Y. Tang, “Wavelet image coding based on significance extraction using morphological operation,” IEE Proceedings Image and Signal Processing, Vol. 146, No. 4, Aug. 1999, pp. 206-210.
[38] A. Yoshitaka, T. Ishii, M. Hirakawa, T. Ichikawa, “Content-Based Retrieval of Video Data by the Grammar of Film, ” Visual Languages, 1997. Proceedings. 1997 IEEE Symposium , 1997 pp. 310 —317
[39] Y. Gong, L. T. Sin, C. H. Chuan, H. Zhang and M. Sakauchi, “Automatic Parsing of TV Soccer Programs,” Proceedings of the International Conference on Multimedia Computing and Systems, pp. 167-174, 1995.
[40] Huai-Rong Shao, Wenwu Zhu, and Ya-Qin Zhang, “Scalable Object-Based Video Multicasting Over The Internet,” IEEE Int. Conf. Image Processing (ICIP2000), Canada, Sept. 2000.
[41] Aditya Vailaya, Mirio Figueiredo, Anil Jain, HongJiang Zhang, “Content-Based Hierarchical Classification of Vacation Images, ” Multimedia Computing and Systems, 1999. IEEE International Conference, Volume: 1 , 1999 pp. 518 -523 vol.1
[42] Sameer Antani, Rangachar Kasturi, Ramesh Jain, “A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video,” Pattern Recognition 35 (2002) pp. 945-965

QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
系統版面圖檔 系統版面圖檔