跳到主要內容

臺灣博碩士論文加值系統

(54.224.133.198) 您好!臺灣時間:2022/01/29 21:59
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:蕭銘和
研究生(外文):Ming-Ho Hsiao
論文名稱:利用自動化字幕偵測與字幕處理來擷取結構化之視訊內容
論文名稱(外文):Visual Structuring and Retrieval Based on Automatic Closed Caption Detection and Caption Processing
指導教授:李素瑛李素瑛引用關係
指導教授(外文):Suh-Yin Lee
學位類別:碩士
校院名稱:國立交通大學
系所名稱:資訊工程系
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2002
畢業學年度:90
語文別:英文
論文頁數:57
中文關鍵詞:視訊切割字幕偵測字體大小辨識
外文關鍵詞:Video SegmentationScene IdentificationCaption LocalizationFont Size Differentiation
相關次數:
  • 被引用被引用:0
  • 點閱點閱:375
  • 評分評分:
  • 下載下載:73
  • 收藏至我的研究室書目清單書目收藏:1
我們利用階層式架構,提出一種結構化的網球影片內容瀏覽與索引方法。經過視訊切割,自動化字幕偵測與字幕字體大小辨識等方法,將影片做結構化的分析並建立在數位影片資料庫中。對數位影片資料庫而言,影片內容的結構化提供了瀏覽的能力而影片的字幕則提供更有意義的資訊。
為了建構影片的階層式架構,我們提出並整合了一些視訊處理的技術,包括影片的視訊切割,選擇適當的視訊片段,偵測視訊片段是否有字幕以及字體大小的辨識的方法。我們選擇網球影片當作研究的實例,而且利用我們所設計的自動化選擇適當視訊片段的方法,來作進一步的字幕偵測。我們可在偵測到有字幕的視訊片段,做更精確地自動化字幕檢測。利用我們提出的字幕字體大小辨識的方法,使用者可以利用此技術來過濾及選擇更有意義的字幕資訊,如比賽分數、球員名字等。具有意義的字幕資訊不僅可提供對於高階層的視訊影片架構分析和視訊影片索引,更可作為MPEG7中內容描述的資訊。我們所有提出的方法都可直接在MPEG壓縮影片中做處理,不僅節省計算的時間,更可提高視訊影片處理的效率。此研究實驗結果證明了提出的方法令人滿意。

An efficient indexing and retrieval of tennis video content is proposed using hierarchical structure. The hierarchical structure is constructed through video segmentation, shots selection and closed caption detection. The video content representation provides browsing capabilities for digital video databases. The video indexing supports more efficient content-based queries and retrieval capabilities for digital video databases.
In this thesis, a novel approach of automatic closed caption detection and font size differentiation among localized text regions in I-frames of MPEG videos is proposed. The approach consists of five modules: video segmentation, shot selection, caption frame detection, caption localization and font size differentiation. Tennis videos are selected as the case study and the module of shot selection is designed to automatically select specific type of shot for further closed caption detection. The noise of potential captions is filtered out based on the long-term consistency of the constant potential caption regions detection over consecutive frames. While the general closed captions are localized, the designed tool — font size differentiation is used as a filter to assist users in the selection of the specific and significant text captions. The significant closed captions, e.g. scores, can support high-level video structuring, video browsing, video indexing and video content description in MPEG-7. Experimental results show the effectiveness and the feasibility of the proposed scheme.

Chapter 1 Introduction…………………………………………………1
1.1 Motivation…………………………………………………….1
1.2 Organization………………………………………………….3
Chapter 2 Background……………………………………………………4
2.1 Overview of MPEG-IIstandard………………………………4
2.1.1 Codec structure of video data……………………5
2.1.2 Picture types of video data………………………6
2.2 Scene change detection method……………………………8
2.2.1 Uncompressed-domain scene change detection….8
2.2.2 Compressed-domain scene change detection…..10
2.3 Text Caption Localization Method……………........12
2.4 Overview of MPEG-7 multimedia description schemes.15
Chapter 3 Automatic Closed Caption Detection and Font Size
Differentiation in MPEG Videos………..…………….18
3.1 Overview of The proposed Scheme……………………….18
3.2 Scene change detection …………….……………………19
3.2.1 GOP-based scene change detection approach….19
3.2.1.1 Inter-GOP scene change detection…………21
3.2.1.2 Intra-GOP scene change detection…………23
3.3 Shots Selecting……………….……………………………25
3.4 Text Caption Localization…………….………………..26
3.4.1 Caption Frame Detecting……….………...….…28
3.4.2 Closed Caption Detection…………………………31
3.4.3 Font Size Differentiation……….………………33
Chapter 4 System architecture and experiment………………….40
4.1 Overview of video structure analysis and indexing
system…………………………………………………………40
4.2 Tennis video structure analysis and indexing module
…………………………………………………………………43
4.2.1 Component of video scene change detection….44
4.2.2 Component of shots grouping...…………………45
4.2.3 Component of text caption detecting…….……46
4.3 Experiment and analysis…………….……………………47
Chapter 5 Conclusion and future work…………….………………56
Bibliography………………………………………………………….…58

[1] H. Wang and S. F. Chang, “A Highly Efficient System for Automatic Face Region Detection in MPEG Video,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 7, No. 4, pp. 615-628, Aug. 1997,.
[2] Y. Zhong, H. Zhang and A. K. Jain, “Automatic Caption Localization in Compressed Video,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 4, pp. 385-392,Apr. 2000.
[3] H. Luo and A. Eleftheriadis, “On Face Detection in the Compressed Domain,” Proc. of ACM Multimedia , pp. 285-294, 2000.
[4] Y. Zhang and T. S. Chua, “Detection of Text Captions in Compressed Domain Video,” Proc. of ACM Multimedia Workshop, pp. 201-204, 2000.
[5] S. W. Lee, Y. M. Kim and S. W. Choi, “Fast Scene Change Detection using Direct Feature Extraction from MPEG Compressed Videos,” IEEE Transactions on Multimedia, Vol. 2, No. 4, pp. 240-254,Dec. 2000.
[6] X. Chen and H. Zhang, “Text Area Detection from Video Frames,” Proc. of 2nd IEEE Pacific Rim Conference on Multimedia, pp. 222-228, Oct. 2001.
[7] J. Nang, O. Kwon and S. Hong, “Caption Processing for MPEG Video in MC-DCT Compressed Domain,” Proc of ACM Multimedia Workshop, pp. 211-214, 2000.
[8] S. Y. Lee, J. L. Lian and D. Y. Chen, “Video Summary and Browsing Based on Story-Unit for Video-on-Demand Service,” Proc. International Conference on ICICS, Oct. 2001.
[9] J. L. Mitchell, W. B. Pennebaker, C. E. Fogg, and D. J. LeGall, “MPEG VIDEO COMPRESSION STANDARD,” Chapman&Hall, NY, USA, 1997.
[10] J. Meng, Y. Juan, S.F. Chang, “Scene Change Detection in a MPEG Compressed Video Sequence,” Proc. IS&T/SPIE, Vol. 2419, pp.14-25, 1995.
[11] H. J. Zhang, C. Y. Low, S. W. Smoliar and J. H. Wu, “Video Parsing and Browsing Using Compressed Data,” Multimedia Tools and Applications, pp. 89-111,1995.
[12] H. Li, D. Doermann and O. Kia, “Automatic Text Detection and Tracking in Digital Video,” IEEE Transactions on Image Processing, Vol. 9, No. 1, pp. 147-156,Jan. 2000.
[13] J. C. Shim, C. Dorai and R. Bollee, “Automatic Text Extraction from Video for Content-Based Annotation and Retrieval,” Proc. 14th International Conference on Pattern Recognition, pp. 618-620, 1998.
[14] J. Ohya, A. Shio and S. Akamastsu, “Recognizing Characters in Scene Images,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 16, No. 2, pp. 214-220, February 1994,.
[15] U. Gargi, S. Antani and R. Kasturi, “Indexing Text Events in Digital Video Databases,” Proc. 14th International Conference on Pattern Recognition, pp. 916-918, 1998.
[16] S. Kannangara, E. Asbun, R. X. Browning and E. J. Delp, “The Use of Nonlinear Filtering in Automatic Video Title Capture,” Proc. IEEE/EURASIP Workshop on Nonlinear Signal and Image Processing, 1997.
[17] V. Wu, R. Manmatha and E. M. Riseman, “TextFinder: An Automatic System to Detect and Recognize Text in Images,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 21, No. 11, pp. 1224-1229, November 1999.
[18] S. W. Lee and D. S. Ryu, “Parameter-Free Geometric Document Layout Analysis,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, No. 11, pp. 1240-1256, November 2001.
[19] R. G. Casey and E. Lecolinet, “A Survey of Methods and Strategies in Character Segmentation,” IEEE Transactions on Pattern Analysis and Machine Intelligence,” Vol. 18, No. 7, pp. 690-706, July 1996.
[20] D.Y. Chen, S. Y. Lee, “Motion-Based Semantic Event Detection for Video Content Description in MPEG-7,” Proc. of 2nd IEEE Pacific Rim Conference on Multimedia, pp. 110-117, Oct. 2001.
[21] J. L. Mitchell, W. B. Pennebaker, Chad E.Fogg, and Didier J. LeGall, “MPEG VIDEO COMPRESSION STANDARD,” Chapman&Hall, NY, USA, 1997.
[22] B. Furht, “Multimedia Systems: An Overview,” IEEE Multimedia, Vol. 1, No. 1, pp. 47-59, 1994.
[23] A. Nagasaka, and Y. Tanaka, “Automatic Video Indexing and Full-Video Search for Object Appearances,” Visual Database Systems, II, Eds. E. Knuth, and L.M. Wegner, Elsevier Science Publishers B.B., IFIP, pp. 113-127, 1992.
[24] H. J. Zhang, A. Kankanhalli, and S.W. Smoliar, “Automatic Partitioning of Full-Motion Viedo,” Multimedia Systems, Vol. 1, No. 1, pp. 10-28, 1993.
[25] F. Arman, A.Hsu, and M.Y. Chiu, “Image processing on Compressed Data for large Video Databases,” Proceedings First ACM International Conference on Multimedia, Anaheim, CA, pp. 267-272, 1993.
[26] J. Meng, Y. Juan, S.F. Chang, “Scene Change Detection in a MPEG Compressed Video Sequence,” Proceedings IS&T/SPIE, Vol. 2419, pp. 14-25, 1995.
[27] B. L. Yeo and B. Liu, “Rapid Scene Analysis on compressed Video,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 5, No. 6, pp. 533-544, 1995.
[28] MPEG Software Simulation Group, HTTP://www.mpeg.org.
[29] Xinying Wang; Zhengke Weng, " Scene abrupt change detection," Electrical and Computer Engineering, 2000 Canadian Conference on , Vol. 2 , pp.880 —883, 2000.
[30] I. K. Sethi and N. V. Patel, “A statistical approach to scene change detection,” in IS&T SPIE: Storage and Retrieval for Image and Video Databases III, vol. 2420, pp. 329—339, San Jose, CA, 1995.
[31] Salembier, P.; Smith, J.R., “MPEG-7 multimedia description schemes,” Circuits and Systems for Video Technology, IEEE Transactions on , Volume: 11 Issue: 6 , pp.748 —759, June 2001.
[32] Seong-Whan Lee; Young-Min Kim; Sung Woo Choi ,”Fast scene change detection using direct feature extraction from MPEG compressed videos,” Multimedia, IEEE Transactions on , Volume: 2 Issue: 4 , pp. 240 —254, Dec. 2000.
[33] V. Kobla, D. S. Doermann, and K.-I. Lin, “Archiving, indexing, and retrieval
of video in the compressed domain,” Proc. SPIE: MultimediaStorage and Archiving Systems, vol. 2916, pp. 78—89, 1996.
[34] R. Zabih, J. Miller, and K. Mai, “A Feature-Based Algorithm for Detecting and Classifying Scene Breaks,” Proc. ACM Multimedia, , pp.189-200, 1993.

QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top