跳到主要內容

臺灣博碩士論文加值系統

(216.73.216.54) 您好!臺灣時間:2026/01/08 15:48
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:鄭煒平
研究生(外文):WeiPing Cheng
論文名稱:MPEG電影音訊自動分段方法
論文名稱(外文):A Method for Automatic Audio Segmentation in MPEG Movies
指導教授:劉志俊劉志俊引用關係
指導教授(外文):C.C. Liu
學位類別:碩士
校院名稱:中華大學
系所名稱:資訊工程學系碩士班
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2005
畢業學年度:93
語文別:中文
論文頁數:48
中文關鍵詞:MPEG-7MPEG音訊音效分段音效資料庫內涵式查詢
外文關鍵詞:MPEG-7MPEG audioaudio segmentationsound effect databasecontent-based retrieval
相關次數:
  • 被引用被引用:1
  • 點閱點閱:206
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:4
隨著資訊軟硬體與網際網路技術快速發展,今日網路上的數位多媒體類型相當多元化,數位多媒體資料的取得也相當容易,要如何將如此豐富的數位資料保存整理,已經成為相當重要的課題。其中電影是一種最有趣但也是最複雜的多媒體資料,要對網際網路電影資料庫,描述其中電影資料的內涵特徵值,是所有電影內涵分析研究的首要條件。
在先前的研究我們提出一種有效且創新的方法,可以對電影劇情類型進行識別。在該論文中利用MPEG-7音效特徵值,對電影中所有音效進行分類,建立出網際網路電影資料庫,並建構出電影音效輪廓。但是美中不足的是電影中所有的音效必須以人工方式分段。因此本篇論文中將探討解決此一問題,我們提出一種利用MPEG-7音效特徵值組對電影音訊進行自動化分段,以協助先前的電影輪廓分析系統完成電影音訊自動分段與檢索,便於對大量的數位電影進行自動化分析。
1. 序論 5
2. 相關研究 7
3. MPEG電影音訊介紹 10
3.1 MPEG音訊介紹 10
3.2 MPEG音訊解碼係數特性 11
3.3 MP3特徵值向量 11
4. 電影音效分段之系統架構 13
5. 音效特徵值 15
5.1 MPEG-7音效特徵值組 15
5.2 非MPEG-7音效特徵值組 15
6. 音效斷點方法 19
6.1 經驗法則斷點偵測法 19
6.2 分類法則斷點偵測法 20
6.2.1 kNN分類器 21
6.2.2 RCE類神經網路分類器 23
6.2.3 電影音效特徵值資料庫 26
7. 實驗 28
7.1 實驗環境說明 28
7.1.1 實驗軟硬體 28
7.1.2 實驗樣本 28
7.1.3 實驗效能評估計算方式 29
7.1.4 影響音效分段效能之因素 30
7.2 經驗法則分段實驗結果 30
7.2.1音效特徵值邊界差異值的臨界值常數選定 30
7.2.2音效特徵值的權重設定 35
7.2.3 音訊資料框架小節大小 37
7.3 分類法則分段實驗結果 37
7.4 兩種分類法之比較 38
9. 參考文獻 41
附錄A MPEG7特徵値公式 45
[1]F. Dick Bernard, “Anatomy of Film 4th Edition,” Palgrave Macmillan, Jan 2002.
[2]S. M. Bhandarkar and A.A. Khombhadia, “Motion-based parsing of compressed video,” in Proc. of IEEE Intl. Workshop on Multi-Media Database Management Systems, pp. 80 –87, 1998.
[3]J. S. Boreczky and L. D.Wilcox, “A hidden Markov model framework for video segmentation using audio and image features,” in Proceedings of the 1998 IEEE Internation Conference on Acoustics, Speech, and Signal Processing, vol. 6, pp. 3741-3744, May 1998.
[4]N. Brady, “MPEG-4 standardized methods for the compression of arbitrarily shaped video objects,” IEEE Trans. on Circuits and Systems for Video Technology, Vol. 9, No. 8, pp. 1170 –1189, Dec. 1999.
[5]N. Brady, F. Bossen and N. Murphy, “Context-based arithmetic encoding of 2D shape sequences,” in Proc. IEEE Intl. Conf. on Image Processing, Vol. 1, pp. 29–32, 1997.
[6]Albert S. Bregman, “Auditory Scene Analysis: The Perceptual Organization of Sound,” MIT press, 1994.
[7]Guy J.Brown and Martin Cooke, "Computational auditory scene analysis," Computer Speech and Language,vol. 8, pp.297-336, Oct. 1994.
[8]M. J. Carey, E. S. Parris and H. Lloyd-Thomas, “A Comparison of Features for Speech Music Discrimination,” in Proc. of 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 149-152, March 1999.
[9]Wu Chou and Liang Gu, “Robust Singing Detection in Speech/Music Discriminator Design,” in Proc. of 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 865-868, May 2001.
[10]W.A.C. Fernando, C.N. Canagarajah, and D.R. Bull, “A unified approach to scene change detection in uncompressed and compressed video,” IEEE Transactions on Consumer Electronics, Vol. 46, No. 3, pp. 769 –779, Aug. 2000.
[11]Louis Giannetti, “Understanding Movies,” Prentice Hall, 1990.
[12]T. Hain and et al., “Segment Generation And Clustering In The Htk Broadcast News Transcription System,” in Proc. of 1998 Broadcast News Transcription and Understanding Workshop, pp. 133-137, 1998.
[13]ISO/IEC 11172-3:1993, “Information Technology — Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s — Part 3: Audio.”
[14]Zhu Liu and Qian Huang, “Classification of Audio Events in Broadcast News,” in Proc. of IEEE Second Workshop on Multimedia Signal Processing, pp. 364-369, Dec. 1998.
[15]Zhu Liu, Jincheng Huang and Yao Wang, ” Classification TV Programs Based on Audio Information Using Hidden Markov Model,” in Proc. of IEEE Second Workshop on Multimedia Signal Processing, vol. , pp. 27-32, Dec. 1998.
[16]Zhu Liu and et al., “Audio Feature Extraction and Analysis for Scene Classification,” in Proc. of IEEE First Workshop on Multimedia Signal Processing, pp. 343-348, June 1997.
[17]Beth Logan, “Mel Frequency Cepstral Coefficients for Music Modeling,” in Proc. of International Symposium on Music Information Retrieval, 2000.
[18]Lie Lu and et al., “Content Analysis for Audio Classification and Segmentation,” IEEE Transactions on Audio Classification and Segmentation, vol. 10, pp. 504-516, October 2002.
[19]J.P. Marques de Sá, ” Pattern Recognition Concepts, Methods and Applications,” Springer, 2001.
[20]MPEG Requirements Group, “Information technology - Multimedia Content Description Interface - Part 2:Description Definition Language,” ISO/IEC JTC1/SC29/WG11 N4002, Singapore, Mar. 2001.
[21]MPEG Requirements Group, “Information technology - Multimedia Content Description Interface - Part 4:Audio,” ISO/IEC CD 15938-4, Oct. 2000.
[22]MPEG Requirements Group, “Information technology - Multimedia Content Description Interface - Part 5:Multimedia Description Schemes,” ISO/IEC JTC1/SC29/WG11 N3966, Singapore, Mar. 2001.
[23]MPEG Requirements Group, “Overview of MPEG-7 Standard(version 8.0),” ISO/IEC JTC1/SC29/WG11 N4980, Singapore, July. 2002.
[24]A. Nagasaka and Y. Tanaka, “Automatic video indexing and fullvideo search for objects appearances,” Visual Database Systems II, E. Knuth and L. M. Wegner, Eds. New York: Elsevier Science, pp.113–127, 1992.
[25]Y. Nakajima and et al., “A fast audio classification from MPEG coded data,” in Proc. of 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 6, pp. 3005-3008, March 1999.
[26]M.R. Naphade and et al., “Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems,” in Proc. of 1998 International Conference on Image Processing, vol. 3, pp. 536-540, Oct. 1998.
[27]N. Patel and I. Sethi, “Audio Characterization for Video Indexing,” in Proc. of SPIE Conf. Storage Retrieval Still Image Video Databases, vol. 2670, pp. 373-384, 1996.
[28]Silvia Pfeiffer, Stephan Fischer and Wolfgang Effelsberg, “Automatic audio content analysis,” in Proc. of the fourth ACM international conference on Multimedia, pp. 21-30, 1997.
[29]V. I. Pudovkin, “Film Technique, and Film Acting,” Grove Press, Jun 1970.
[30]J. Saunders, “Read-Time Discrimination of Broadcast Speech/Music,”in Proc. of IEEE ICASSP, pp. 993-996, 1996.
[31]E. Scheirer and M.Slaney, “Construction and Evaluation of a Robust Multifeature Speech/Music Discrimination,” in Proc. IEEE ICASSP, vol. 2, pp. 1331-1334, 1997.
[32]G.. Tzanetakis and P. Cook, “Multifeature Audio Segmentation for Browsing and Annotation,” IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 17-20, Oct. 1999.
[33]E. Wold and et al., “Content-based Classification, Search, and Retrieval of Audio,” IEEE Multimedia, vol. 3, pp. 27-36, Fall 1996.
[34]B. L. Yeo and B. Liu, “Rapid scene analysis on compressed videos,” IEEE Trans. Circuits Syst. Video Technol., Vol. 5, No. 6, pp. 533-544, Dec. 1995.
[35]H. J. Zhang , A. Kankanhalli, and S. W. Smoliar, “Automatic partitioning of full-motion video,” Multimedia Systems, Vol.1, No.1, pp.10-28, June 1993.
[36]Tong Zhang and C. C. Jay Kuo, “Audio Content Analysis for Online Audiovisual Data Segmentation and Classification,” IEEE Transactions on Speech and Audio Processing, vol. 9, pp. 441 - 457, MAY 2001.
[37]范世鎮、劉志俊, “利用特寫鏡頭偵測與主角辨識技術來自動建立電影摘要,” 第二屆數位典藏技術研討會, 2003.
[38]陳信修、劉志俊, “一種利用特寫鏡頭對數位電影資料進行自動化摘要合成之技術,” 第一屆數位典藏技術研討會, 2002.
[39]黃群菘、劉志俊, “MP3數位音樂資料的自動化分類,” 第一屆數位典藏技術研討會, 2002
[40]葉億真、劉志俊, “音效資料的內涵式分類及其在電影資料庫的應用,” 第二屆數位典藏技術研討會, 2003.
[41]劉志俊、傅佳源、王志浩、喻仲平, “一種利用物件形狀來進行MPEG-4鏡頭變化偵測之技術,” 第一屆數位典藏技術研討會, 2002.
[42]梅長齡, ”電影原理與製作,” 三民書局股份有限公司, 1978.
[43]鄭煒平、劉志俊, “MPEG電影音效自動分段系統,” 2004數位生活與網際網路科技研討會, 2004.
[44]鄭煒平、劉志俊, “網際網路電影資料庫之音效自動分段索引系統,” 網際網路應用與發展學術研討會,Vol. 6, 2005.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top